BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038692
(443 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
Length = 500
Score = 609 bits (1571), Expect = e-172, Method: Compositional matrix adjust.
Identities = 312/504 (61%), Positives = 355/504 (70%), Gaps = 69/504 (13%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPHSATESPGSDSAV-------VKTLALT-G 52
MPDL N+ S+TLTRC+S PP ++ + + VKTL
Sbjct: 1 MPDLGNI-------SITLTRCSSLKPLSPPPPLNLDAQSDSTTIHSLCSDSVKTLVKDCN 53
Query: 53 EEENVCANG------------NGHSVRVMKRCRGAKNISG----LEDHVAAWVKKKMELG 96
++N+ N S+RV+++ R L+D+V W +KKM+ G
Sbjct: 54 WDQNLLLPIPIPTPIPIEERCNDRSIRVLRKKRKKGGGGANGKCLDDYVRVWAQKKMDSG 113
Query: 97 VPQSNCSLPFLVGAKKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRN 156
V S C LPFLV +KK+ EC CHR IY GEEV CSVR CGGV+HFICVKE +SNP+
Sbjct: 114 VSNSRCFLPFLVSSKKIAECLVCHRLIYPGEEVLCSVRNCGGVFHFICVKETFHVSNPKK 173
Query: 157 FKCPQHACFICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK 216
FKCPQHACF+C+Q+ WRCVRC +ASHD+C PWPD+VIHL D P RAVCWRHP WLLDK
Sbjct: 174 FKCPQHACFVCKQKFDWRCVRCLMASHDRCGPWPDKVIHLIDPPRRAVCWRHPTNWLLDK 233
Query: 217 QEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPP-------------------------- 250
+EVF RLPLPY +EEFKID TWK LMENK+ PPP
Sbjct: 234 KEVFSRLPLPYVEEEFKIDATWKGLMENKLEPPPYVHIRRNVYLVKKKRDDVDNDVGCTS 293
Query: 251 ------------VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKG 298
VQ ISCS+ACHCSE C NRPFRKEKKIKIVKTEFCGWGVE EPINKG
Sbjct: 294 CSSSCCEDCVCRVQCISCSRACHCSENCTNRPFRKEKKIKIVKTEFCGWGVETVEPINKG 353
Query: 299 EFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDP 358
+FIIEYIGEVIDDA+CEQRLWDMKY+GVQNFYMCEIRKDFTIDATFKGN SRFLNHSCDP
Sbjct: 354 DFIIEYIGEVIDDAVCEQRLWDMKYKGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCDP 413
Query: 359 NCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
NCILEKWQVEGETRVGVFAARSIK GEPLTYDYRFVQFGPEV+C+CGA +C GYLGTKRK
Sbjct: 414 NCILEKWQVEGETRVGVFAARSIKVGEPLTYDYRFVQFGPEVRCHCGAPNCHGYLGTKRK 473
Query: 419 IGKLELCWGSKRKRSSTACLAIIT 442
I KL +CWG+KR+RSSTACLAI+T
Sbjct: 474 ICKLNICWGAKRRRSSTACLAIVT 497
>gi|224143888|ref|XP_002325110.1| SET domain protein [Populus trichocarpa]
gi|222866544|gb|EEF03675.1| SET domain protein [Populus trichocarpa]
Length = 516
Score = 593 bits (1528), Expect = e-167, Method: Compositional matrix adjust.
Identities = 296/443 (66%), Positives = 329/443 (74%), Gaps = 61/443 (13%)
Query: 62 NGHSVRVMKRCRGAK---------NISG-----LEDHVAAWVKKKMELGVPQSNCSLPFL 107
NG S+RV+KR ++ NIS L+D+V WV+KKM+ GV QS C LPFL
Sbjct: 74 NGSSIRVLKRTSASRLGCKKGFNNNISSNNGKCLDDYVRVWVQKKMDAGVSQSRCLLPFL 133
Query: 108 VGAKKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFIC 167
VGAKKM+EC C R IY GE + CSVR C GVYH CV E LG+SN R FKCPQH CF C
Sbjct: 134 VGAKKMVECLFCCRSIYPGEGMQCSVRNCQGVYHLTCVVEGLGVSNLRKFKCPQHECFTC 193
Query: 168 RQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK---------QE 218
+ + WRCVRCT+ASH++CAPW D V++LK+QPGRAVCWRHP W LDK +E
Sbjct: 194 KGKFHWRCVRCTVASHNQCAPWSDEVVYLKNQPGRAVCWRHPTNWRLDKKHVVPATDIEE 253
Query: 219 VFCRLPLPYADEEFKIDLTWKDLMENKVGPPP---------------------------- 250
+FCRLPLPY DEEFKIDLTWKDL ENK+ PPP
Sbjct: 254 IFCRLPLPYIDEEFKIDLTWKDLTENKLEPPPYVHIRRNVYLVKKKRDDSDGDVGCTNCS 313
Query: 251 ----------VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEF 300
VQ ISCSKAC C ETC NRPFRKEKKIKIVKTEFCGWGVEAAEP+NKG+F
Sbjct: 314 STCCENCVCRVQCISCSKACRCPETCTNRPFRKEKKIKIVKTEFCGWGVEAAEPLNKGDF 373
Query: 301 IIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNC 360
IIEYIGEVIDD LCEQRLWDMKY+GVQNFYMCEIRKDFTIDATFKGN SRFLNHSC PNC
Sbjct: 374 IIEYIGEVIDDKLCEQRLWDMKYKGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCKPNC 433
Query: 361 ILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIG 420
ILEKW VEGETRVGVFAA SI+ GEPLTYDYRFV+FGPEVKCYCGA +CQGYLGTKRKI
Sbjct: 434 ILEKWDVEGETRVGVFAAGSIRVGEPLTYDYRFVRFGPEVKCYCGAPNCQGYLGTKRKIA 493
Query: 421 KLELCWGSKRKRSSTACLAIITL 443
KL + WG+KRKR+STAC+AIIT+
Sbjct: 494 KLNIGWGAKRKRTSTACVAIITM 516
>gi|449457959|ref|XP_004146715.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Cucumis
sativus]
Length = 502
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 298/507 (58%), Positives = 352/507 (69%), Gaps = 69/507 (13%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPH---SATESPGSDSAVVKTLA-------L 50
MPDL NL+LS S LTLT + +LKPL PH S +S + +L+ L
Sbjct: 1 MPDLGNLSLSQS---LTLTP-SPNLKPLLHPHLPSSFLSHKSLNSLPIDSLSTQPFQCSL 56
Query: 51 TGEEENVCANG----NGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPF 106
E+++ A + H+ ++ R + LEDHV +WV++K G+PQS CS PF
Sbjct: 57 PNPEDDLPAPTQIRVSTHTRNLLDRSKRNYPAKNLEDHVKSWVRRKQHSGLPQSRCSFPF 116
Query: 107 LVGAKKMIE--CRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHAC 164
L G +KM+E C CHR+I+ GEEV CS+R CGGVYH C K+ LG S R FKCPQH C
Sbjct: 117 LQGTRKMVEKECLCCHRYIFPGEEVECSIRDCGGVYHVTCAKDNLGFSGHRKFKCPQHEC 176
Query: 165 FICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLL---------D 215
FIC+QRL W+C+RC IASHDKCAPWPD+VIHLK+QPGRA+CWRHP W + D
Sbjct: 177 FICKQRLHWKCIRCNIASHDKCAPWPDKVIHLKNQPGRAICWRHPTDWRVDLKHAVSTSD 236
Query: 216 KQEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPP------------------------- 250
+EVF RLPLPY DEEFKID+TWKD E K+ PPP
Sbjct: 237 LEEVFLRLPLPYVDEEFKIDITWKD-AEKKMEPPPYTHIRRNIYLVKKKRDTGVADGLGC 295
Query: 251 --------------VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPIN 296
VQ ISCS+ACHC +TC NRPFRKEKK+KIVKTE CGWGVEAAE I
Sbjct: 296 TNCKTECSDDCVCRVQCISCSRACHCRDTCTNRPFRKEKKVKIVKTELCGWGVEAAESIG 355
Query: 297 KGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC 356
KGEF+IEYIGEVIDDALCEQRLWDMKY+G++NFYMCEIRKDFTIDATFKGN SRFLNHSC
Sbjct: 356 KGEFVIEYIGEVIDDALCEQRLWDMKYKGMKNFYMCEIRKDFTIDATFKGNASRFLNHSC 415
Query: 357 DPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
DPNC LEKWQVEGETRVGVFAARSI+ GEPLTYDYRFVQFGPEVKC+CGA +CQ YLGTK
Sbjct: 416 DPNCSLEKWQVEGETRVGVFAARSIEVGEPLTYDYRFVQFGPEVKCHCGAPNCQRYLGTK 475
Query: 417 RKIGKLELCWGSKRKRSSTACLAIITL 443
++I ++L WG+KR+R+S A + I +
Sbjct: 476 KRISAIDLFWGTKRRRTSAARITGIIV 502
>gi|225445763|ref|XP_002272781.1| PREDICTED: histone-lysine N-methyltransferase ASHR3 [Vitis
vinifera]
gi|297743704|emb|CBI36587.3| unnamed protein product [Vitis vinifera]
Length = 494
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 297/500 (59%), Positives = 342/500 (68%), Gaps = 63/500 (12%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPP--HSATESPGSDSAVVKTLALTGEEENVC 58
MPDL NL SSS + +RC +LKPL+ SP S + + ++ +
Sbjct: 1 MPDLGNLLNSSSLTL---SRC-HNLKPLSDSCDSGGPLSPNSVNWEQRLRFPLSPKDEIE 56
Query: 59 ANGNGHSVRVMKRCRGAK--------NISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGA 110
+G +RV+KR RG N L+DHV AW KKME G +S CSLPF+ GA
Sbjct: 57 CEKSGVGIRVLKRTRGGSLDRSKKLSNGKSLDDHVKAWADKKMESGATKSQCSLPFMSGA 116
Query: 111 KKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFICRQR 170
++ EC CH FIY GEEV C++ GC GVYH C K LG S R FKCPQHACF+C++R
Sbjct: 117 SRLDECLVCHSFIYPGEEVSCTIDGCQGVYHLKCAKNELGFSTKRKFKCPQHACFVCKRR 176
Query: 171 LQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK---------QEVFC 221
+CVRC IASH KCAPWP+ + L+++ G+AVCWRHP W LDK +E+F
Sbjct: 177 SYLQCVRCIIASHQKCAPWPEEMTLLRNRSGQAVCWRHPTDWRLDKKHAAPTSDIEEIFS 236
Query: 222 RLPLPYADEEFKIDLTWKDLMENKVGPPP------------------------------- 250
RLPLPY +EEFKIDLTWKD +ENK+ P P
Sbjct: 237 RLPLPYDEEEFKIDLTWKDTVENKMEPTPYVHIRRNVYLVKKKRDDAADGIGCTNCSSVC 296
Query: 251 -------VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIE 303
VQ ISCSK+CHCSE C NRPFRKEKKIKIVKTE CGWGV+AAE INKG+F+IE
Sbjct: 297 SENCVCRVQCISCSKSCHCSENCTNRPFRKEKKIKIVKTELCGWGVDAAESINKGDFVIE 356
Query: 304 YIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILE 363
YIGEVIDDALCE+RLWDMK RG QNFYMCEIRKDFTIDATFKGN SRFLNHSCDPNC LE
Sbjct: 357 YIGEVIDDALCERRLWDMKDRGDQNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCKLE 416
Query: 364 KWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLE 423
KWQVEGETRVGVFAARSIKAGEPLTYDYRFV+FGPEVKC+CGA SC GYLGTK+KI K+E
Sbjct: 417 KWQVEGETRVGVFAARSIKAGEPLTYDYRFVRFGPEVKCHCGAPSCHGYLGTKKKIAKVE 476
Query: 424 LCWGSKRKRSSTACLAIITL 443
LCWGSKRKRSS A+IT+
Sbjct: 477 LCWGSKRKRSSY--FALITI 494
>gi|356514491|ref|XP_003525939.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Glycine
max]
Length = 459
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 286/489 (58%), Positives = 333/489 (68%), Gaps = 79/489 (16%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAP----PHSATESPGSDSAVVKTLALTGEEEN 56
MPDL NL+LS+ LTL+RC+S+ AP P++ + PGS+ A
Sbjct: 1 MPDLGNLSLSAP---LTLSRCSSNDALDAPVKTLPNAFSSEPGSEPAF------------ 45
Query: 57 VCANGNGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKKMIEC 116
RV+KR RG+ + + +++ LPFLVGA M EC
Sbjct: 46 --------EPRVLKRTRGSLDRVKKPPIADKAFQDRLK--------DLPFLVGAPMMEEC 89
Query: 117 RACHRFIYHGE-EVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFICRQRLQWRC 175
C FIY G+ E+ CSVRGC YH C K+ +G SN + FKCPQH CFIC+ + Q+RC
Sbjct: 90 CFCCHFIYPGDDELLCSVRGCDARYHSECAKDAVGASNLKKFKCPQHVCFICKLKKQFRC 149
Query: 176 VRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDKQEVFCRLPLPYADEEFKID 235
VRC IA H KCAPW D V+ LKD PG+AVCWRHP+ W LD++EVFCRLPLP+ EEFKID
Sbjct: 150 VRCKIAFHSKCAPWSDSVVQLKDHPGQAVCWRHPSDWRLDRKEVFCRLPLPFISEEFKID 209
Query: 236 LTWKDLMENKVGPPP-----------------------------------------VQYI 254
TWKD M+NK+ P VQ I
Sbjct: 210 FTWKD-MDNKMEQPSPYVHIRRNIYLVKKKRSDADDGAGCTSCSSTSTCSDDCVCRVQCI 268
Query: 255 SCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
SCSKAC CSE CNNRPFRKEKKIKIVKTE CGWGVEAAE I+KG FIIEYIGEVIDDALC
Sbjct: 269 SCSKACRCSENCNNRPFRKEKKIKIVKTELCGWGVEAAETIDKGGFIIEYIGEVIDDALC 328
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
E+RLWDMKYRGVQNFYMCEIRKDFTIDATFKGN SRFLNHSCDPNC+LEKWQV+GETRVG
Sbjct: 329 EKRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNTSRFLNHSCDPNCVLEKWQVDGETRVG 388
Query: 375 VFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKRSS 434
VFAA SI+AGEPLTYDYRFVQFGPEVKC+CGA++CQG+LGTK+KIGKL+L WGSKRKR+S
Sbjct: 389 VFAACSIEAGEPLTYDYRFVQFGPEVKCHCGAANCQGFLGTKKKIGKLDLYWGSKRKRTS 448
Query: 435 TA-CLAIIT 442
A C+ ++T
Sbjct: 449 KASCITLVT 457
>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 497
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 278/503 (55%), Positives = 329/503 (65%), Gaps = 90/503 (17%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPH--SATESPGSDSAVVKTLALTGEEENVC 58
M DL N++LS+S + LT C S L + P + +SP EN+
Sbjct: 1 MLDLGNMSLSAS---VALTCCPSFLPAASGPELSKSIDSP----------------ENIA 41
Query: 59 ANGNG-HS--------VRVMKRCRGAKNISG-------------LEDHVAAWVKKKMELG 96
+ NG HS VR +K G +G LEDHV WVK+++ G
Sbjct: 42 GDCNGKHSPMIPPEEEVRDIKTANGVTAFTGKQNPSDRVKKGLILEDHVKDWVKRRVASG 101
Query: 97 VPQSNCSLPFLVGAKKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRN 156
V ++ C LPFLVGAKKM++C CH+ +Y GEE+ CSVRGC G YH +C K+ LG S
Sbjct: 102 VSETRCCLPFLVGAKKMVDCLVCHKPVYPGEELSCSVRGCQGAYHSLCAKDSLGFSKSSK 161
Query: 157 FKCPQHACFICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK 216
F+CPQH CF+C+QR QWRCV+C +A+HDK +PW ++HLKDQPGRAVCWRHP W LDK
Sbjct: 162 FRCPQHDCFVCKQRTQWRCVKCPMAAHDKHSPWSKEILHLKDQPGRAVCWRHPTNWRLDK 221
Query: 217 ---------QEVFCRLPLPYADEEFKIDLTWKDLMENKVGPP------------------ 249
+EVFC+LPLPY +EEFKIDLTWKD + PP
Sbjct: 222 KHAVAQSEMEEVFCQLPLPYVEEEFKIDLTWKDSVVKDDLPPYVHIRRNIYFVKKKRDNA 281
Query: 250 -------------------PVQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVE 290
VQ ISCSK C C ETC NRPFRKEKKI+IVKTE CGWGVE
Sbjct: 282 NDGVGCTNCGPTCCRSCVCRVQCISCSKGCRCPETCGNRPFRKEKKIRIVKTEHCGWGVE 341
Query: 291 AAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSR 350
AAE INK +FI+EYIGEVI DA CEQRLWDMK++G+++FYMCEI+KDFTIDATFKGN SR
Sbjct: 342 AAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASR 401
Query: 351 FLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQ 410
FLNHSC PNC+LEKWQVEGETRVGVFAAR I+AGEPLTYDYRFVQFGPEVKC CG+ SCQ
Sbjct: 402 FLNHSCSPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFVQFGPEVKCNCGSESCQ 461
Query: 411 GYLGTKRKIGK-LELCWGSKRKR 432
GYLGTKRK L + WG+KR+R
Sbjct: 462 GYLGTKRKEPNCLAVSWGAKRRR 484
>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
Full=ASH1-related protein 3; AltName: Full=Protein SET
DOMAIN GROUP 4; AltName: Full=Protein stamen loss
gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
Length = 497
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 276/488 (56%), Positives = 326/488 (66%), Gaps = 60/488 (12%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPHSA--TESPGSDSAVV--KTLALTGEEEN 56
M DL N+++S+S + LT C S L + P A SP + + K L + EE
Sbjct: 1 MLDLGNMSMSAS---VALTCCPSFLPAASGPELAKSINSPENLAGECNGKHLPMIPPEEE 57
Query: 57 V----CANGNGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKK 112
V ANG R K L+DHV WVK+++ GV +S C LPFLVGAKK
Sbjct: 58 VKDIKIANGVTAFTRKQNPSDRVKKGFVLDDHVKDWVKRRVASGVSESTCFLPFLVGAKK 117
Query: 113 MIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFICRQRLQ 172
M++C CH+ +Y GE++ CSVRGC G YH +C KE LG S FKCPQH CF+C+QR Q
Sbjct: 118 MVDCLVCHKPVYPGEDLSCSVRGCQGAYHSLCAKESLGFSKSSKFKCPQHECFVCKQRTQ 177
Query: 173 WRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK---------QEVFCRL 223
WRCV+C +A+HDK +PW ++HLKDQPGRAVCWRHP W LD +EVFC+L
Sbjct: 178 WRCVKCPMAAHDKHSPWSKEILHLKDQPGRAVCWRHPTDWRLDTKHAVAQSEIEEVFCQL 237
Query: 224 PLPYADEEFKIDLTWKDLMENKVGPPP--------------------------------- 250
PLPY +EEFKIDL WKD + K PP
Sbjct: 238 PLPYVEEEFKIDLAWKDSVV-KEDPPSYVHIRRNIYLVKKKRDNANDGVGCTNCGPNCDR 296
Query: 251 -----VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
VQ ISCSK C C E+C NRPFRKEKKIKIVKTE CGWGVEAAE INK +FI+EYI
Sbjct: 297 SCVCRVQCISCSKGCSCPESCGNRPFRKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYI 356
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GEVI DA CEQRLWDMK++G+++FYMCEI+KDFTIDATFKGN SRFLNHSC+PNC+LEKW
Sbjct: 357 GEVISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKW 416
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGK-LEL 424
QVEGETRVGVFAAR I+AGEPLTYDYRFVQFGPEVKC CG+ +CQGYLGTKRK L +
Sbjct: 417 QVEGETRVGVFAARQIEAGEPLTYDYRFVQFGPEVKCNCGSENCQGYLGTKRKEPNCLVV 476
Query: 425 CWGSKRKR 432
WG+KR+R
Sbjct: 477 SWGAKRRR 484
>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
Length = 477
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 252/454 (55%), Positives = 308/454 (67%), Gaps = 25/454 (5%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPHSA--TESPGSDSAVV--KTLALTGEEEN 56
M DL N+++S+S + LT C S L + P A SP + + K L + EE
Sbjct: 14 MLDLGNMSMSAS---VALTCCPSFLPAASGPELAKSINSPENLAGECNGKHLPMIPPEEE 70
Query: 57 V----CANGNGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKK 112
V ANG R K L+DHV WVK+++ GV +S C LPFLVGAKK
Sbjct: 71 VKDIKIANGVTAFTRKQNPSDRVKKGFVLDDHVKDWVKRRVASGVSESTCFLPFLVGAKK 130
Query: 113 MIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQH--------AC 164
M++C CH+ +Y GE++ CSVRGC G YH +C KE LG S FKCPQH C
Sbjct: 131 MVDCLVCHKPVYPGEDLSCSVRGCQGAYHSLCAKESLGFSKSSKFKCPQHFLIEWSMQEC 190
Query: 165 FICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDKQEVFCRLP 224
F+C+QR QWRCV+C +A+HDK +PW ++HLKDQPGRAVCWRHP W LD + F
Sbjct: 191 FVCKQRTQWRCVKCPMAAHDKHSPWSKEILHLKDQPGRAVCWRHPTDWRLDTKVDFLVYR 250
Query: 225 LPYADEEFKIDLTW-KDLMENKVGPPPVQYISCSKA----CHCSETCNNRPFRKEKKIKI 279
E DL + + L+ ++ + +++ C C E+C NRPFRKEKKIKI
Sbjct: 251 FHKILLEASYDLYFHRVLLTGLTKSLLIEQHAVAQSEIEGCSCPESCGNRPFRKEKKIKI 310
Query: 280 VKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFT 339
VKTE CGWGVEAAE INK +FI+EYIGEVI DA CEQRLWDMK++G+++FYMCEI+KDFT
Sbjct: 311 VKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMCEIQKDFT 370
Query: 340 IDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE 399
IDATFKGN SRFLNHSC+PNC+LEKWQVEGETRVGVFAAR I+AGEPLTYDYRFVQFGPE
Sbjct: 371 IDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFVQFGPE 430
Query: 400 VKCYCGASSCQGYLGTKRKIGK-LELCWGSKRKR 432
VKC CG+ +CQGYLGTKRK L + WG+KR+R
Sbjct: 431 VKCNCGSENCQGYLGTKRKEPNCLVVSWGAKRRR 464
>gi|449530608|ref|XP_004172286.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like, partial
[Cucumis sativus]
Length = 285
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 181/268 (67%), Positives = 203/268 (75%), Gaps = 40/268 (14%)
Query: 215 DKQEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPP------------------------ 250
D +EVF RLPLPY DEEFKID+TWKD E K+ PPP
Sbjct: 19 DLEEVFLRLPLPYVDEEFKIDITWKD-AEKKMEPPPYTHIRRNIYLVKKKRDTGVADGLG 77
Query: 251 ---------------VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPI 295
VQ ISCS+ACHC +TC NRPFRKEKK+KIVKTE CGWGVEAAE I
Sbjct: 78 CTNCKTECSDDCVCRVQCISCSRACHCRDTCTNRPFRKEKKVKIVKTELCGWGVEAAESI 137
Query: 296 NKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHS 355
KGEF+IEYIGEVIDDALCEQRLWDMKY+G++NFYMCEIRKDFTIDATFKGN SRFLNHS
Sbjct: 138 GKGEFVIEYIGEVIDDALCEQRLWDMKYKGMKNFYMCEIRKDFTIDATFKGNASRFLNHS 197
Query: 356 CDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGT 415
CDPNC LEKWQVEGETRVGVFAARSI+ GEPLTYDYRFVQFGPEVKC+CGA +CQ YLGT
Sbjct: 198 CDPNCSLEKWQVEGETRVGVFAARSIEVGEPLTYDYRFVQFGPEVKCHCGAPNCQRYLGT 257
Query: 416 KRKIGKLELCWGSKRKRSSTACLAIITL 443
K++I ++L WG+KR+R+S A + I +
Sbjct: 258 KKRISAIDLFWGTKRRRTSAARITGIIV 285
>gi|357142828|ref|XP_003572707.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR3-like [Brachypodium distachyon]
Length = 466
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 191/387 (49%), Positives = 228/387 (58%), Gaps = 74/387 (19%)
Query: 81 LEDHVAAWVKKKMELGVPQSNCSLPFLVGAKKMIECRACHRFIYHGEEVFCSVRGCGGVY 140
LEDH W +K+ G +C LPFL GA K + +YH +
Sbjct: 12 LEDHAKEWAARKVASGAALHHCVLPFLTGAPKAADGYGLS--MYH--------QLMNSTC 61
Query: 141 HFICVKERLGISNPRNFKCPQHACFICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQP 200
HF + + A F+ ++R A+H KCAPWP +IHLKD
Sbjct: 62 HFY-------------LQMNKAAWFVSKRRFFGAVDVVQXAAHTKCAPWP--LIHLKDDQ 106
Query: 201 GRAVCWRHPAKWLLDKQ---------EVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPV 251
G A+CWRHP+ WLL + EVFCRLPLPY +EEF ID T +D PP
Sbjct: 107 GSAICWRHPSNWLLQNENADLTNNIEEVFCRLPLPYVNEEFNIDSTIRDFTAIVCKPPSY 166
Query: 252 QYI----------------------------------------SCSKACHCSETCNNRPF 271
+I SCSK+CHCS+ C N+PF
Sbjct: 167 IHIRRNVYLVKKKRADSSAETGCTNCRADSVCKDDCECRGLSMSCSKSCHCSDLCTNKPF 226
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
RK+KKIKIVK+E CGWG A EP+ K +FIIEY+GEVIDDA CEQRLW+MK RG +NFYM
Sbjct: 227 RKDKKIKIVKSEGCGWGAIALEPLEKCDFIIEYVGEVIDDATCEQRLWEMKRRGDKNFYM 286
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
CEI KD+TIDATFKGN SRFLNHSCDPNC LEKWQV+GETRVGVFA+R I+ GEPLTYDY
Sbjct: 287 CEISKDYTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRFIEVGEPLTYDY 346
Query: 392 RFVQFGPEVKCYCGASSCQGYLGTKRK 418
RFV FG +VKC+CGA SCQGYLG++ K
Sbjct: 347 RFVHFGEKVKCHCGAKSCQGYLGSQLK 373
>gi|326522849|dbj|BAJ88470.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 394
Score = 343 bits (881), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 171/288 (59%), Positives = 197/288 (68%), Gaps = 51/288 (17%)
Query: 180 IASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDK---------QEVFCRLPLPYADE 230
A+H KCAPWP +IHLKD G A+CWRHP+ WLL +EVFCRLPLPY +E
Sbjct: 20 FAAHTKCAPWP--LIHLKDDQGSAICWRHPSNWLLQNENADLTNNIEEVFCRLPLPYVNE 77
Query: 231 EFKIDLTWKDLMENKVGPPPVQYI------------------------------------ 254
EF ID T ++ PP +I
Sbjct: 78 EFNIDSTIRNFTAIVCKPPSYTHIRRNVYLVKKKRADSSAETGCTNCKSDSVCKDDCECR 137
Query: 255 ----SCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
SCSK+CHCS+ C+N+PFRK+KKIKIVK+E CGWG A EP+ KG+FIIEY+GEVI+
Sbjct: 138 GLSMSCSKSCHCSDLCSNKPFRKDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVIN 197
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
DA CEQRLWDMK RG +NFYMCEI KDFTIDATFKGN SRFLNHSCDPNC LEKWQVEGE
Sbjct: 198 DATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGE 257
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
TRVGVFA+R I+ GEPLTYDYRFV FG +VKC+CGA SCQGYLG++ K
Sbjct: 258 TRVGVFASRIIEVGEPLTYDYRFVHFGEKVKCHCGAKSCQGYLGSQLK 305
>gi|168025972|ref|XP_001765507.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683357|gb|EDQ69768.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 993
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 168/390 (43%), Positives = 219/390 (56%), Gaps = 65/390 (16%)
Query: 107 LVGAKKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFK--CPQH-- 162
L + M C C + E+V CS C VYHF C + +S K CPQH
Sbjct: 592 LFPSNYMEVCFECKGSLNSAEKVVCSRNDCDRVYHFSCTVDLSDVSWSCEGKLVCPQHMF 651
Query: 163 ------------------------------ACFIC---RQRLQWRCVRCTIASHDKCAPW 189
C+ C R WRC C++A+H+ C P
Sbjct: 652 SADVEWTIHSAQDSGVPICEAVEDTLAEGLVCYACGKSRGAKLWRCQYCSLATHESCIPC 711
Query: 190 PDRVIHLKDQPGRAVCWRHPAK---------WLLDKQEVFCRLPLPYADEEFKIDLTW-K 239
P+ ++PG A+CWRH + W D Q++F LP+P +EF + + K
Sbjct: 712 PEVTTLFHNKPGWAICWRHGSDIDNKLKTPTW--DCQDIFRHLPVPEVQQEFSLAPEFIK 769
Query: 240 DLMENKVGPPPVQYI----------SCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGV 289
++MEN PPP +I SCSK C CS+ C N+PFRK+K++K+ KT CGWG
Sbjct: 770 EVMENDREPPPYFFIRRSILVYLSMSCSKDCKCSDKCCNKPFRKDKRLKVSKTAHCGWGA 829
Query: 290 EAAEPINKGEFIIEYIGEVIDDALCEQRLWDMK-YRGVQNFYMCEIRKDFTIDATFKGNF 348
+ I K EF+IEY GEVIDDA+CE+RLW+MK R + NFYMCEI KDF IDAT KGN
Sbjct: 830 FTSVAIKKDEFVIEYTGEVIDDAMCEKRLWEMKGRRSICNFYMCEIAKDFIIDATRKGNA 889
Query: 349 SRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASS 408
SR+LNHSC PNC LEKW+V+GETRVGVFA R+I AGE LTYDY++V+FGP VKC CGA +
Sbjct: 890 SRYLNHSCQPNCRLEKWRVDGETRVGVFAGRNIIAGEELTYDYKYVEFGPNVKCRCGAPN 949
Query: 409 CQGYLG-----TKRKIGKLELCWGSKRKRS 433
C+G +G ++ + WG++ KRS
Sbjct: 950 CRGVIGERLISNRKSNNNFPIKWGAQHKRS 979
>gi|326487338|dbj|BAJ89653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 147/242 (60%), Positives = 169/242 (69%), Gaps = 40/242 (16%)
Query: 217 QEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQYI---------------------- 254
+EVFCRLPLPY +EEF ID T ++ PP +I
Sbjct: 10 EEVFCRLPLPYVNEEFNIDSTIRNFTAIVCKPPSYTHIRRNVYLVKKKRADSSAETGCTN 69
Query: 255 ------------------SCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPIN 296
SCSK+CHCS+ C+N+PFRK+KKIKIVK+E CGWG A EP+
Sbjct: 70 CKSDSVCKDDCECRGLSMSCSKSCHCSDLCSNKPFRKDKKIKIVKSEGCGWGAVALEPLE 129
Query: 297 KGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC 356
KG+FIIEY+GEVI+DA CEQRLWDMK RG +NFYMCEI KDFTIDATFKGN SRFLNHSC
Sbjct: 130 KGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSC 189
Query: 357 DPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
DPNC LEKWQVEGETR GVFA+R I+ GEPLTYDYRFV FG +VKC+CGA SCQGYLG++
Sbjct: 190 DPNCKLEKWQVEGETRAGVFASRIIEVGEPLTYDYRFVHFGEKVKCHCGAKSCQGYLGSQ 249
Query: 417 RK 418
K
Sbjct: 250 LK 251
>gi|242065740|ref|XP_002454159.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
gi|241933990|gb|EES07135.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
Length = 328
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/241 (58%), Positives = 166/241 (68%), Gaps = 40/241 (16%)
Query: 218 EVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQYI----------------------- 254
EVFCRLPLPY +E+ I+ T + + PPP I
Sbjct: 1 EVFCRLPLPYVNEDLNIESTLAEYAQAVYKPPPYTSIRRNVYLIKKKRTSVRVDIGCTNC 60
Query: 255 -----------------SCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINK 297
SCSK C CS+ C NRPFRK+KKIKIVKT+ CGWG A EP+ +
Sbjct: 61 RADSTCKEDCEFRGISTSCSKNCRCSDLCTNRPFRKDKKIKIVKTKRCGWGAVALEPLER 120
Query: 298 GEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCD 357
G+F+IEY+GEVIDDA CEQRLWD++YRG +NFYMCEI KDFTIDATFKGN SRFLNHSC+
Sbjct: 121 GDFVIEYVGEVIDDATCEQRLWDIRYRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCE 180
Query: 358 PNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKR 417
PNC LEKWQV+GETRVGVFA+RSIK GEPLTYDYRFV FG +VKC+C A +CQGYLG++
Sbjct: 181 PNCKLEKWQVDGETRVGVFASRSIKVGEPLTYDYRFVHFGEKVKCHCEAVNCQGYLGSQI 240
Query: 418 K 418
K
Sbjct: 241 K 241
>gi|222623224|gb|EEE57356.1| hypothetical protein OsJ_07493 [Oryza sativa Japonica Group]
Length = 384
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 127/166 (76%), Positives = 144/166 (86%)
Query: 253 YISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
Y+SCSK CHCS+ C N+PFRK+KKIK VKT+ CGWG + EP+ KG+FIIEY+GEVI+DA
Sbjct: 121 YMSCSKNCHCSDMCTNKPFRKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDA 180
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CEQRLWDMK RG +NFYMCEI KDFTIDATFKGN SRFLNHSCDPNC LEKWQV+GETR
Sbjct: 181 TCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETR 240
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
VGVFA+RSI+ GE LTYDYRFV FG +VKCYCGA +CQGYLG + K
Sbjct: 241 VGVFASRSIQVGEHLTYDYRFVHFGEKVKCYCGAQNCQGYLGNQIK 286
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 3/41 (7%)
Query: 81 LEDHVAAWVKKKMELGVPQSNCSLPFLVG---AKKMIECRA 118
E++V W KK LGVP C LPFL G AKK + RA
Sbjct: 58 FEEYVKEWKAKKAALGVPAGRCELPFLTGTPKAKKRPDSRA 98
>gi|297599580|ref|NP_001047405.2| Os02g0611300 [Oryza sativa Japonica Group]
gi|255671080|dbj|BAF09319.2| Os02g0611300 [Oryza sativa Japonica Group]
Length = 263
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 126/165 (76%), Positives = 143/165 (86%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+SCSK CHCS+ C N+PFRK+KKIK VKT+ CGWG + EP+ KG+FIIEY+GEVI+DA
Sbjct: 1 MSCSKNCHCSDMCTNKPFRKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDAT 60
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CEQRLWDMK RG +NFYMCEI KDFTIDATFKGN SRFLNHSCDPNC LEKWQV+GETRV
Sbjct: 61 CEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRV 120
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
GVFA+RSI+ GE LTYDYRFV FG +VKCYCGA +CQGYLG + K
Sbjct: 121 GVFASRSIQVGEHLTYDYRFVHFGEKVKCYCGAQNCQGYLGNQIK 165
>gi|238013758|gb|ACR37914.1| unknown [Zea mays]
Length = 252
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 123/165 (74%), Positives = 144/165 (87%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+SCSK C CS+ C NRPFRK+KKIKIVKT+ CGWG A EP+ +G+F+IEY+GEVIDDA
Sbjct: 1 MSCSKNCRCSDLCTNRPFRKDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDAT 60
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CEQRLWD++ RG +NFYMCEI KDFTIDATFKGN SRFLNHSC+PNC LEKWQV+GETRV
Sbjct: 61 CEQRLWDIRRRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRV 120
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
GVFA+RSI+ GEPLTYDYRFV FG +VKC+C A +CQGYLG++ K
Sbjct: 121 GVFASRSIEVGEPLTYDYRFVHFGEKVKCHCEAVNCQGYLGSQIK 165
>gi|42407424|dbj|BAD10031.1| SET domain protein-like [Oryza sativa Japonica Group]
Length = 437
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 115/199 (57%), Positives = 152/199 (76%), Gaps = 9/199 (4%)
Query: 216 KQEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQYISCSKACHCSETCNNRPFRKEK 275
K+E F RLPLP+ +EF ID K+ ++N PPP Y+ H C N+PFR++K
Sbjct: 238 KKEAFRRLPLPHTFQEFNIDPIKKEELDNGTEPPP--YV------HMKHECTNKPFRRQK 289
Query: 276 KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIR 335
KI+IVKT++CGWG A E I K +F+IE++GEVIDD CE+RL DM+ RG +NFYMC+++
Sbjct: 290 KIEIVKTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRRRGDKNFYMCKVK 349
Query: 336 KDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQ 395
KDF IDATFKGN RF NHSC+PNC L+KWQV G+TR+GVFA+++I+ GEPLTYDYRF Q
Sbjct: 350 KDFVIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTYDYRFEQ 409
Query: 396 -FGPEVKCYCGASSCQGYL 413
+GPE++C+CGA +CQG +
Sbjct: 410 HYGPEIECFCGAQNCQGNM 428
>gi|357141465|ref|XP_003572234.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like
[Brachypodium distachyon]
Length = 347
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 123/239 (51%), Positives = 158/239 (66%), Gaps = 41/239 (17%)
Query: 216 KQEVFCRLPLPYADEEFKIDLTWKDLMENKVGPP-------------------------- 249
++E F +LPLPY D EF ID + +E+ PP
Sbjct: 109 RKEAFRQLPLPYTDREFNIDPIKTEELESLTKPPTYVLLKRSILCMIHKCDGDAIEGGCT 168
Query: 250 ----PVQ----------YISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPI 295
P+ +ISCS+AC CS C NRPFR+EK+I++VKT+ CGWGV A E I
Sbjct: 169 DCDPPLACKTMCSCRSVWISCSRACKCSNECTNRPFRREKRIEVVKTQHCGWGVVALESI 228
Query: 296 NKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHS 355
KG+F+IE++GEVIDD CE+RL DMK RG QNFYMC++ K+F IDATF+GN RFLNHS
Sbjct: 229 QKGDFVIEFVGEVIDDVTCEERLEDMKRRGDQNFYMCKVNKNFVIDATFRGNACRFLNHS 288
Query: 356 CDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQ-FGPEVKCYCGASSCQGYL 413
C+PNC LEKWQV G+TR+GVFA+++I+ G+PLTY YRF Q FGP ++C CGA++CQG L
Sbjct: 289 CEPNCQLEKWQVNGKTRLGVFASQAIEVGKPLTYSYRFKQHFGPRMECLCGAANCQGKL 347
>gi|218191149|gb|EEC73576.1| hypothetical protein OsI_08033 [Oryza sativa Indica Group]
Length = 163
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 109/141 (77%), Positives = 124/141 (87%)
Query: 253 YISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
Y+SCSK CHCS+ C N+PFRK+KKIK VKT+ CGWG + EP+ KG+FIIEY+GEVI+DA
Sbjct: 5 YMSCSKNCHCSDMCTNKPFRKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDA 64
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CEQRLWDMK RG +NFYMCEI KDFTIDATFKGN SRFLNHSCDPNC LEKWQV+GETR
Sbjct: 65 TCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETR 124
Query: 373 VGVFAARSIKAGEPLTYDYRF 393
VGVFA+RSI+ GE LTYDYR+
Sbjct: 125 VGVFASRSIQVGEHLTYDYRY 145
>gi|414870328|tpg|DAA48885.1| TPA: putative SET-domain containing protein family [Zea mays]
Length = 266
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 121/229 (52%), Positives = 150/229 (65%), Gaps = 30/229 (13%)
Query: 215 DKQEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQY----------ISCSK------ 258
D +E F RLPLPY D+EF ID K ++++ PPP + I+C
Sbjct: 38 DIKEAFRRLPLPYIDQEFNIDPINKQDLKSESEPPPYKKSEGRPIKGGCINCDHGSTCGS 97
Query: 259 -------------ACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
C CS C+N+PFR+EKKIKIVKT CGWG A E I K +F+IE++
Sbjct: 98 SCSCRSSMTSCSLGCPCSVKCSNKPFRREKKIKIVKTRQCGWGAIALETIGKDDFVIEFV 157
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GEVIDDA+CE RL DM+ R QNFYMC++ KDF ID TFKGN RF NHSC PNC LEKW
Sbjct: 158 GEVIDDAMCEDRLQDMRQRRDQNFYMCKVDKDFVIDPTFKGNACRFFNHSCQPNCRLEKW 217
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYL 413
QV G+TR+GVFA+++I+ G PLTY+YRF FGPE +C+CGA +CQG L
Sbjct: 218 QVNGKTRLGVFASQTIEVGMPLTYNYRFRTSFGPEKECFCGAPNCQGKL 266
>gi|449529911|ref|XP_004171941.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like, partial
[Cucumis sativus]
Length = 229
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 118/231 (51%), Positives = 150/231 (64%), Gaps = 20/231 (8%)
Query: 1 MPDLANLALSSSSSSLTLTRCASSLKPLAPPH---SATESPGSDSAVVKTLA-------L 50
MPDL NL+LS S LTLT + +LKPL PH S +S + +L+ L
Sbjct: 1 MPDLGNLSLSQS---LTLTP-SPNLKPLLHPHLPSSFLSHKSLNSLPIDSLSTQPFQCSL 56
Query: 51 TGEEENVCANG----NGHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPF 106
E+++ A + H+ ++ R + LEDHV +WV++K G+PQS CS PF
Sbjct: 57 PNPEDDLPAPTQIRVSTHTRNLLDRSKRNYPAKNLEDHVKSWVRRKQHSGLPQSRCSFPF 116
Query: 107 LVGAKKMIE--CRACHRFIYHGEEVFCSVRGCGGVYHFICVKERLGISNPRNFKCPQHAC 164
L G +KM+E C CHR+I+ GEEV CS+R CGGVYH C K+ LG S R FKCPQH C
Sbjct: 117 LQGTRKMVEKECLCCHRYIFPGEEVECSIRDCGGVYHVTCAKDNLGFSGHRKFKCPQHEC 176
Query: 165 FICRQRLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKWLLD 215
FIC+QRL W+C+RC IASHDKCAPWPD+VIHLK+QPGRA+CWRHP W +D
Sbjct: 177 FICKQRLHWKCIRCNIASHDKCAPWPDKVIHLKNQPGRAICWRHPTDWRVD 227
>gi|212721872|ref|NP_001132822.1| uncharacterized protein LOC100194312 [Zea mays]
gi|194695492|gb|ACF81830.1| unknown [Zea mays]
gi|413923000|gb|AFW62932.1| putative SET-domain containing protein family [Zea mays]
Length = 418
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 102/137 (74%), Positives = 120/137 (87%)
Query: 282 TEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTID 341
T+ CGWG A EP+ +G+F+IEY+GEVIDDA CEQRLWD++ RG +NFYMCEI KDFTID
Sbjct: 195 TKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRRRGDKNFYMCEISKDFTID 254
Query: 342 ATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK 401
ATFKGN SRFLNHSC+PNC LEKWQV+GETRVGVFA+RSI+ GEPLTYDYRFV FG +VK
Sbjct: 255 ATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIEVGEPLTYDYRFVHFGEKVK 314
Query: 402 CYCGASSCQGYLGTKRK 418
C+C A +CQGYLG++ K
Sbjct: 315 CHCEAVNCQGYLGSQIK 331
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 53/92 (57%)
Query: 71 RCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKKMIECRACHRFIYHGEEVF 130
R + + + LE+ V+ W KK GVP +C LPFL GA +ECR C + IY GEE+
Sbjct: 103 RTQASASAPTLEERVSEWAAKKAAAGVPTHHCVLPFLTGAPMAVECRLCSKIIYVGEEIK 162
Query: 131 CSVRGCGGVYHFICVKERLGISNPRNFKCPQH 162
CSV+ C ++H CV E +F+CPQH
Sbjct: 163 CSVKKCPQMFHLKCVAEDTSNFTVESFRCPQH 194
>gi|224095256|ref|XP_002310367.1| SET domain protein [Populus trichocarpa]
gi|222853270|gb|EEE90817.1| SET domain protein [Populus trichocarpa]
Length = 281
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 102/189 (53%), Positives = 134/189 (70%), Gaps = 12/189 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C+N+PF R+ KK+K+V+TE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 5 SCSSGCKCGSSCSNKPFQHRQVKKMKLVQTEKCGSGIVADEDIKQGEFVIEYVGEVIDDK 64
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW MK+ G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 65 TCEERLWKMKHCGETNFYLCEINRDMVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETR 124
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA R I+ GE LTYDY+FVQFG + C+CG+S C+ LG K SK K
Sbjct: 125 IGIFATRDIRKGEHLTYDYQFVQFGADQDCHCGSSGCRKKLGVK----------PSKPKM 174
Query: 433 SSTACLAII 441
SS A L ++
Sbjct: 175 SSDAALKLV 183
>gi|224117806|ref|XP_002331636.1| SET domain protein [Populus trichocarpa]
gi|222874032|gb|EEF11163.1| SET domain protein [Populus trichocarpa]
Length = 351
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 102/189 (53%), Positives = 133/189 (70%), Gaps = 12/189 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C+N+PF R KK+K+V+TE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 91 SCSSGCKCGSSCSNKPFQHRPLKKMKLVQTEKCGSGIVADEDIKQGEFVIEYVGEVIDDN 150
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW MK+RG NFY+CEI ++ IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 151 TCEERLWKMKHRGETNFYLCEINRNMVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETR 210
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA I+ GE LTYDY+FVQFG + C+CGAS C+ LG K SK K
Sbjct: 211 IGIFATHDIRKGEHLTYDYQFVQFGADQDCHCGASGCRKKLGVK----------PSKPKM 260
Query: 433 SSTACLAII 441
SS A L ++
Sbjct: 261 SSDAALKLV 269
>gi|297737225|emb|CBI26426.3| unnamed protein product [Vitis vinifera]
Length = 438
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 97/164 (59%), Positives = 123/164 (75%), Gaps = 2/164 (1%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C N+PF R KK+K+V+TE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 177 SCSSGCKCGTSCLNKPFQSRPVKKMKMVETEKCGSGIVADEDIKQGEFVIEYVGEVIDDK 236
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE RLW MK+ G NFY+CEI +D IDAT+KGN SR++NHSCDPN ++KW+++GETR
Sbjct: 237 TCEDRLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCDPNTEMQKWRIDGETR 296
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
+G+FA R IK GE LTYDY+FVQFG + C+CGA C+ LG K
Sbjct: 297 IGIFATRDIKRGEHLTYDYQFVQFGADQDCHCGAVGCRRKLGVK 340
>gi|359489946|ref|XP_002268035.2| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Vitis
vinifera]
Length = 377
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 97/164 (59%), Positives = 123/164 (75%), Gaps = 2/164 (1%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C N+PF R KK+K+V+TE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 100 SCSSGCKCGTSCLNKPFQSRPVKKMKMVETEKCGSGIVADEDIKQGEFVIEYVGEVIDDK 159
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE RLW MK+ G NFY+CEI +D IDAT+KGN SR++NHSCDPN ++KW+++GETR
Sbjct: 160 TCEDRLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCDPNTEMQKWRIDGETR 219
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
+G+FA R IK GE LTYDY+FVQFG + C+CGA C+ LG K
Sbjct: 220 IGIFATRDIKRGEHLTYDYQFVQFGADQDCHCGAVGCRRKLGVK 263
>gi|218201888|gb|EEC84315.1| hypothetical protein OsI_30811 [Oryza sativa Indica Group]
Length = 360
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 125/175 (71%), Gaps = 2/175 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+PF R +K K++KTE CG GV A E I KGEF+IEY+GEVIDD
Sbjct: 87 CSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDIKKGEFVIEYVGEVIDDRT 146
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CEQRLW MK +G NFY+CE+ + IDAT KGN SRF+NHSC+PN ++KW VEGETRV
Sbjct: 147 CEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRV 206
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS 428
G+FA R IK GE LTYDY+FVQFG + C+CG+S+C+ LG + + + L G+
Sbjct: 207 GIFALRDIKTGEELTYDYKFVQFGADQDCHCGSSNCRKMLGITKPVNSIVLHNGN 261
>gi|222641285|gb|EEE69417.1| hypothetical protein OsJ_28789 [Oryza sativa Japonica Group]
Length = 360
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 125/175 (71%), Gaps = 2/175 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+PF R +K K++KTE CG GV A E I KGEF+IEY+GEVIDD
Sbjct: 87 CSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDIKKGEFVIEYVGEVIDDRT 146
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CEQRLW MK +G NFY+CE+ + IDAT KGN SRF+NHSC+PN ++KW VEGETRV
Sbjct: 147 CEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRV 206
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS 428
G+FA R IK GE LTYDY+FVQFG + C+CG+S+C+ LG + + + L G+
Sbjct: 207 GIFALRDIKTGEELTYDYKFVQFGADQDCHCGSSNCRKMLGITKPVNSIVLHNGN 261
>gi|115478464|ref|NP_001062827.1| Os09g0307800 [Oryza sativa Japonica Group]
gi|51091678|dbj|BAD36461.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|51091893|dbj|BAD36704.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|113631060|dbj|BAF24741.1| Os09g0307800 [Oryza sativa Japonica Group]
Length = 340
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 125/175 (71%), Gaps = 2/175 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+PF R +K K++KTE CG GV A E I KGEF+IEY+GEVIDD
Sbjct: 87 CSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDIKKGEFVIEYVGEVIDDRT 146
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CEQRLW MK +G NFY+CE+ + IDAT KGN SRF+NHSC+PN ++KW VEGETRV
Sbjct: 147 CEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRV 206
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS 428
G+FA R IK GE LTYDY+FVQFG + C+CG+S+C+ LG + + + L G+
Sbjct: 207 GIFALRDIKTGEELTYDYKFVQFGADQDCHCGSSNCRKMLGITKPVNSIVLHNGN 261
>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
Length = 820
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 93/167 (55%), Positives = 123/167 (73%), Gaps = 2/167 (1%)
Query: 253 YISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
Y SCS +C C C N PF+K +++++ +TE CGWG+ A E I +G+F+IEYIGEVID
Sbjct: 152 YTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEYIGEVID 211
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
D CE+RLWD+K RG NFY+CE+ D IDATFKGN SRF+NHSC+PN L KWQ +GE
Sbjct: 212 DKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCNPNAQLRKWQCDGE 271
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKR 417
R+GVFA I G+ +TYDY+++QFG E +C+CG+ +C+G LG R
Sbjct: 272 LRIGVFAVSRILKGQEITYDYKYIQFGTEQQCHCGSKNCKGILGGSR 318
>gi|302795285|ref|XP_002979406.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
gi|300153174|gb|EFJ19814.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
Length = 274
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 94/167 (56%), Positives = 123/167 (73%), Gaps = 2/167 (1%)
Query: 253 YISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
Y SCS +C C C N PF+K +++++ +TE CGWG+ A E I +G+F+IEYIGEVID
Sbjct: 61 YTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEYIGEVID 120
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
D CE+RLWD+K RG NFY+CE+ D IDATFKGN SRF+NHSCDPN L KWQ +GE
Sbjct: 121 DKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCDPNAQLRKWQCDGE 180
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKR 417
R+GVFA I G+ +TYDY+++QFG E +C+CG+ +C+G LG R
Sbjct: 181 LRIGVFAVSRILKGQEITYDYKYIQFGTEQQCHCGSKNCKGILGGSR 227
>gi|356559949|ref|XP_003548258.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 101/189 (53%), Positives = 132/189 (69%), Gaps = 12/189 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C N+PF R KK+K+VKTE CG G+ A E I GEF+IEY+GEVIDD
Sbjct: 93 SCSSGCKCGSSCLNKPFQNRPVKKMKLVKTEKCGSGIVADEDIKLGEFVIEYVGEVIDDK 152
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW+MK+ G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 153 TCEERLWNMKHSGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 212
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA R I+ GE LTYDY+FVQFG + C+CGA+ C+ LG + +K K
Sbjct: 213 IGIFATRDIQKGEHLTYDYQFVQFGADQDCHCGAAECRRKLGVR----------PTKSKL 262
Query: 433 SSTACLAII 441
SS A L ++
Sbjct: 263 SSDAALKLV 271
>gi|449505027|ref|XP_004162355.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 131/190 (68%), Gaps = 12/190 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C C N+PF R KK+K+VKTE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 93 SCSSGCKCGVLCLNKPFQHRPVKKMKLVKTEKCGSGIVADEDIKQGEFVIEYVGEVIDDK 152
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW+MK+RG NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 153 TCEERLWNMKHRGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 212
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA R I GE LTYDY+FVQFG + C+CGA C+ LG + +K K
Sbjct: 213 IGIFATRDIPKGEHLTYDYQFVQFGADQDCHCGAVDCRRKLGVR----------PTKPKT 262
Query: 433 SSTACLAIIT 442
SS A L ++
Sbjct: 263 SSDAALKLVA 272
>gi|449442399|ref|XP_004138969.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 131/190 (68%), Gaps = 12/190 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C C N+PF R KK+K+VKTE CG G+ A E I +GEF+IEY+GEVIDD
Sbjct: 93 SCSSGCKCGVLCLNKPFQHRPVKKMKLVKTEKCGSGIVADEDIKQGEFVIEYVGEVIDDK 152
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW+MK+RG NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 153 TCEERLWNMKHRGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 212
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA R I GE LTYDY+FVQFG + C+CGA C+ LG + +K K
Sbjct: 213 IGIFATRDIPKGEHLTYDYQFVQFGADQDCHCGAVDCRRKLGVR----------PTKPKT 262
Query: 433 SSTACLAIIT 442
SS A L ++
Sbjct: 263 SSDAALKLVA 272
>gi|356530969|ref|XP_003534051.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 101/189 (53%), Positives = 132/189 (69%), Gaps = 12/189 (6%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C N+PF R KK+K+VKTE CG G+ A E I GEF+IEY+GEVIDD
Sbjct: 93 SCSSGCKCGSSCLNKPFQNRPVKKMKLVKTEKCGSGIVADEDIKLGEFVIEYVGEVIDDK 152
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+RLW+MK+RG NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR
Sbjct: 153 TCEERLWNMKHRGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 212
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
+G+FA I+ GE LTYDY+FVQFG + C+CGA+ C+ LG + +K K
Sbjct: 213 IGIFATSDIQKGEHLTYDYQFVQFGADQDCHCGAAECRRKLGVR----------PTKPKL 262
Query: 433 SSTACLAII 441
SS A L ++
Sbjct: 263 SSDATLKLV 271
>gi|357498513|ref|XP_003619545.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
gi|355494560|gb|AES75763.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
Length = 348
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 97/164 (59%), Positives = 121/164 (73%), Gaps = 2/164 (1%)
Query: 255 SCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
SCS C C +C N+ F R KK+K+VKTE CG G+ A E I GEF+IEY+GEVIDD
Sbjct: 89 SCSSGCKCGISCLNKAFQHRPVKKMKLVKTEKCGSGIVADEDIKLGEFVIEYVGEVIDDK 148
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CEQRLW+MK RG NFY+CEI +D IDAT KGN SR++NHSC PN ++KW ++GETR
Sbjct: 149 TCEQRLWNMKDRGETNFYLCEINRDMVIDATNKGNKSRYINHSCCPNTEMQKWIIDGETR 208
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
+G+FA+R IK GE LTYDY+FVQFG + C+CGA C+ LG +
Sbjct: 209 IGIFASRDIKKGEHLTYDYQFVQFGADQDCHCGAVQCRRKLGAR 252
>gi|357116306|ref|XP_003559923.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 349
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 101/199 (50%), Positives = 131/199 (65%), Gaps = 13/199 (6%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R KK K++KTE CG+G+ A E I +GEFIIEY+GEVIDD
Sbjct: 94 CSSNCKCENRCANKSFQLRPMKKTKLIKTEKCGFGLVADEGIQQGEFIIEYVGEVIDDRT 153
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+RLW MK + NFY+CE+ + IDAT KGN SRF+NHSC PN ++KW V+GETRV
Sbjct: 154 CEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRV 213
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS----- 428
G+FA IK GE LTYDY+FVQFG + C+CG+S+C+ +GT + + + L G+
Sbjct: 214 GIFALHDIKKGEELTYDYKFVQFGADQDCHCGSSNCRKMVGTSKSVNSIVLHNGNSGSSQ 273
Query: 429 ------KRKRSSTACLAII 441
KRK +S C+ I
Sbjct: 274 DQHIVKKRKTTSDNCIGEI 292
>gi|226493201|ref|NP_001149253.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|194704072|gb|ACF86120.1| unknown [Zea mays]
gi|195625808|gb|ACG34734.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|238014446|gb|ACR38258.1| unknown [Zea mays]
gi|414589294|tpg|DAA39865.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414589295|tpg|DAA39866.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 339
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 98/182 (53%), Positives = 126/182 (69%), Gaps = 2/182 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C +TC N+ F R KK K++KTE CG G+ A + I KGEF+IEY+GEVIDD
Sbjct: 95 CSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVAEDEIKKGEFVIEYVGEVIDDRT 154
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE RLW MK +FY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 155 CENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRV 214
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKRS 433
G+FA R IK GE LTYDY+FVQFG C+CG+S+C+ LGT + G + K+KR
Sbjct: 215 GIFALRDIKIGEELTYDYKFVQFGAAQVCHCGSSNCRKMLGTTKYSGSSQNHHAKKKKRK 274
Query: 434 ST 435
++
Sbjct: 275 TS 276
>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 615
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 96/172 (55%), Positives = 124/172 (72%), Gaps = 3/172 (1%)
Query: 253 YISCSKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+++CS C C ++C N PF+K K+K VKTE CGWG+ A E I G F++EY+GEVID
Sbjct: 300 FMTCSSNCGCGDSCTNLPFQKLPGSKMKAVKTERCGWGLVADEDIKAGSFLVEYVGEVID 359
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
D CE+RLW MK +G NFYMCEI ++ IDATFKGN SRF+NHSC PN L+KW ++GE
Sbjct: 360 DQTCEERLWAMKKQGEMNFYMCEISREMVIDATFKGNLSRFINHSCQPNSELQKWDIDGE 419
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGK 421
TR+GVFA IK G+ +TYDY+F+QFG + + C+CGA C+G LG K K
Sbjct: 420 TRIGVFAITDIKRGDFVTYDYQFIQFGSKNQHCHCGAPECRGQLGKPMKQQK 471
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 75/173 (43%), Gaps = 19/173 (10%)
Query: 110 AKKMIECRACHRF-IYHGEEVFCSVRGCGGVYHFICVKERLGI----SNPRNFKCPQHAC 164
A+ + C C F + V C+ R CG YH C K + I ++ CP+H C
Sbjct: 75 AENKVRCFKCKAFGSLEVDLVKCAHRNCGKYYHKDCSKGWVRIPPKKTDNGAMVCPRHHC 134
Query: 165 FICRQ-----RLQWRCVRCTIASHDKCAPWPDRVIHLKDQPGRAVCWRHPAKW----LLD 215
CR+ +L RC+ C +A H+ C+P + L+D PG +CW+H +W LD
Sbjct: 135 DACRKCQKNAKLH-RCLYCPVAYHESCSPEGTNL--LEDIPGYLLCWKHDDEWKHEYKLD 191
Query: 216 KQEVFCRLPLPYADEEFKIDLTWK--DLMENKVGPPPVQYISCSKACHCSETC 266
+ + C L K W+ + ME P +CSK C C C
Sbjct: 192 LKGISCSLNFKLDIYLVKKSKRWQQDEKMECLCKPTVGSSETCSKDCLCGVCC 244
>gi|357157974|ref|XP_003577976.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 338
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 102/199 (51%), Positives = 131/199 (65%), Gaps = 13/199 (6%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R KK K++KTE CG+G+ A + I KGEFIIEY+GEVIDD
Sbjct: 83 CSSNCKCENRCANKSFQLRPLKKTKLIKTEKCGFGLVADDGIQKGEFIIEYVGEVIDDRT 142
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+RLW MK + NFY+CE+ + IDAT KGN SRF+NHSC PN ++KW V+GETRV
Sbjct: 143 CEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRV 202
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS----- 428
G+FA R IK GE LTYDY+FVQFG + C+CG+S C+ +GT + + + L G+
Sbjct: 203 GIFALRDIKKGEELTYDYKFVQFGADQDCHCGSSKCRKMVGTSKSVNSIILHNGNSGSSQ 262
Query: 429 ------KRKRSSTACLAII 441
KRK +S C+ I
Sbjct: 263 DQHIVKKRKITSDNCIGEI 281
>gi|242048842|ref|XP_002462165.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
gi|241925542|gb|EER98686.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
Length = 341
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 122/182 (67%), Gaps = 2/182 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R KK K++KTE CG G+ A + I KGEF+IEY+GEVIDD
Sbjct: 96 CSSQCECDNACTNKSFQHRPLKKTKLIKTEKCGHGLVAEDEIKKGEFVIEYVGEVIDDRA 155
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE RLW MK +FY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 156 CENRLWTMKRLNDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTKMQKWTVDGETRV 215
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKRS 433
G+FA R IK GE LTYDY+FVQFG C+CG+S C+ LGT + G + K+KR
Sbjct: 216 GIFALRDIKIGEELTYDYKFVQFGAAQVCHCGSSKCRKMLGTAKYSGSSQNHHAKKKKRK 275
Query: 434 ST 435
+
Sbjct: 276 TN 277
>gi|326496078|dbj|BAJ90660.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 99/199 (49%), Positives = 131/199 (65%), Gaps = 13/199 (6%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R K K++KTE CG+G+ A + I KGEF+IEY+GEVIDD
Sbjct: 107 CSSNCKCGNNCANKSFQLRPLFKTKLIKTEKCGFGLIAEDEIKKGEFVIEYVGEVIDDRT 166
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+RLW MK + NFY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 167 CEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFINHSCEPNTEMQKWTVDGETRV 226
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGS----- 428
G+FA R I+ GE LTYDY+FVQFG + C+CG+S+C+ +GT + + L G+
Sbjct: 227 GIFALRDIERGEELTYDYKFVQFGADQDCHCGSSNCRKMVGTSKSVNSFILHNGNSANSQ 286
Query: 429 ------KRKRSSTACLAII 441
KRK +S C+ I
Sbjct: 287 DQHDMKKRKTTSDNCIGEI 305
>gi|18406465|ref|NP_566010.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
gi|94707125|sp|Q945S8.2|ASHH3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH3; AltName:
Full=ASH1 homolog 3; AltName: Full=Protein SET DOMAIN
GROUP 7
gi|15028059|gb|AAK76560.1| unknown protein [Arabidopsis thaliana]
gi|20197070|gb|AAC23419.2| expressed protein [Arabidopsis thaliana]
gi|20259301|gb|AAM14386.1| unknown protein [Arabidopsis thaliana]
gi|330255289|gb|AEC10383.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
Length = 363
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/176 (55%), Positives = 121/176 (68%), Gaps = 15/176 (8%)
Query: 256 CSKACHCS-------------ETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEF 300
C CHC CNN+PF R KK+K+++TE CG G+ A E I GEF
Sbjct: 82 CGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKLIQTEKCGSGIVAEEEIEAGEF 141
Query: 301 IIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNC 360
IIEY+GEVIDD CE+RLW MK+RG NFY+CEI +D IDAT KGN SR++NHSC+PN
Sbjct: 142 IIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMVIDATHKGNKSRYINHSCNPNT 201
Query: 361 ILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
++KW ++GETR+G+FA R IK GE LTYDY+FVQFG + C+CGA C+ LG K
Sbjct: 202 QMQKWIIDGETRIGIFATRGIKKGEHLTYDYQFVQFGADQDCHCGAVGCRRKLGVK 257
>gi|297824409|ref|XP_002880087.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297325926|gb|EFH56346.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 363
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 97/176 (55%), Positives = 121/176 (68%), Gaps = 15/176 (8%)
Query: 256 CSKACHCS-------------ETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEF 300
C CHC CNN+PF R KK+K+++TE CG G+ A E I GEF
Sbjct: 82 CGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKLIQTEKCGSGIVAEEEIKPGEF 141
Query: 301 IIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNC 360
IIEY+GEVIDD CE+RLW MK+RG NFY+CEI +D IDAT KGN SR++NHSC+PN
Sbjct: 142 IIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMVIDATHKGNKSRYINHSCNPNT 201
Query: 361 ILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
++KW ++GETR+G+FA R IK GE LTYDY+FVQFG + C+CGA C+ LG K
Sbjct: 202 QMQKWIIDGETRIGIFATRGIKKGEHLTYDYQFVQFGADQDCHCGAVGCRRKLGVK 257
>gi|414589296|tpg|DAA39867.1| TPA: putative histone-lysine N-methyltransferase family protein
[Zea mays]
Length = 343
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 99/186 (53%), Positives = 126/186 (67%), Gaps = 6/186 (3%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C +TC N+ F R KK K++KTE CG G+ A + I KGEF+IEY+GEVIDD
Sbjct: 95 CSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVAEDEIKKGEFVIEYVGEVIDDRT 154
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE RLW MK +FY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 155 CENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRV 214
Query: 374 GVFAARSIKAGEPLTYD----YRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELCWGSK 429
G+FA R IK GE LTYD YRFVQFG C+CG+S+C+ LGT + G + K
Sbjct: 215 GIFALRDIKIGEELTYDYNIMYRFVQFGAAQVCHCGSSNCRKMLGTTKYSGSSQNHHAKK 274
Query: 430 RKRSST 435
+KR ++
Sbjct: 275 KKRKTS 280
>gi|162460550|ref|NP_001105653.1| LOC542662 [Zea mays]
gi|24021802|gb|AAN41254.1| SET domain protein 110 [Zea mays]
gi|195652527|gb|ACG45731.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
Length = 342
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 93/167 (55%), Positives = 115/167 (68%), Gaps = 2/167 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R K K++KTE CG G+ A + I KGEF+IEY+GEVIDD
Sbjct: 98 CSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLVAEDEIKKGEFVIEYVGEVIDDRT 157
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE RLW MK +FY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 158 CENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRV 217
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIG 420
G+FA R IK GE LTYDY+FVQFG C+CG+S C+ LGT + G
Sbjct: 218 GIFALRDIKIGEELTYDYKFVQFGAAQVCHCGSSKCRKMLGTTKYSG 264
>gi|414884958|tpg|DAA60972.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414884959|tpg|DAA60973.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 337
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 93/167 (55%), Positives = 115/167 (68%), Gaps = 2/167 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C C C N+ F R K K++KTE CG G+ A + I KGEF+IEY+GEVIDD
Sbjct: 98 CSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLVAEDEIKKGEFVIEYVGEVIDDRT 157
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE RLW MK +FY+CE+ + IDAT KGN SRF+NHSC+PN ++KW V+GETRV
Sbjct: 158 CENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRV 217
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIG 420
G+FA R IK GE LTYDY+FVQFG C+CG+S C+ LGT + G
Sbjct: 218 GIFALRDIKIGEELTYDYKFVQFGAAQVCHCGSSKCRKMLGTTKYSG 264
>gi|222640644|gb|EEE68776.1| hypothetical protein OsJ_27487 [Oryza sativa Japonica Group]
Length = 405
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 91/191 (47%), Positives = 127/191 (66%), Gaps = 14/191 (7%)
Query: 217 QEVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQ---------YISCSKACHCSETCN 267
E F RLPLP+ +EF ID K+ ++N PPP + +++ +
Sbjct: 205 NEAFRRLPLPHTFQEFNIDPIKKEELDNGTEPPPYKIKIIPTGLYWLAALGLVTAQMSAL 264
Query: 268 NRPFRKEKKIK-----IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMK 322
F K+++ +++T++CGWG A E I K +F+IE++GEVIDD CE+RL DM+
Sbjct: 265 TNHFADRKRLRSLSGILLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMR 324
Query: 323 YRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIK 382
RG +NFYMC+++KDF IDATFKGN RF NHSC+PNC L+KWQV G+TR+GVFA+++I+
Sbjct: 325 RRGDKNFYMCKVKKDFVIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIE 384
Query: 383 AGEPLTYDYRF 393
GEPLTYDY F
Sbjct: 385 VGEPLTYDYSF 395
>gi|218201221|gb|EEC83648.1| hypothetical protein OsI_29393 [Oryza sativa Indica Group]
Length = 451
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 127/190 (66%), Gaps = 14/190 (7%)
Query: 218 EVFCRLPLPYADEEFKIDLTWKDLMENKVGPPPVQ---------YISCSKACHCSETCNN 268
E F RLPLP+ +EF ID K+ ++N PPP + +++ +
Sbjct: 252 EAFRRLPLPHTFQEFNIDPIKKEELDNGTEPPPYKIKIIPTGLYWLAALGLVTAQMSALT 311
Query: 269 RPFRKEKKIK-----IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKY 323
F K+++ +++T++CGWG A E I K +F+IE++GEVIDD CE+RL DM+
Sbjct: 312 NHFADRKRLRSLSGILLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRR 371
Query: 324 RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKA 383
RG +NFYMC+++KDF IDATFKGN RF NHSC+PNC L+KWQV G+TR+GVFA+++I+
Sbjct: 372 RGDKNFYMCKVKKDFVIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEV 431
Query: 384 GEPLTYDYRF 393
GEPLT+DY F
Sbjct: 432 GEPLTFDYSF 441
>gi|15232214|ref|NP_191555.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
gi|75264575|sp|Q9M1X9.1|ASHH4_ARATH RecName: Full=Putative histone-lysine N-methyltransferase ASHH4;
AltName: Full=ASH1 homolog 4; AltName: Full=Protein SET
DOMAIN GROUP 24
gi|7019690|emb|CAB75815.1| putative protein [Arabidopsis thaliana]
gi|332646470|gb|AEE79991.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
Length = 352
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 114/151 (75%), Gaps = 2/151 (1%)
Query: 268 NRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N+PF++ KK+K+V+TE CG+G+ A E IN GEFIIEY+GEVIDD +CE+RLW + ++
Sbjct: 102 NKPFQQRHIKKMKLVQTEKCGYGIVADEDINSGEFIIEYVGEVIDDKICEERLWKLNHKV 161
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
NFY+C+I + IDAT KGN SR++NHSC PN ++KW ++GETR+G+FA R I GE
Sbjct: 162 ETNFYLCQINWNMVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGE 221
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
LTYDY+FVQFG + CYCGA C+ LG K
Sbjct: 222 QLTYDYQFVQFGADQDCYCGAVCCRKKLGAK 252
>gi|297817294|ref|XP_002876530.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297322368|gb|EFH52789.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 354
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 116/152 (76%), Gaps = 3/152 (1%)
Query: 268 NRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV-IDDALCEQRLWDMKYR 324
N+PF++ KK+K+V+TE CG+G+ A E IN GEFIIEY+GEV ID+ +CE+RLW + ++
Sbjct: 103 NKPFQQRHIKKMKLVQTEKCGYGIVADEDINSGEFIIEYVGEVVIDEKICEERLWKLNHK 162
Query: 325 GVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAG 384
+NFY+C+I + IDAT KGN SR++NHSC+PN ++KW ++GETR+G+FA R I G
Sbjct: 163 VEKNFYLCQINWNMVIDATHKGNKSRYINHSCNPNTEMQKWIIDGETRIGIFATRFINKG 222
Query: 385 EPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK 416
E LTYDY+FVQFG + CYCGA C+ LG K
Sbjct: 223 EQLTYDYQFVQFGADQDCYCGAVCCRKKLGAK 254
>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
Length = 823
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 80/171 (46%), Positives = 110/171 (64%), Gaps = 7/171 (4%)
Query: 253 YISCS-KACHCSE--TCNNRPFRKEKKIKIVKTEFC---GWGVEAAEPINKGEFIIEYIG 306
+ISCS + C + C+NR K +++K V+ E+ G+G+ A E IN GEFIIEY+G
Sbjct: 208 FISCSDETCSAPDLSMCSNRAI-KRRELKSVRVEYIPGPGFGLVANEKINAGEFIIEYVG 266
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EVIDD CE+R+ + G NFYM E+ K+ IDA ++ N SRF+NH CDPN + +KW
Sbjct: 267 EVIDDIECERRMIQYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHCCDPNSVTQKWN 326
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKR 417
V+G R+G+FA R+I E +T DY F FG C CG+++C G LG KR
Sbjct: 327 VDGMQRIGIFARRNIAPDEEITIDYNFSHFGEAADCKCGSTACTGKLGVKR 377
>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 751
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 110/172 (63%), Gaps = 8/172 (4%)
Query: 253 YISCS-KACHCSE--TCNNRPFRKEKKIKIVKTEFC----GWGVEAAEPINKGEFIIEYI 305
+ISCS + C + C+NR K +++K ++ E+ G+G+ E IN GEF+IEY+
Sbjct: 174 FISCSDETCSAPDPSVCSNRAI-KRRQLKSMRVEYIPGGPGFGLITNEDINAGEFVIEYV 232
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GEVIDD CE+R+ + G NFYM E+ K+ IDA ++ N SRF+NHSCDPN + +KW
Sbjct: 233 GEVIDDKECERRMITYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHSCDPNSVTQKW 292
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKR 417
V+G R+G+FA R+I E +T DY F FG C CG+++C G +G KR
Sbjct: 293 NVDGMQRIGIFARRNIAPNEEITIDYNFSHFGEAADCRCGSTACTGKMGLKR 344
>gi|255078696|ref|XP_002502928.1| set domain protein [Micromonas sp. RCC299]
gi|226518194|gb|ACO64186.1| set domain protein [Micromonas sp. RCC299]
Length = 377
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 101/164 (61%), Gaps = 8/164 (4%)
Query: 258 KACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C C +C+NRPF K K + TE GWG+ AEP+ G FI+EY+GE++D+ E
Sbjct: 66 RVCPCGPSCSNRPFHQLKSPKTDTLLTENRGWGLFLAEPVKAGTFIVEYVGEILDEHTTE 125
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ--VEGETRV 373
+RLW+ K RG NFY+ E+ + IDA +KGN SRF+N SC PNC +KWQ GETRV
Sbjct: 126 KRLWEDKKRGEDNFYLMEVMPNQCIDARYKGNLSRFINSSCHPNCETQKWQDSATGETRV 185
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPE----VKCYCGASSCQGYL 413
G+FA + I G LTYDY F FG E C CG C+G L
Sbjct: 186 GIFAIQDIPEGTELTYDYNFAHFGGEGTTSFSCMCGHPLCRGTL 229
>gi|291227185|ref|XP_002733567.1| PREDICTED: HSPC069-like [Saccoglossus kowalevskii]
Length = 2376
Score = 159 bits (401), Expect = 4e-36, Method: Composition-based stats.
Identities = 75/167 (44%), Positives = 107/167 (64%), Gaps = 3/167 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C+ C C + C NR F R+ K++I KT + G+G+ I +G+F++EY+GEV++
Sbjct: 1142 IECTSRCPCGDYCTNRSFQRRENAKVEIFKTPWKGFGLRTCAEIPEGKFVLEYVGEVLNY 1201
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ + R +++Y + D IDAT KGN SRF+NHSCDPNC +KW V G
Sbjct: 1202 SEFKSRTKHYNKDNRKHYYFMALTSDEIIDATKKGNVSRFINHSCDPNCETQKWTVNGHI 1261
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKR 417
RVG F R+I AGE LT+DY+F ++G E KCYCGAS+C+G+LG +
Sbjct: 1262 RVGFFTKRAIPAGEELTFDYQFERYGKEAQKCYCGASNCRGFLGGNK 1308
>gi|427779581|gb|JAA55242.1| Putative histone-lysine n-methyltransferase setd2 [Rhipicephalus
pulchellus]
Length = 2038
Score = 158 bits (400), Expect = 4e-36, Method: Composition-based stats.
Identities = 75/169 (44%), Positives = 107/169 (63%), Gaps = 3/169 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C+NR F+K+ IK+ K TE GWG+ E ++ G F++EY+GEV+
Sbjct: 846 IECGSRCPNGENCSNRRFQKKSYIKVEKFMTEKKGWGLRTLETVSSGTFVMEYVGEVLTP 905
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ Q++Y +R D IDAT KGN SRF+NHSCDPNC +KW V GE
Sbjct: 906 EDFRKRVKQYARDNNQHYYFMALRADEIIDATQKGNVSRFINHSCDPNCETQKWTVNGEL 965
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKI 419
R+G F R ++AGE LT+DY+F ++G E +C+C +S+C+GY+G K+
Sbjct: 966 RIGFFTRRPLRAGEELTFDYQFQRYGKEAQRCHCESSNCRGYIGEDTKM 1014
>gi|284434727|gb|ADB85424.1| putative retrotransposon protein [Phyllostachys edulis]
Length = 1946
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/119 (60%), Positives = 90/119 (75%), Gaps = 1/119 (0%)
Query: 279 IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDF 338
+ TE CG+G+ A + I KGEF+IEY+GEVIDD CE+RLW MK + NFY+CE+ +
Sbjct: 106 LTLTEKCGFGLVAEDEIKKGEFVIEYVGEVIDDRTCEERLWKMKRQRDTNFYLCEVSSNM 165
Query: 339 TIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR-FVQF 396
IDAT KGN SRF+NHSC+PN ++KW V+GETRVG+FA R IK GE LTYDY+ VQF
Sbjct: 166 VIDATNKGNMSRFINHSCEPNTEMQKWTVDGETRVGIFALRDIKEGEELTYDYKEAVQF 224
>gi|328715143|ref|XP_001944096.2| PREDICTED: hypothetical protein LOC100167890 [Acyrthosiphon pisum]
Length = 2332
Score = 158 bits (399), Expect = 7e-36, Method: Composition-based stats.
Identities = 83/193 (43%), Positives = 114/193 (59%), Gaps = 11/193 (5%)
Query: 237 TWKDLMENKVGPPP-----VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGV 289
T KD +E K G + I C KAC C N+ F+ + K+ KI+KT G+GV
Sbjct: 851 TRKDWIEKKPGCATECLNRLLNIECGKACVLKSLCTNKQFQNKQFKRTKIIKTADKGYGV 910
Query: 290 EAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFS 349
A E I G F+ EY+GEVID R+ + Y+ N YM +++ D IDAT KGN +
Sbjct: 911 FALEDIPSGTFVDEYMGEVIDQCEMIIRMKKILYK--NNNYMVQLKHDEIIDATRKGNIT 968
Query: 350 RFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG--PEVKCYCGAS 407
RF+NHSC+PNC+ EKW V GE+R+G F+ I+ GE +T+DY F FG + KCYCG
Sbjct: 969 RFINHSCEPNCVAEKWNVLGESRMGFFSKELIRKGEEITFDYSFEIFGDAAQQKCYCGTP 1028
Query: 408 SCQGYLGTKRKIG 420
C+G++ K + G
Sbjct: 1029 KCRGFISKKSRTG 1041
>gi|241753587|ref|XP_002401135.1| huntingtin interacting protein, putative [Ixodes scapularis]
gi|215508354|gb|EEC17808.1| huntingtin interacting protein, putative [Ixodes scapularis]
Length = 1594
Score = 157 bits (398), Expect = 8e-36, Method: Composition-based stats.
Identities = 75/169 (44%), Positives = 106/169 (62%), Gaps = 3/169 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C ++C+NR F+K+ IK+ K TE GWG+ E + G F++EY+GEV+
Sbjct: 595 IECGSRCPNGDSCSNRRFQKKSYIKVEKFLTEKKGWGLRTVETLASGAFVMEYVGEVLTP 654
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ Q++Y +R D IDAT KGN SRF+NHSCDPNC +KW V GE
Sbjct: 655 EDFRKRVKQYARDNHQHYYFMALRSDEIIDATQKGNVSRFINHSCDPNCETQKWTVNGEL 714
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKI 419
R+G F R ++AGE LT+DY+F ++G E KCYC +S C+G++G K+
Sbjct: 715 RIGFFTRRPLRAGEELTFDYQFQRYGKEAQKCYCESSKCRGFIGEDNKM 763
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
(Silurana) tropicalis]
Length = 1298
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/164 (48%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C + C N+ F K + + KI+KTE GWG+ A I KGEF+ EYIGE+ID+ C
Sbjct: 988 QVCPAGDRCQNQCFNKRQYPETKIIKTEGKGWGLIATRDIKKGEFVNEYIGELIDEEECM 1047
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGNFSRF+NHSC PNC +KW V G+TRVG+
Sbjct: 1048 YRIRHAQENDITHFYMLTIDKDRIIDAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVGL 1107
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA R I AGE LT++Y G E C CGA +C G+LG + K
Sbjct: 1108 FAVRDIPAGEELTFNYNLDCLGNEKTICRCGAPNCSGFLGDRPK 1151
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 37/86 (43%), Gaps = 16/86 (18%)
Query: 109 GAKKMIECRACHRFIYHGEEVFCSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFIC 167
GA+ C+ +R + C V CG YH C+++ L + R F+CP H C C
Sbjct: 657 GARSCFLCKESNRDVKR-----CIVPHCGKFYHESCLRKYPLAVFESRGFRCPLHRCATC 711
Query: 168 ----------RQRLQWRCVRCTIASH 183
+ RCVRC +A H
Sbjct: 712 YFSNPSNPRASKGKMVRCVRCPLAYH 737
>gi|189237403|ref|XP_973596.2| PREDICTED: similar to AGAP011688-PA [Tribolium castaneum]
gi|270007628|gb|EFA04076.1| hypothetical protein TcasGA2_TC014310 [Tribolium castaneum]
Length = 1569
Score = 157 bits (396), Expect = 1e-35, Method: Composition-based stats.
Identities = 76/164 (46%), Positives = 104/164 (63%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+K + +++ KTE G G+ AA I GEFI+EY+GEV+D
Sbjct: 550 IECGGLCPVGDRCTNKKFQKSQFAPVEVFKTEKKGLGLRAAANIPYGEFILEYVGEVLDP 609
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R D +++Y +R D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 610 EEFDNRADDYSNDKNKHYYFMSLRADAIIDATMKGNISRFINHSCDPNAETQKWTVNGEL 669
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ R+I AGE +T+DYRF ++G E KCYC +S C+G+LG
Sbjct: 670 RIGFFSTRTILAGEEITFDYRFQRYGKEAQKCYCESSLCRGWLG 713
>gi|145353759|ref|XP_001421172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145357147|ref|XP_001422783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581408|gb|ABO99465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583027|gb|ABP01142.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 503
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/169 (45%), Positives = 102/169 (60%), Gaps = 9/169 (5%)
Query: 254 ISC-SKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
SC S+ C C + C+NRP + K KI++TE GWG+ EP+ G FI+EY GE++D
Sbjct: 257 FSCDSRTCPCGDACSNRPLSQLPAPKTKIIRTENRGWGLTLQEPVRAGTFIVEYAGEILD 316
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW--QVE 368
+ C +RLW K G +NFY+ EI ++ IDA FKG+ +RF+N SC PNC ++W
Sbjct: 317 EHECAERLWYDKQSGEENFYLMEISANYVIDAKFKGSIARFINSSCHPNCETQRWVDAST 376
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE----VKCYCGASSCQGYL 413
ETRVG+FA I +G LTYDY F FG E C CG C+G L
Sbjct: 377 NETRVGIFATEDIASGTELTYDYNFAHFGDEKGTSFVCMCGHPKCRGTL 425
>gi|413923002|gb|AFW62934.1| putative SET-domain containing protein family, partial [Zea mays]
Length = 248
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 70/148 (47%), Positives = 95/148 (64%), Gaps = 3/148 (2%)
Query: 71 RCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSLPFLVGAKKMIECRACHRFIYHGEEVF 130
R + + + LE+ V+ W KK GVP +C LPFL GA +ECR C + IY GEE+
Sbjct: 103 RTQASASAPTLEERVSEWAAKKAAAGVPTHHCVLPFLTGAPMAVECRLCSKIIYVGEEIK 162
Query: 131 CSVRGCGGVYHFICVKERLGISNPRNFKCPQHACFICRQRL-QWRCVRCTIASHDKCAPW 189
CSV+ C ++H CV E +F+CPQH C +C+Q++ WRC RCT+A+H KCAPW
Sbjct: 163 CSVKKCPQMFHLKCVAEDTSNFTVESFRCPQHGCMVCKQKMFFWRCGRCTVAAHTKCAPW 222
Query: 190 PDRVIHLKDQPGRAVCWRHPAKWLLDKQ 217
P V+HLK+ G A+CWRHP+ WLL +
Sbjct: 223 P--VVHLKNDQGSAICWRHPSDWLLQNK 248
>gi|147765700|emb|CAN77905.1| hypothetical protein VITISV_024391 [Vitis vinifera]
Length = 290
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 92/139 (66%), Gaps = 17/139 (12%)
Query: 278 KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKD 337
K++ TE CG G+ A E I +GEF+IEY+GEVIDD CE RLW MK+ G NFY+CEI +D
Sbjct: 74 KVMLTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLWKMKHLGETNFYLCEINRD 133
Query: 338 FTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG 397
IDAT+KGN +++GETR+G+FA R IK GE LTYDY+FVQFG
Sbjct: 134 MVIDATYKGN-----------------KRIDGETRIGIFATRDIKRGEHLTYDYQFVQFG 176
Query: 398 PEVKCYCGASSCQGYLGTK 416
+ C+CGA C+ LG K
Sbjct: 177 ADQDCHCGAVGCRRKLGVK 195
>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
Length = 2228
Score = 154 bits (389), Expect = 8e-35, Method: Composition-based stats.
Identities = 72/164 (43%), Positives = 104/164 (63%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C E C N+ F++++ + ++ +T++ GWG+ A E ++ G ++EY GEV+D
Sbjct: 1040 IECSARCPLGEQCQNKRFQRKQYVPTEVFQTKWKGWGIRATENLSPGMLVMEYCGEVLDL 1099
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R Q+FY + +D IDAT KGN SRF+NHSCDPNC +KW V G
Sbjct: 1100 QEFGRRSLLYSRGNQQHFYFMALSQDEIIDATTKGNTSRFINHSCDPNCETQKWTVNGRL 1159
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
RVG F R I GE +T+DY+F ++G E + CYCG+S+C+GYLG
Sbjct: 1160 RVGFFTMRDINKGEEITFDYQFQRYGKEAQACYCGSSNCRGYLG 1203
>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
Length = 1004
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 104/165 (63%), Gaps = 3/165 (1%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+I C C C + C NR F+ + K+++ KTE G+G+ E + +F++EY GEVID
Sbjct: 99 FIECGSKCSCGKFCTNRRFQMAEYPKVEVFKTEKKGFGLRTLEDLEDNQFVLEYCGEVID 158
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E+R D + ++++Y + + IDA+ KG FSRF+NHSCDPNC+ +KW V G
Sbjct: 159 LREFERRKRDYAKKKIKHYYFMTLSPNEIIDASRKGTFSRFINHSCDPNCVTQKWTVNGM 218
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R+G F R I A LT+DY+F ++G EV+ CYCG+ C+GYLG
Sbjct: 219 LRIGFFTLRKIPANTELTFDYQFERYGREVQECYCGSEKCRGYLG 263
>gi|198432159|ref|XP_002123225.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1 protein,
partial [Ciona intestinalis]
Length = 752
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 105/171 (61%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ Y S C E C N+ F+K + ++ KT + GW + A + I KGEF+ EY+GE+
Sbjct: 255 LMYECHSDLCPAGEKCQNQRFQKREYPSSEVFKTSWGGWALRAKDLIKKGEFVSEYVGEL 314
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+D C +R+ D V NFYM I KD IDA KGN+SRF+NHSCDPNC +KW V
Sbjct: 315 VDSEECMRRIEDAHKNNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCDPNCETQKWMVN 374
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
G+TRVG+FA R I+ GE L ++Y G + C CG+++C G++G + K
Sbjct: 375 GDTRVGLFALREIQDGEELMFNYNLDCLGNDKTPCMCGSANCSGFIGVRPK 425
>gi|312378119|gb|EFR24776.1| hypothetical protein AND_10404 [Anopheles darlingi]
Length = 2632
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 107/166 (64%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C NR F++++ ++ +TE G+G++A+ PI GEFI+EY+GEV++
Sbjct: 1438 IECGSRCTVGERCTNRRFQRQEYAHCQVFRTEKKGFGIQASAPIAPGEFIMEYVGEVLNG 1497
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ +QR +++Y +R D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 1498 SQFDQRAEAYSRDKNKHYYFMALRSDGIIDATTKGNISRFINHSCDPNAETQKWTVNGEL 1557
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ + I GE +T+DY+F ++G + KC+C A +C+G++G K
Sbjct: 1558 RIGFFSTKYILPGEEITFDYQFQRYGRKAQKCFCEAENCRGWIGAK 1603
>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
[Amphimedon queenslandica]
Length = 862
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 105/167 (62%), Gaps = 3/167 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C E C N+ F + ++++KTE GWG++A I++ F++EY GEV
Sbjct: 100 IECGSRCPCGEYCTNKRFTRSSYANVEVIKTEMKGWGLKATCDISRYSFVMEYCGEVCSL 159
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R + +++Y ++ D +DAT KGN SRF+NHSC+PNC +KW V G
Sbjct: 160 EEFERRRNIYEKESRRHYYFMSLKTDEILDATRKGNLSRFINHSCEPNCETQKWTVNGRL 219
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKR 417
RVG FA R I AGE LT+DY+F +FG V KCYCG+ +C+G+LG K+
Sbjct: 220 RVGFFALRHIPAGEELTFDYQFQRFGESVQKCYCGSETCRGFLGAKQ 266
>gi|47225482|emb|CAG11965.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1625
Score = 152 bits (384), Expect = 3e-34, Method: Composition-based stats.
Identities = 71/191 (37%), Positives = 111/191 (58%), Gaps = 3/191 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C C+NR F R+ + ++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 269 IECSSRCQNGAYCSNRRFQMRQHAEFDVILTENKGWGLRAAKDLPSNTFVLEYCGEVLDH 328
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ + IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 329 KEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNGQL 388
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F +++ AG LT+DY+F ++G E KC+CG +C+G+LG + ++ K+
Sbjct: 389 RVGFFTTKAVTAGTELTFDYQFQRYGKEAQKCFCGTPNCRGFLGGENRVSVRAAGGKMKK 448
Query: 431 KRSSTACLAII 441
RS + L +
Sbjct: 449 DRSRKSALTTV 459
>gi|320166309|gb|EFW43208.1| ASHH4 [Capsaspora owczarzaki ATCC 30864]
Length = 453
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/169 (43%), Positives = 109/169 (64%), Gaps = 6/169 (3%)
Query: 251 VQYISCSK-ACHCSETCNNRPFRKE---KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIG 306
+ ++ C + +C + C NR F++ K +K+ +T G+G+ +PI G+F++EY+G
Sbjct: 191 LMFVECVRGSCRMEDRCQNRNFQRHNWTKNLKVFQTPNAGYGLRCTDPIAPGQFVMEYVG 250
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ DA E+R+W Y G N Y E+ K IDA KG +RF+NHSCDPNC +EKW
Sbjct: 251 EVVSDAERERRMWG-PYAGNPNHYFLELEKGVLIDACSKGCDARFINHSCDPNCHVEKWN 309
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLG 414
V GE RVG+FA+R+I E L+YDYRF G + +C+CGA++C+ +G
Sbjct: 310 VNGEFRVGIFASRAIAPNEELSYDYRFETLGEIQQQCWCGAANCRKVIG 358
>gi|158301050|ref|XP_001238385.2| AGAP011688-PA [Anopheles gambiae str. PEST]
gi|157013454|gb|EAU75883.2| AGAP011688-PA [Anopheles gambiae str. PEST]
Length = 2404
Score = 152 bits (383), Expect = 5e-34, Method: Composition-based stats.
Identities = 70/166 (42%), Positives = 106/166 (63%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C NR F++++ ++ +TE G+G++A+ I GEFI+EY+GEV++
Sbjct: 1257 IECGSRCTVGDRCTNRRFQRQEYAHCQVFRTEKKGFGIQASSAIAPGEFIMEYVGEVLNS 1316
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A ++R +++Y +R D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 1317 AQFDERAEAYSREKNKHYYFMALRSDGIIDATTKGNISRFINHSCDPNAETQKWTVNGEL 1376
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ + I GE +T+DY+F ++G + KCYC A SC+G++G K
Sbjct: 1377 RIGFFSTKYILPGEEITFDYQFQRYGRKAQKCYCEAESCRGWIGAK 1422
>gi|308810955|ref|XP_003082786.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
related methyltransferases (ISS) [Ostreococcus tauri]
gi|116061255|emb|CAL56643.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
related methyltransferases (ISS) [Ostreococcus tauri]
Length = 501
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 73/170 (42%), Positives = 99/170 (58%), Gaps = 8/170 (4%)
Query: 257 SKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
S+ C C + C+NRP + K KI++TE GWG+ EP+ G FI+EY GE++++A
Sbjct: 256 SRTCPCGDACSNRPLSQLPTPKTKIIRTENRGWGLMLQEPVTAGTFIVEYTGEILNEADV 315
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW--QVEGETR 372
RLW K G +NFY+ EI ++ IDA FKG+ +RF+N C PNC ++W ETR
Sbjct: 316 ANRLWLDKQEGEENFYLMEISTNYVIDAKFKGSLARFINSGCHPNCETQRWVDASTNETR 375
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFG----PEVKCYCGASSCQGYLGTKRK 418
VG+FA I AG LTYDY F FG C CG C+G L ++
Sbjct: 376 VGIFAIEDIPAGTELTYDYHFAHFGDASATSFVCMCGHPKCRGTLDAAKR 425
>gi|321468162|gb|EFX79148.1| hypothetical protein DAPPUDRAFT_319776 [Daphnia pulex]
Length = 1408
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 70/164 (42%), Positives = 107/164 (65%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+K + KI++ TE G G+ A + ++ G+FIIEY+GEVID
Sbjct: 444 IECGPRCQLAARCTNKRFQKRQYGKIEVFNTEKKGVGLRALQDMDPGDFIIEYVGEVIDP 503
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R D +++Y ++ D IDAT +GN SRF+NHSCDPN +KW V G+
Sbjct: 504 REFHRRAKDYAREKNKHYYFMALKSDAIIDATQQGNVSRFINHSCDPNAETQKWTVNGDL 563
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
RVG FA +S+K+G+ +T+DY+F ++G E +CYC +S+C+G++G
Sbjct: 564 RVGFFARKSLKSGDEVTFDYQFQRYGKEAQRCYCESSNCRGWIG 607
>gi|384253874|gb|EIE27348.1| hypothetical protein COCSUDRAFT_55364 [Coccomyxa subellipsoidea
C-169]
Length = 1591
Score = 151 bits (382), Expect = 6e-34, Method: Composition-based stats.
Identities = 81/191 (42%), Positives = 112/191 (58%), Gaps = 15/191 (7%)
Query: 251 VQYISCS-KACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ YI C K C C + C+NRPF + K ++ TE GWGV+A E I +G FI+EY GE
Sbjct: 490 LSYIHCDPKQCPCGDYCSNRPFHLLPQPKTELFLTEDRGWGVKATEHIPRGTFIVEYAGE 549
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ- 366
VI++ C +R+ K G+Q+FYM E+ IDA KGN +RF+N SC PNC +KW
Sbjct: 550 VIEEHECRRRMAQAKVTGLQHFYMMELAPGLIIDARVKGNMARFINSSCAPNCESQKWHD 609
Query: 367 -VEGETRVGVFAARSIKAGEPLTYDYRF-----VQFGPEVKCYCGASSCQGYLGTK---- 416
GE R+G+FAA I+ G L YDY+F Q +C CGA +C+G + T+
Sbjct: 610 AATGEIRIGIFAADDIEPGTELAYDYQFQHAGLAQDAGAYRCMCGAPNCRGTMDTQPERF 669
Query: 417 RKIGK-LELCW 426
+ GK +E+ W
Sbjct: 670 KDFGKRVEVFW 680
>gi|410905477|ref|XP_003966218.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Takifugu
rubripes]
Length = 1950
Score = 151 bits (382), Expect = 6e-34, Method: Composition-based stats.
Identities = 72/191 (37%), Positives = 110/191 (57%), Gaps = 3/191 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C C+NR F R+ ++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 899 IECSSRCQNGAYCSNRRFQMRQHADFDVILTEDKGWGLRAAKDLPSNTFVLEYCGEVLDH 958
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ + IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 959 KEFKTRVKEYARNKNIHYYFMALKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNGQL 1018
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F +++ AG LT+DY+F ++G E KC+CG SC+G+LG + ++ K+
Sbjct: 1019 RVGFFTTKAVTAGTELTFDYQFQRYGKEAQKCFCGTLSCRGFLGGENRVSVRAAGGKMKK 1078
Query: 431 KRSSTACLAII 441
RS + L +
Sbjct: 1079 DRSRKSALTTV 1089
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 108/171 (63%), Gaps = 5/171 (2%)
Query: 251 VQYISC-SKACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ YI C +K C+ + C N+ F++ + K++ KTE GWG+ + + +G+ +IEY+GE
Sbjct: 737 MMYIECDAKCCNNGKNCRNQRFQRREYPKLIPFKTEHRGWGLRLGQDVEEGDLVIEYVGE 796
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID A C +R+ + R +FY+ + D +DA K N +RF+NHSCDPNC+ +KW V
Sbjct: 797 VIDGAECRRRIDQYEERNTSSFYILSLGSDTFVDAREKANMARFINHSCDPNCVTQKWNV 856
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRF--VQFGPEVKCYCGASSCQGYLGTK 416
GETRVG+FA R++ G LT+DY + + C+CGA +C G++G K
Sbjct: 857 LGETRVGIFAKRALAKGTELTFDYMLDCLNSVKKTPCHCGAPNCSGFIGVK 907
>gi|328725322|ref|XP_001947915.2| PREDICTED: hypothetical protein LOC100165877 isoform 3
[Acyrthosiphon pisum]
Length = 1568
Score = 151 bits (381), Expect = 8e-34, Method: Composition-based stats.
Identities = 74/171 (43%), Positives = 106/171 (61%), Gaps = 7/171 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID- 310
+ C C C N+ F+ + KK I+KT+ G+G+ A E I KG I EY+GEVID
Sbjct: 483 VECGLGCSLKRYCTNKQFQNKQFKKTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDY 542
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +C RL +Y+ + YM ++ D ID+T KGN +RF+NHSCDPN + EKW V G+
Sbjct: 543 NEMC-NRLTKKEYKNLN--YMVQLNPDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQ 599
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIG 420
+R+G F+ R I+ GE +T+DY F FG + CYCG+S C+GY+ +I
Sbjct: 600 SRIGFFSTRHIEKGEEITFDYSFQIFGDGAQICYCGSSKCRGYINKSSQIA 650
>gi|328725324|ref|XP_003248429.1| PREDICTED: hypothetical protein LOC100165877 isoform 2
[Acyrthosiphon pisum]
Length = 1384
Score = 150 bits (380), Expect = 1e-33, Method: Composition-based stats.
Identities = 74/171 (43%), Positives = 106/171 (61%), Gaps = 7/171 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID- 310
+ C C C N+ F+ + KK I+KT+ G+G+ A E I KG I EY+GEVID
Sbjct: 483 VECGLGCSLKRYCTNKQFQNKQFKKTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDY 542
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +C RL +Y+ + YM ++ D ID+T KGN +RF+NHSCDPN + EKW V G+
Sbjct: 543 NEMC-NRLTKKEYKNLN--YMVQLNPDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQ 599
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIG 420
+R+G F+ R I+ GE +T+DY F FG + CYCG+S C+GY+ +I
Sbjct: 600 SRIGFFSTRHIEKGEEITFDYSFQIFGDGAQICYCGSSKCRGYINKSSQIA 650
>gi|440801495|gb|ELR22513.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 981
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 106/172 (61%), Gaps = 4/172 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V I C K C C C+NR F RK KI T GWG+ A E I KG FIIEY+GE
Sbjct: 381 VLNIECKLKHCPCGTNCSNRQFQLRKYAKIDRFLTGKKGWGLRAREKIPKGTFIIEYVGE 440
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VI +C+ R+ + G++++Y + IDA+ KGN +RF+NHSC+PN KW V
Sbjct: 441 VISTDMCQDRMKYYEEMGLEHYYFLTLDGSECIDASQKGNLARFINHSCNPNAKTHKWTV 500
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ E RVG+FA I G+ +T+DY+F +F G + KC+CG ++C+G+LG K K
Sbjct: 501 DKEIRVGIFAEEDIPVGQEITFDYQFERFGGKKQKCFCGETNCRGFLGAKPK 552
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/178 (43%), Positives = 109/178 (61%), Gaps = 4/178 (2%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C + C N+ F R+ ++ TE GWG+ A E I G+F+IEY+GE
Sbjct: 863 ILMVECSPGICPAEDKCMNQSFVLRQYPAMEPFHTEGRGWGLRALEDIAAGKFVIEYVGE 922
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
+IDDA ++RL K +NFY I + TIDA KGN SRF+NHSC PNC +KW V
Sbjct: 923 IIDDAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCETQKWTV 982
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLEL 424
G+TR+G+FA R I++GE LT++Y G K C CGAS+C G++G K + +L +
Sbjct: 983 NGDTRIGLFALRDIESGEELTFNYNLASDGETRKACLCGASNCSGFIGLKAQKQQLSI 1040
>gi|449492020|ref|XP_004174653.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2 [Taeniopygia guttata]
Length = 2489
Score = 150 bits (379), Expect = 1e-33, Method: Composition-based stats.
Identities = 72/183 (39%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F+K++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1511 IECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1570
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1571 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1630
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+S+C+GYLG + ++ K+
Sbjct: 1631 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSSNCRGYLGGENRVSIRAAGGKMKK 1690
Query: 431 KRS 433
+RS
Sbjct: 1691 ERS 1693
>gi|355729169|gb|AES09787.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Mustela putorius furo]
Length = 596
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 102/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 279 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 338
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 339 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 398
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA R I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 399 GDVRVGLFALRDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 449
>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
Length = 725
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/165 (44%), Positives = 104/165 (63%), Gaps = 3/165 (1%)
Query: 257 SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
S C + C N+ F+K + K +I K+ CGWG+++AE I G ++EY+GE++++ C
Sbjct: 423 SSMCPAGDKCQNQRFQKLEYAKSEIFKSNQCGWGLKSAEDIYAGTLVVEYVGELLNEKTC 482
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
QR+ + +G +NFYM I KD IDA KGN +RF+NHSC PNC KW V G T +G
Sbjct: 483 YQRIKMAQSKGEKNFYMLNIDKDVIIDAGQKGNLARFMNHSCQPNCETHKWTVNGLTCIG 542
Query: 375 VFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLGTKRK 418
+FA IK GE LT+DYR G + +C+CG+ C+ YLG K K
Sbjct: 543 LFAIDDIKQGEELTFDYRLHAVGNDQAECHCGSKLCRKYLGAKIK 587
>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Takifugu rubripes]
Length = 1329
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 79/171 (46%), Positives = 100/171 (58%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F R KI+KT GWG+ I KGEF+ EYIGE+
Sbjct: 997 LQYECHPQVCPSGERCGNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKGEFVNEYIGEL 1056
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + V NFYM I KD IDA KGN+SRF+NHSC PNC +KW V
Sbjct: 1057 IDEEECRARIKYAQENNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVN 1116
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
G+TRVG+FA + AG LT++Y G E C CGA +C G+LG + K
Sbjct: 1117 GDTRVGLFAICDVPAGTELTFNYNLDCLGNEKTACCCGAPNCSGFLGDRPK 1167
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 13/65 (20%)
Query: 131 CSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHACFIC-----------RQRLQWRCVRC 178
C V CG YH C++ L + + + F+CP H C C + RL RC+RC
Sbjct: 691 CHVPHCGKFYHEACIRLNPLTVFDNKGFRCPLHTCLGCCCSSRTKHKPSKGRL-MRCLRC 749
Query: 179 TIASH 183
+A H
Sbjct: 750 PVAYH 754
>gi|327289513|ref|XP_003229469.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Anolis
carolinensis]
Length = 2579
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C E C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1545 IECSSRCPNGEHCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1604
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1605 KEFKTRVKEYARSKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1664
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1665 RVGFFTTKMVPSGSELTFDYQFQRYGKEAQKCFCGSTNCRGYLGGENRVSIRAAGGKMKK 1724
Query: 431 KRS 433
+RS
Sbjct: 1725 ERS 1727
>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
Length = 1740
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 104/163 (63%), Gaps = 3/163 (1%)
Query: 259 ACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + CNN+ F K + +V +T + GWG++ PI KG+F+IEY+GE+ID+ ++
Sbjct: 1342 VCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVGEMIDEQEYQR 1401
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ M + +N+Y I KD +DA KGN +RF+NHSCDPNC +KW V G+TRVG+F
Sbjct: 1402 RVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWTVNGDTRVGLF 1461
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
A I AG LT++Y G E K C+CGA +C G++G K K
Sbjct: 1462 ANCDIPAGTELTFNYNLECIGKEKKICHCGAPNCSGFIGVKVK 1504
>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
Length = 1795
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 104/163 (63%), Gaps = 3/163 (1%)
Query: 259 ACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + CNN+ F K + +V +T + GWG++ PI KG+F+IEY+GE+ID+ ++
Sbjct: 1397 VCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVGEMIDEQEYQR 1456
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ M + +N+Y I KD +DA KGN +RF+NHSCDPNC +KW V G+TRVG+F
Sbjct: 1457 RVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWTVNGDTRVGLF 1516
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
A I AG LT++Y G E K C+CGA +C G++G K K
Sbjct: 1517 ANCDIPAGTELTFNYNLECIGKEKKICHCGAPNCSGFIGVKVK 1559
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Callithrix jacchus]
Length = 1400
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/165 (47%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GEVID+ C
Sbjct: 1079 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECM 1138
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1139 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1198
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1199 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1243
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 728 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 787
Query: 180 IASH 183
+A H
Sbjct: 788 VAYH 791
>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
Length = 729
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 100/158 (63%), Gaps = 3/158 (1%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F R+ ++ +++ E GWG+ + + G+F++EY+GE+I++ CE+R
Sbjct: 555 CPAGEHCKNQHFLRREYAQVTVIRAEGRGWGLRTDQALTAGDFVMEYVGEIINEQECERR 614
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
L + NFY + +D IDA +GN SRF+NHSCDPNC +KW V G+TRVG+FA
Sbjct: 615 LSRLHLEHSSNFYFLTLDRDRIIDAGPRGNLSRFMNHSCDPNCETQKWTVNGDTRVGIFA 674
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
R I G LT++Y G E +KC CGAS+C GY+G
Sbjct: 675 IRDIAPGTELTFNYNLDCRGNERIKCACGASNCSGYMG 712
>gi|156391978|ref|XP_001635826.1| predicted protein [Nematostella vectensis]
gi|156222924|gb|EDO43763.1| predicted protein [Nematostella vectensis]
Length = 348
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/189 (41%), Positives = 112/189 (59%), Gaps = 12/189 (6%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C+ C C + C NR F++ KIK+ KTE GWGV+ E + + +F+IEY GEV++
Sbjct: 54 IECNHRCPCGDLCTNRRFQEGCKIKVEVFKTEKKGWGVKTLEDLEQNQFVIEYCGEVMNY 113
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R + +++Y +R D IDAT KG+ SRF+NHSC+PNC+ +KW V G
Sbjct: 114 RDFQSRAQRYDRQKRRHYYFMTLRADEIIDATLKGSISRFINHSCEPNCVTQKWTVNGLL 173
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG---------TKRKIGK 421
R+G F R+IKAGE LT+DY+ ++G + CYC + SC+G +G T KI +
Sbjct: 174 RIGFFTLRTIKAGEELTFDYQLQRYGKIAQTCYCESPSCRGIIGGEKHTPLKTTVEKIDE 233
Query: 422 LELCWGSKR 430
LE G R
Sbjct: 234 LERLIGDHR 242
>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
Length = 1037
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 101/168 (60%), Gaps = 3/168 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KIV+T+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 658 QVCPAGEACQNQSFSKRQYPETKIVRTDGKGWGLVATRDIRKGEFVNEYVGELIDEEECM 717
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 718 ARIKHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 777
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIGKL 422
FA I AG LT++Y G E C CGA +C G+LG + K +
Sbjct: 778 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKTSTV 825
>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Ovis aries]
Length = 1273
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 101/165 (61%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E+C N+ F K + + KIV+T+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 990 QVCPAGESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1049
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1050 ARIKRAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1109
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1110 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1154
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV+ L + R F+CP H+C C + RCVRC
Sbjct: 674 CVVSQCGKFYHEACVRRFPLTVFESRGFRCPLHSCLSCHASNPSNPRPSKGKMLRCVRCP 733
Query: 180 IASH 183
+A H
Sbjct: 734 VAYH 737
>gi|334333796|ref|XP_001375978.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Monodelphis
domestica]
Length = 2592
Score = 148 bits (374), Expect = 4e-33, Method: Composition-based stats.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1550 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1609
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1610 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1669
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1670 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1729
Query: 431 KRS 433
+RS
Sbjct: 1730 ERS 1732
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 100/166 (60%), Gaps = 3/166 (1%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C
Sbjct: 1135 QVCPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECR 1194
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+
Sbjct: 1195 SRIRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGL 1254
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
FA I AG LT++Y G E C CGA +C G+LG + K G
Sbjct: 1255 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 1300
Score = 38.1 bits (87), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 35/89 (39%), Gaps = 19/89 (21%)
Query: 114 IECRACHRFIY--------HGEEVFCSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHAC 164
I C AC ++ GE C CG YH CV+ L + R F+CP H C
Sbjct: 797 ILCTACSTGVHVCFTCKKSEGEVRRCCALHCGRFYHEACVRLSALTVFENRGFRCPLHTC 856
Query: 165 FICR----------QRLQWRCVRCTIASH 183
C + RC+RC +A H
Sbjct: 857 LSCHYSGRAACKATKGKMMRCLRCPVAYH 885
>gi|354548180|emb|CCE44916.1| hypothetical protein CPAR2_407180 [Candida parapsilosis]
Length = 810
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 110/166 (66%), Gaps = 3/166 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C + C N+ F+K + + + +TE G+G++A +PI++G+FI EYIGEVID+
Sbjct: 88 TNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPISEGQFIYEYIGEVIDEGA 147
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
QR+ + + ++FY ++ D IDAT KG+ +RF+NHSC+PN ++KW V + R+
Sbjct: 148 FRQRMIEYDMKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSCNPNAFVDKWVVGDKLRM 207
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
G+FA R+I GE +T+DY ++G + + CYCG +C ++G K++
Sbjct: 208 GIFAKRNIAKGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGKKQ 253
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Callithrix jacchus]
Length = 1365
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/165 (47%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GEVID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
latipes]
Length = 1597
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 71/169 (42%), Positives = 103/169 (60%), Gaps = 3/169 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K + ++VKTE GWG++ + + KG+F+ EY+GEV
Sbjct: 1272 LQYECHPQVCPAGDNCENQNFTKRLYAETEVVKTEGRGWGLQTNQALRKGDFVAEYVGEV 1331
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID C+QR+ V NFYM + KD IDA KGN +RF+NHSC+PNC +KW V
Sbjct: 1332 IDSEECQQRIKRAHENHVTNFYMLTLTKDRVIDAGPKGNSARFINHSCNPNCETQKWTVN 1391
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
G+ R+G+FA I+AG LT++Y G C+CG+ +C G+LG +
Sbjct: 1392 GDVRIGIFALCDIEAGTELTFNYNLHCVGNRRTSCHCGSENCSGFLGVQ 1440
>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
[Desmodus rotundus]
Length = 1438
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 108/191 (56%), Gaps = 7/191 (3%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWG 427
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K C
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKSA----CVS 1295
Query: 428 SKRKRSSTACL 438
+ +R+ A L
Sbjct: 1296 ATEERAKNAKL 1306
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
boliviensis boliviensis]
Length = 1368
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/165 (47%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GEVID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
guttata]
Length = 1435
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1117 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1176
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1177 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1236
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1237 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKTA 1289
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQW---------RCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C RC+RC +
Sbjct: 762 CSVSTCGKFYHEACVRKFATALFESRGFRCPQHCCTACSMDKDMHKASKGRMARCLRCPV 821
Query: 181 ASH 183
A H
Sbjct: 822 AYH 824
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/180 (42%), Positives = 106/180 (58%), Gaps = 7/180 (3%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + ++I +T CGWG+ A I KG F+ EY+GEVID+ C
Sbjct: 1846 QVCAAGERCQNQAFTKRQYTTVEIYRTLSCGWGLRAVSDIKKGAFVSEYVGEVIDEEECR 1905
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NHSC PNC +KW V G+TRVG+
Sbjct: 1906 ARIRHAQEHDICNFYMLTLDKDRVIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGL 1965
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK----IGKLELCWGSKR 430
FA + I GE LT++Y G + C CGA +C G+LG + K K++L G +R
Sbjct: 1966 FALQDIAKGEELTFNYNLECRGNGKTVCKCGAPNCSGFLGVRPKNQPPAEKVKLKEGKRR 2025
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 100/166 (60%), Gaps = 3/166 (1%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C
Sbjct: 1135 QVCPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECR 1194
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+
Sbjct: 1195 SRIRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGL 1254
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
FA I AG LT++Y G E C CGA +C G+LG + K G
Sbjct: 1255 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 1300
Score = 38.1 bits (87), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 35/89 (39%), Gaps = 19/89 (21%)
Query: 114 IECRACHRFIY--------HGEEVFCSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHAC 164
I C AC ++ GE C CG YH CV+ L + R F+CP H C
Sbjct: 797 ILCTACSTGVHVCFTCKKSEGEVRRCCALHCGRFYHEACVRLSALTVFENRGFRCPLHTC 856
Query: 165 FICR----------QRLQWRCVRCTIASH 183
C + RC+RC +A H
Sbjct: 857 LSCHYSGRAACKATKGKMMRCLRCPVAYH 885
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 101/165 (61%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E+C N+ F K + + KIV+T+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKRAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVSQCGKFYHEACVRRFPLTVFESRGFRCPLHSCLSCHASNPSNPRPSKGKMLRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
Length = 1435
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1117 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1176
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1177 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1236
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1237 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 1287
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 762 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 821
Query: 181 ASH 183
A H
Sbjct: 822 AYH 824
>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Otolemur garnettii]
Length = 1438
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
NSD3-like [Sus scrofa]
Length = 1438
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 1290
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Ornithorhynchus anatinus]
Length = 1437
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPKTA 1291
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 15/85 (17%)
Query: 114 IECRACHRFIYH----GEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC 167
+EC H + G++V CSV CG YH CV+ I R F+CPQH C C
Sbjct: 742 VECETGHHTCFSCKVPGKDVKRCSVSACGKFYHEACVRIFPTAIFEARGFRCPQHCCSAC 801
Query: 168 RQRLQ---------WRCVRCTIASH 183
RC++C IA H
Sbjct: 802 SMEKDIHKASKGRMMRCLKCPIAYH 826
>gi|348534024|ref|XP_003454503.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oreochromis
niloticus]
Length = 2253
Score = 148 bits (374), Expect = 5e-33, Method: Composition-based stats.
Identities = 72/191 (37%), Positives = 112/191 (58%), Gaps = 3/191 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C C+NR F ++ +++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1032 IECSSRCLNGAYCSNRRFQMKQHADFEVILTEDKGWGLRAAKDLAPNTFVLEYCGEVLDH 1091
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ + IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1092 KEFKTRVKEYARNKNIHYYFMSLKNNEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1151
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F +++ AG LT+DY+F ++G E KC+CGA SC+G+LG + ++ K+
Sbjct: 1152 RVGFFTTKAVTAGTELTFDYQFQRYGKEAQKCFCGAPSCRGFLGGENRVSVRAAGGKMKK 1211
Query: 431 KRSSTACLAII 441
RS + L +
Sbjct: 1212 DRSRKSALTTV 1222
>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Otolemur garnettii]
Length = 1389
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1071 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1130
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1131 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1190
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1191 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1241
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|363729887|ref|XP_418510.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gallus gallus]
Length = 2554
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F+K++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1525 IECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1584
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1585 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1644
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1645 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1704
Query: 431 KRS 433
+RS
Sbjct: 1705 ERS 1707
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Nasonia
vitripennis]
Length = 1382
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 109/187 (58%), Gaps = 4/187 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ I CS C S C N+ F RK +K TE GWG+ + EPI G+FIIEY+GE
Sbjct: 979 ILMIECSPDTCPASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGE 1038
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A + RL K R +N+Y I IDA KGN SRF+NHSC PNC +KW+V
Sbjct: 1039 VIDEAEYKLRLQQKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKV 1098
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCW 426
G+TR+G+FA R I+ GE LT++Y G K C C A +C G++G K ++ L
Sbjct: 1099 NGDTRIGLFALRDIEPGEELTFNYNLACDGETRKPCLCKAPNCSGFIGLKVPKQQMSLVQ 1158
Query: 427 GSKRKRS 433
K +++
Sbjct: 1159 QKKIEKA 1165
>gi|354484245|ref|XP_003504300.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cricetulus
griseus]
gi|344236054|gb|EGV92157.1| Histone-lysine N-methyltransferase SETD2 [Cricetulus griseus]
Length = 2412
Score = 148 bits (373), Expect = 6e-33, Method: Composition-based stats.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1376 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1435
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1436 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1495
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1496 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1555
Query: 431 KRS 433
+RS
Sbjct: 1556 ERS 1558
>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
scrofa]
Length = 1338
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1017 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1076
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1077 ARIRRAQEHDITRFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1136
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1137 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1181
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV+ L + R F+CP H+C C + RCVRC
Sbjct: 703 CVVSQCGKFYHEACVRRFPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 762
Query: 180 IASH 183
+A H
Sbjct: 763 VAYH 766
>gi|448524136|ref|XP_003868932.1| Set2 protein [Candida orthopsilosis Co 90-125]
gi|380353272|emb|CCG26028.1| Set2 protein [Candida orthopsilosis]
Length = 806
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 109/166 (65%), Gaps = 3/166 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C + C N+ F+K + + + +TE G+G++A +PI++G+FI EYIGEVID+
Sbjct: 88 TNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPISEGQFIYEYIGEVIDEGA 147
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
QR+ + + ++FY ++ D IDAT KG+ +RF+NHSC+PN ++KW V + R+
Sbjct: 148 FRQRMIEYDVKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSCNPNAFVDKWVVGDKLRM 207
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
G+FA R I GE +T+DY ++G + + CYCG +C ++G K++
Sbjct: 208 GIFAKRKIAKGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGKKQ 253
>gi|358056897|dbj|GAA97247.1| hypothetical protein E5Q_03924 [Mixia osmundae IAM 14324]
Length = 949
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 105/161 (65%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
+C C + C NR F++++ I+IV T G+GV AAE + K F+ EYIGEV+ ++
Sbjct: 251 SCRCGKYCTNRRFQRKQYANIQIVDTGMKGFGVRAAEDMLKDAFVYEYIGEVVGAGQLQK 310
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ D G+++FY ++++ IDAT KGN RFLNHSC PNC + KW V + R+G+F
Sbjct: 311 RMKDYYEEGIEHFYFMALQREEFIDATKKGNKGRFLNHSCSPNCYVSKWVVGEKMRMGIF 370
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I+AGE LT++Y ++G E + CYCG ++C G++G K
Sbjct: 371 TKRKIQAGEELTFNYNVDRYGHEAQPCYCGEANCVGFIGGK 411
>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
Length = 1641
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/191 (41%), Positives = 108/191 (56%), Gaps = 7/191 (3%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1323 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1382
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1383 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1442
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWG 427
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K C
Sbjct: 1443 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK----STCVA 1498
Query: 428 SKRKRSSTACL 438
+ +R+ A L
Sbjct: 1499 TTEERARNAKL 1509
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ + + F+CPQH C C RC+RC +
Sbjct: 959 CSVGACGKFYHEACVRKFPTAVFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 1018
Query: 181 ASH 183
A H
Sbjct: 1019 AYH 1021
>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Nasonia
vitripennis]
Length = 1317
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 109/187 (58%), Gaps = 4/187 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ I CS C S C N+ F RK +K TE GWG+ + EPI G+FIIEY+GE
Sbjct: 914 ILMIECSPDTCPASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGE 973
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A + RL K R +N+Y I IDA KGN SRF+NHSC PNC +KW+V
Sbjct: 974 VIDEAEYKLRLQQKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKV 1033
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCW 426
G+TR+G+FA R I+ GE LT++Y G K C C A +C G++G K ++ L
Sbjct: 1034 NGDTRIGLFALRDIEPGEELTFNYNLACDGETRKPCLCKAPNCSGFIGLKVPKQQMSLVQ 1093
Query: 427 GSKRKRS 433
K +++
Sbjct: 1094 QKKIEKA 1100
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 100/166 (60%), Gaps = 3/166 (1%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C
Sbjct: 1040 QVCPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECR 1099
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+
Sbjct: 1100 SRIRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGL 1159
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
FA I AG LT++Y G E C CGA +C G+LG + K G
Sbjct: 1160 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 1205
Score = 38.1 bits (87), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 35/89 (39%), Gaps = 19/89 (21%)
Query: 114 IECRACHRFIY--------HGEEVFCSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHAC 164
I C AC ++ GE C CG YH CV+ L + R F+CP H C
Sbjct: 702 ILCTACSTGVHVCFTCKKSEGEVRRCCALHCGRFYHEACVRLSALTVFENRGFRCPLHTC 761
Query: 165 FICR----------QRLQWRCVRCTIASH 183
C + RC+RC +A H
Sbjct: 762 LSCHYSGRAACKATKGKMMRCLRCPVAYH 790
>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oreochromis
niloticus]
Length = 1605
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 71/169 (42%), Positives = 102/169 (60%), Gaps = 3/169 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C ++C N+ F K + +++KTE GWG+ + + KG+F+ EY+GEV
Sbjct: 1279 LQYECHPQVCPAGDSCENQCFSKRLYAETEVIKTEGRGWGLRTNQALKKGDFVTEYVGEV 1338
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID C+QR+ V NFYM + KD IDA KGN SRF+NHSC PNC +KW V
Sbjct: 1339 IDSEECQQRIKRAHENHVTNFYMLTLTKDRVIDAGPKGNSSRFINHSCSPNCETQKWTVN 1398
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
G+ R+G+FA I+AG LT++Y G C+CG+ +C G+LG +
Sbjct: 1399 GDVRIGIFALCDIEAGTELTFNYNLHCVGNRRTSCHCGSDNCSGFLGVQ 1447
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 39/76 (51%), Gaps = 16/76 (21%)
Query: 126 GEEVF-CSVRGCGGVYHFICVKERLGISNPR--NFKCPQHACFIC----------RQRLQ 172
G EV CSV GCG YH CV++ G ++ F CPQH+C C + RL
Sbjct: 915 GREVTRCSVSGCGCYYHEDCVRKLPGTTSSPGGGFSCPQHSCSTCCLERDPQRASKGRL- 973
Query: 173 WRCVRCTIASH--DKC 186
RC+RC +A H D C
Sbjct: 974 IRCIRCPLAYHTSDSC 989
>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
Length = 1440
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1122 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1181
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1182 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1241
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1242 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 1292
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 30/63 (47%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC+RC +
Sbjct: 755 CSVSTCGKFYHEACVRKFATALFESRGFRCPQHCCTACSMDKDIHKASKGRMVRCLRCPV 814
Query: 181 ASH 183
A H
Sbjct: 815 AFH 817
>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
rubripes]
Length = 1499
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 100/169 (59%), Gaps = 3/169 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K + ++VKT GWG++A +P+ KGEF+ EY+GEV
Sbjct: 1171 LQYECHPQVCPAGDNCENQGFTKRLYAETEVVKTADRGWGLKANQPLKKGEFVTEYVGEV 1230
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID C+QR+ + NFYM + KD IDA KGN SRF+NHSC PNC +KW V
Sbjct: 1231 IDAEECQQRIKRAHENHMTNFYMLTLTKDRVIDAAQKGNLSRFINHSCSPNCETQKWTVN 1290
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
G+ +G+FA I AG LT++Y G C CG+ +C G+LG +
Sbjct: 1291 GDVHIGLFALCDIDAGTELTFNYNLHCVGNRRTTCNCGSDNCSGFLGVQ 1339
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/75 (42%), Positives = 40/75 (53%), Gaps = 14/75 (18%)
Query: 126 GEEVF-CSVRGCGGVYHFICVKERLGISNPR--NFKCPQHACFIC--RQRLQ-------W 173
G+EV CSV GCG YH CV++ LG ++ F CPQH C C + LQ
Sbjct: 807 GQEVTRCSVSGCGCYYHEDCVRKLLGTTSSPGGGFCCPQHICSTCCLERDLQRASKGRLM 866
Query: 174 RCVRCTIASH--DKC 186
RC+RC +A H D C
Sbjct: 867 RCIRCPVAYHTGDSC 881
>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
[Oryzias latipes]
Length = 1167
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 79/171 (46%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K + KI+KT GWG+ A I KG+F+ EYIGE+
Sbjct: 931 LQYECHPQVCPSGERCCNQDFTKRLYPETKIIKTPGKGWGLVALRDIKKGKFVNEYIGEL 990
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + NFYM I KD IDA KGN+SRF+NHSC PNC +KW V
Sbjct: 991 IDEEECRARIKYAHENNITNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVN 1050
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
G+TRVG+FA I AG LT++Y G E C CGA +C G+LG + K
Sbjct: 1051 GDTRVGLFAVCHIPAGTELTFNYNLDCLGNEKTICRCGAPNCSGFLGDRPK 1101
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 15/80 (18%)
Query: 116 CRACHRFIYHGEEVFCSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHACFIC------- 167
C AC + GE C V CG YH C++ L + + + +CP HAC C
Sbjct: 612 CFACKQ--SEGEVRRCHVVHCGKFYHEACIRLNPLTVFDNKGLRCPLHACLSCHYGSRTK 669
Query: 168 ----RQRLQWRCVRCTIASH 183
+ RL RC+RC +A H
Sbjct: 670 HKSTKGRL-MRCLRCPVAYH 688
>gi|326921432|ref|XP_003206963.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2-like [Meleagris gallopavo]
Length = 2147
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F+K++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1332 IECSSRCPNGDFCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1391
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1392 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1451
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1452 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1511
Query: 431 KRS 433
+RS
Sbjct: 1512 ERS 1514
>gi|440891718|gb|ELR45266.1| Histone-lysine N-methyltransferase SETD2, partial [Bos grunniens
mutus]
Length = 2533
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1496 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1555
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1556 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1615
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1616 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1675
Query: 431 KRS 433
+RS
Sbjct: 1676 ERS 1678
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
[Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
[Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
[Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Gorilla gorilla gorilla]
Length = 1365
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|296225059|ref|XP_002758501.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Callithrix
jacchus]
Length = 2510
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1473 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1532
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1533 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1592
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1593 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1652
Query: 431 KRS 433
+RS
Sbjct: 1653 ERS 1655
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
AltName: Full=Wolf-Hirschhorn syndrome candidate 1
protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 979 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1038
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1039 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1098
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1099 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1143
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 665 CVVSQCGKFYHEACVRKYPLTVFESRGFRCPLHSCVTCHASNPSNPRPSKGKMMRCVRCP 724
Query: 180 IASH 183
+A H
Sbjct: 725 VAYH 728
>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
Length = 1437
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 10/67 (14%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASHDKCA 187
A H + A
Sbjct: 825 AYHSEDA 831
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1045 QVCPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECM 1104
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1105 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1164
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1165 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 1208
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 731 CVVSQCGKFYHEACVKKYHLTVFESRGFRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCP 790
Query: 180 IASH 183
+A H
Sbjct: 791 VAYH 794
>gi|119914792|ref|XP_589886.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 2547
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1510 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1569
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1570 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1629
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1630 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1689
Query: 431 KRS 433
+RS
Sbjct: 1690 ERS 1692
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|356507632|ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2081
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 104/160 (65%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C + C+N+ F+K K +K K G+G++A E + +G+F+IEY+GEV+D E
Sbjct: 1207 TCPCGDRCSNQQFQKHKYASLKWFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEA 1266
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + +G ++FY + IDA+ KGN RF+NHSCDPNC EKW V GE +G+F
Sbjct: 1267 RQREYALKGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1326
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
A R++K E LT+DY +V+ FG KCYCG+S+C+GY+G
Sbjct: 1327 ALRNVKKDEELTFDYNYVRVFGAAAKKCYCGSSNCRGYIG 1366
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|402860278|ref|XP_003894560.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Papio anubis]
Length = 2521
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1484 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1543
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1544 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1603
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1604 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1663
Query: 431 KRS 433
+RS
Sbjct: 1664 ERS 1666
>gi|297671474|ref|XP_002813857.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pongo abelii]
Length = 2563
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1526 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1585
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1586 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1645
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1646 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1705
Query: 431 KRS 433
+RS
Sbjct: 1706 ERS 1708
>gi|432092361|gb|ELK24976.1| Histone-lysine N-methyltransferase SETD2 [Myotis davidii]
Length = 2865
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1830 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1889
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1890 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1949
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1950 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 2009
Query: 431 KRS 433
+RS
Sbjct: 2010 ERS 2012
>gi|197313748|ref|NP_054878.5| histone-lysine N-methyltransferase SETD2 [Homo sapiens]
gi|296452963|sp|Q9BYW2.3|SETD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD2; AltName:
Full=HIF-1; AltName: Full=Huntingtin yeast partner B;
AltName: Full=Huntingtin-interacting protein 1;
Short=HIP-1; AltName: Full=Huntingtin-interacting protein
B; AltName: Full=Lysine N-methyltransferase 3A; AltName:
Full=SET domain-containing protein 2; Short=hSET2;
AltName: Full=p231HBP
Length = 2564
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1527 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1586
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1587 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1646
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1647 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1706
Query: 431 KRS 433
+RS
Sbjct: 1707 ERS 1709
>gi|410220670|gb|JAA07554.1| SET domain containing 2 [Pan troglodytes]
gi|410261336|gb|JAA18634.1| SET domain containing 2 [Pan troglodytes]
gi|410295964|gb|JAA26582.1| SET domain containing 2 [Pan troglodytes]
gi|410339683|gb|JAA38788.1| SET domain containing 2 [Pan troglodytes]
Length = 2564
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1527 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1586
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1587 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1646
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1647 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1706
Query: 431 KRS 433
+RS
Sbjct: 1707 ERS 1709
>gi|296474690|tpg|DAA16805.1| TPA: Wolf-Hirschhorn syndrome candidate 1 protein-like [Bos taurus]
Length = 2547
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1510 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1569
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1570 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1629
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1630 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1689
Query: 431 KRS 433
+RS
Sbjct: 1690 ERS 1692
>gi|73985747|ref|XP_864158.1| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 11 [Canis
lupus familiaris]
Length = 2562
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1525 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1584
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1585 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1644
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1645 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1704
Query: 431 KRS 433
+RS
Sbjct: 1705 ERS 1707
>gi|332216412|ref|XP_003257344.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Nomascus
leucogenys]
Length = 2499
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1512 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1571
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1572 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1631
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1632 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1691
Query: 431 KRS 433
+RS
Sbjct: 1692 ERS 1694
>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
9 with methyltransferase activity to lysine; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1;
Short=WHSC1-like protein 1
gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
sapiens]
gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
Length = 1437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
Length = 1438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
porcellus]
Length = 1438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 11/69 (15%)
Query: 126 GEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WR 174
GE+V CSV CG YH CV++ I + F+CPQH C C R
Sbjct: 759 GEDVKRCSVGACGKFYHEACVRKFSTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMR 818
Query: 175 CVRCTIASH 183
C+RC +A H
Sbjct: 819 CLRCPVAYH 827
>gi|426340342|ref|XP_004034089.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gorilla gorilla
gorilla]
Length = 2564
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1527 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1586
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1587 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1646
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1647 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1706
Query: 431 KRS 433
+RS
Sbjct: 1707 ERS 1709
>gi|397495290|ref|XP_003818492.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pan paniscus]
Length = 2564
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1527 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1586
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1587 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1646
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1647 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1706
Query: 431 KRS 433
+RS
Sbjct: 1707 ERS 1709
>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Callithrix jacchus]
Length = 1438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
paniscus]
Length = 1437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
abelii]
Length = 1438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
Length = 1438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K ++++TE GWG+ I KGEF+ EY+GE+
Sbjct: 1121 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1180
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1291
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQW---------RCVRCTI 180
CSV CG YH CV++ + + F+CPQH C C RC+RC +
Sbjct: 766 CSVSACGKFYHEACVRKFPTAVFESKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPV 825
Query: 181 ASH 183
A H
Sbjct: 826 AYH 828
>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|281343603|gb|EFB19187.1| hypothetical protein PANDA_000629 [Ailuropoda melanoleuca]
Length = 2535
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1498 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1557
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1558 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1617
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1618 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1677
Query: 431 KRS 433
+RS
Sbjct: 1678 ERS 1680
>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
Length = 1388
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1070 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1129
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1130 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1189
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1190 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1240
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 10/67 (14%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASHDKCA 187
A H + A
Sbjct: 825 AYHSEDA 831
>gi|301754075|ref|XP_002912890.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ailuropoda
melanoleuca]
Length = 2549
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1512 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1571
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1572 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1631
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1632 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1691
Query: 431 KRS 433
+RS
Sbjct: 1692 ERS 1694
>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
troglodytes]
gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
Length = 1437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1289
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|395516140|ref|XP_003762252.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Sarcophilus
harrisii]
Length = 2570
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1529 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1588
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1589 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1648
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1649 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1708
Query: 431 KRS 433
+RS
Sbjct: 1709 ERS 1711
>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
paniscus]
Length = 1388
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1070 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1129
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1130 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1189
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1190 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1240
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
Length = 926
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 102/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 608 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 667
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C+ R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 668 IDEEECKLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 727
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 728 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 778
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 39/85 (45%), Gaps = 15/85 (17%)
Query: 114 IECR----ACHRFIYHGEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC 167
+EC+ AC G +V CSV CG YH CV++ I + F+CPQH C C
Sbjct: 222 VECKTGQHACFSCKVSGTDVKRCSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSAC 281
Query: 168 ---------RQRLQWRCVRCTIASH 183
+ RC+RC +A H
Sbjct: 282 SVEKDIYKASKGRMMRCLRCPVAYH 306
>gi|417406999|gb|JAA50136.1| Putative clathrin coat binding protein/huntingtin [Desmodus rotundus]
Length = 2557
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1520 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1579
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1580 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1639
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1640 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1699
Query: 431 KRS 433
+RS
Sbjct: 1700 ERS 1702
>gi|403268536|ref|XP_003926329.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Saimiri
boliviensis boliviensis]
Length = 2057
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1020 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1079
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1080 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1139
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1140 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1199
Query: 431 KRS 433
+RS
Sbjct: 1200 ERS 1202
>gi|431905124|gb|ELK10179.1| Histone-lysine N-methyltransferase SETD2 [Pteropus alecto]
Length = 2482
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1445 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1504
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1505 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1564
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1565 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1624
Query: 431 KRS 433
+RS
Sbjct: 1625 ERS 1627
>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
gorilla]
Length = 1397
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1079 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1138
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1139 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1198
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1199 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1249
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 725 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 784
Query: 181 ASH 183
A H
Sbjct: 785 AYH 787
>gi|148677064|gb|EDL09011.1| mCG15806 [Mus musculus]
Length = 2034
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 998 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1057
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1058 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1117
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1118 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1177
Query: 431 KRS 433
+RS
Sbjct: 1178 ERS 1180
>gi|355559685|gb|EHH16413.1| hypothetical protein EGK_11693 [Macaca mulatta]
Length = 2343
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1306 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1365
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1366 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1425
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1426 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1485
Query: 431 KRS 433
+RS
Sbjct: 1486 ERS 1488
>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
troglodytes]
Length = 1388
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1070 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1129
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1130 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1189
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1190 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1240
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|114586572|ref|XP_516423.2| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 3 [Pan
troglodytes]
Length = 2549
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1512 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1571
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1572 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1631
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1632 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1691
Query: 431 KRS 433
+RS
Sbjct: 1692 ERS 1694
>gi|380812066|gb|AFE77908.1| histone-lysine N-methyltransferase SETD2 [Macaca mulatta]
Length = 2565
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1528 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1587
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1588 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1647
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1648 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1707
Query: 431 KRS 433
+RS
Sbjct: 1708 ERS 1710
>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
Length = 1439
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1121 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1180
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1291
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFSTAIFESKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|109040979|ref|XP_001113652.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like isoform 2
[Macaca mulatta]
Length = 2550
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1513 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1572
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1573 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1632
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1633 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1692
Query: 431 KRS 433
+RS
Sbjct: 1693 ERS 1695
>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
Length = 1235
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 917 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 976
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 977 IDEEECRLRIQRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1036
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1037 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPK 1087
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ + + F+CPQH C C RC+RC +
Sbjct: 619 CSVSACGKFYHEACVRKFPTAVFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 678
Query: 181 ASH 183
A H
Sbjct: 679 AYH 681
>gi|432909264|ref|XP_004078147.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oryzias
latipes]
Length = 1665
Score = 147 bits (370), Expect = 1e-32, Method: Composition-based stats.
Identities = 70/191 (36%), Positives = 112/191 (58%), Gaps = 3/191 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C C+NR F ++ +++ T+ GWG+ AA+ + F++EY GEV+D
Sbjct: 608 IECSSRCLNGAYCSNRRFQMKQHADFEVILTDDKGWGLRAAKEMAPNTFVLEYCGEVLDH 667
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ + IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 668 KEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNCSRFMNHSCEPNCETQKWTVNGQL 727
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F +++ AG LT+DY+F ++G E KC+CGA SC+G+LG + ++ K+
Sbjct: 728 RVGFFTTKAVAAGTELTFDYQFQRYGKEAQKCFCGAPSCRGFLGGENRVSVRAAGGKMKK 787
Query: 431 KRSSTACLAII 441
R+ + L +
Sbjct: 788 DRNRKSALTTV 798
>gi|338714932|ref|XP_001495700.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Equus caballus]
Length = 2064
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1027 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1086
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1087 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1146
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1147 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1206
Query: 431 KRS 433
+RS
Sbjct: 1207 ERS 1209
>gi|348582642|ref|XP_003477085.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cavia
porcellus]
Length = 2565
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1528 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1587
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1588 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1647
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1648 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1707
Query: 431 KRS 433
+RS
Sbjct: 1708 ERS 1710
>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
carolinensis]
Length = 1438
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K + +I+KT+ GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGEGCQNQCFTKRLYPEAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCHPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFAVCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPKTA 1292
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC+RC I
Sbjct: 764 CSVNACGKFYHEACVRKFATTVFESRGFRCPQHCCTSCSVDKDIHKASKGRMVRCLRCPI 823
Query: 181 ASH 183
A H
Sbjct: 824 AYH 826
>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K ++++TE GWG+ I KGEF+ EY+GE+
Sbjct: 1121 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1180
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1291
Score = 42.0 bits (97), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQW---------RCVRCTI 180
CSV CG YH CV++ + + F+CPQH C C RC+RC +
Sbjct: 766 CSVSACGKFYHEACVRKFPTAVFESKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPV 825
Query: 181 ASH 183
A H
Sbjct: 826 AYH 828
>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
aries]
Length = 1439
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K ++++TE GWG+ I KGEF+ EY+GE+
Sbjct: 1121 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1180
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1291
Score = 41.6 bits (96), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQW---------RCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 766 CSVSTCGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPV 825
Query: 181 ASH 183
A H
Sbjct: 826 AYH 828
>gi|410951014|ref|XP_003982197.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Felis catus]
Length = 2064
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1027 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1086
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1087 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1146
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1147 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1206
Query: 431 KRS 433
+RS
Sbjct: 1207 ERS 1209
>gi|359078405|ref|XP_002697155.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 1448
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 450 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 509
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 510 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 569
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 570 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 629
Query: 431 KRS 433
+RS
Sbjct: 630 ERS 632
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
Length = 1346
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1025 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1084
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1085 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1144
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1145 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1189
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 711 CMVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 770
Query: 180 IASH 183
+A H
Sbjct: 771 VAYH 774
>gi|197927225|ref|NP_001074809.2| SET domain containing 2 [Mus musculus]
Length = 2537
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1501 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1560
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1561 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1620
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1621 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1680
Query: 431 KRS 433
+RS
Sbjct: 1681 ERS 1683
>gi|109658484|gb|AAI17163.1| SET domain containing 2 [Homo sapiens]
gi|109658962|gb|AAI17165.1| SET domain containing 2 [Homo sapiens]
Length = 2061
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1024 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1083
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1084 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1143
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1144 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1203
Query: 431 KRS 433
+RS
Sbjct: 1204 ERS 1206
>gi|12697196|emb|CAC28349.1| huntingtin interacting protein 1 [Homo sapiens]
gi|50512435|gb|AAT77612.1| HSPC069 isoform a [Homo sapiens]
Length = 2061
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1024 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1083
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1084 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1143
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1144 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1203
Query: 431 KRS 433
+RS
Sbjct: 1204 ERS 1206
>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
Length = 1446
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K ++++TE GWG+ I KGEF+ EY+GE+
Sbjct: 1130 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1189
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1190 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1249
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1250 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1300
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQW---------RCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 766 CSVSACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPV 825
Query: 181 ASH 183
A H
Sbjct: 826 AYH 828
>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
Length = 1322
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1079 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1138
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1139 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1198
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1199 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1249
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH C+++ I + F+CPQH C C RC+RC +
Sbjct: 764 CSVGACGKFYHEACIRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 823
Query: 181 ASH 183
A H
Sbjct: 824 AYH 826
>gi|20521978|dbj|BAB21823.2| KIAA1732 protein [Homo sapiens]
Length = 1915
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 878 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 937
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 938 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 997
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 998 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1057
Query: 431 KRS 433
+RS
Sbjct: 1058 ERS 1060
>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Gallus
gallus]
Length = 1436
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KT+ GWG+ I KGEF+ EY+GE+
Sbjct: 1118 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGEL 1177
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1178 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1238 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKTA 1290
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC RC I
Sbjct: 763 CSVSACGKFYHEACVRKFATALFESRGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPI 822
Query: 181 ASH 183
A H
Sbjct: 823 AYH 825
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1045 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1104
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1105 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1164
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1165 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1209
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 731 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 790
Query: 180 IASH 183
+A H
Sbjct: 791 VAYH 794
>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Loxodonta africana]
Length = 1438
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1290
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGTCGKFYHETCVRKFPTAIFESKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1045 QVCPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECM 1104
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1105 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1164
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1165 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 1208
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 731 CVVSQCGKFYHEACVKKYHLTVFESRGFRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCP 790
Query: 180 IASH 183
+A H
Sbjct: 791 VAYH 794
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 887 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 946
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 947 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1006
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1007 FALCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 1050
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 618 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 677
Query: 180 IASH 183
+A H
Sbjct: 678 VAYH 681
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1025 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1084
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1085 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1144
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1145 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1189
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 711 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 770
Query: 180 IASH 183
+A H
Sbjct: 771 VAYH 774
>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1071 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1130
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1131 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1190
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1191 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1241
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cavia porcellus]
Length = 1366
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVRKFPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|344276291|ref|XP_003409942.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Loxodonta
africana]
Length = 2551
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 113/183 (61%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA + F++EY GEV+D
Sbjct: 1513 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDH 1572
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1573 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1632
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1633 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1692
Query: 431 KRS 433
+RS
Sbjct: 1693 ERS 1695
>gi|157824020|ref|NP_001101659.1| histone-lysine N-methyltransferase SETD2 [Rattus norvegicus]
gi|149018436|gb|EDL77077.1| kinesin family member 9 (predicted) [Rattus norvegicus]
Length = 2294
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1258 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1317
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1318 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1377
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1378 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1437
Query: 431 KRS 433
+RS
Sbjct: 1438 ERS 1440
>gi|119585214|gb|EAW64810.1| SET domain containing 2, isoform CRA_f [Homo sapiens]
Length = 2342
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1305 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1364
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1365 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1424
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1425 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1484
Query: 431 KRS 433
+RS
Sbjct: 1485 ERS 1487
>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
[Meleagris gallopavo]
Length = 1436
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KT+ GWG+ I KGEF+ EY+GE+
Sbjct: 1118 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGEL 1177
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1178 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1238 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKTA 1290
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC RC I
Sbjct: 763 CSVSACGKFYHEACVRKFATALFESRGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPI 822
Query: 181 ASH 183
A H
Sbjct: 823 AYH 825
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1043 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1102
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1103 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1162
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1163 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1207
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 729 CVVSQCGKFYHEACVRKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 788
Query: 180 IASH 183
+A H
Sbjct: 789 VAYH 792
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 977 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1036
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1037 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1096
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1097 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1141
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 663 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 722
Query: 180 IASH 183
+A H
Sbjct: 723 VAYH 726
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1050 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1109
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1110 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1169
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1170 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1214
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV+ L + R F+CP H+C C + RCVRC
Sbjct: 743 CVVTQCGKFYHDACVRRYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 802
Query: 180 IASH 183
+A H
Sbjct: 803 VAYH 806
>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Cavia porcellus]
Length = 1367
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1045 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1104
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1105 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1164
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1165 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1209
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 731 CVVTQCGKFYHEACVRKFPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 790
Query: 180 IASH 183
+A H
Sbjct: 791 VAYH 794
>gi|60688116|gb|AAH90954.1| SETD2 protein [Homo sapiens]
Length = 1845
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 808 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 867
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 868 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 927
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 928 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 987
Query: 431 KRS 433
+RS
Sbjct: 988 ERS 990
>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Gallus
gallus]
Length = 1386
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KT+ GWG+ I KGEF+ EY+GE+
Sbjct: 1068 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGEL 1127
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1128 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1187
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1188 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKTA 1240
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC RC I
Sbjct: 763 CSVSACGKFYHEACVRKFATALFESRGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPI 822
Query: 181 ASH 183
A H
Sbjct: 823 AYH 825
>gi|355746723|gb|EHH51337.1| hypothetical protein EGM_10693 [Macaca fascicularis]
Length = 2343
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 113/183 (61%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA + F++EY GEV+D
Sbjct: 1306 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDH 1365
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1366 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1425
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1426 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1485
Query: 431 KRS 433
+RS
Sbjct: 1486 ERS 1488
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 977 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1036
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1037 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1096
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1097 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1141
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 663 CMVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 722
Query: 180 IASH 183
+A H
Sbjct: 723 VAYH 726
>gi|119585209|gb|EAW64805.1| SET domain containing 2, isoform CRA_a [Homo sapiens]
Length = 1538
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1024 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1083
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1084 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1143
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1144 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1203
Query: 431 KRS 433
+RS
Sbjct: 1204 ERS 1206
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cricetulus griseus]
Length = 1365
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 1207
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|119585211|gb|EAW64807.1| SET domain containing 2, isoform CRA_c [Homo sapiens]
Length = 1819
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1305 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1364
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1365 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1424
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1425 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1484
Query: 431 KRS 433
+RS
Sbjct: 1485 ERS 1487
>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
boliviensis boliviensis]
Length = 1438
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 100/171 (58%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1239
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1240 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTACHCGADNCSGFLGVRPK 1290
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1043 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1102
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1103 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1162
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1163 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1207
Score = 41.6 bits (96), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 729 CVVSQCGKFYHEACVRKYPLAVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 788
Query: 180 IASH 183
+A H
Sbjct: 789 VAYH 792
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
Length = 1344
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1023 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1082
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1083 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1142
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1143 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 1186
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 709 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCP 768
Query: 180 IASH 183
+A H
Sbjct: 769 VAYH 772
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1042 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1101
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1102 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1161
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1162 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1206
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 728 CVVSQCGKFYHEACVRKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 787
Query: 180 IASH 183
+A H
Sbjct: 788 VAYH 791
>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
[Meleagris gallopavo]
Length = 1386
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KT+ GWG+ I KGEF+ EY+GE+
Sbjct: 1068 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGEL 1127
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1128 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1187
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1188 GDIRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGAENCSGFLGVRPKTA 1240
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC---------RQRLQWRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C + RC RC I
Sbjct: 763 CSVSACGKFYHEACVRKFATALFESRGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPI 822
Query: 181 ASH 183
A H
Sbjct: 823 AYH 825
>gi|50512437|gb|AAT77613.1| HSPC069 isoform b [Homo sapiens]
Length = 1211
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 1024 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 1083
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 1084 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 1143
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 1144 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 1203
Query: 431 KRS 433
+RS
Sbjct: 1204 ERS 1206
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Equus caballus]
Length = 1365
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1164 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1208
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVSQCGKFYHEACVRKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
Length = 857
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 536 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 595
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 596 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 655
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 656 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 700
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 222 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 281
Query: 180 IASH 183
+A H
Sbjct: 282 VAYH 285
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 692 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 751
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 752 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 811
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 812 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 856
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 378 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 437
Query: 180 IASH 183
+A H
Sbjct: 438 VAYH 441
>gi|443722431|gb|ELU11300.1| hypothetical protein CAPTEDRAFT_160470, partial [Capitella teleta]
Length = 282
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 98/165 (59%), Gaps = 3/165 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C + C N+ F++ K + + E GWG+ A ++ G F++EY+GEV+D
Sbjct: 31 IECGSRCPCGDMCTNKRFQRRHYAKTEPFRAEVKGWGLRATSDLSSGVFVMEYVGEVLDY 90
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
R + Y + D IDAT KGN SRF+NHSCDPNC +KW V G+
Sbjct: 91 PNFRLRCKQYAEDNHTHHYFMALNGDEIIDATQKGNTSRFINHSCDPNCETQKWTVNGQL 150
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGT 415
RVG F RSI AG LT+DY+F Q+G E+ +C+CGA SC+G +GT
Sbjct: 151 RVGFFTLRSIPAGTELTFDYQFEQYGSEIQRCFCGADSCRGIIGT 195
>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1418
Score = 146 bits (369), Expect = 2e-32, Method: Composition-based stats.
Identities = 73/168 (43%), Positives = 106/168 (63%), Gaps = 6/168 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y C S C + C N+ F+K++ +I+ E GWG++A + I FIIEY GEVI
Sbjct: 1033 YFECHSGYCIHGDRCRNQRFQKQQYCRILPFSAEKKGWGLKAVDNIGAKTFIIEYCGEVI 1092
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
C +R+ + + + FY + + +DA+ KGN +RF+NHSCDPNC +KW V+G
Sbjct: 1093 SKQKCLERMTESESE--KYFYFLTLDRLECLDASRKGNLARFINHSCDPNCETQKWNVDG 1150
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
E R+G+FA R IK GE LT+DY + +FG + CYCGA++C+G+LG K
Sbjct: 1151 EVRIGIFAIRDIKRGEELTFDYNYERFGTSKQVCYCGAANCRGFLGEK 1198
>gi|118348872|ref|XP_001007909.1| SET domain containing protein [Tetrahymena thermophila]
gi|89289676|gb|EAR87664.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 3527
Score = 146 bits (369), Expect = 2e-32, Method: Composition-based stats.
Identities = 72/162 (44%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C +C C E C NR F+K + + V T GWG+ A + I KG FII+Y GEV D
Sbjct: 182 CDLSCRCGELCQNRRFQKHQDACVYPVPTRGKGWGLCAGQFIPKGTFIIQYTGEVFDINS 241
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
E Y Y+ +I ++ ID T+KGN +RF+NHSCDPNCI +KW V GE +
Sbjct: 242 SEGIKRCKDYSRSTCTYLMKIDRNEVIDPTYKGNLARFINHSCDPNCITQKWHVLGEICI 301
Query: 374 GVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLG 414
G+F+ + I+ E LT+DY+F F P KC C A+ C+GYLG
Sbjct: 302 GIFSIKDIQEDEELTFDYQFDSFKTPLTKCLCQAAKCKGYLG 343
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 539 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 598
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 599 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 658
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 659 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 702
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 225 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 284
Query: 180 IASH 183
+A H
Sbjct: 285 VAYH 288
>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
NZE10]
Length = 963
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C NR F+++K + ++KTE G+G+ A + + G+F+ EYIGEVI +
Sbjct: 186 MECVGDCGCGSKCQNRRFQRKKYADVTVIKTEKKGYGLRADKELRPGDFVYEYIGEVIGE 245
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 246 NVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVNDKL 305
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+F R+I+AGE L ++Y ++G + + CYCG +C GY+G GK + G+K
Sbjct: 306 RMGIFVERNIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGYIG-----GKTQTERGTKL 360
Query: 431 KRSSTACLAI 440
+ L I
Sbjct: 361 SHTIIEALGI 370
>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
Length = 521
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/185 (42%), Positives = 109/185 (58%), Gaps = 4/185 (2%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++V+TE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 68 CRCGVYCKNQRFQKCQYARTRLVRTEGRGWGLVADENIMAGQFVIEYCGEVISWKESKRR 127
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ Y+ + D +IDAT KGNF+RF+NHSC PNC KW V GE RVG+FA
Sbjct: 128 AQAYETQGLKDAYIIYLNADESIDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFA 187
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSST 435
+ I G L+YDY F +G V+C CGA+SC G+LG K R + W R S
Sbjct: 188 KQDIPFGTELSYDYNFEWYGGVMVRCLCGAASCSGFLGAKSRGFQEATYLWEDDDDRFSV 247
Query: 436 ACLAI 440
+ +
Sbjct: 248 ENIPL 252
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 100/162 (61%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG++A I KGEF+ EY+GE+ID+ C R
Sbjct: 1356 CPAGNRCQNQCFTKRQYPEVEIFRTLARGWGLQAKRDIKKGEFVNEYVGELIDEEECRAR 1415
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1416 IRHAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1475
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
++KAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1476 ITNVKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1517
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 101/162 (62%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F K + +++I +T GWG++A I KGEF+ EY+GE+ID+ C R
Sbjct: 654 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 713
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 714 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 773
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 774 LVNIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 815
>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
Length = 851
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 530 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 589
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 590 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 649
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 650 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 694
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 216 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 275
Query: 180 IASH 183
+A H
Sbjct: 276 VAYH 279
>gi|328768890|gb|EGF78935.1| hypothetical protein BATDEDRAFT_90118 [Batrachochytrium
dendrobatidis JAM81]
Length = 1361
Score = 145 bits (367), Expect = 3e-32, Method: Composition-based stats.
Identities = 72/168 (42%), Positives = 104/168 (61%), Gaps = 4/168 (2%)
Query: 254 ISCS-KACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS + C C NR F+ + I++ + G+G+ A E I G FIIEY GEVI
Sbjct: 217 LECSAEDCPTGNACQNRRFQLCQYSPIQVARAGSKGFGIYARENIAGGAFIIEYCGEVIP 276
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+L +R+ + Q+FY ++KD IDA+ KGN SR+LNHSCDPNC L+KW V
Sbjct: 277 ASLFGKRITEHSNNSAQHFYFMSLKKDEYIDASKKGNLSRYLNHSCDPNCSLQKWLVGDT 336
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKR 417
R+G+FA R+I LT+DY+F ++G + + CYCGA++C G++G R
Sbjct: 337 IRIGLFALRAIPKNAELTFDYKFERYGSKAQECYCGAAACTGFIGGNR 384
>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
domestica]
Length = 1435
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1117 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1176
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1177 IDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1236
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1237 GDIRVGLFALCDIPAGVELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPKTA 1289
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 39/85 (45%), Gaps = 15/85 (17%)
Query: 114 IECRACHRFIYH----GEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC 167
EC+ HR + G++V CSV CG YH CV++ + R F+CPQH C C
Sbjct: 741 TECKTGHRTCFSCKIPGKDVKRCSVPACGKFYHEACVRKFATAVFESRGFRCPQHCCSTC 800
Query: 168 RQRLQ---------WRCVRCTIASH 183
RC+RC IA H
Sbjct: 801 SMEKDIHKASKGRMMRCLRCPIAYH 825
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 76/180 (42%), Positives = 105/180 (58%), Gaps = 7/180 (3%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + ++I +T CGWG+ A I KG F+IEY+GEVID+ C
Sbjct: 758 QVCAAGERCQNQAFTKREYTPVEIYRTLSCGWGLRAVSDIKKGAFVIEYVGEVIDEEECR 817
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NH C PNC +KW V G+TRVG+
Sbjct: 818 ARIKHAQENDIFNFYMLTLDKDRIIDAGPKGNQARFMNHCCQPNCETQKWTVNGDTRVGL 877
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK----IGKLELCWGSKR 430
FA + I G+ L ++Y G + C CGA +C G+LG + K KL+L G +R
Sbjct: 878 FALQDIPKGKELNFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNQPSAEKLKLKEGKRR 937
>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Sarcophilus harrisii]
Length = 1437
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1239 GDVRVGLFALCDIPAGVELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPKTA 1291
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C RC+RC I
Sbjct: 765 CSVTACGKFYHEACVRKFATAVFESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPI 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Sarcophilus harrisii]
Length = 1389
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 101/173 (58%), Gaps = 3/173 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1071 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1130
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1131 IDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1190
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIG 420
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1191 GDVRVGLFALCDIPAGVELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPKTA 1243
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ + R F+CPQH C C RC+RC I
Sbjct: 765 CSVTACGKFYHEACVRKFATAVFESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPI 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
candidate 1 (WHSC1) [Danio rerio]
Length = 728
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 100/166 (60%), Gaps = 3/166 (1%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C
Sbjct: 402 QVCPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECR 461
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+
Sbjct: 462 SRIRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGL 521
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
FA I AG LT++Y G E C CGA +C G+LG + K G
Sbjct: 522 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 567
>gi|6841376|gb|AAF29041.1|AF161554_1 HSPC069 [Homo sapiens]
Length = 591
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 109/170 (64%), Gaps = 3/170 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 77 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 136
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 137 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 196
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIG 420
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++
Sbjct: 197 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVS 246
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/164 (43%), Positives = 98/164 (59%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C + C N+ F K + ++I +T CGWG+ I KG FI EY+GEVID+ C
Sbjct: 1627 QVCAAGDRCQNQSFTKRQYTPVEIFRTLSCGWGLRGLSDIRKGAFISEYVGEVIDEEECR 1686
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NHSC PNC +KW V G+TRVG+
Sbjct: 1687 ARIRHAQENDICNFYMLTLDKDRIIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGL 1746
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA + + GE LT++Y G + C CGA +C G+LG + K
Sbjct: 1747 FALQDVPKGEELTFNYNLECRGNGKTACKCGAPNCSGFLGVRPK 1790
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 421 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 480
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 481 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 540
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 541 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 585
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 107 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 166
Query: 180 IASH 183
+A H
Sbjct: 167 VAYH 170
>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
norvegicus]
Length = 1396
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1080 QYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELI 1139
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1140 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1199
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1200 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 1249
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C +C+RC I
Sbjct: 766 CSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMKCLRCPI 825
Query: 181 ASH 183
A H
Sbjct: 826 AYH 828
>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Oreochromis niloticus]
Length = 1167
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/171 (45%), Positives = 100/171 (58%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K + KI+KT GWG+ I KGEF+ EYIGE+
Sbjct: 829 LQYECHPQVCPSGERCCNQDFTKRLYPETKIIKTPGKGWGLICLRDIKKGEFVNEYIGEL 888
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V
Sbjct: 889 IDEEECRARIKYAHENNITDFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVN 948
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
G+TRVG+FA I AG LT++Y G E C CGA +C G+LG + K
Sbjct: 949 GDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPK 999
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 126 GEEVFCSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHACFIC-----------RQRLQW 173
GE C V CG YH CV+ L + + + F+CP H C C + RL
Sbjct: 518 GEVRRCHVPHCGKFYHEACVRLNTLTVFDNKGFRCPLHTCLSCHYGCRTKHKSNKGRL-M 576
Query: 174 RCVRCTIASH--DKC 186
RC+RC +A H D+C
Sbjct: 577 RCLRCPVAYHIGDQC 591
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 394 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 453
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 454 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 513
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 514 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 558
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 20/136 (14%)
Query: 63 GHSVRVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSL----PFLVGAKKMIECRA 118
G +V R R +++ E+ + + +++ + SL FL+G C+
Sbjct: 13 GLVSKVGNRARCFSSVAASEEELLDFSGSELQFNSCSLHLSLHPFFNFLLGIHSCFVCKE 72
Query: 119 CHRFIYHGEEVFCSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR--------- 168
+ C V CG YH CVK+ L + R F+CP H+C C
Sbjct: 73 SKTDVKR-----CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRP 127
Query: 169 -QRLQWRCVRCTIASH 183
+ RCVRC +A H
Sbjct: 128 SKGKMMRCVRCPVAYH 143
>gi|356518575|ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2037
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C + C+N+ F+K K +K K G+G++A E + +G+F+IEY+GEV+D E
Sbjct: 1163 TCPCGDRCSNQQFQKHKYASLKWFKCGKKGYGLKAIENVAQGQFLIEYVGEVLDMQAYEA 1222
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + +G ++FY + IDA+ KGN RF+NHSCDPNC EKW V GE +G+F
Sbjct: 1223 RQREYALKGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1282
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
A R IK E LT+DY +V+ FG KCYCG+ +C+GY+G
Sbjct: 1283 ALRDIKKDEELTFDYNYVRVFGAAAKKCYCGSPNCRGYIG 1322
>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
Length = 486
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 100/166 (60%), Gaps = 3/166 (1%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C
Sbjct: 160 QVCPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECR 219
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+
Sbjct: 220 SRIRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGL 279
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
FA I AG LT++Y G E C CGA +C G+LG + K G
Sbjct: 280 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 325
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 101/162 (62%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F K + +++I +T GWG++A I KGEF+ EY+GE+ID+ C R
Sbjct: 1640 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1699
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1700 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1759
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1760 IVNIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1801
>gi|340959559|gb|EGS20740.1| hypothetical protein CTHT_0025760 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 939
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 104/160 (65%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C E C N+ F++++ K+ ++KTE G+G+ A + +FI EY+GEVI++ R
Sbjct: 163 CNCGEGCQNQRFQRKQYAKVSVIKTEKKGYGLRADTDLQPNDFIYEYVGEVINEPTFRSR 222
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K+ +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 223 MLKYDKEGIKHFYFMSLTKNEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFA 282
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
AR IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 283 ARYIKAGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 322
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 392 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 451
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 452 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 511
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 512 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 556
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 20/132 (15%)
Query: 67 RVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSL----PFLVGAKKMIECRACHRF 122
+V R R +++ E+ + + +++ + SL FL+G C+
Sbjct: 15 KVGNRARCFSSVAASEEELLDFSGSELQFNSCSLHLSLHPFFNFLLGIHSCFVCKESKTD 74
Query: 123 IYHGEEVFCSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRL 171
+ C V CG YH CVK+ L + R F+CP H+C C +
Sbjct: 75 VKR-----CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGK 129
Query: 172 QWRCVRCTIASH 183
RCVRC +A H
Sbjct: 130 MMRCVRCPVAYH 141
>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
homolog; Short=WHSC1-like protein 1
Length = 1439
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1123 QYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELI 1182
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1183 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1242
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1243 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 1292
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 392 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 451
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 452 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 511
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 512 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 556
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 54/132 (40%), Gaps = 20/132 (15%)
Query: 67 RVMKRCRGAKNISGLEDHVAAWVKKKMELGVPQSNCSL----PFLVGAKKMIECRACHRF 122
+V R R N++ E+ + + +++ + SL FL+G C+
Sbjct: 15 KVGNRARCFSNVAVSEEELLDFSGSELQFNSCSLHLSLHPFFNFLLGIHSCFVCKESKTD 74
Query: 123 IYHGEEVFCSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRL 171
+ C V CG YH CVK+ L + R F+CP H+C C +
Sbjct: 75 VKR-----CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGK 129
Query: 172 QWRCVRCTIASH 183
RCVRC +A H
Sbjct: 130 MMRCVRCPVAYH 141
>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
Length = 502
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 76/164 (46%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI++T+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 114 QVCPAGEFCQNQCFTKRQYPETKIIRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEDECM 173
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 174 ARIKRAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 233
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 234 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPK 277
>gi|384484414|gb|EIE76594.1| hypothetical protein RO3G_01298 [Rhizopus delemar RA 99-880]
Length = 291
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 101/166 (60%), Gaps = 6/166 (3%)
Query: 258 KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
K C C C+N F+K++ K+++ KT+ GWG+ E I KG F+IEY GE+I LC
Sbjct: 127 KHCPCGPKCSNNRFQKKQRKYKLEVFKTDQRGWGLRTLEDIKKGAFVIEYRGEIISQKLC 186
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
E+R+ Y NFY E K IDA KG +RF+NHSCDPNC +EKW GE G
Sbjct: 187 EERMC-TDYVNENNFYFLEYSKGEVIDACTKGTEARFINHSCDPNCHIEKWSYRGEAHFG 245
Query: 375 VFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
VFA++ I A L+YDY F F E + C+CG+ SC+G +G K+K
Sbjct: 246 VFASKDIPAYSELSYDYNFSTFNVENEQMCHCGSESCRGTIGKKKK 291
>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
construct]
Length = 1446
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1130 QYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELI 1189
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1190 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1249
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1250 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 1299
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 775 CSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPV 834
Query: 181 ASH 183
A H
Sbjct: 835 AYH 837
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 101/162 (62%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F K + +++I +T GWG++A I KGEF+ EY+GE+ID+ C R
Sbjct: 1628 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1687
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1688 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1747
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1748 IVNIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1789
>gi|296808927|ref|XP_002844802.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844285|gb|EEQ33947.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 969
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 113/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C + C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 172 MECVQNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLQPNDFIFEYIGEVINE 231
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 232 PQFRKRMIQYDEEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 291
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R +KAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 292 RMGIFAERHVKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 346
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 347 SNATIEALGIDDL 359
>gi|390342260|ref|XP_003725626.1| PREDICTED: uncharacterized protein LOC578079 isoform 1
[Strongylocentrotus purpuratus]
Length = 3023
Score = 145 bits (365), Expect = 5e-32, Method: Composition-based stats.
Identities = 71/173 (41%), Positives = 101/173 (58%), Gaps = 17/173 (9%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
V + C C C + C NR F+K + ++ + TE G G++A E + EF++EY+GEV
Sbjct: 1637 VLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVMEYVGEV 1696
Query: 309 IDDALCEQRLWDMKYRGVQ-------NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
++ + K+R Q +FY ++ D IDAT KGN SRF+NHSCDPNC
Sbjct: 1697 LN-------FHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCE 1749
Query: 362 LEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYL 413
+KW V G+ RVG F R +K GE LT+DY+F +G E KC CG+ C+G +
Sbjct: 1750 TQKWTVNGQLRVGFFTKRQVKPGEELTFDYQFEVYGQEAQKCLCGSEKCRGVI 1802
>gi|390342258|ref|XP_783359.3| PREDICTED: uncharacterized protein LOC578079 isoform 2
[Strongylocentrotus purpuratus]
Length = 3024
Score = 145 bits (365), Expect = 5e-32, Method: Composition-based stats.
Identities = 71/173 (41%), Positives = 101/173 (58%), Gaps = 17/173 (9%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
V + C C C + C NR F+K + ++ + TE G G++A E + EF++EY+GEV
Sbjct: 1637 VLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVMEYVGEV 1696
Query: 309 IDDALCEQRLWDMKYRGVQ-------NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
++ + K+R Q +FY ++ D IDAT KGN SRF+NHSCDPNC
Sbjct: 1697 LN-------FHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCE 1749
Query: 362 LEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYL 413
+KW V G+ RVG F R +K GE LT+DY+F +G E KC CG+ C+G +
Sbjct: 1750 TQKWTVNGQLRVGFFTKRQVKPGEELTFDYQFEVYGQEAQKCLCGSEKCRGVI 1802
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 101/169 (59%), Gaps = 5/169 (2%)
Query: 257 SKACHCSETCNNRPFRKEKKIKIV----KTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
S+ C + C N+ F+K + + + + GWG++ + I KG+F+ EY+GE++D+
Sbjct: 1303 SQVCPAGDNCQNQRFQKMQYPETIPFRTDEKGRGWGLKTTQDIKKGDFVHEYVGELVDEE 1362
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
C +R+ + + NFYM I KD IDA KGN +RF+NHSCDPNC KW + +TR
Sbjct: 1363 TCRERIKKCQQLDIDNFYMLTIDKDHVIDAGPKGNLARFMNHSCDPNCETMKWTILPDTR 1422
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
VG+FA R I AG LT++Y G E KC CGA +C GY+G + +
Sbjct: 1423 VGLFAKRDITAGSELTFNYNLDCLGNEKKKCECGAKNCSGYIGVRPRTA 1471
>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
Length = 1381
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1065 QYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELI 1124
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1125 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1184
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 1185 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 1234
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 750 CSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPV 809
Query: 181 ASH 183
A H
Sbjct: 810 AYH 812
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/164 (46%), Positives = 99/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1048 QVCPAGERCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECM 1107
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1108 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1167
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGA +C G+LG + K
Sbjct: 1168 FAVCDIPAGTELTFNYNLDCLGNEKTVCKCGAPNCSGFLGDRPK 1211
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 734 CVVSHCGKFYHEACVKKFHLTVFENRGFRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCP 793
Query: 180 IASH 183
+A H
Sbjct: 794 VAYH 797
>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
Length = 584
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 263 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 322
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 323 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 382
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 383 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 427
>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 421
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 100 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 159
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 160 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 219
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 220 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 264
>gi|260941922|ref|XP_002615127.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
gi|238851550|gb|EEQ41014.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
Length = 717
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 67/161 (41%), Positives = 105/161 (65%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
+C C + C N+ F+K++ +K+++TE G+G+ A + +G F+ EYIGEVID+
Sbjct: 83 SCTCGQNCQNQRFQKKQYSSVKVIQTEKKGYGLVAQADVPEGSFVYEYIGEVIDEKTFRA 142
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ D R +++FY + KD IDAT KG+ +RF NHSC PN ++KW V + R+G+F
Sbjct: 143 RMLDYDRRNLRHFYFMMLTKDAFIDATEKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIF 202
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
A R I AGE +T+DY ++G + + CYCG+ +C G++G K
Sbjct: 203 ARRRILAGEEITFDYNVDRYGAQQQPCYCGSPNCMGWIGGK 243
>gi|327300913|ref|XP_003235149.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
118892]
gi|326462501|gb|EGD87954.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
118892]
Length = 971
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 106/171 (61%), Gaps = 3/171 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 172 MECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFIFEYIGEVINE 231
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 232 PQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 291
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGK 421
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G K + G+
Sbjct: 292 RMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIGGKTQTGR 342
>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
Length = 601
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 280 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 339
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 340 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 399
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 400 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 444
>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Cricetulus griseus]
Length = 1387
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1071 QYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRSIKKGEFVNEYVGELI 1130
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1131 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1190
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CG+ +C G+LG + K
Sbjct: 1191 DVRVGLFAICDIPAGMELTFNYNLDCLGNGRTVCHCGSDNCSGFLGVRPK 1240
Score = 42.0 bits (97), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 11/69 (15%)
Query: 126 GEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WR 174
GE+V CSV CG YH CV++ I + F+CPQH C C R
Sbjct: 753 GEDVKRCSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMR 812
Query: 175 CVRCTIASH 183
C+RC +A H
Sbjct: 813 CLRCPVAYH 821
>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
Length = 704
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 383 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 442
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 443 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 502
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 503 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 547
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 69 CVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCP 128
Query: 180 IASH 183
+A H
Sbjct: 129 VAYH 132
>gi|242019388|ref|XP_002430143.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
humanus corporis]
gi|212515234|gb|EEB17405.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
humanus corporis]
Length = 1448
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 103/165 (62%), Gaps = 3/165 (1%)
Query: 253 YISCSKACHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
++ C C + C N+ F+K + KI KTE G+G+ A E ++ FI+EY+GEV++
Sbjct: 538 FLECGSRCALGDRCTNKRFQKLQYANCKIFKTEKKGFGLRAEEDLSGNTFIMEYVGEVVN 597
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+R+ ++FY ++ D IDAT KGN SRF+NHSCDPN +KW + GE
Sbjct: 598 QKEFGRRVKMYAKENNKHFYFMALKGDAVIDATNKGNISRFINHSCDPNAETQKWTINGE 657
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
RVG F R + AGE +T+DY+F ++G + KCYC AS+C+G++G
Sbjct: 658 LRVGFFTRRFVAAGEEITFDYQFQRYGKQAQKCYCEASNCRGWIG 702
>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Cricetulus griseus]
Length = 1436
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 1120 QYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRSIKKGEFVNEYVGELI 1179
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 1180 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 1239
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CG+ +C G+LG + K
Sbjct: 1240 DVRVGLFAICDIPAGMELTFNYNLDCLGNGRTVCHCGSDNCSGFLGVRPK 1289
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 11/69 (15%)
Query: 126 GEEV-FCSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WR 174
GE+V CSV CG YH CV++ I + F+CPQH C C R
Sbjct: 759 GEDVKRCSVSVCGKFYHEACVRKFPTAIFESKGFRCPQHCCSSCSMEKDIHKASKGRMMR 818
Query: 175 CVRCTIASH 183
C+RC +A H
Sbjct: 819 CLRCPVAYH 827
>gi|62531333|gb|AAH93421.1| Whsc1 protein [Danio rerio]
Length = 320
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 98/158 (62%), Gaps = 3/158 (1%)
Query: 266 CNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKY 323
C N+ F K + KI++T GWG+ + I KGEF+ EY+GE+ID+ C R+ + +
Sbjct: 2 CQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSRIRNAQE 61
Query: 324 RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKA 383
+ +FYM I KD IDA KGN+SRF+NHSC PNC +KW V G+TRVG+FA I A
Sbjct: 62 NDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPA 121
Query: 384 GEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIG 420
G LT++Y G E C CGA +C G+LG + K G
Sbjct: 122 GTELTFNYNLDCLGNEKTVCRCGAPNCSGFLGDRPKNG 159
>gi|355729163|gb|AES09785.1| Wolf-Hirschhorn syndrome candidate 1 [Mustela putorius furo]
Length = 409
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 89 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 148
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 149 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 208
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 209 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 253
>gi|302677464|ref|XP_003028415.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
gi|300102103|gb|EFI93512.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
Length = 883
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 102/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
CHC + C N+ F++++ I+IVKTE G+G+ A + I + FI EY+G+V++ ++R
Sbjct: 129 CHCRQFCQNQRFQRKEYAPIEIVKTEKKGFGLRAEQDIQRDTFIYEYVGDVVNSPSFKKR 188
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ D G+Q+FY ++KD IDAT G RF NHSC PNC + KW V R+G+FA
Sbjct: 189 MRDYAKEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCSPNCYVAKWTVGDRVRMGIFA 248
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IK E LT++Y ++G + + CYCG +C GY+G K
Sbjct: 249 KRNIKKYEELTFNYNVDRYGHQAQTCYCGEPNCVGYIGGK 288
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ E I KGEF+ EY+GE+ID+ C R
Sbjct: 1822 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTQEDIRKGEFVNEYVGELIDEEECMAR 1881
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+FA
Sbjct: 1882 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1941
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1942 VCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGVRPK 1983
>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1404
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 100/169 (59%), Gaps = 3/169 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K + ++VKT GWG++A +PI KGEF+IEY+GEV
Sbjct: 1109 LQYECHPQVCPAGDNCENQCFTKRLYAETEVVKTADRGWGLKANQPIKKGEFVIEYVGEV 1168
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID C+QR+ + NFYM + KD IDA KGN SRF+NHSC PNC +KW V
Sbjct: 1169 IDAEECQQRIKRAHENHMTNFYMLTLTKDRVIDAGQKGNLSRFINHSCSPNCETQKWTVN 1228
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
G+ +G+FA I+ LT++Y G C CG+ +C G+LG +
Sbjct: 1229 GDVHIGLFALCDIETDTELTFNYNLHCVGNRRATCNCGSDNCSGFLGVQ 1277
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 126 GEEVF-CSVRGCGGVYHFICVKERLGISNPR--NFKCPQHACFIC 167
G EV CSV GCG YH CV++ LG ++ F CPQH C C
Sbjct: 771 GREVTRCSVSGCGCFYHEDCVRKLLGTTSSPGGGFCCPQHICSTC 815
>gi|238883879|gb|EEQ47517.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 846
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C + C N+ F R+ K+K+++TE G+G+ A + I + +FI EYIGEVID+
Sbjct: 121 VNRNCLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEIS 180
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
QR+ + R +++FY + D IDAT KG+ RF+NHSC+PN ++KW V R+
Sbjct: 181 FRQRMIEYDLRHLKHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRM 240
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 241 GIFAKRKISRGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 284
>gi|68483319|ref|XP_714401.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|68483410|ref|XP_714361.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|74585304|sp|Q59XV0.1|SET2_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|46435920|gb|EAK95292.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|46435962|gb|EAK95333.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
Length = 844
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C + C N+ F R+ K+K+++TE G+G+ A + I + +FI EYIGEVID+
Sbjct: 121 VNRNCLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEIS 180
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
QR+ + R +++FY + D IDAT KG+ RF+NHSC+PN ++KW V R+
Sbjct: 181 FRQRMIEYDLRHLKHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRM 240
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 241 GIFAKRKISRGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 284
>gi|225562106|gb|EEH10386.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1000
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 183 MECVGDCGCGDECENQRFQRRQYADVSVIKTEKKGYGLRAESDLRPNQFIFEYIGEVINE 242
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
L +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 243 TLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 302
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 303 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 357
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 358 SNATIEALGI 367
>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
Length = 558
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 100/165 (60%), Gaps = 3/165 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 237 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 296
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 297 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 356
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 357 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 401
>gi|403072167|pdb|4FMU|A Chain A, Crystal Structure Of Methyltransferase Domain Of Human Set
Domain- Containing Protein 2 Compound: Pr-Snf
gi|407944022|pdb|4H12|A Chain A, The Crystal Structure Of Methyltransferase Domain Of Human
Set Domain- Containing Protein 2 In Complex With
S-Adenosyl-L-Homocysteine
Length = 278
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 114/183 (62%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F++++ ++++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 94 IECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDH 153
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ D IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 154 KEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQL 213
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
RVG F + + +G LT+DY+F ++G E KC+CG+++C+GYLG + ++ K+
Sbjct: 214 RVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKK 273
Query: 431 KRS 433
+RS
Sbjct: 274 ERS 276
>gi|325091551|gb|EGC44861.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1000
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 183 MECVGDCGCGDECENQRFQRRQYAHVSVIKTEKKGYGLRAESDLRPNQFIFEYIGEVINE 242
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
L +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 243 TLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 302
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 303 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 357
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 358 SNATIEALGI 367
>gi|326674803|ref|XP_003200208.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Danio
rerio]
Length = 1428
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 69/183 (37%), Positives = 109/183 (59%), Gaps = 3/183 (1%)
Query: 254 ISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C C+NR F ++ +++ TE GWG+ AA+ + F++EY GEV+D
Sbjct: 558 IECSSRCLNGAYCSNRRFQMKQHADYEVILTESKGWGLRAAKDLQPNTFVLEYCGEVLDH 617
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R+ + ++Y ++ + IDAT KGN SRF+NHSC+PNC +KW V G+
Sbjct: 618 REFKARVKEYARNKNIHYYFMALKNNEIIDATLKGNCSRFMNHSCEPNCETQKWTVNGQL 677
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G F +++ AG LT+DY+F ++G E KC+CGA SC+G +G + ++ +R
Sbjct: 678 RIGFFTTKAVTAGTELTFDYQFQRYGKEAQKCFCGAPSCRGLIGGETRVSVRAAGGKKQR 737
Query: 431 KRS 433
+RS
Sbjct: 738 ERS 740
>gi|340515729|gb|EGR45981.1| predicted protein [Trichoderma reesei QM6a]
Length = 933
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 114/191 (59%), Gaps = 9/191 (4%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS + +C C NR F++++ + ++KTE G+G+ A P+ +FI EYIGEVI+
Sbjct: 156 MECSADSANCGGGCQNRRFQRKQYADVTVIKTEKKGFGLRANSPLEPNDFIYEYIGEVIN 215
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K+ +DAT KGN RF NHSC PNC ++KW V +
Sbjct: 216 EPTFRRRMLQYDEEGIKHFYFMSLNKNEFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDK 275
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSK 429
R+G+FA R I+AGE L ++Y ++G E + CYCG ++C G++G GK + +K
Sbjct: 276 LRMGIFALRKIQAGEELVFNYNVDRYGAEPQPCYCGEANCVGFIG-----GKTQTERATK 330
Query: 430 RKRSSTACLAI 440
++ L I
Sbjct: 331 LPTATVEALGI 341
>gi|315048637|ref|XP_003173693.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311341660|gb|EFR00863.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 967
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 113/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C + C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 172 MECVQNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANADLRPNDFIFEYIGEVINE 231
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 232 PQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 291
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 292 RMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 346
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 347 SNATIEALGIDDL 359
>gi|302664281|ref|XP_003023773.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
gi|291187786|gb|EFE43155.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
Length = 973
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 112/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 173 MECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFIYEYIGEVINE 232
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 233 PQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 292
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 293 RMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 347
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 348 SNATIEALGIDDL 360
>gi|451994892|gb|EMD87361.1| hypothetical protein COCHEDRAFT_1144880 [Cochliobolus
heterostrophus C5]
Length = 923
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C N+ F++++ + ++KTE G+G+ A + + GEF+ EYIGEVID+
Sbjct: 185 MECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFVFEYIGEVIDE 244
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 245 RTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKL 304
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R +KAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 305 RMGIFVERQVKAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIGGK 350
>gi|451846131|gb|EMD59442.1| hypothetical protein COCSADRAFT_258710 [Cochliobolus sativus
ND90Pr]
Length = 923
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C N+ F++++ + ++KTE G+G+ A + + GEF+ EYIGEVID+
Sbjct: 185 MECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFVFEYIGEVIDE 244
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 245 RTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKL 304
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R +KAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 305 RMGIFVERQVKAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIGGK 350
>gi|189195894|ref|XP_001934285.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980164|gb|EDU46790.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 948
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 105/166 (63%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C TC N+ F++++ + ++KTE G+G+ A + + G+F+ EYIGEVID+
Sbjct: 185 MECVGDCSCGRTCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFVFEYIGEVIDE 244
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 245 RTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKL 304
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R+++AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 305 RMGIFVERNVRAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIGGK 350
>gi|361129227|gb|EHL01139.1| putative Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Glarea lozoyensis 74030]
Length = 839
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 110/174 (63%), Gaps = 3/174 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ ++ + C C +TC N+ F+++ + ++KT+ G+G+ A ++ G+F+ EYIGEV
Sbjct: 95 IDGVNNAYKCKCGDTCQNQRFQRQDYADVSVIKTDKKGYGLRANTDLDAGDFVFEYIGEV 154
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I++ +R + G+++FY + KD +DAT KGN RF NHSC+PNC ++KW V+
Sbjct: 155 INEPTFRKRTIEYHKEGIKHFYFMSLGKDEFVDATKKGNLGRFCNHSCNPNCFVDKWIVK 214
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGK 421
+ R+G+FA R I+ GE L ++Y ++G + + CYCG +C G++G K + +
Sbjct: 215 DKLRMGIFAQRPIQKGEELVFNYNVDRYGADPQPCYCGEPNCGGFMGGKTQTDR 268
>gi|332026544|gb|EGI66662.1| Histone-lysine N-methyltransferase SETD2 [Acromyrmex echinatior]
Length = 1841
Score = 143 bits (361), Expect = 1e-31, Method: Composition-based stats.
Identities = 69/164 (42%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A I GEFI+EY+GEV+D
Sbjct: 849 IECGPRCVVGDRCTNKRFQNCQYAKCEVFRTEKKGFGLRAVVDIMAGEFIMEYVGEVVDP 908
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 909 KDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDPNAETQKWTVNGEL 968
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KCYC A +C+G++G
Sbjct: 969 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCYCEALNCRGWIG 1012
>gi|326488341|dbj|BAJ93839.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1070
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 102/166 (61%), Gaps = 5/166 (3%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+K C C E C+N+ F++ KI+ + G+G++ E +++G F+IEY+GEV+D
Sbjct: 320 IECAKRTCPCEEQCSNQQFQRRNYAKIEWFHSGKKGYGLQLLEEVSEGRFLIEYVGEVLD 379
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R D +G ++FY + IDA KGN RF+NHSC PNC EKW V GE
Sbjct: 380 ITTYESRQRDYASKGKKHFYFMALDGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGE 439
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
+G+FA R+IK GE LT+DY +V+ KC+CG + C+GY+G
Sbjct: 440 VCIGIFAMRNIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIG 485
>gi|302505469|ref|XP_003014441.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
gi|291178262|gb|EFE34052.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
Length = 973
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 112/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 173 MECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFIYEYIGEVINE 232
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 233 PQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 292
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 293 RMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 347
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 348 SNATIEALGIDDL 360
>gi|393236319|gb|EJD43868.1| hypothetical protein AURDEDRAFT_114444 [Auricularia delicata
TFB-10046 SS5]
Length = 882
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F K + +I IV+TE G+G+ AA + K +FI EY+GEV+++ +R
Sbjct: 128 CRCGVHCQNQRFAKRQYAQIHIVQTEKKGFGLRAATDLKKDDFIYEYVGEVVNNTQFMKR 187
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++K+ IDAT KG RF NHSC+PNC + KW V R+G+FA
Sbjct: 188 MREYADEGIRHFYFMMLQKEEFIDATKKGGIGRFANHSCNPNCYVAKWTVGKRIRMGIFA 247
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R++KAGE LT++Y ++G E + CYCG +C G++G K
Sbjct: 248 QRNVKAGEELTFNYNVDRYGHEAQTCYCGEPNCVGFIGGK 287
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C +E C N+ F K + +++I +T GWG+ I KG+F+ EY+GEVID+ C
Sbjct: 1981 QVCPAAERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECR 2040
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NHSC PNC +KW V G+TRVG+
Sbjct: 2041 SRIRHAQDNNICNFYMLTLDKDRIIDAGPKGNEARFMNHSCQPNCETQKWTVNGDTRVGL 2100
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G + C CGAS+C G+LG + K
Sbjct: 2101 FALIDIAAGTELTFNYNLECLGNRKTVCKCGASNCSGFLGLRPK 2144
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 99/165 (60%), Gaps = 3/165 (1%)
Query: 257 SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
S+ C E C N+ F K + +++I +T GWG+ + I KG F+ EY+GEVID+ C
Sbjct: 1605 SQVCPAGERCQNQSFTKRQYTEVEIFRTLSRGWGLRSISDIKKGAFVNEYVGEVIDEEEC 1664
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
R+ + + + NFYM + KD IDA KGN SRF+NHSC PNC +KW V G+TRVG
Sbjct: 1665 RSRIKNAQDNDICNFYMLTLDKDRIIDAGPKGNESRFMNHSCQPNCETQKWTVNGDTRVG 1724
Query: 375 VFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+FA I G LT++Y G + C CGA +C G+LG + K
Sbjct: 1725 LFALEDIPKGVELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1769
>gi|241952364|ref|XP_002418904.1| SET domain-containing protein, putative; histone methyltransferase,
putative; histone-lysine N-methyltransferase, putative
[Candida dubliniensis CD36]
gi|223642243|emb|CAX44211.1| SET domain-containing protein, putative [Candida dubliniensis CD36]
Length = 837
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 108/173 (62%), Gaps = 7/173 (4%)
Query: 251 VQYISC----SKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEY 304
+ I+C +K C C + C N+ F+ + K+K+++TE G+G+ A + I + +FI EY
Sbjct: 112 INRITCVECINKNCLCGDDCQNQRFQNCQYSKVKVIQTELKGYGLIAEQDIEENQFIYEY 171
Query: 305 IGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEK 364
IGEVID++ QR+ + R +++FY + D IDAT KG+ RF+NHSC+PN ++K
Sbjct: 172 IGEVIDESSFRQRMIEYDLRHLKHFYFMMLSNDSFIDATQKGSLGRFINHSCNPNAFVDK 231
Query: 365 WQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
W V R+G+FA R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 232 WHVGDRLRMGIFAKRKILRGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 284
>gi|403333784|gb|EJY66014.1| hypothetical protein OXYTRI_13823 [Oxytricha trifallax]
Length = 1936
Score = 143 bits (361), Expect = 2e-31, Method: Composition-based stats.
Identities = 76/189 (40%), Positives = 109/189 (57%), Gaps = 21/189 (11%)
Query: 250 PVQYISCSKAC--------HCSETCNNRPFRKEKKIKI--------VKTEFCGWGVEAAE 293
P+Q +C + C C++TC + F K ++ ++ KTE G+G+ A E
Sbjct: 117 PLQSFNCGERCINRCVSTECCAQTCPSGAFCKNRRFQLHQNAYVFPAKTEKKGYGLFAGE 176
Query: 294 PINKGEFIIEYIGEV--IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRF 351
I KG FI++Y+GEV +D L +QR+ ++YR Y+ + ID T+ GN +RF
Sbjct: 177 FIPKGTFIMQYVGEVFSVDTDLGQQRV--LEYRKSTCTYLMRTTNNEVIDPTYVGNVARF 234
Query: 352 LNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQ 410
+NHSC+PNC +KW V GE VG+F+ R I E L++DY+F F P KCYCG S C+
Sbjct: 235 INHSCEPNCETQKWNVLGEVCVGIFSLRDIHENEELSFDYQFDFFKTPFTKCYCGTSKCK 294
Query: 411 GYLGTKRKI 419
GYLG KI
Sbjct: 295 GYLGVLSKI 303
>gi|366998103|ref|XP_003683788.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
gi|357522083|emb|CCE61354.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
Length = 707
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 99/158 (62%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+K++ I I KTE G+GV A + + +FI EY GEVI++ +RL
Sbjct: 99 CGDNCENQKFQKKEYADISIFKTELKGYGVRANSDLEENQFIYEYKGEVIEENEFRERLI 158
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ RG ++FY ++ IDAT KG +RF NHSC+PN + KW+V G+ R+G+FA R
Sbjct: 159 EYDQRGFKHFYFMMLQSGEFIDATIKGALARFCNHSCNPNAYVNKWEVAGKLRMGIFAKR 218
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYCG S+C GY+G K
Sbjct: 219 RISKGEEITFDYNVDRYGATAQKCYCGESNCIGYMGGK 256
>gi|344238567|gb|EGV94670.1| Histone-lysine N-methyltransferase NSD3 [Cricetulus griseus]
Length = 620
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 99/170 (58%), Gaps = 3/170 (1%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 304 QYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRSIKKGEFVNEYVGELI 363
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ V NFYM + KD IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 364 DEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 423
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CG+ +C G+LG + K
Sbjct: 424 DVRVGLFAICDIPAGMELTFNYNLDCLGNGRTVCHCGSDNCSGFLGVRPK 473
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/175 (40%), Positives = 105/175 (60%), Gaps = 4/175 (2%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
+ CS + C + C N+ + + + + T GWG++ ++ G+F+IEY+GE++
Sbjct: 751 FTECSTENCPAGDKCQNQRMLRNESVPTQTFYTGNRGWGLKTMRSLSPGDFVIEYVGEIV 810
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D A ++RL + V +FY + ++ IDA K N +RF+NHSCDPNC +KW V G
Sbjct: 811 DMAAVQERLKKTQEASVSSFYFLTLERNLIIDARVKSNHARFINHSCDPNCETQKWTVNG 870
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLE 423
ETR+G+FA + IK LT+DY+F G E K C CGA +C G+LG K K KL+
Sbjct: 871 ETRIGIFAIKDIKEDTELTFDYQFDCLGNEKKACLCGAQNCSGFLGEKPKQEKLK 925
>gi|344230287|gb|EGV62172.1| SET domain-containing protein [Candida tenuis ATCC 10573]
Length = 721
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 103/161 (63%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
AC+C + C N+ F++ I + +TE G+GV A +++G FI EYIGEVID+A +
Sbjct: 91 ACNCGDRCQNQRFQRCEYAAISVFQTEKKGYGVRADTALDEGSFIYEYIGEVIDEATFRR 150
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ D R +++FY ++KD IDAT KG +RF+NHSC PN ++KW V + R+G+F
Sbjct: 151 RMVDYDSRQLRHFYFMMLKKDAFIDATEKGALARFVNHSCSPNAYVDKWVVGDKLRMGIF 210
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
A R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 211 AKRHIARGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 251
>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
magnipapillata]
Length = 1214
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 257 SKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
+K C C N+ +K KK K E GWG+ A I +GEF+IEY+GE+ID+ C
Sbjct: 830 AKTCPAGNLCQNQQIQKNESKKCHPFKCEGRGWGLMADTDIKQGEFVIEYVGELIDEETC 889
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
+R+ + + + ++Y I KD IDA KGN SRF+NHSC+PNC +KW V GE RV
Sbjct: 890 HRRVREYHEKDIFDYYFLTIDKDNIIDAYPKGNMSRFMNHSCNPNCETQKWTVNGEIRVA 949
Query: 375 VFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGT 415
+FA R IK GE L ++Y G + K C CGA +C G+LG
Sbjct: 950 LFATRDIKMGEELCFNYNLDSLGNDKKQCKCGAVNCSGFLGV 991
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 99/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + +I+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1049 QVCPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECM 1108
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1109 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1168
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGA +C G+LG + K
Sbjct: 1169 FAVCDIPAGTELTFNYNLDCLGNEKTVCKCGAPNCSGFLGDRPK 1212
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 735 CVVSHCGKFYHEACVKKFHLTVFESRGFRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCP 794
Query: 180 IASH 183
+A H
Sbjct: 795 VAYH 798
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 99/164 (60%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + +I+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1028 QVCPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECM 1087
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1088 ARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1147
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G E C CGA +C G+LG + K
Sbjct: 1148 FAVCDIPAGTELTFNYNLDCLGNEKTVCKCGAPNCSGFLGDRPK 1191
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 714 CVVSHCGKFYHEACVKKFHLTVFENRGFRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCP 773
Query: 180 IASH 183
+A H
Sbjct: 774 VAYH 777
>gi|326481316|gb|EGE05326.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS
127.97]
Length = 970
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 112/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 173 MECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFIFEYIGEVINE 232
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 233 PQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 292
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 293 RMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 347
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 348 SNATIEALGIDDL 360
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 101/170 (59%), Gaps = 4/170 (2%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C C N+ F R+ ++ T GWG+ E I G+F+IEY+GE
Sbjct: 921 ILMVECSPGICPAGAKCMNQSFVLRQYPAMEPFHTMGRGWGLRTLEDIKTGQFVIEYVGE 980
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
+IDDA ++RL K +NFY I + TIDA KGN SRF+NHSC PNC +KW V
Sbjct: 981 IIDDAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCETQKWTV 1040
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+TR+G+FA R I++GE LT++Y G K C CGA +C G++G K
Sbjct: 1041 NGDTRIGLFALRDIESGEELTFNYNLASDGETRKACLCGAPNCSGFIGLK 1090
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 98/163 (60%), Gaps = 3/163 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 503 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 562
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 563 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 622
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKI 419
IKAG LT++Y G + C CGA +C G+LG + K+
Sbjct: 623 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKV 665
>gi|297602734|ref|NP_001052807.2| Os04g0429100 [Oryza sativa Japonica Group]
gi|255675465|dbj|BAF14721.2| Os04g0429100, partial [Oryza sativa Japonica Group]
Length = 612
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 100/162 (61%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++VKTE GWG+ A E I G+F++EY GEVI ++R
Sbjct: 163 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 222
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+ + Y+ + D +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 223 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 282
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ I G L+YDY F F G V+C CGA SC G+LG K +
Sbjct: 283 KQDIPIGTELSYDYNFEWFGGAMVRCLCGAGSCSGFLGAKSR 324
>gi|357619110|gb|EHJ71815.1| putative huntingtin interacting protein [Danaus plexippus]
Length = 225
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 104/174 (59%), Gaps = 19/174 (10%)
Query: 254 ISCSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C+ C E C NR F+K++ +K+ + G GVEA I GEF++EY+GEV+D
Sbjct: 54 IECNSRCPVGERCTNRRFQKKENGPLKVFYADKKGCGVEATTDITNGEFLMEYVGEVLD- 112
Query: 312 ALCEQRLWDMKYRGVQNF--------YMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILE 363
+D Y+ Q + Y ++ D IDAT KGN SRF+NHSC+PN +
Sbjct: 113 -------YDQFYKRAQAYSDDNNLHHYFMSLKGDTVIDATLKGNISRFINHSCEPNAETQ 165
Query: 364 KWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
KW V GE R+G F+ R I AGE +T+DY+F +FG +CYCGA +C+G++G +
Sbjct: 166 KWTVNGELRIGFFSKREISAGEEITFDYQFQRFGKVAQRCYCGAENCRGWIGAE 219
>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial [Acyrthosiphon
pisum]
Length = 1530
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
Query: 258 KACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C + CNN+ F K +V T+ GWG++ E I +G F+IEY+G+VID+ +
Sbjct: 1129 KVCPAGDKCNNQRFEKTLYPAMVPFLTKGRGWGLKTLEDIKEGSFVIEYVGDVIDEEEFQ 1188
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R +M R QN+Y I TIDA KGN SRF+NHSC+PNC+ +KW V G+TR+G+
Sbjct: 1189 RRCLEMHQRNEQNYYFLTIDNSRTIDAGPKGNLSRFMNHSCEPNCVTQKWTVNGDTRIGL 1248
Query: 376 FAARSIKAGEPLTYDYRFVQ-FGPEVK-CYCGASSCQGYLG 414
FA + I G L +DYR G E K C CGA+ C ++G
Sbjct: 1249 FALQDIPTGTELVFDYRLQSCAGVEKKPCQCGATRCSKFIG 1289
>gi|326468707|gb|EGD92716.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 970
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 112/193 (58%), Gaps = 8/193 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F+K + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 173 MECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFIFEYIGEVINE 232
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 233 PQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKL 292
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA + IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K
Sbjct: 293 RMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYCGEANCLGYIG-----GKTQTERATKL 347
Query: 431 KRSSTACLAIITL 443
++ L I L
Sbjct: 348 SNATIEALGIDDL 360
>gi|3540208|gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1767
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C+N+ F+K K +K + + G+G+ E + +G+F+IEY+GEV+D E
Sbjct: 1008 TCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYET 1067
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + ++G ++FY + + IDA KGN RF+NHSC+PNC EKW V GE VG+F
Sbjct: 1068 RQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIF 1127
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
+ + +K G+ LT+DY +V+ FG KCYCG+S C+GY+G
Sbjct: 1128 SMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIG 1167
>gi|403167549|ref|XP_003327326.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375167080|gb|EFP82907.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 974
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/172 (40%), Positives = 106/172 (61%), Gaps = 5/172 (2%)
Query: 260 CHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C E C N+ F+K I+IV T G+G+ + K F+ EY+GEVI +R
Sbjct: 181 CRCREKCQNQRFQKRLYAPIEIVLTPKKGFGMRLQADVPKDTFVYEYLGEVIGVKALHKR 240
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
L D G+++FY E++KD IDAT KG F RFLNHSC+PNC + KW V + R+G+F
Sbjct: 241 LKDYGQEGIKHFYFMELQKDQYIDATKKGGFGRFLNHSCNPNCYIGKWVVGRQLRMGIFT 300
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKLELCW 426
R+++ GE LT++Y ++G E + C+CG ++C G+LG T+ +G ++ +
Sbjct: 301 KRAVRGGEELTFNYNVDRYGHEAQECFCGEANCVGFLGGKTQTDLGAMDELY 352
>gi|240254387|ref|NP_177854.6| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|157734196|gb|ABV68921.1| SDG8 [Arabidopsis thaliana]
gi|332197839|gb|AEE35960.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1805
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C+N+ F+K K +K + + G+G+ E + +G+F+IEY+GEV+D E
Sbjct: 1008 TCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYET 1067
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + ++G ++FY + + IDA KGN RF+NHSC+PNC EKW V GE VG+F
Sbjct: 1068 RQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIF 1127
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
+ + +K G+ LT+DY +V+ FG KCYCG+S C+GY+G
Sbjct: 1128 SMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIG 1167
>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
Length = 881
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 110 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 169
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 170 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 229
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 230 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 271
>gi|336469811|gb|EGO57973.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Neurospora tetrasperma FGSC 2508]
gi|350290514|gb|EGZ71728.1| histone-lysine N-methyltransferase, H3 lysine-36 specific, partial
[Neurospora tetrasperma FGSC 2509]
Length = 957
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 102/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ R
Sbjct: 165 CNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSR 224
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSCDPNC ++KW V + R+G+FA
Sbjct: 225 MVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNCYVDKWVVGDKLRMGIFA 284
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G + + CYCG ++C G++G K
Sbjct: 285 GRAIKAGEELVFNYNVDRYGADPQPCYCGEANCTGFIGGK 324
>gi|302839691|ref|XP_002951402.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
gi|300263377|gb|EFJ47578.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
Length = 2345
Score = 142 bits (359), Expect = 3e-31, Method: Composition-based stats.
Identities = 76/185 (41%), Positives = 105/185 (56%), Gaps = 15/185 (8%)
Query: 257 SKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
+K C C E C+NR ++ K ++ TE GWGV EP++KG FIIEY GEVIDD
Sbjct: 1550 AKLCPCGELCSNRSLHLLRQPKTEVFLTENRGWGVRTMEPLSKGAFIIEYAGEVIDDREL 1609
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ--VEGETR 372
+R+ + G +FY+ E+ IDA KGN +R +N SCDPNC +KW GE R
Sbjct: 1610 GRRMEHARMNGEPHFYIMELAAGLYIDARRKGNIARLINSSCDPNCETQKWHDASTGEIR 1669
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFG------PEVKCYCGASSCQGYLG----TKRKIG-K 421
VG+FA+R I GE L YDY F +G C CG+ +C+G + +R +G +
Sbjct: 1670 VGIFASRDIPPGEELVYDYFFSTYGAIKQSAASFVCMCGSKNCRGTMDLHPEKRRDLGRR 1729
Query: 422 LELCW 426
+E+ W
Sbjct: 1730 VEVFW 1734
>gi|145327721|ref|NP_001077836.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|332197840|gb|AEE35961.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1501
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C+N+ F+K K +K + + G+G+ E + +G+F+IEY+GEV+D E
Sbjct: 1008 TCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYET 1067
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + ++G ++FY + + IDA KGN RF+NHSC+PNC EKW V GE VG+F
Sbjct: 1068 RQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIF 1127
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
+ + +K G+ LT+DY +V+ FG KCYCG+S C+GY+G
Sbjct: 1128 SMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIG 1167
>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
Length = 1562
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 100/171 (58%), Gaps = 3/171 (1%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C+N+ F K +++KT GWG++ + + KG+F++EY+GE+
Sbjct: 1232 LQYECHPQVCPAGDRCHNQCFSKRLYPDTEVIKTTGRGWGLKTKQDLKKGDFVMEYVGEL 1291
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID C+QR+ V NFYM + KD IDA KGN SRF+NHSC PNC +KW V
Sbjct: 1292 IDSEECKQRIRTANENHVTNFYMLTLTKDRVIDAGPKGNLSRFMNHSCSPNCETQKWTVN 1351
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ R+G+F I A LT++Y G C+CG+ +C G+LG K K
Sbjct: 1352 GDVRIGLFTLCDISADTELTFNYNLDCLGNGRTSCHCGSENCSGFLGVKPK 1402
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 10/71 (14%)
Query: 123 IYHGEEVFCSVRGCGGVYHFICVKERLG-ISNPRNFKCPQHACFIC--RQRLQ------- 172
+ G+ CSV GCG YH CV++ G S+ + +CPQH+C C + LQ
Sbjct: 867 VTEGDMKRCSVNGCGRYYHETCVRKYTGSASDTKGLRCPQHSCATCCLDRDLQKAGKGRM 926
Query: 173 WRCVRCTIASH 183
RC+RC +A H
Sbjct: 927 MRCIRCPVAYH 937
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 3/174 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
IKAG LT++Y G + C CGA +C G+LG + K + SKR
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNQPIATEEKSKR 2101
>gi|94707110|sp|Q2LAE1.1|ASHH2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH2; AltName:
Full=ASH1 homolog 2; AltName: Full=H3-K4-HMTase; AltName:
Full=Histone H3-K36 methyltransferase 8;
Short=H3-K36-HMTase 8; AltName: Full=Protein EARLY
FLOWERING IN SHORT DAYS; AltName: Full=Protein SET DOMAIN
GROUP 8
gi|85036158|gb|ABC69038.1| SDG8 [Arabidopsis thaliana]
Length = 1759
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 102/160 (63%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C+N+ F+K K +K + + G+G+ E + +G+F+IEY+GEV+D E
Sbjct: 1008 TCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYET 1067
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + ++G ++FY + + IDA KGN RF+NHSC+PNC EKW V GE VG+F
Sbjct: 1068 RQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIF 1127
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
+ + +K G+ LT+DY +V+ FG KCYCG+S C+GY+G
Sbjct: 1128 SMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIG 1167
>gi|149239909|ref|XP_001525830.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449953|gb|EDK44209.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 822
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ C C E C N+ F+K++ + + +TE G+G+ A + +G+FI EYIGEVID+
Sbjct: 89 INRHCSCGENCQNQRFQKKQYADVSVFQTELKGYGLRANTQLREGDFIYEYIGEVIDEPT 148
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
Q++ + + ++FY ++ D IDAT KG+ +RF+NHSC PN ++KW V R+
Sbjct: 149 FRQKMIEYDLKQYKHFYFMMLKNDAFIDATEKGSLARFVNHSCSPNAFVDKWVVADRLRM 208
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I AGE +T+DY ++G + + CYCG +C ++G K
Sbjct: 209 GIFAKRDIMAGEEITFDYNVDRYGAQSQPCYCGEPNCLKFMGGK 252
>gi|405952170|gb|EKC20012.1| Histone-lysine N-methyltransferase SETD2 [Crassostrea gigas]
Length = 1451
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 102/165 (61%), Gaps = 3/165 (1%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
YI C C C E C N+ F+K++ ++ T++ G G+ A + G+F++EY+GEV+D
Sbjct: 460 YIECGNRCPCGEYCTNKRFQKKQYADVEAFVTDWKGMGLRATAALQPGDFVMEYVGEVLD 519
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ R+ G ++ Y + D IDA++KGN SR++NHSCDPNC +KW V G
Sbjct: 520 YKQFKSRVKQQAKMGQEHHYFMALNSDEVIDASYKGNVSRYMNHSCDPNCETQKWTVNGV 579
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
RVG F ++++ L +DY+F ++G E KC+CG+ +C+G++G
Sbjct: 580 LRVGFFVKKAVEPLTELNFDYQFERYGKEAQKCFCGSENCRGFIG 624
>gi|157112020|ref|XP_001657377.1| huntingtin interacting protein [Aedes aegypti]
gi|108878208|gb|EAT42433.1| AAEL006013-PA [Aedes aegypti]
Length = 2367
Score = 142 bits (358), Expect = 3e-31, Method: Composition-based stats.
Identities = 68/164 (41%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F+K + ++ +TE G+G++A+ I G+FI+EY+GEV++
Sbjct: 1249 IECGSRCTIGERCTNKRFQKLEYANCQVFRTEKKGFGIQASTEIVPGDFIMEYVGEVLNS 1308
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
++R Q++Y +R D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 1309 EQFDERAELYSKEKNQHYYFMALRSDAIIDATTKGNISRFINHSCDPNAETQKWTVNGEL 1368
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I GE +T+DY+F ++G KCYC A +C G++G
Sbjct: 1369 RIGFFCTKYIMPGEEITFDYQFQRYGRRAQKCYCEAENCTGWIG 1412
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 102/178 (57%), Gaps = 3/178 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + +++I +T GWG+ I KG+F+ EY+GEVID+ C
Sbjct: 1806 QVCPAGEKCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECR 1865
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NH C PNC +KW V G+TRVG+
Sbjct: 1866 SRIRHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGL 1925
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
FA +KAG LT++Y G + C CGA +C G+LG + K G K KR
Sbjct: 1926 FALVDVKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNNPPSDDKGRKMKR 1983
>gi|367027906|ref|XP_003663237.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
42464]
gi|347010506|gb|AEO57992.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
42464]
Length = 950
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 107/167 (64%), Gaps = 4/167 (2%)
Query: 254 ISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C S C+C E C N+ F++++ K+ ++KTE G+G+ A + +F+ EY+GEVI+
Sbjct: 153 IECVSGDCNCGEGCQNQRFQRKQYAKVSVIKTEKKGFGLRADTDLQANDFVYEYVGEVIN 212
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ R+ G+++FY + K+ +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 213 EPTFRSRMVKYDREGIKHFYFMSLTKNEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEK 272
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 273 LRMGIFAGRAIRAGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 319
>gi|320586545|gb|EFW99215.1| set and ww domain containing protein [Grosmannia clavigera kw1407]
Length = 1051
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 102/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C ETC N+ F++++ + ++KT G+G+ A + +F+ EYIGEVI++ +R
Sbjct: 159 CNCGETCQNQRFQRKQYAAVSVIKTAKKGFGLRADVALRANDFVYEYIGEVINEPAFRRR 218
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSC PNC ++KW V + R+G+FA
Sbjct: 219 MLQYDEEGIRHFYFMSLTKHEFVDATRKGNLGRFCNHSCRPNCYVDKWVVGEKLRMGIFA 278
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
+R+I AGE L +DY ++G E + CYCG +C G++G K
Sbjct: 279 SRAIAAGEELVFDYNVDRYGAEPQPCYCGEPNCSGFIGGK 318
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1660 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1719
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1720 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1779
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1780 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1821
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1822 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1881
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1882 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1941
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1942 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1983
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1656 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1715
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1716 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1775
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1776 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1817
>gi|444323962|ref|XP_004182621.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
gi|387515669|emb|CCH63102.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
Length = 745
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 100/163 (61%), Gaps = 3/163 (1%)
Query: 257 SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
SK +C + C N+ F+K + I I +TE G+GV A E + +FI EYIGEVIDD
Sbjct: 76 SKCNYCGDNCQNQRFQKHEYSNISIFQTEMKGFGVRANENLEINQFIYEYIGEVIDDDEF 135
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
+R+ + RG ++FY ++ IDAT KGN RF NHSC+PN + KW V + ++G
Sbjct: 136 HKRMINYDQRGEKHFYFMMLKSGEFIDATEKGNLGRFCNHSCNPNAYVNKWLVGDKLKMG 195
Query: 375 VFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTK 416
+FA R I GE +T+DY ++G KCYCG ++C G+LG K
Sbjct: 196 IFAKRPIDKGEEITFDYNVDRYGASPQKCYCGEANCLGFLGGK 238
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1656 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1715
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1716 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1775
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1776 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1817
>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
Length = 962
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 108/168 (64%), Gaps = 4/168 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ ++ C+ + C C C NR F R+ ++ +++TE G+G+ A + FI+EYIGE
Sbjct: 139 MMFMECTAEDCPCGRYCRNRRFQLRQFARVDVIRTEKKGFGLRALTDLPTNSFIMEYIGE 198
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VI + +R + + G++++Y ++ D IDAT KG +RF+NHSC+PNC+ +KW V
Sbjct: 199 VIPNQEFIRRTKEYEASGLEHYYFMTLKTDEIIDATKKGCLARFINHSCNPNCVTQKWVV 258
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R+G+F R IKAGE LT+DY+F ++G + + CYCG +C+G++G
Sbjct: 259 GKNMRIGIFTNRGIKAGEELTFDYKFERYGAQAQVCYCGEFACKGFIG 306
>gi|398407533|ref|XP_003855232.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339475116|gb|EGP90208.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 799
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C+C C N F+++K + ++KTE G+G+ + +FI EYIGEVI +
Sbjct: 100 MECVADCNCGSKCQNMRFQRKKYANVDVIKTEKKGYGLRTQTDLRPNDFIFEYIGEVIGE 159
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ +R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 160 NVFRRRMQQYDEDGIKHFYFMSLTKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGDKL 219
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+AGE L ++Y ++G E + CYCG ++C GY+G K
Sbjct: 220 RMGIFAERAIQAGEELVFNYNVDRYGAEPQPCYCGEANCTGYIGGK 265
>gi|294658433|ref|XP_460767.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
gi|218511782|sp|Q6BM04.2|SET2_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|202953126|emb|CAG89108.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
Length = 731
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 109/170 (64%), Gaps = 4/170 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V + CS K C C C N+ F+K++ + +++TE G+G+ A E I++ FI EYIGE
Sbjct: 83 VTSVECSNKFCTCGNDCQNQRFQKKQYANVTVIQTELKGYGLRANEDISESSFIYEYIGE 142
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+ +R+ D + + +FY ++KD IDAT KG+ +RF NHSC+PN ++KW V
Sbjct: 143 VIDEESFRKRMIDYDTKKLIHFYFMMLKKDSFIDATMKGSLARFCNHSCNPNAYVDKWVV 202
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
+ R+G+F+ R+I+ GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 203 GEKLRMGIFSKRNIQKGEEITFDYNVDRYGAQSQPCYCGEPNCIKWMGGK 252
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1657 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1716
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1717 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1776
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1777 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1818
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1925 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1984
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1985 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2044
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2045 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2086
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1927 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1986
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1987 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2046
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2047 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2088
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1657 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1716
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1717 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1776
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1777 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1818
>gi|85086731|ref|XP_957740.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
OR74A]
gi|74614418|sp|Q7RZU4.1|SET2_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|28918835|gb|EAA28504.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
OR74A]
Length = 954
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ R
Sbjct: 165 CNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSR 224
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSCDPNC ++KW V + R+G+FA
Sbjct: 225 MVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNCYVDKWVVGDKLRMGIFA 284
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 285 GRAIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIGGK 324
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1659 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1718
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1719 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1778
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1779 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1820
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 298 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 357
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 358 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 417
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 418 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 459
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1925 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1984
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1985 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2044
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2045 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2086
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1619 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1678
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1679 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1738
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1739 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1780
>gi|115446669|ref|NP_001047114.1| Os02g0554000 [Oryza sativa Japonica Group]
gi|50725771|dbj|BAD33302.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113536645|dbj|BAF09028.1| Os02g0554000 [Oryza sativa Japonica Group]
Length = 637
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 101/166 (60%), Gaps = 5/166 (3%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+K C C E C+N+ F++ K+ K T G+G++ E +++G F+IEY+GEV+D
Sbjct: 171 IECTKRTCPCGEHCSNQQFQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLD 230
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R +G ++FY + IDA KGN RF+NHSC PNC EKW V GE
Sbjct: 231 ITAYESRQRYYASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGE 290
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
+G+FA R+IK GE LT+DY +V+ KC+CG + C+GY+G
Sbjct: 291 VCIGIFAMRNIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIG 336
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1924 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1983
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1984 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2043
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2044 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2085
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1569 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1628
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1629 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1688
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1689 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1730
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1655 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1714
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1715 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1774
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1775 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1816
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1925 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1984
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1985 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2044
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2045 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2086
>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
Length = 1181
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 152/348 (43%), Gaps = 73/348 (20%)
Query: 131 CSVRGCGGVYHFICVK-----ERLGISNPRNFKCPQHACFIC------------RQRLQW 173
C++ GCG YH C++ + N CP H C C R L
Sbjct: 757 CALSGCGKYYHIHCLRLFPQHKITSTPNSSTLFCPYHTCHTCVSDDPRTNATATRGSL-V 815
Query: 174 RCVRCTIASHD--KCAPWPDRVIHLKDQPGRAVCWRHPAKWLLDKQEVFCRLPLPYADEE 231
RC++C + H KC P +++ +C +H + E A E
Sbjct: 816 RCIKCPSSYHPDAKCVPAGSQLLTT----STMICPKHSLEQCSINAEKA------RAQES 865
Query: 232 FKIDLTWKDLM-----ENKVGPP-------------------PVQYISCS---------- 257
DL++K M NK P P C
Sbjct: 866 APDDLSFKPPMYVKIKSNKYVAPLRGRNAARDEEEDSICECKPSDTDPCGLDSNCINRAL 925
Query: 258 ------KACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVI 309
K C ++C N+ F +++ + GWG+ A E I++G+F+IEY+GEVI
Sbjct: 926 LVECNPKTCPAGDSCQNQCFERKQYPALAAKRIPNKGWGLVAQEDIHQGQFVIEYVGEVI 985
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ + + +N+Y + + TIDA KGN +RF+NHSC+PNC W+V G
Sbjct: 986 NGEELARRIKQKQEQKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETLLWKVGG 1045
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
VG+FA + +KAGE LT++Y F FG + K C+CGA+ C G +G K
Sbjct: 1046 SQSVGLFALKDLKAGEELTFNYNFETFGDQKKICHCGAAKCSGLIGQK 1093
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1924 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1983
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1984 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2043
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2044 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2085
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1929 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1988
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1989 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2048
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2049 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2090
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1641 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1700
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1701 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1760
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1761 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1802
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1924 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1983
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1984 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2043
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2044 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2085
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1620 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1679
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1680 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1739
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1740 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1781
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
>gi|218194855|gb|EEC77282.1| hypothetical protein OsI_15918 [Oryza sativa Indica Group]
Length = 472
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 100/162 (61%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++VKTE GWG+ A E I G+F++EY GEVI ++R
Sbjct: 64 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 123
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+ + Y+ + D +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 124 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 183
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ I G L+YDY F F G V+C CGA SC G+LG K +
Sbjct: 184 KQDIPIGTELSYDYNFEWFGGAMVRCLCGAGSCSGFLGAKSR 225
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + ++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|116309320|emb|CAH66406.1| OSIGBa0093L02.2 [Oryza sativa Indica Group]
Length = 519
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/179 (43%), Positives = 104/179 (58%), Gaps = 4/179 (2%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++VKTE GWG+ A E I G+F++EY GEVI ++R
Sbjct: 70 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 129
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+ + Y+ + D +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 130 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 189
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSS 434
+ I G L+YDY F F G V+C CGA SC G+LG K R + W R S
Sbjct: 190 KQDIPIGTELSYDYNFEWFGGAMVRCLCGAGSCSGFLGAKSRGFQEATYLWEDDDDRFS 248
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1928 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1987
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1988 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2047
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2048 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2089
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
Score = 38.1 bits (87), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 76/180 (42%), Gaps = 30/180 (16%)
Query: 33 SATESPGSDSAVVKTLALTGEEENVCAN-GNGHSVRVMKRCRGAKNI-SGLEDHVAAWVK 90
S+ E+PGS+ ++ +E V + N H + K+ +G + + L+++V +
Sbjct: 1492 SSKETPGSEGELMTHRTAASPKETVEESVENDHGMPASKKLQGERGGGAALKENVCQNCE 1551
Query: 91 KKMELGVPQSNCSLPFLVGAKKMIE-------CRACHRFIY-------HGEEV-FCSVRG 135
K EL + ++ C F + + E C C I+ GE+V C +
Sbjct: 1552 KLGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPL 1611
Query: 136 CGGVYHFICV-KERLGISNPRNFKCPQHACFIC-----------RQRLQWRCVRCTIASH 183
CG YH CV K + + F+C H C C + RL RCVRC +A H
Sbjct: 1612 CGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPASVSASKGRL-MRCVRCPVAYH 1670
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1917 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1976
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1977 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2036
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2037 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2078
>gi|440633543|gb|ELR03462.1| hypothetical protein GMDG_06195 [Geomyces destructans 20631-21]
Length = 977
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 110/184 (59%), Gaps = 8/184 (4%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ K+ ++KTE G+G+ A ++ +FI EYIGEV+++ +R
Sbjct: 184 CNCGPNCQNQRFQRKEFSKVSVIKTEKKGYGLRADTDLSAHDFIFEYIGEVVNEPTLRRR 243
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 244 MRQYDEEGIKHFYFMSLSKSEFVDATIKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFA 303
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSSTA 436
R+IK GE L ++Y ++G + + CYC +C G+LG GK + +K ++
Sbjct: 304 ERNIKVGEELVFNYNVDRYGADPQPCYCAEPNCTGFLG-----GKTQTERATKLSHTTIE 358
Query: 437 CLAI 440
L I
Sbjct: 359 ALGI 362
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1929 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1988
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1989 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2048
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2049 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2090
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1738 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1797
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1798 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1857
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1858 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1899
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 80/182 (43%), Gaps = 34/182 (18%)
Query: 33 SATESPGSDSAVV--KTLALTGE--EENVCANGNGHSVRVMKRCRGAKNISGLEDHVAAW 88
SA E+PGS+ ++ +T A E EE + + + S + M+ RG + L+++V
Sbjct: 1304 SAKETPGSEGELMTHRTAASPKENIEEGIEHDASMSSSKKMQGERGGG--AALKENVCQN 1361
Query: 89 VKKKMELGVPQSNCSLPFLVGAKKMIE-------CRACHRFIY-------HGEEV-FCSV 133
+K EL + ++ C F + + E C C I+ GE+V C +
Sbjct: 1362 CEKVGELLLCEAQCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLL 1421
Query: 134 RGCGGVYHFICVKERL-GISNPRNFKCPQHACFIC-----------RQRLQWRCVRCTIA 181
CG YH CV++ + + F+C H C C + RL RCVRC +A
Sbjct: 1422 PLCGKFYHEECVQKYPPTVMQNKGFRCSLHICITCHAANPASVSASKGRL-MRCVRCPVA 1480
Query: 182 SH 183
H
Sbjct: 1481 YH 1482
>gi|222628880|gb|EEE61012.1| hypothetical protein OsJ_14832 [Oryza sativa Japonica Group]
Length = 518
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/179 (43%), Positives = 104/179 (58%), Gaps = 4/179 (2%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++VKTE GWG+ A E I G+F++EY GEVI ++R
Sbjct: 69 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 128
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+ + Y+ + D +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 129 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 188
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSS 434
+ I G L+YDY F F G V+C CGA SC G+LG K R + W R S
Sbjct: 189 KQDIPIGTELSYDYNFEWFGGAMVRCLCGAGSCSGFLGAKSRGFQEATYLWEDDDDRFS 247
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + ++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1820 CPAGGRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1879
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1880 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1939
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1940 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1981
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 98/162 (60%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1744 CPAGERCQNQAFSKRQYPEVEIFRTLSRGWGLRCRTDIKKGEFVNEYVGEMIDEEECRAR 1803
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + + NFYM + KD IDA KGNF+RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1804 IRYAQEQDITNFYMLTLDKDRVIDAGPKGNFARFMNHCCQPNCETQKWTVNGDTRVGLFA 1863
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKA LT++Y G + C CGA +C G+LG + K
Sbjct: 1864 LCDIKAXVELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1905
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + ++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1556 CPAGGRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1615
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1616 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1675
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1676 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1717
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 700 CPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 759
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 760 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 819
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 820 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 861
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + ++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1615 CPAGGRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1674
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1675 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1734
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1735 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1776
>gi|328767162|gb|EGF77213.1| hypothetical protein BATDEDRAFT_91931 [Batrachochytrium
dendrobatidis JAM81]
Length = 779
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 99/177 (55%), Gaps = 7/177 (3%)
Query: 245 KVGPPPVQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFC---GWGVEAAEPINKGEFI 301
K G PP + C C + C NR FR+ ++I+ ++ + G+G+ PI G I
Sbjct: 532 KDGSPPCE---CGDNCPMGDCCTNRAFRRNEQIEKLRVFYAPNRGFGLYTDVPIKAGVLI 588
Query: 302 IEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
IEY GE+I A C +R D Y G +N Y E +D KG +RF NHSCDPNC
Sbjct: 589 IEYRGEIISTAKCIER-NDTIYSGQKNHYFLEYGNGLVLDGCRKGTIARFANHSCDPNCH 647
Query: 362 LEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
+EKW V E RVG+FA +I G LTYDYRF +G CYCG+ +C+G++ +K
Sbjct: 648 VEKWYVGTEFRVGIFATNNISVGSELTYDYRFDSYGQMQPCYCGSQNCRGFICINKK 704
>gi|396478086|ref|XP_003840449.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
gi|312217021|emb|CBX96970.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
Length = 962
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 105/168 (62%), Gaps = 3/168 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C N+ F +++ + ++KT+ G+G+ A + + G+F+ EYIGEVID+
Sbjct: 211 MECVGDCTCGRKCQNQRFLRKQYADVTVIKTDKKGFGLRANKDMAPGDFVFEYIGEVIDE 270
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ + G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 271 KTFRRRMLQYDHEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKL 330
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
R+G+F R ++AGE L ++Y ++G + + CYCG +C G++G K++
Sbjct: 331 RMGIFVERRVQAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIGGKQQ 378
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 96/164 (58%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
C C C ++ R+ ++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 1823 CPAGVRCQNQCFSK--RQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECR 1880
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+
Sbjct: 1881 ARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGL 1940
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1941 FALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1984
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 21/88 (23%)
Query: 116 CRACHRFIY-------HGEEV-FCSVRGCGGVYHFICV-KERLGISNPRNFKCPQHACFI 166
C CH I+ GE+V C + CG YH CV K ++ + F+CP H C
Sbjct: 1481 CNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVTQNKGFRCPLHICIT 1540
Query: 167 C-----------RQRLQWRCVRCTIASH 183
C + RL RCVRC +A H
Sbjct: 1541 CHAANPANVSASKGRL-MRCVRCPVAYH 1567
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 96/164 (58%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
C C C ++ R+ ++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 1823 CPAGVRCQNQCFSK--RQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECR 1880
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+
Sbjct: 1881 ARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGL 1940
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1941 FALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1984
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 21/88 (23%)
Query: 116 CRACHRFIY-------HGEEV-FCSVRGCGGVYHFICV-KERLGISNPRNFKCPQHACFI 166
C CH I+ GE+V C + CG YH CV K ++ + F+CP H C
Sbjct: 1481 CNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVTQNKGFRCPLHICIT 1540
Query: 167 C-----------RQRLQWRCVRCTIASH 183
C + RL RCVRC +A H
Sbjct: 1541 CHAANPANVSASKGRL-MRCVRCPVAYH 1567
>gi|212535752|ref|XP_002148032.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
gi|210070431|gb|EEA24521.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
Length = 900
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C + C N+ F++++ + ++KT G+G+ A I +FI EYIGEVI++
Sbjct: 195 IECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIQPHQFIYEYIGEVINE 254
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + K +DAT KGN +RF NHSC+PNC ++KW V +
Sbjct: 255 ANFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCYVDKWVVGEKL 314
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 315 RMGIFAERYIQAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIG-----GKTQTERATKL 369
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 370 SNATIEALGI 379
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 100/168 (59%), Gaps = 7/168 (4%)
Query: 258 KACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C + C NR F++ + K++ T+ GWG++A E I +G+F+IEY+GE+ID C
Sbjct: 1044 KTCPVKDKCQNRRFQRRQYPKLIPFLTQSKGWGLKAGEDIAEGQFVIEYVGEIIDATECR 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+RL + +FY+ + +DA K N +RF+NHSC PNC +KW V GETRVG+
Sbjct: 1104 RRLAASQAANDHSFYILSLSGSSFVDARNKANLARFINHSCGPNCETQKWNVLGETRVGI 1163
Query: 376 FAARSIKAGEPLTYDYRFVQFGP--EVKCYCGASSCQGYLGTKRKIGK 421
FA I G LT+DY+ G C+CGASSC+G + K+G+
Sbjct: 1164 FAKEDIPKGTELTFDYQLDSLGSRGRTTCHCGASSCRGVI---EKLGR 1208
Score = 45.1 bits (105), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 34/69 (49%), Gaps = 16/69 (23%)
Query: 131 CSVRGCGGVYHFICVKERLGISNPR------NFKCPQHACFICR--QRLQW---RCVRCT 179
CSVR CG YH C+ +NPR +FKCP H C C Q + RC+RC
Sbjct: 742 CSVRNCGKKYHRACIS-----NNPRAALKDNSFKCPLHKCANCTYPQASTYPLVRCIRCP 796
Query: 180 IASHDKCAP 188
IA H C P
Sbjct: 797 IAYHTCCVP 805
>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
Length = 1958
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 71/160 (44%), Positives = 100/160 (62%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C + C+N+ F+K K+K K G+G++ + I++G+F+IEY+GEV+D E
Sbjct: 1247 TCPCGDLCSNQQFQKRGYAKLKWFKCGKKGYGLQLQQDISQGQFLIEYVGEVLDLQTYEA 1306
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + RG ++FY + IDA KGN RF+NHSCDPNC EKW V GE +G+F
Sbjct: 1307 RQKEYASRGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLF 1366
Query: 377 AARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSCQGYLG 414
A R IK GE +T+DY +V+ FG KC CG+ C+GY+G
Sbjct: 1367 ALRDIKKGEEVTFDYNYVRVFGAAAKKCVCGSPQCRGYIG 1406
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 96/164 (58%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
C C C ++ R+ ++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 1926 CPAGVRCQNQCFSK--RQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECR 1983
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+
Sbjct: 1984 ARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGL 2043
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2044 FALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 21/88 (23%)
Query: 116 CRACHRFIY-------HGEEV-FCSVRGCGGVYHFICV-KERLGISNPRNFKCPQHACFI 166
C CH I+ GE+V C + CG YH CV K ++ + F+CP H C
Sbjct: 1584 CNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVTQNKGFRCPLHICIT 1643
Query: 167 C-----------RQRLQWRCVRCTIASH 183
C + RL RCVRC +A H
Sbjct: 1644 CHAANPANVSASKGRL-MRCVRCPVAYH 1670
>gi|365982940|ref|XP_003668303.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
gi|343767070|emb|CCD23060.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
Length = 795
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 97/158 (61%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ + I KT+ G+GV A E I G+FI EYIGEVID+ +R+
Sbjct: 90 CGNDCQNQRFQKKQYSNVSIFKTKLKGYGVRANENIENGQFIYEYIGEVIDEIQFRERMI 149
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D + ++FY ++ IDAT KG+ +RF NHSC PN + KW+V G+ R+G+FA R
Sbjct: 150 DYDLKKFKHFYFMMLQNGQFIDATIKGSLARFCNHSCSPNAYVNKWEVAGKLRMGIFAKR 209
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 210 KILKGEEITFDYNVDRYGATAQKCYCDEPNCIGFLGGK 247
>gi|383860108|ref|XP_003705533.1| PREDICTED: uncharacterized protein LOC100883855 [Megachile
rotundata]
Length = 1766
Score = 141 bits (355), Expect = 8e-31, Method: Composition-based stats.
Identities = 67/164 (40%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A + GEFI+EY+GEV+D
Sbjct: 790 IECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMADMLAGEFIMEYVGEVVDP 849
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 850 KDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNVSRFINHSCDPNSETQKWTVNGEL 909
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KC+C A++C+G++G
Sbjct: 910 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCFCEAANCRGWIG 953
>gi|119182346|ref|XP_001242314.1| hypothetical protein CIMG_06210 [Coccidioides immitis RS]
Length = 1003
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + K+ ++KTE G+G+ A + EFI EYIGEVI++
Sbjct: 172 MECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFIFEYIGEVINE 231
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 232 PQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 291
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 292 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 346
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 347 SHATIEALGI 356
>gi|256074584|ref|XP_002573604.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 1575
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 102/171 (59%), Gaps = 8/171 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFC-----GWGVEAAEPINKGEFIIEYIGEV 308
I C C + C+NR F + ++ F GWG+ A + + KG F+IEY+GEV
Sbjct: 429 IECGLHCAAGDFCSNRQF--QMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEV 486
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID + +R+ + G + Y + D IDA KGN++RF+NHSC+PNC+ +KW V
Sbjct: 487 IDFSEFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVN 546
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLGTKRK 418
GE R+G FA I +G+ +T DY+FVQ+G E KCYCGAS+C G +G K
Sbjct: 547 GEIRIGFFAKEDIPSGQEVTIDYQFVQYGVSEQKCYCGASTCSGIMGATSK 597
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + ++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1617 CPAGVRCQNQCFSKRQYPDVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1676
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1677 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1736
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1737 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1778
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 21/88 (23%)
Query: 116 CRACHRFIY-------HGEEV-FCSVRGCGGVYHFICV-KERLGISNPRNFKCPQHACFI 166
C CH I+ GE+V C + CG YH CV K ++ + F+CP H C
Sbjct: 1275 CNECHTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECVQKYPPTVTQNKGFRCPLHICIT 1334
Query: 167 C-----------RQRLQWRCVRCTIASH 183
C + RL RCVRC +A H
Sbjct: 1335 CHAANPANVSASKGRL-MRCVRCPVAYH 1361
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1823 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKIDIKKGEFVNEYVGELIDEEECRAR 1882
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1883 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1942
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1943 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1984
>gi|303319123|ref|XP_003069561.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240109247|gb|EER27416.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1011
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + K+ ++KTE G+G+ A + EFI EYIGEVI++
Sbjct: 180 MECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFIFEYIGEVINE 239
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 240 PQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 299
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 300 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 354
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 355 SHATIEALGI 364
>gi|150865134|ref|XP_001384227.2| hypothetical protein PICST_31672 [Scheffersomyces stipitis CBS
6054]
gi|149386390|gb|ABN66198.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 732
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C + C N+ F+K + + + +TE G+G+ A + I +G FI EYIGEVID+A R
Sbjct: 103 CGCGKNCQNQRFQKRQYASVSVFQTELKGYGLRADDVIPEGGFIYEYIGEVIDEASFRAR 162
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ D + ++FY ++ D IDAT KG+ +RF NHSC PN ++KW V + R+G+FA
Sbjct: 163 MIDYDSKNFKHFYFMMLKNDSFIDATIKGSLARFCNHSCSPNAFVDKWVVGDKLRMGIFA 222
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
RSI GE +T+DY ++G + + CYCG ++C ++G K
Sbjct: 223 KRSISKGEEITFDYNVDRYGAQSQPCYCGEANCIKFMGGK 262
>gi|442570025|sp|Q1DU03.2|SET2_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|392865212|gb|EAS30985.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Coccidioides immitis RS]
Length = 1011
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + K+ ++KTE G+G+ A + EFI EYIGEVI++
Sbjct: 180 MECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFIFEYIGEVINE 239
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 240 PQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 299
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 300 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 354
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 355 SHATIEALGI 364
>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
Length = 1179
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 105/172 (61%), Gaps = 4/172 (2%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVK--TEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
CS C + C NR F K ++K T++ G G++A + I KG FIIEYIGE+I
Sbjct: 970 CSSKCLNGDKCQNRRFTKRHYPPLLKFQTDWGGNGLKAKKLIPKGSFIIEYIGEIISHDE 1029
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
RL + GV N+Y+ E+ IDA +GN +RF+NHSCDPNC ++ W V+G+TR+
Sbjct: 1030 SRIRLEESAKIGVTNYYILELDNLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRI 1089
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLE 423
G+F+ R I+ GE LT++Y+ Q E KC CG+ +C G++G K K K E
Sbjct: 1090 GIFSKRDIQEGEELTFNYQLQQSSDEGKTKCLCGSKNCAGFIGDKVKNEKSE 1141
>gi|330946631|ref|XP_003306794.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
gi|311315535|gb|EFQ85104.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
Length = 961
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 104/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C N+ F++++ + ++KTE G+G+ A + + G+F+ EYIGEVID+
Sbjct: 186 MECVGDCSCGRRCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFVFEYIGEVIDE 245
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 246 RTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKL 305
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R+++AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 306 RMGIFVERNVRAGEELVFNYNVDRYGADPQPCYCGEPNCSGFIGGK 351
>gi|295665754|ref|XP_002793428.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226278342|gb|EEH33908.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 1001
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 182 MECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADSDLRPHQFIFEYIGEVINE 241
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 242 ASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 301
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 302 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 356
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 357 SNATIEALGI 366
>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
Length = 690
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 105/177 (59%), Gaps = 5/177 (2%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+Q+ + C + C N+ F+K + + + GWG+ + + I KG F+IEY+GE+
Sbjct: 352 LQFECVASRCPAGDKCQNQRFQKRQYVDCEPFRAHSRGWGLRSKQAIKKGTFVIEYVGEL 411
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
IDDA C +R+ K N+YM I KD IDA GN SRF+NHSC PNC +KW V
Sbjct: 412 IDDATCRERV--KKGDDDTNYYMLTIDKDCIIDAGPMGNLSRFMNHSCYPNCETQKWTVN 469
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIGKLEL 424
GE RVG+F +R +++ E LT+DY G E KC+CG+ +C G+LG + K ++
Sbjct: 470 GEVRVGLFTSRDVESQEELTFDYCLDCHGNEKKKCHCGSQNCSGFLGVRPKTQNAQM 526
>gi|320041043|gb|EFW22976.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 972
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + K+ ++KTE G+G+ A + EFI EYIGEVI++
Sbjct: 141 MECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFIFEYIGEVINE 200
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 201 PQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 260
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 261 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 315
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 316 SHATIEALGI 325
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1926 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKIDIKKGEFVNEYVGELIDEEECRAR 1985
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1986 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 2045
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 2046 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 2087
>gi|320169023|gb|EFW45922.1| ASHH3 [Capsaspora owczarzaki ATCC 30864]
Length = 885
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C C C N+ F R+ +++ +TE G G+ E + G+F+IEY+G+V+ + +
Sbjct: 180 KRCPCGTYCTNQRFQNRQYPAMEVFRTEKKGNGLMVLEDLAPGQFLIEYVGDVVHNREFK 239
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R R +FY + D IDAT +G+ SRF+NHSC+PNC +KW V+ RVG+
Sbjct: 240 KRTKSYHERQYDHFYFMTLSSDEVIDATVRGSISRFINHSCEPNCETQKWVVDRRIRVGI 299
Query: 376 FAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
FA ++IKAG +T+DY+F +F E + CYCGA SC+G +G K++
Sbjct: 300 FAKKAIKAGTEITFDYKFERFSDEGQACYCGAPSCKGIIGGKKQ 343
>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Apis mellifera]
Length = 1218
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 111/204 (54%), Gaps = 17/204 (8%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C CNN+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 800 ILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTIGRGWGLRSLEHIKAGQFVIEYVGE 859
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + TIDA KGN SRF+NHSC PNC +KW V
Sbjct: 860 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCSPNCETQKWTV 919
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK---------- 416
G+TR+G+FA I+ GE LT++Y G K C CGAS+C G++G K
Sbjct: 920 NGDTRIGLFALCDIEPGEELTFNYNLACDGETRKPCLCGASNCSGFIGLKVQKPQITTPS 979
Query: 417 ---RKIGKLELCWGSKRKRSSTAC 437
+KI K + KR R C
Sbjct: 980 IQQKKIEKFDKIKRQKRSRKHVLC 1003
>gi|327354316|gb|EGE83173.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1159
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 184 MECVGDCGCGDECENQRFQRRQYANVTVIKTEKKGYGLRADSDLRPNQFIFEYIGEVINE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 244 GNFRKRMRSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 304 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 358
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 359 SNATIEALGI 368
>gi|226291164|gb|EEH46592.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
Pb18]
Length = 980
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 181 MECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADSDLRPHQFIFEYIGEVINE 240
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 241 ASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 300
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 301 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 355
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 356 SNATIEALGI 365
>gi|406867230|gb|EKD20268.1| histone H3 lysine 36 (K36) methyltransferase [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 1038
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 110/184 (59%), Gaps = 8/184 (4%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + + KTE G+G+ A ++ +FI EYIGEVI++ +R
Sbjct: 192 CNCGSGCQNQRFQQKQYADVSVFKTEKKGYGLRANVDLDANDFIFEYIGEVINEPTFRRR 251
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 252 TVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFA 311
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSSTA 436
R+IKAGE L ++Y ++G + + CYCG ++C GY+G GK + +K ++
Sbjct: 312 ERNIKAGEELVFNYNVDRYGADPQPCYCGEANCTGYIG-----GKTQTERATKLSHATIE 366
Query: 437 CLAI 440
L I
Sbjct: 367 ALGI 370
>gi|350855153|emb|CCD58126.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 887
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 102/171 (59%), Gaps = 8/171 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFC-----GWGVEAAEPINKGEFIIEYIGEV 308
I C C + C+NR F + ++ F GWG+ A + + KG F+IEY+GEV
Sbjct: 257 IECGLHCAAGDFCSNRQF--QMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEV 314
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID + +R+ + G + Y + D IDA KGN++RF+NHSC+PNC+ +KW V
Sbjct: 315 IDFSEFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVN 374
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLGTKRK 418
GE R+G FA I +G+ +T DY+FVQ+G E KCYCGAS+C G +G K
Sbjct: 375 GEIRIGFFAKEDIPSGQEVTIDYQFVQYGVSEQKCYCGASTCSGIMGATSK 425
>gi|156051364|ref|XP_001591643.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980]
gi|154704867|gb|EDO04606.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 971
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 111/186 (59%), Gaps = 8/186 (4%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C+C C N+ F++ + K+ ++KT+ G+G+ A ++ +FI EYIGEVI++
Sbjct: 187 KDCNCGVACQNQRFQRRQYAKVSVIKTDKKGYGLRANTDLHPDDFIFEYIGEVINEPTFR 246
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+
Sbjct: 247 RRTIQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGI 306
Query: 376 FAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSS 434
FA R+IKAGE L ++Y ++G + + CYCG +C G++G GK + +K ++
Sbjct: 307 FAERAIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKLSHAT 361
Query: 435 TACLAI 440
L I
Sbjct: 362 IEALGI 367
>gi|225679434|gb|EEH17718.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
Pb03]
Length = 976
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 181 MECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADSDLRPHQFIFEYIGEVINE 240
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 241 ASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 300
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 301 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 355
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 356 SNATIEALGI 365
>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 779
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/165 (40%), Positives = 105/165 (63%), Gaps = 4/165 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+ + C+C +C N+ F+K+ + +++TE G+G+ A + KG F+ EYIGEVI
Sbjct: 134 IECTDENCYCGPSCQNQRFQKKMYADVDVIQTEKKGFGLRANSYLTKGTFVYEYIGEVIP 193
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ + RG+++FY ++K IDAT KG+ +RF NHSC PNC ++KW V +
Sbjct: 194 EVRFRKRMREYDERGIRHFYFMMLQKGEYIDATVKGSLARFCNHSCRPNCYVDKWVVGNK 253
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R+G+F R I+ GE LT+DY ++G + + CYCG C GY+G
Sbjct: 254 LRMGIFCKRDIQKGEELTFDYNVDRYGAQAQPCYCGEDCCLGYIG 298
>gi|119498557|ref|XP_001266036.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
gi|119414200|gb|EAW24139.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
Length = 967
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 191 IECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFIFEYVGEVINE 250
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + + +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 251 AQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 310
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 311 RMGIFAERTIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTDRATKL 365
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 366 SNATIEALGI 375
>gi|449300569|gb|EMC96581.1| hypothetical protein BAUCODRAFT_481348 [Baudoinia compniacensis
UAMH 10762]
Length = 992
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 101/166 (60%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C+ C C C N+ F +++ + ++KTE G+G+ + +FI EYIGEVI +
Sbjct: 187 MECAADCGCGAACQNQRFLRKQFADVTVIKTEKKGYGLRTNTNLKANDFIFEYIGEVIGE 246
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 247 NVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKL 306
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R I+AGE L ++Y ++G E + CYCG +C GY+G K
Sbjct: 307 RMGIFAERKIQAGEELVFNYNVDRYGAEPQPCYCGEPNCTGYIGGK 352
>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
[Brachypodium distachyon]
Length = 517
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/185 (42%), Positives = 105/185 (56%), Gaps = 4/185 (2%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++VKTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 66 CRCGVYCKNQRFQKCQYARTRLVKTEGRGWGLLAEENIMAGQFVIEYCGEVISWKEAKRR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+ Y+ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQAYEDQGLMEAYIIYLNTAESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSST 435
+ I G L+YDY F FG V+C CGA+SC G+LG K R + W R S
Sbjct: 186 KQDIPIGMELSYDYNFEWFGGAIVRCLCGAASCSGFLGAKSRGFQEATYLWEDDDDRFSV 245
Query: 436 ACLAI 440
+ I
Sbjct: 246 ENIPI 250
>gi|239609047|gb|EEQ86034.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 897
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 184 MECVGDCGCGDECENQRFQRRQYANVTVIKTEKKGYGLRADSDLRPNQFIFEYIGEVINE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 244 GNFRKRMRSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 304 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 358
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 359 SNATIEALGI 368
>gi|340726897|ref|XP_003401788.1| PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
Length = 1777
Score = 140 bits (353), Expect = 1e-30, Method: Composition-based stats.
Identities = 67/164 (40%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A + GEFI+EY+GEV+D
Sbjct: 798 IECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIMEYVGEVVDP 857
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 858 KDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSETQKWTVNGEL 917
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KC+C A +C+G++G
Sbjct: 918 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCFCEAPNCRGWIG 961
>gi|67903676|ref|XP_682094.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|74592565|sp|Q5ASA5.1|SET2_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|40740923|gb|EAA60113.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|259482964|tpe|CBF77940.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC
2.1.1.43)(SET domain-containing protein 2)
[Source:UniProtKB/Swiss-Prot;Acc:Q5ASA5] [Aspergillus
nidulans FGSC A4]
Length = 980
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A E + +FI EY+GEVI++
Sbjct: 218 IECMGDCGCGPDCQNQRFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFIFEYVGEVINE 277
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 278 GPFHRRMRQYDAEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 337
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 338 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 392
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 393 SNATIEALGI 402
>gi|258571704|ref|XP_002544655.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237904925|gb|EEP79326.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1013
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C ++C N+ F++ + + ++KTE G+G+ + +FI EYIGEVI++
Sbjct: 181 MECVGDCGCGDSCQNQRFQQREYANVSVIKTEKKGYGLRTDSDLRPNQFIFEYIGEVINE 240
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ + G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 241 PQFRRRMINYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 300
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 301 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 355
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 356 SNATIEALGI 365
>gi|367049654|ref|XP_003655206.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
gi|347002470|gb|AEO68870.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
Length = 931
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C + C N+ F++++ K+ ++KTE G+G+ A + +F+ EY+GEVI++ R
Sbjct: 155 CNCGDGCQNQRFQRKQYAKVSVIKTEKKGFGLRADTDLQPNDFVYEYVGEVINEPTFRSR 214
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K+ +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 215 MLKYDKEGIKHFYFMSLTKNEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFA 274
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I+AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 275 GRRIRAGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 314
>gi|70998620|ref|XP_754032.1| SET and WW domain protein [Aspergillus fumigatus Af293]
gi|74672926|sp|Q4WTT2.1|SET2_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|66851668|gb|EAL91994.1| SET and WW domain protein [Aspergillus fumigatus Af293]
Length = 966
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 191 IECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFIFEYVGEVINE 250
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + + +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 251 AQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 310
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 311 RMGIFAERAIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTDRATKL 365
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 366 SNATIEALGI 375
>gi|261189306|ref|XP_002621064.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239591641|gb|EEQ74222.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
Length = 897
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C + C N+ F++ + + ++KTE G+G+ A + +FI EYIGEVI++
Sbjct: 184 MECVGDCGCGDECENQRFQRRQYANVTVIKTEKKGYGLRADSDLRPNQFIFEYIGEVINE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 244 GNFRKRMRSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 304 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 358
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 359 SNATIEALGI 368
>gi|358396308|gb|EHK45689.1| hypothetical protein TRIATDRAFT_241288 [Trichoderma atroviride IMI
206040]
Length = 931
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 114/191 (59%), Gaps = 9/191 (4%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS A +C C NR F++++ + ++KTE G+G+ A + +FI EYIGEVI+
Sbjct: 152 MECSANAGNCGGGCQNRRFQRKQYADVTVIKTEKKGFGLRANAMLEPNDFIYEYIGEVIN 211
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K+ +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 212 EPTFRRRMLQYDDEGIKHFYFMSLNKNEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDK 271
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSK 429
R+G+FA R I+AGE L ++Y ++G E + CYCG ++C G++G GK + +K
Sbjct: 272 LRMGIFALRKIQAGEELVFNYNVDRYGAEPQPCYCGETNCVGFIG-----GKTQTERATK 326
Query: 430 RKRSSTACLAI 440
++ L I
Sbjct: 327 LPTATVEALGI 337
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 101/177 (57%), Gaps = 3/177 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + +++I +T GWG+ I KG+F+ EY+GEVID+ C
Sbjct: 1929 QVCPAGERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECR 1988
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NH C PNC +KW V G+TRVG+
Sbjct: 1989 SRIRHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVSGDTRVGL 2048
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWGSKRK 431
FA I AG LT++Y G + C CGA +C G+LG + K G KR+
Sbjct: 2049 FALVDIPAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNNPPSEDKGRKRR 2105
>gi|348667102|gb|EGZ06928.1| hypothetical protein PHYSODRAFT_565808 [Phytophthora sojae]
Length = 1540
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 103/168 (61%), Gaps = 5/168 (2%)
Query: 253 YISCSKACHCSETCNNRPFRKEKK---IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ CS +C + C N+ + ++ ++ KTE G+GV + I + + EY+GE+I
Sbjct: 111 YVECSPSCAAGQYCRNQRLQHPERYPQLEPFKTEHKGYGVRTRQHIGQLSIVGEYVGEII 170
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D +RL + R NFY + IDA KG+F+RF+NHSC+PNC EKW V+G
Sbjct: 171 DQKELARRLKSVP-RHELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKG 229
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
ETR+ V A R I+ GE LT+DY++ G ++KCYCGA SC+G +GT+
Sbjct: 230 ETRIAVIALRDIEVGEELTFDYQWKALGSRQIKCYCGAPSCKGVIGTQ 277
>gi|159126234|gb|EDP51350.1| SET and WW domain protein [Aspergillus fumigatus A1163]
Length = 966
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 191 IECVGDCSCGPECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFIFEYVGEVINE 250
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A +R+ G+++FY + + +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 251 AQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 310
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 311 RMGIFAERAIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTDRATKL 365
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 366 SNATIEALGI 375
>gi|154292496|ref|XP_001546822.1| hypothetical protein BC1G_14887 [Botryotinia fuckeliana B05.10]
gi|347836524|emb|CCD51096.1| hypothetical protein [Botryotinia fuckeliana]
Length = 953
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 111/186 (59%), Gaps = 8/186 (4%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C+C C N+ F++++ K+ ++KT+ G+G+ A + +FI EYIGEVI++
Sbjct: 187 KDCNCGLDCQNQRFQRKQYAKVSVIKTDKKGYGLRANTDLQPDDFIFEYIGEVINEPTFR 246
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+
Sbjct: 247 RRTVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGI 306
Query: 376 FAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSS 434
FA R+IKAGE L ++Y ++G + + CYCG +C G++G GK + +K ++
Sbjct: 307 FAERAIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKLPHAT 361
Query: 435 TACLAI 440
L I
Sbjct: 362 IEALGI 367
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 888 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 947
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 948 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1007
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1008 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1049
>gi|242793892|ref|XP_002482258.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
gi|218718846|gb|EED18266.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
Length = 891
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 112/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C + C N+ F++++ + ++KT G+G+ A I+ +FI EYIGEVI++
Sbjct: 186 IECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIHPHQFIYEYIGEVINE 245
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN +RF NHSC+PNC ++KW V +
Sbjct: 246 VSFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCYVDKWVVGEKL 305
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 306 RMGIFAERYIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 360
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 361 SNATIEALGI 370
>gi|146419774|ref|XP_001485847.1| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
6260]
Length = 707
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 105/164 (64%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+K+C C + C N+ F+K++ + +++TE G+G+ A E IN+ FI EY+GEVI++
Sbjct: 87 VNKSCGCGDDCQNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEES 146
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ + + +FY ++KD IDAT KG+ +RF NHSC PN ++KW V + R+
Sbjct: 147 FRKRMVEYDEKKFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRM 206
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I+AGE +T+DY ++G + + CYCG +C +G K
Sbjct: 207 GIFAKRLIQAGEEITFDYNVDRYGAQSQPCYCGEPNCIKVMGGK 250
>gi|412991390|emb|CCO16235.1| unnamed protein product [Bathycoccus prasinos]
Length = 825
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 102/171 (59%), Gaps = 8/171 (4%)
Query: 251 VQYISCSKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+Y ++ C C + C+NRPF+ + K+K+ TE G+G+ + + +G+FI+EY+GE+
Sbjct: 470 TRYTCDTRVCPCGDDCSNRPFQHLPQPKVKVQLTENRGYGLFLQQDVFEGDFIVEYMGEI 529
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW--Q 366
+D+ C +RL K + NFY+ EI IDA F GN +RF+N SC PNC ++W
Sbjct: 530 VDEEECTRRLLACKGKNEPNFYLMEITPSQIIDARFCGNNARFINSSCHPNCETQRWVDA 589
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE----VKCYCGASSCQGYL 413
ETRVG+FA IK+G LTYDY F FG E C+CG C+G L
Sbjct: 590 STNETRVGIFATEDIKSGTELTYDYNFAHFGGEGTTSFTCFCGHPMCKGTL 640
>gi|413937237|gb|AFW71788.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1815
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 99/166 (59%), Gaps = 5/166 (3%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C K C C E C+N+ F++ K++ + G+G++ E + +G F+IEY+GEV+D
Sbjct: 1044 IECVKRTCPCGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLD 1103
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R +G ++FY + IDA KGN RF+NHSC PNC EKW V GE
Sbjct: 1104 ITSYESRQRYYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGE 1163
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
+G+FA RSIK GE LT+DY +V+ KC+CG + C+GYLG
Sbjct: 1164 VCIGIFALRSIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYLG 1209
>gi|190345522|gb|EDK37420.2| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
6260]
Length = 707
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 105/164 (64%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+K+C C + C N+ F+K++ + +++TE G+G+ A E IN+ FI EY+GEVI++
Sbjct: 87 VNKSCGCGDDCQNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEES 146
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ + + +FY ++KD IDAT KG+ +RF NHSC PN ++KW V + R+
Sbjct: 147 FRKRMVEYDEKKFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRM 206
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I+AGE +T+DY ++G + + CYCG +C +G K
Sbjct: 207 GIFAKRLIQAGEEITFDYNVDRYGAQSQPCYCGEPNCIKVMGGK 250
>gi|356535766|ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
gi|34529091|dbj|BAC85636.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 98/162 (60%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
CHC C N+ F+K K K+ KTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 66 CHCDILCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQAYENQGLKDAFIIFLNVSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
I G L YDY F F G +V+C CGA C G+LG K +
Sbjct: 186 KHDIPIGTELAYDYNFEWFGGAKVRCLCGALKCSGFLGAKSR 227
>gi|452985541|gb|EME85297.1| hypothetical protein MYCFIDRAFT_83386 [Pseudocercospora fijiensis
CIRAD86]
Length = 835
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 104/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C+C C N+ F++++ + +++TE G+G+ A + +FI EYIGEVI +
Sbjct: 106 MECVGDCNCGIACQNQRFQRKQYANVSVIQTEKKGYGLRANTDLRANDFIFEYIGEVIGE 165
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ +++ +G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 166 NVFRRKMQQYDEQGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKL 225
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R IKAGE L ++Y ++G E + CYCG +C G++G K
Sbjct: 226 RMGIFAERKIKAGEELVFNYNVDRYGAEPQPCYCGEPNCTGFIGGK 271
>gi|170039780|ref|XP_001847701.1| huntingtin interacting protein [Culex quinquefasciatus]
gi|167863380|gb|EDS26763.1| huntingtin interacting protein [Culex quinquefasciatus]
Length = 2379
Score = 139 bits (351), Expect = 2e-30, Method: Composition-based stats.
Identities = 67/164 (40%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F+K + ++ +TE G+G++A+ I GEFI+EY+GEV++
Sbjct: 1307 IECGSRCVVGERCTNKRFQKLEYANTQVFRTEKKGFGIQASTEIAPGEFIMEYVGEVLNS 1366
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
++R +++Y +R D IDAT KGN SRF+NHSCD N +KW V GE
Sbjct: 1367 EQFDERAELYSKEKNKHYYFMALRSDAIIDATTKGNISRFINHSCDANAETQKWTVNGEL 1426
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I GE +T+DY+F ++G + KCYC A +C G++G
Sbjct: 1427 RIGFFSTKYILPGEEVTFDYQFQRYGRKAQKCYCEADNCTGWIG 1470
>gi|336266160|ref|XP_003347849.1| hypothetical protein SMAC_06682 [Sordaria macrospora k-hell]
gi|380091782|emb|CCC10510.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 907
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ R
Sbjct: 165 CNCGAGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSR 224
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 225 MIKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFA 284
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 285 GRAIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIGGK 324
>gi|242061944|ref|XP_002452261.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
gi|241932092|gb|EES05237.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
Length = 1840
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 101/171 (59%), Gaps = 5/171 (2%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+K C C E C+N+ F++ K++ + G+G++ E + +G F+IEY+GEV+D
Sbjct: 1107 IECAKRTCPCGEQCSNQKFQRRSYAKLRWFYSGKKGYGLQLQEDVTEGRFLIEYVGEVLD 1166
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R +G ++FY + IDA KGN RF+NHSC PNC EKW V GE
Sbjct: 1167 ITSYESRQRYYASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGE 1226
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLGTKRKI 419
+G+F+ R+IK GE LT+DY +V+ KC+CG + C+GYLG I
Sbjct: 1227 VCIGIFSLRNIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYLGGDVSI 1277
>gi|342872314|gb|EGU74695.1| hypothetical protein FOXB_14795 [Fusarium oxysporum Fo5176]
Length = 919
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 109/167 (65%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS +A +C+ C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI+
Sbjct: 142 MECSAEAGNCAGGCQNQRFQRKQYANVSVIKTEKKGFGLRADSDLQANDFVFEYIGEVIN 201
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K +DAT KGN+ RF NHSC+PNC ++KW V +
Sbjct: 202 EPTFRRRMMQYDEEGIKHFYFMSLNKSEFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDK 261
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA+R I++GE L ++Y ++G + + CYCG ++C G++G K
Sbjct: 262 LRMGIFASRKIQSGEELVFNYNVDRYGADPQPCYCGEANCVGFIGGK 308
>gi|413937236|gb|AFW71787.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1756
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 97/161 (60%), Gaps = 4/161 (2%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C C E C+N+ F++ K++ + G+G++ E + +G F+IEY+GEV+D E
Sbjct: 990 RTCPCGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYE 1049
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R +G ++FY + IDA KGN RF+NHSC PNC EKW V GE +G+
Sbjct: 1050 SRQRYYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGI 1109
Query: 376 FAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
FA RSIK GE LT+DY +V+ KC+CG + C+GYLG
Sbjct: 1110 FALRSIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYLG 1150
>gi|384500869|gb|EIE91360.1| hypothetical protein RO3G_16071 [Rhizopus delemar RA 99-880]
Length = 883
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 107/168 (63%), Gaps = 4/168 (2%)
Query: 251 VQYISC-SKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ ++ C ++ C C C NR F+ + ++ +++TE G+G+ A ++ FI+EYIGE
Sbjct: 222 MMFMECIAQDCPCGRLCRNRRFQLGQYARVDVIRTEKKGYGLRALTDLSSNSFIMEYIGE 281
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VI R + +G +++Y ++ D IDAT KG +RF+NHSC PNC+ +KW +
Sbjct: 282 VITQNEFLHRTREYDAQGFKHYYFMTLKNDEIIDATRKGCLARFMNHSCRPNCVTQKWVI 341
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
+ R+G+F +R+IKAGE LT+DY+F ++G KC+CG +C+G++G
Sbjct: 342 GKKMRIGIFTSRNIKAGEELTFDYKFERYGAVAQKCFCGEVNCKGFIG 389
>gi|345480373|ref|XP_001606723.2| PREDICTED: hypothetical protein LOC100123115 [Nasonia vitripennis]
Length = 1746
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 101/166 (60%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + ++ +TE G+G+ A + G+FI+EY+GEV+D
Sbjct: 822 IECGSRCVVGDRCTNKRFQNCEYANCEVFRTEKKGFGLRATTNLEAGDFIMEYVGEVLDP 881
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 882 KDFRKRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDPNAETQKWTVNGEL 941
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F + + AGE +T+DY F ++G E KC+C A++C+G++G K
Sbjct: 942 RIGFFNKKFVAAGEEITFDYHFQRYGKEAQKCFCEATNCRGWIGDK 987
>gi|121712644|ref|XP_001273933.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
gi|119402086|gb|EAW12507.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
Length = 968
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 110/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 191 IECVGDCSCGANCQNQRFQRREYANVAVIKTEKKGFGLRAETDLQPHQFIFEYVGEVINE 250
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 251 GHFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 310
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G GK + +K
Sbjct: 311 RMGIFAERAIRAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GKTQTERATKL 365
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 366 SNATIEALGI 375
>gi|224063022|ref|XP_002300966.1| SET domain protein [Populus trichocarpa]
gi|222842692|gb|EEE80239.1| SET domain protein [Populus trichocarpa]
Length = 605
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 96/160 (60%), Gaps = 4/160 (2%)
Query: 259 ACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C + C+N+ F+K K+ + G+G+ E I +G+F+IEY+GEV+D E
Sbjct: 109 TCPCGDLCSNQQFQKHNYAKMTWDRCGKKGFGLRLEEDITRGQFLIEYVGEVLDVHAYEA 168
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R + +G ++FY + IDA KGN RF+NHSCDPNC EKW V GE +G+F
Sbjct: 169 RQKEYASKGHKHFYFMTLDGSEVIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLF 228
Query: 377 AARSIKAGEPLTYDYRFVQF--GPEVKCYCGASSCQGYLG 414
A R IK GE +T+DY +V+ +CYCG+ CQGY+G
Sbjct: 229 ALRDIKKGEEVTFDYNYVRVVGAAAKRCYCGSPQCQGYIG 268
>gi|425766575|gb|EKV05181.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Penicillium digitatum PHI26]
gi|425781728|gb|EKV19675.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Penicillium digitatum Pd1]
Length = 887
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C+ C C C N+ F+K++ + ++KTE G+G+ A ++ GE I EY+GEV+ +
Sbjct: 154 IECAGDCGCGSDCQNQRFQKQQFAPVSVIKTEKKGFGLRAERNLDPGELIYEYVGEVVGE 213
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 214 QQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 273
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+++AGE L ++Y ++G + + CYCG C G++G G+ + +K
Sbjct: 274 RMGIFAERAVQAGEELVFNYNVDRYGADPQPCYCGEPMCTGFIG-----GRTQTERATKL 328
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 329 SNATIEALGI 338
>gi|378732919|gb|EHY59378.1| histone-lysine N-methyltransferase SETD2 [Exophiala dermatitidis
NIH/UT8656]
Length = 1034
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 108/190 (56%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C+K C C C N+ F + + ++KTE G+G+ + +FI EYIGE I +
Sbjct: 203 MECAKDCGCGPACQNQRFLRRLWANVSVIKTEKKGYGLRTNVDLRPHDFIFEYIGETIPE 262
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K IDAT KGN RF NHSC+PNC ++KW V +
Sbjct: 263 NTFRKRMRQYDEEGIKHFYFMSLSKGEFIDATKKGNLGRFCNHSCNPNCYVDKWVVGDKL 322
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG S+C G++G GK + G+K
Sbjct: 323 RMGIFAERYIKAGEELVFNYNVDRYGADPQPCYCGESNCTGFIG-----GKTQTERGTKL 377
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 378 SAATIEALGI 387
>gi|169596857|ref|XP_001791852.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
gi|111069727|gb|EAT90847.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
Length = 940
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 256 CSKACHCSETCNN-RPFRKE-KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+ C C C N R FRK+ + ++KT G+G+ A + + G+F+ EYIGEVID+
Sbjct: 186 VADGCSCGTKCQNQRFFRKQYADVTVIKTAKKGFGLRANQDMFPGDFVFEYIGEVIDERT 245
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+
Sbjct: 246 FRRRMIQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRM 305
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKR 432
G+F R +KAGE L ++Y ++G + + CYCG +C GY+G GK + +K
Sbjct: 306 GIFVERKVKAGEELVFNYNVDRYGADPQPCYCGEPNCTGYIG-----GKTQSDNATKLSH 360
Query: 433 SSTACLAI 440
++ L I
Sbjct: 361 ATIEALGI 368
>gi|147816102|emb|CAN75267.1| hypothetical protein VITISV_028162 [Vitis vinifera]
Length = 460
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C CS C N+ F+K + K K+ +TE GWG+ A E I G FIIEY GEVI +R
Sbjct: 143 CPCSVHCKNQRFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARER 202
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+G+ + Y+ + IDAT G+ +RF+NHSC+PNC KW V GE R+G+FA
Sbjct: 203 SLAYASQGINDAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFA 262
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
R I G LTYDY F + G +V C CGA+SC G+LG K +
Sbjct: 263 MRDISIGTELTYDYNFQWYGGAKVHCLCGATSCCGFLGAKSR 304
>gi|359493195|ref|XP_003634538.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/167 (44%), Positives = 100/167 (59%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+ + C CS C N+ F+K + K K+ + E GWG+ A E I GEF++EY GEVI
Sbjct: 90 IECTPRYCPCSIHCKNQRFQKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVIS 149
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
R +G+++ Y+ + IDAT KGN +RF+NHSC PNC KW V GE
Sbjct: 150 RTEARGRSQVYVSQGLKDVYIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGE 209
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK 416
RVG+FA R+I G LTY Y F + G +V+C CGA+ C G+LG K
Sbjct: 210 DRVGIFALRNISVGTELTYSYNFEWYSGAKVRCLCGATRCSGFLGGK 256
>gi|358381100|gb|EHK18776.1| hypothetical protein TRIVIDRAFT_57632 [Trichoderma virens Gv29-8]
Length = 924
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 112/191 (58%), Gaps = 9/191 (4%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS A +C C NR F++++ + ++KTE G+G+ + +FI EYIGEVI+
Sbjct: 152 MECSADAGNCGGGCQNRRFQRKQYADVTVIKTEKKGFGLRTNSALEPNDFIYEYIGEVIN 211
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K+ +DAT KGN RF NHSC PNC ++KW V +
Sbjct: 212 EPTFRRRMLQYDEEGIKHFYFMSLNKNEFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDK 271
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSK 429
R+G+FA R I+AGE L ++Y ++G E + CYCG ++C G++G GK + +K
Sbjct: 272 LRMGIFALRKIRAGEELVFNYNVDRYGAEPQPCYCGETNCVGFIG-----GKTQTERATK 326
Query: 430 RKRSSTACLAI 440
++ L I
Sbjct: 327 LPTTTVEALGI 337
>gi|357149500|ref|XP_003575133.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like
[Brachypodium distachyon]
Length = 1022
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/170 (43%), Positives = 103/170 (60%), Gaps = 13/170 (7%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEKKIKIVKTEFC----GWGVEAAEPINKGEFIIEYIGEV 308
I C+K C C E C+N+ F++ KI + FC G+G++ E + +G F+IEY+GEV
Sbjct: 222 IECTKRTCPCGEYCSNQQFQRRSYAKI--SWFCSGKKGFGLQLKEEVTEGRFLIEYVGEV 279
Query: 309 IDDAL--CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
+D C QR + K G ++FY + IDA KGN RF+NHSC PNC EKW
Sbjct: 280 LDITAYECRQRYYASK--GQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWM 337
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
V GE +G+FA R+IK GE LT+DY +V+ KC+CG + C+GY+G
Sbjct: 338 VNGEVCIGIFAMRNIKKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIG 387
>gi|449446403|ref|XP_004140961.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Cucumis
sativus]
Length = 497
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 76/164 (46%), Positives = 102/164 (62%), Gaps = 4/164 (2%)
Query: 266 CNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKY 323
C N+ F+K K K+ KTE GWG+ A E I G+FIIEY GEVI ++R +
Sbjct: 72 CRNQRFQKCEYAKTKLFKTEGRGWGLLADENIKNGQFIIEYCGEVISWKEAKRRSHTYEN 131
Query: 324 RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKA 383
+G+++ Y+ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA + I
Sbjct: 132 QGLKDAYIISLNASESIDATRKGSLARFINHSCFPNCETRKWNVLGEIRVGIFAKQDISI 191
Query: 384 GEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRKIGKLELCW 426
G L YDY F + G +V+C CGASSC G+LG K + G LE +
Sbjct: 192 GTELAYDYNFEWYGGAKVRCLCGASSCSGFLGAKSR-GFLEDTY 234
>gi|359493197|ref|XP_003634539.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C CS C N+ F+K + K K+ +TE GWG+ A E I G FIIEY GEVI +R
Sbjct: 96 CPCSVHCKNQRFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARER 155
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+G+ + Y+ + IDAT G+ +RF+NHSC+PNC KW V GE R+G+FA
Sbjct: 156 SLAYASQGINDAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFA 215
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
R I G LTYDY F + G +V C CGA+SC G+LG K +
Sbjct: 216 MRDISIGTELTYDYNFQWYGGAKVHCLCGATSCCGFLGAKSR 257
>gi|296081207|emb|CBI18233.3| unnamed protein product [Vitis vinifera]
Length = 370
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/167 (44%), Positives = 100/167 (59%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+ + C CS C N+ F+K + K K+ + E GWG+ A E I GEF++EY GEVI
Sbjct: 47 IECTPRYCPCSIHCKNQRFQKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVIS 106
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
R +G+++ Y+ + IDAT KGN +RF+NHSC PNC KW V GE
Sbjct: 107 RTEARGRSQVYVSQGLKDVYIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGE 166
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK 416
RVG+FA R+I G LTY Y F + G +V+C CGA+ C G+LG K
Sbjct: 167 DRVGIFALRNISVGTELTYSYNFEWYSGAKVRCLCGATRCSGFLGGK 213
>gi|224084984|ref|XP_002307459.1| SET domain protein [Populus trichocarpa]
gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa]
Length = 594
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 98/161 (60%), Gaps = 6/161 (3%)
Query: 259 ACHCSETCNNRPFRKEKKIKIVKTEFCG---WGVEAAEPINKGEFIIEYIGEVIDDALCE 315
C C + C+N+ F+K K+ E CG +G+ E I++G+F+IEY+GEV+D E
Sbjct: 143 TCPCGDHCSNQQFQKRNYAKMT-WERCGKKGFGLRLDEDISRGQFLIEYVGEVLDVHAYE 201
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R D +G ++FY + IDA KGN RF+NHSCDPNC EKW V GE +G+
Sbjct: 202 ARQKDYASKGHKHFYFMTLDGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGL 261
Query: 376 FAARSIKAGEPLTYDYRFVQF--GPEVKCYCGASSCQGYLG 414
FA R IK GE +T+DY +V+ +CYCG+ C+GY+G
Sbjct: 262 FALRDIKMGEEVTFDYNYVRVVGAAAKRCYCGSPQCRGYIG 302
>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1363
Score = 139 bits (349), Expect = 4e-30, Method: Composition-based stats.
Identities = 68/168 (40%), Positives = 103/168 (61%), Gaps = 6/168 (3%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
+I C + C + C N+ F++++ I + GWG++A E I+ +F+IEY GEVI
Sbjct: 958 FIECEPSNCKLGKHCKNQRFQRQEYALIAPFNAKKKGWGLKAKEKISAHQFVIEYCGEVI 1017
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
A R+ + G + FY + +DA+ KGN +RF+NHSCDPNC +KW V+G
Sbjct: 1018 TRAQSMDRMREAD--GEKYFYFLTLDSKEVLDASRKGNLARFINHSCDPNCETQKWSVDG 1075
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
ETR+G+FA + I+AG LT+DY + + G + CYCG+ +C+ YLG K
Sbjct: 1076 ETRIGIFALKDIEAGTELTFDYNYERVGSSKQSCYCGSVNCREYLGNK 1123
>gi|116207846|ref|XP_001229732.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
gi|121787978|sp|Q2H988.1|SET2_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|88183813|gb|EAQ91281.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
Length = 894
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 104/167 (62%), Gaps = 4/167 (2%)
Query: 254 ISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C S C+C E C N+ F++++ + ++KTE G+G+ + +F+ EY+GEVI+
Sbjct: 104 IECVSGDCNCGEGCENQRFQRKQYANVSVIKTEKKGFGLRTDADLQANDFVFEYVGEVIN 163
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ R G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 164 EPTFRNRTVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDK 223
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+AGE L ++Y ++G + + CYCG S+C G++G K
Sbjct: 224 LRMGIFATRAIRAGEELVFNYNVDRYGADPQPCYCGESNCVGFIGGK 270
>gi|297842509|ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
Length = 1766
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/185 (37%), Positives = 107/185 (57%), Gaps = 18/185 (9%)
Query: 248 PPPVQYISCSK--------------ACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEA 291
PPP + C + C + C+N+ F+K K +K + + G+G+
Sbjct: 983 PPPDGRLGCGEECLNRMLNIECLQGTCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRL 1042
Query: 292 AEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRF 351
E + +G+F+IEY+GEV+D + R + +G ++FY + + IDA KGN RF
Sbjct: 1043 LEDVREGQFLIEYVGEVLDMQSYDTRQKEYACKGQKHFYFMTLNGNEVIDAGAKGNLGRF 1102
Query: 352 LNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQ-FGPEV-KCYCGASSC 409
+NHSC+PNC EKW V GE VG+F+ + +K G+ LT+DY +V+ FG KCYCG+S C
Sbjct: 1103 INHSCEPNCRTEKWMVNGEICVGIFSMKDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHC 1162
Query: 410 QGYLG 414
+GY+G
Sbjct: 1163 RGYIG 1167
>gi|296081208|emb|CBI18234.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C CS C N+ F+K + K K+ +TE GWG+ A E I G FIIEY GEVI +R
Sbjct: 58 CPCSVHCKNQRFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARER 117
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+G+ + Y+ + IDAT G+ +RF+NHSC+PNC KW V GE R+G+FA
Sbjct: 118 SLAYASQGINDAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFA 177
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
R I G LTYDY F + G +V C CGA+SC G+LG K +
Sbjct: 178 MRDISIGTELTYDYNFQWYGGAKVHCLCGATSCCGFLGAKSR 219
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 92/162 (56%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ +TE GWG+ A E I G F+IEY GEVI R
Sbjct: 481 CRCGLFCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGR 540
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
G+++ ++ + IDAT KG+ RF+NHSC PNC KW V GE RVG+FA
Sbjct: 541 SQVYASLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFA 600
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ I G L Y+Y F + G +V+C CGA SC G+LG K +
Sbjct: 601 KQDISIGTELAYNYNFEWYGGAKVRCLCGAISCSGFLGAKSR 642
>gi|344302485|gb|EGW32759.1| hypothetical protein SPAPADRAFT_54781 [Spathaspora passalidarum
NRRL Y-27907]
Length = 718
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 105/170 (61%), Gaps = 4/170 (2%)
Query: 251 VQYISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V + C ++ C C C N+ F+K + + + +TE G+G+ A I +G+FI EY GE
Sbjct: 86 VTSVECMNRHCLCGNDCQNQRFQKREYAAVSVFQTELKGYGLRADLEIGEGQFIYEYTGE 145
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A QR+ + + ++FY ++ D IDAT +G+ +RF+NHSC+PN ++KW V
Sbjct: 146 VIDEATFRQRMVEYDQKSFKHFYFMMLKSDSFIDATVRGSLARFVNHSCNPNAYVDKWVV 205
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
+ R+G+FA R I GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 206 GDKLRMGIFAKRKIARGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 255
>gi|358253063|dbj|GAA51760.1| histone-lysine N-methyltransferase NSD1/2 [Clonorchis sinensis]
Length = 1596
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 93/168 (55%), Gaps = 4/168 (2%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE---KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ Y C + C N+ F K ++ GWG++ PI G F+ EYIG+
Sbjct: 1022 LHYECLPSVCPNGDACQNQRFTKRLYPRQRPFWTGSERGWGLKTLVPIKAGSFVNEYIGD 1081
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
+ID+ +RL V N+YM ++ IDA KGN SRF+NH CDPN +KW V
Sbjct: 1082 LIDEEEANRRLRFAHENNVTNYYMMKLDAQRIIDAGPKGNLSRFMNHCCDPNLNTQKWTV 1141
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
G+ R+G+FA R I AGE LT+DY FV G E + C CGA +C G+LG
Sbjct: 1142 NGDNRIGLFAVRDIAAGEELTFDYNFVALGQERLNCRCGAENCTGFLG 1189
>gi|296418221|ref|XP_002838740.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634701|emb|CAZ82931.1| unnamed protein product [Tuber melanosporum]
Length = 892
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 101/158 (63%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F++ + + ++KTE G+G+ A I+ F+ EY+GEVID+ +R+
Sbjct: 174 CGQDCQNQRFQQRQYADVSVIKTEKKGFGLRANSDISINTFLYEYVGEVIDENKFRRRME 233
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G+++FY + K+ IDAT KG +RF NHSC+PNC ++KW V + R+G+FA R
Sbjct: 234 KYDSEGIKHFYFMSLGKNEFIDATKKGGLARFCNHSCNPNCFIDKWVVGEKLRMGIFAKR 293
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
++KAGE L +DY ++G E + CYCG +C GY+G K
Sbjct: 294 NVKAGEELVFDYNVDRYGAEPQTCYCGEFNCLGYIGGK 331
>gi|255943807|ref|XP_002562671.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587406|emb|CAP85439.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 688
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C+ C C C N+ F+K + + ++KTE G+G+ A + ++ GE I EY+GEV+ +
Sbjct: 155 IECAGECGCGPDCQNQRFQKRQFAPVSVIKTEKKGFGLRAEKNLDPGELIYEYVGEVVGE 214
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 215 QQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 274
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R+++AGE L ++Y ++G + + CYCG C G++G G+ + +K
Sbjct: 275 RMGIFAERAVQAGEELVFNYNVDRYGADPQPCYCGEPMCTGFIG-----GRTQTERATKL 329
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 330 SNATIEALGI 339
>gi|255575537|ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
gi|223531892|gb|EEF33708.1| set domain protein, putative [Ricinus communis]
Length = 495
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 101/162 (62%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + K ++ KTE GWG+ A E I G+FIIEY GEVI ++R
Sbjct: 66 CRCGIFCKNQRFQKCEYFKTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVISWKEAKRR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQAYERQGLKDAFIISLNSSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ I G L YDY F + G +V+C CG++SC G+LG K +
Sbjct: 186 KQDISIGTELAYDYNFEWYGGAKVRCLCGSASCSGFLGAKSR 227
>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
Length = 1746
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 1/132 (0%)
Query: 286 GWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFK 345
GWG++ I GEF+ EYIG++ID+ +RL + N+YM ++ IDA K
Sbjct: 1038 GWGLKTMIAIRAGEFVNEYIGDLIDEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPK 1097
Query: 346 GNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYC 404
GN SRF+NHSCDPN +KW V G+ R+G+FA R I GE LT++Y FV G E + C C
Sbjct: 1098 GNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLNCRC 1157
Query: 405 GASSCQGYLGTK 416
GAS+C G+LG +
Sbjct: 1158 GASNCVGFLGAR 1169
>gi|195012609|ref|XP_001983710.1| GH16034 [Drosophila grimshawi]
gi|193897192|gb|EDV96058.1| GH16034 [Drosophila grimshawi]
Length = 2059
Score = 138 bits (347), Expect = 6e-30, Method: Composition-based stats.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F++ + ++ +TE G G+ A + GEFI+EY+GEVID
Sbjct: 1135 IECGPLCTNGERCTNKRFQQHQGWPCRVFRTEKKGCGITAELQMPSGEFIMEYVGEVIDS 1194
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R D IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1195 EEFERRQHLYSEDRNRHYYFMALRSDSIIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1254
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ ++I GE +T+DY++ ++G + +CYC +++C+G++G +
Sbjct: 1255 RIGFFSLKTIMPGEEITFDYQYQRYGRDAQRCYCESANCRGWIGNE 1300
>gi|452836869|gb|EME38812.1| hypothetical protein DOTSEDRAFT_83490 [Dothistroma septosporum
NZE10]
Length = 1236
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/165 (41%), Positives = 106/165 (64%), Gaps = 15/165 (9%)
Query: 265 TCNNRPF----RKEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
TC+NRPF R+ KK +++VKT+ G+G+ + + I+EY GE+I +
Sbjct: 587 TCSNRPFAEIDRRLKKGGSFDIGVEVVKTDKRGFGIRSTRSFRPDQIIMEYTGEIISEGE 646
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
C++R+ + +Y+ N+Y+ E+ + ID T KG+ +RF+NH+C+PNC ++ ++V G R+
Sbjct: 647 CQRRMRE-EYKDKPNYYLMELERGLVIDGT-KGSMARFINHACEPNCTVKMFRVNGVARM 704
Query: 374 GVFAARS-IKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
GVFA +S I GE LTYDY F FG + CYCG ++C+GYLG K
Sbjct: 705 GVFAGKSGIMTGEELTYDYNFDNFGESRQNCYCGTTNCRGYLGPK 749
>gi|350421470|ref|XP_003492853.1| PREDICTED: hypothetical protein LOC100746901 [Bombus impatiens]
Length = 1777
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A + GEFI+EY+GEV+D
Sbjct: 798 IECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIMEYVGEVVDP 857
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 858 KDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSETQKWTVNGEL 917
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KC+C A +C+G++G
Sbjct: 918 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCFCEAPNCRGWIG 961
>gi|395325049|gb|EJF57478.1| hypothetical protein DICSQDRAFT_111705 [Dichomitus squalens
LYAD-421 SS1]
Length = 866
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 98/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I+IVKTE G+G+ AA I K FI EY+G+V+ +R
Sbjct: 169 CRCRGYCRNQRFQRKQYAPIEIVKTEMKGFGLRAAADIPKDTFIYEYVGDVVSQPSFIKR 228
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY ++KD IDAT +G RF NHSC+PNC + KW V R+G+FA
Sbjct: 229 MRQYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTVGSHVRMGIFA 288
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R IK E LT++Y ++G E + CYCG +C GY+G K
Sbjct: 289 NRVIKKDEELTFNYNVDRYGHEAQPCYCGEPNCVGYIGGK 328
>gi|195392836|ref|XP_002055060.1| GJ19006 [Drosophila virilis]
gi|194149570|gb|EDW65261.1| GJ19006 [Drosophila virilis]
Length = 2101
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F+ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1039 IECGPLCTNGERCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDS 1098
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1099 EEFERRQHIYSRDRNRHYYFMALRGEAIIDATAKGNISRYINHSCDPNAETQKWTVNGEL 1158
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ ++I GE +T+DY++ ++G + +CYC AS+C+G++G
Sbjct: 1159 RIGFFSVKTIMPGEEITFDYQYQRYGRDAQRCYCEASNCRGWIG 1202
>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 489
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/185 (40%), Positives = 107/185 (57%), Gaps = 4/185 (2%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++V+T GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 44 CRCGVYCKNQRFQKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRR 103
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +++ Y+ + D +IDAT KGN +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 104 AQAYETQCLKDAYIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFA 163
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSST 435
++I G L+YDY F +G V+C CGA+SC G+LG K R + W R S
Sbjct: 164 KQNIPFGTELSYDYNFEWYGGVMVRCLCGAASCSGFLGAKSRGFQEATYLWEDDDDRFSV 223
Query: 436 ACLAI 440
+ +
Sbjct: 224 ENIPL 228
>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
Length = 1912
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 106/202 (52%), Gaps = 16/202 (7%)
Query: 233 KIDLTWKDLME-----NKVGP--PPVQ------YISCSKACHCSETCNNRPFRKE--KKI 277
KIDL L + N V P P Q C C + CNNR F K ++
Sbjct: 1606 KIDLEESSLTQCECDPNDVDPCGPYTQCLNRMLLTECGPTCRAGDRCNNRAFEKRLYPRL 1665
Query: 278 KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKD 337
+T GWG+ + + G+F+IEY+GE+ID+ +R+ +NFY + K+
Sbjct: 1666 GPYRTPHRGWGLRTMQDLRAGQFVIEYVGELIDEEEFRRRMNRKHEVRDENFYFLTLDKE 1725
Query: 338 FTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG 397
IDA KGN +RF+NHSC+PNC +KW V G+ RVG+FA R I A LT++Y G
Sbjct: 1726 RMIDAGPKGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTFNYNLETSG 1785
Query: 398 PEVK-CYCGASSCQGYLGTKRK 418
E K C CGA C GY+G K K
Sbjct: 1786 IEKKRCMCGAKRCSGYIGAKPK 1807
>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 503
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/185 (40%), Positives = 107/185 (57%), Gaps = 4/185 (2%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++V+T GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 58 CRCGVYCKNQRFQKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRR 117
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +++ Y+ + D +IDAT KGN +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 118 AQAYETQCLKDAYIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFA 177
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSST 435
++I G L+YDY F +G V+C CGA+SC G+LG K R + W R S
Sbjct: 178 KQNIPFGTELSYDYNFEWYGGVMVRCLCGAASCSGFLGAKSRGFQEATYLWEDDDDRFSV 237
Query: 436 ACLAI 440
+ +
Sbjct: 238 ENIPL 242
>gi|322709340|gb|EFZ00916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium
anisopliae ARSEF 23]
Length = 894
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+ +C C N+ F+++ + ++KT+ G+G+ P+ +FI EY+GEVI++
Sbjct: 151 VASGSNCGGGCQNQRFQRKLWADVAVIKTDKKGYGLRTESPLQPNDFIYEYVGEVINEPT 210
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+
Sbjct: 211 FRRRMLQYDEEGIKHFYFMSLSKSEFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRM 270
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I+AGE L ++Y ++G E + CYCG S+C G++G K
Sbjct: 271 GIFALRKIQAGEELVFNYNVDRYGAEPQPCYCGESNCVGFIGGK 314
>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
Length = 513
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/185 (40%), Positives = 107/185 (57%), Gaps = 4/185 (2%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + ++V+T GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 68 CRCGVYCKNQRFQKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRR 127
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +++ Y+ + D +IDAT KGN +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 128 AQAYETQCLKDAYIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFA 187
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK-RKIGKLELCWGSKRKRSST 435
++I G L+YDY F +G V+C CGA+SC G+LG K R + W R S
Sbjct: 188 KQNIPFGTELSYDYNFEWYGGVMVRCLCGAASCSGFLGAKSRGFQEATYLWEDDDDRFSV 247
Query: 436 ACLAI 440
+ +
Sbjct: 248 ENIPL 252
>gi|194766778|ref|XP_001965501.1| GF22528 [Drosophila ananassae]
gi|190619492|gb|EDV35016.1| GF22528 [Drosophila ananassae]
Length = 2414
Score = 137 bits (346), Expect = 8e-30, Method: Composition-based stats.
Identities = 65/164 (39%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1401 IECGPLCSNGDRCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDS 1460
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1461 EEFERRQHLYSRDRKRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1520
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ ++I+ GE +T+DY++ ++G + +CYC A++C+G++G
Sbjct: 1521 RIGFFSVKTIQPGEEITFDYQYQRYGRDAQRCYCEATNCRGWIG 1564
>gi|390596531|gb|EIN05933.1| hypothetical protein PUNSTDRAFT_54761 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 813
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 105/172 (61%), Gaps = 5/172 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F R+ I+IV+TE G+G+ A E I K +FI EY+G+V+ ++R
Sbjct: 147 CRCGAHCRNQRFNRREYAPIEIVQTEKKGFGLRAREDIRKDQFIYEYVGDVVSHPSFKKR 206
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++KD IDAT +G RF NHSC PNC + KW V R+G+FA
Sbjct: 207 MREYAQEGIRHFYFMMLQKDEYIDATKRGGIGRFANHSCSPNCYVAKWTVGTHVRMGIFA 266
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKLELCW 426
+R I+ E LT++Y ++G + + CYCG +C G+LG T+ IG ++ +
Sbjct: 267 SRHIRQHEELTFNYNVDRYGHDAQPCYCGEPNCVGFLGGKTQTDIGAMDDLY 318
>gi|328790605|ref|XP_003251435.1| PREDICTED: hypothetical protein LOC100578450 [Apis mellifera]
Length = 1394
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A + GEFI+EY+GEV+D
Sbjct: 415 IECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIMEYVGEVVDP 474
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 475 KDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDPNSETQKWTVNGEL 534
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KC+C A +C+G++G
Sbjct: 535 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCFCEAPNCRGWIG 578
>gi|380019005|ref|XP_003693408.1| PREDICTED: uncharacterized protein LOC100869667 [Apis florea]
Length = 1392
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 67/164 (40%), Positives = 100/164 (60%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFR--KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + K ++ +TE G+G+ A + GEFI+EY+GEV+D
Sbjct: 415 IECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIMEYVGEVVDP 474
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 475 KDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDPNSETQKWTVNGEL 534
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F + I AGE +T+DY F ++G E KC+C A +C+G++G
Sbjct: 535 RIGFFNKKFIAAGEEITFDYHFQRYGKEAQKCFCEAPNCRGWIG 578
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 95/160 (59%), Gaps = 3/160 (1%)
Query: 264 ETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDM 321
E C N+ F+K +K GWG+ A I KG+F+ EY+GE++D+ C +R+
Sbjct: 1220 EKCQNQRFQKRAYPDSCQMKVSHRGWGLVAMVDIKKGDFVNEYVGELVDEEECRRRIKQA 1279
Query: 322 KYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSI 381
+ +FY + KD IDA KGN SRF+NHSC PNC +KW V G+TRVG+FA R+I
Sbjct: 1280 HEENITDFYFLTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKWTVNGDTRVGLFAIRNI 1339
Query: 382 KAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIG 420
AG ++++Y G E K C CGA +C G++G + K
Sbjct: 1340 AAGNEISFNYNLDCLGNEKKRCECGAPNCSGFIGVRPKTA 1379
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 31/66 (46%), Gaps = 15/66 (22%)
Query: 131 CSVRGCGGVYHFICVKERLGIS--NPRNFKCPQHACFIC-----------RQRLQWRCVR 177
C V CG YH C++ R ++ + R F CP HAC C R RL RCVR
Sbjct: 894 CHVPVCGKYYHEGCIR-RFPLTRFDSRGFTCPLHACVACFADNPKSTKASRGRL-MRCVR 951
Query: 178 CTIASH 183
C A H
Sbjct: 952 CPTAYH 957
>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
Length = 1503
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 1/132 (0%)
Query: 286 GWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFK 345
GWG++ I GEF+ EYIG++ID+ +RL + N+YM ++ IDA K
Sbjct: 1038 GWGLKTMIAIRAGEFVNEYIGDLIDEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPK 1097
Query: 346 GNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYC 404
GN SRF+NHSCDPN +KW V G+ R+G+FA R I GE LT++Y FV G E + C C
Sbjct: 1098 GNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLNCRC 1157
Query: 405 GASSCQGYLGTK 416
GAS+C G+LG +
Sbjct: 1158 GASNCVGFLGAR 1169
>gi|50553372|ref|XP_504097.1| YALI0E18260p [Yarrowia lipolytica]
gi|74633562|sp|Q6C5G5.1|SET2_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49649966|emb|CAG79692.1| YALI0E18260p [Yarrowia lipolytica CLIB122]
Length = 768
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 109/186 (58%), Gaps = 8/186 (4%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C + C N+ F+ +K + ++ TE G+G+ A + I GEF+ EY+GEVID+ ++R
Sbjct: 75 VRCCKGCQNKRFQGKKYASVDVISTEKKGFGLRATKDIAAGEFVYEYVGEVIDEPTFKER 134
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+GV++FY ++K IDAT KG RF NHSC PN +EKW V R+G+FA
Sbjct: 135 TAIYTTQGVKHFYFMMLQKGEFIDATAKGGLGRFCNHSCAPNGHVEKWVVGKRLRMGIFA 194
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSSTA 436
+R I+ GE +T+DY ++G E + CYCG +C G+LG GK + SK + TA
Sbjct: 195 SRHIQRGEEVTFDYNVDRYGAEAQACYCGEKNCVGFLG-----GKTQTESASKVSGTLTA 249
Query: 437 CLAIIT 442
L + +
Sbjct: 250 ALGLTS 255
>gi|355708046|gb|AES03147.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 261
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 96/161 (59%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 100 VCPAGGRCQNQCFTKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRA 159
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+F
Sbjct: 160 RIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLF 219
Query: 377 AARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
A IKAG LT++Y G + C CGA +C G+LG +
Sbjct: 220 ALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVR 260
>gi|409043202|gb|EKM52685.1| hypothetical protein PHACADRAFT_211907 [Phanerochaete carnosa
HHB-10118-sp]
Length = 841
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I+IV+TE G+G+ A + + K FI EY+G+V++ +R
Sbjct: 171 CRCRSHCQNQRFQRKQHAPIEIVQTEKKGFGLRAGQDLRKDTFIYEYVGDVVNHPSLLKR 230
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+Q+FY ++KD IDAT +G RF NHSC+PNC + KW V R+G+FA
Sbjct: 231 MRQYGEEGIQHFYFMALQKDEFIDATKRGGIGRFANHSCNPNCYVAKWSVGKRVRMGIFA 290
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IK E LT++Y ++G E + CYCG C G++G K
Sbjct: 291 NRNIKKNEELTFNYNVDRYGHEPQMCYCGEDKCVGFIGGK 330
>gi|323451944|gb|EGB07819.1| hypothetical protein AURANDRAFT_6525 [Aureococcus anophagefferens]
Length = 160
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/163 (44%), Positives = 100/163 (61%), Gaps = 10/163 (6%)
Query: 254 ISCSKA-CHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+ A C + C NR F + K+ + K + CGWG+ A EPI K F++E +GE+I
Sbjct: 1 IECNPATCPMGDKCQNRCFAAKLGSKVSVEKADQCGWGLFAGEPIAKRAFVVEALGELIS 60
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIR----KDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
+ ++RL + G +++YM K IDAT KGN +RF NHSCDP+C LEKW+
Sbjct: 61 EEEAQERLATARANGDEDYYMLAASDAATKGLVIDATRKGNEARFANHSCDPSCRLEKWR 120
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSC 409
+ R G+FA RS+K GE LTYDYR+ F +CYCGA++C
Sbjct: 121 CGSQDRYGIFALRSVKPGEQLTYDYRWASF---ERCYCGAANC 160
>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
Length = 1480
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/183 (41%), Positives = 104/183 (56%), Gaps = 8/183 (4%)
Query: 258 KACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K+C E C N+ F K + + GWG+ A E I +G+F+IEY+GEVI + E
Sbjct: 1201 KSCPAGELCQNQCFEKRQYPSLAARRIPQKGWGLVAQEDIRQGQFVIEYVGEVISNEELE 1260
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+RL + +N+Y + + TIDA KGN +RF+NHSC+PNC W V G VG+
Sbjct: 1261 RRLQHKVAQKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETMLWTVGGAQSVGL 1320
Query: 376 FAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGS----KR 430
FA IKAGE LT++Y F E K C+C AS C G++G K + LE GS KR
Sbjct: 1321 FAIMDIKAGEELTFNYNFESKSDEKKVCHCNASKCSGFIGQKYR-PPLESASGSASTGKR 1379
Query: 431 KRS 433
++S
Sbjct: 1380 RKS 1382
>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Crassostrea gigas]
Length = 1079
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/162 (41%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C E C+N+ F++ + KTE GWG+ I KG+F+ EY+GE+ID+ ++R
Sbjct: 694 CQCGEKCHNQRFQRREYPDCTPFKTEGRGWGLRTNVDIKKGQFVHEYVGELIDEEEVKRR 753
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + + N+YM + K+ IDA KGN SRF+NHSC PNC +KW G+ RVG+FA
Sbjct: 754 IDESHENNISNYYMLTLDKNRVIDAGPKGNLSRFMNHSCAPNCETQKWTANGDVRVGLFA 813
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
I AG LT++Y G + KC CGA C G+LG + K
Sbjct: 814 IYDIPAGTELTFNYNLECLGNDKTKCNCGAELCSGFLGVRPK 855
>gi|195171947|ref|XP_002026763.1| GL27000 [Drosophila persimilis]
gi|194111702|gb|EDW33745.1| GL27000 [Drosophila persimilis]
Length = 944
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 30 IECGPLCTNGDRCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEFIMEYVGEVIDS 89
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT +GN SR++NHSCDPN +KW V GE
Sbjct: 90 EEFERRQHRYSKDRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNAETQKWTVNGEL 149
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ ++I GE +T+DY++ ++G + +CYC A++C+G++GT+
Sbjct: 150 RIGFFSLKNILPGEEITFDYQYQRYGRDAQRCYCEAANCRGWIGTE 195
>gi|255725552|ref|XP_002547705.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135596|gb|EER35150.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 782
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 103/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+K C C E C N+ F+K+ K+ + +TE G+G+++ E IN+ +FI EYIGEVI++
Sbjct: 126 INKHCSCGEDCENQRFQKKLYSKVSVFQTELKGYGLKSEEIINENQFIYEYIGEVINETD 185
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ + ++FY + D IDAT KG +RF NHSC+PN ++KW V + R+
Sbjct: 186 FRKRMIEYDLENYKHFYFMMLDHDSFIDATKKGCLARFANHSCNPNAYVDKWVVGDKLRM 245
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I+ GE +T+DY ++G + + CYCG +C ++G K
Sbjct: 246 GIFAKRRIEKGEEITFDYNVDRYGAQSQPCYCGEPNCIKFMGGK 289
>gi|297842431|ref|XP_002889097.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
gi|297334938|gb|EFH65356.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
Length = 525
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K++K E GWG+ A E I G+FIIEY GEVI ++R
Sbjct: 72 CPCGVYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEDIKAGQFIIEYCGEVISWKEAKRR 131
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ GV++ Y+ + IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 132 AQTYETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFA 191
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
SI L YDY F + G +V+C CGA +C G+LG K +
Sbjct: 192 KESISPRTELAYDYNFEWYGGAKVRCLCGAVACSGFLGAKSR 233
>gi|392563363|gb|EIW56542.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 785
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I+IVKTE G+G+ A E I K FI EY+G+V+ +R
Sbjct: 106 CRCRSYCRNQRFQRKQYAPIEIVKTEMKGFGLRAGENIPKDAFIYEYVGDVVSQPSFLKR 165
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY ++KD IDAT +G RF NHSC+PNC + KW V R+G+FA
Sbjct: 166 MRQYAEEGIRHFYFMMLQKDEYIDATKRGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFA 225
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IK E LT++Y ++G + + CYCG +C G++G K
Sbjct: 226 NRTIKQHEELTFNYNVDRYGHDAQICYCGEPNCVGFIGGK 265
>gi|145479919|ref|XP_001425982.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393054|emb|CAK58584.1| unnamed protein product [Paramecium tetraurelia]
Length = 1384
Score = 137 bits (344), Expect = 1e-29, Method: Composition-based stats.
Identities = 71/162 (43%), Positives = 98/162 (60%), Gaps = 7/162 (4%)
Query: 260 CHCSETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEV--IDDALCE 315
C C+E C NR F+K + ++ G G+ A E I KG+FI++Y+GE+ I+ A
Sbjct: 112 CPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAGERILKGQFIMQYVGEIFQINSAFGR 171
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R+ +Y Y+ ++ ID T KGN +RF+NHSC+PNCI EKW V GE +G+
Sbjct: 172 RRV--QEYSKSTCTYLMKLNNQEVIDPTSKGNLARFINHSCEPNCITEKWNVLGEVCIGI 229
Query: 376 FAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK 416
FA R I E LT+DY+F F P KC CGA+ C+GYLG K
Sbjct: 230 FAIRDINEDEELTFDYQFDVFHTPLTKCLCGANKCKGYLGLK 271
>gi|353234454|emb|CCA66479.1| related to SET2-Histone methyltransferase [Piriformospora indica
DSM 11827]
Length = 799
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 107/193 (55%), Gaps = 20/193 (10%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINK---------------- 297
+ C C C N+ F+K + I +VKTE G+G+ AA PI +
Sbjct: 133 VEEECRCGVFCQNQRFQKRQYANIHVVKTEKKGYGLRAASPIKRYLIFSSADASTPHMVS 192
Query: 298 -GEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC 356
EFI EYIGE+I++ E+R+ + G+++FY ++K IDAT KG F RF NHSC
Sbjct: 193 SDEFIYEYIGEIINERTLERRMDNYGDEGIEHFYFMMLQKGEFIDATKKGGFGRFANHSC 252
Query: 357 DPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGT 415
+PNC + +W V+G R+G+FA R I E LT++Y ++G + CYCG +C G++G
Sbjct: 253 NPNCYVARWVVDGGLRMGIFAKRDILKDEELTFNYNADRYGHAAQPCYCGEPNCVGFIGG 312
Query: 416 KRKIGKLELCWGS 428
K + + W +
Sbjct: 313 KTQTDVAGMDWTT 325
>gi|380492750|emb|CCF34378.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 951
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 103/161 (63%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
A +C + C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ +
Sbjct: 158 AGNCGDGCQNQRFQRKEYANVSVIKTEKKGFGLRADVDLQANDFIFEYIGEVINEPTFRR 217
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+F
Sbjct: 218 RMVQYDDEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIF 277
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
++R IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 278 SSRRIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIGGK 318
>gi|24641786|ref|NP_572888.2| Set2, isoform A [Drosophila melanogaster]
gi|22832197|gb|AAF48273.2| Set2, isoform A [Drosophila melanogaster]
Length = 2362
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1388 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 1447
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1448 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1507
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY+++++G + +CYC A++C+G++G
Sbjct: 1508 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIG 1551
>gi|281360813|ref|NP_001162740.1| Set2, isoform B [Drosophila melanogaster]
gi|118582047|sp|Q9VYD1.2|C1716_DROME RecName: Full=Probable histone-lysine N-methyltransferase CG1716
gi|92109778|gb|ABE73213.1| LD27386p [Drosophila melanogaster]
gi|272506087|gb|ACZ95275.1| Set2, isoform B [Drosophila melanogaster]
Length = 2313
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1339 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 1398
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1399 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1458
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY+++++G + +CYC A++C+G++G
Sbjct: 1459 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIG 1502
>gi|340506525|gb|EGR32648.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 978
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 101/164 (61%), Gaps = 7/164 (4%)
Query: 256 CSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ +C C + C NR F+K + + T GWG+ A + I KG FII+YIGEV D
Sbjct: 27 CNLSCRCGDLCQNRRFQKHEDACVYPYPTSGKGWGLCAGQYIPKGTFIIQYIGEVFDINS 86
Query: 314 CE--QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E +R+ D Y Y+ +I K+ ID TFKGN +RF+NHSCDPNCI +KW V GE
Sbjct: 87 EEGIKRVKD--YSRSTCTYLMKIDKNEVIDPTFKGNLARFINHSCDPNCITQKWHVLGEI 144
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLG 414
+G+FA ++IK + LT+DY+F + P KC CG C+GYLG
Sbjct: 145 CIGIFAIKNIKEDDELTFDYQFDSYKTPLTKCLCGNVKCKGYLG 188
>gi|195352880|ref|XP_002042939.1| GM11634 [Drosophila sechellia]
gi|194126986|gb|EDW49029.1| GM11634 [Drosophila sechellia]
Length = 1965
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1244 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 1303
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1304 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1363
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY+++++G + +CYC A++C+G++G
Sbjct: 1364 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEATNCRGWIG 1407
>gi|328856222|gb|EGG05344.1| hypothetical protein MELLADRAFT_78094 [Melampsora larici-populina
98AG31]
Length = 1098
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 106/176 (60%), Gaps = 7/176 (3%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C C + C N+ F+K + I+IV TE G+GV + F+ EYIGEV+ + +
Sbjct: 264 KECRCLQMCQNQRFQKRQYAPIEIVATERKGFGVRLKSDVPADSFVYEYIGEVVGEKAFQ 323
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R+ + G+++FY +++++ IDAT KG RFLNHSC+PNC + KW V R+G+
Sbjct: 324 RRIKEYAQEGLKHFYFMQLQREEYIDATKKGGLGRFLNHSCNPNCYIGKWVVGRHLRMGI 383
Query: 376 FAARSIKAGEPLTYDY---RFVQFGPEVKCYCGASSCQGYLG--TKRKIGKLELCW 426
F R++K GE LT++Y R+ Q +C+CG + C G+LG T+ +G ++ +
Sbjct: 384 FTKRAVKGGEELTFNYNVDRYGQVYEAQECFCGEAQCVGFLGGKTQTDVGAMDELY 439
>gi|167526044|ref|XP_001747356.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774191|gb|EDQ87823.1| predicted protein [Monosiga brevicollis MX1]
Length = 734
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/172 (40%), Positives = 99/172 (57%), Gaps = 6/172 (3%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ C+ A C C C NR +K++ +++I KTE GWG+ A EPI KG+FI EY GEV D
Sbjct: 94 VECNVARCPCGNKCRNRRLQKQQHARVEIFKTEKKGWGLRALEPIRKGDFIYEYCGEVFD 153
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
A+ +R + G ++Y + D IDAT KG SRF+NHSCDPN +KW V G
Sbjct: 154 QAVFRERQLEYAQEGRFHYYFMSLSADTVIDATRKGAVSRFINHSCDPNAETQKWTVGGV 213
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKL 422
R+G F R I E +T+DY++ ++G + GA+ G R++ KL
Sbjct: 214 LRIGFFCIRDIAVNEEITFDYQYERYG---RRDSGATEASSAKGRDRELNKL 262
>gi|310793120|gb|EFQ28581.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 940
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 103/161 (63%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
A +C + C N+ F++++ + ++KTE G+G+ A + +FI EYIGEVI++ +
Sbjct: 151 AGNCGDGCQNQRFQRKQYANVSVIKTEKKGFGLRANVDLQANDFIFEYIGEVINEPTFRR 210
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+F
Sbjct: 211 RMVQYDDEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIF 270
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
++R IKAGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 271 SSRLIKAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIGGK 311
>gi|15150415|gb|AAK84931.1| SD01656p [Drosophila melanogaster]
Length = 1443
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 469 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 528
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 529 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 588
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY+++++G + +CYC A++C+G++G
Sbjct: 589 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCEAANCRGWIG 632
>gi|312072804|ref|XP_003139232.1| hypothetical protein LOAG_03647 [Loa loa]
gi|307765598|gb|EFO24832.1| hypothetical protein LOAG_03647 [Loa loa]
Length = 1422
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/167 (40%), Positives = 100/167 (59%), Gaps = 6/167 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
Y C C C NR F ++ K+++ GWG+ AAEP+ G FIIEYIGEVID
Sbjct: 578 YTECGSRCPSGVGCANRRFHNKQYAKVEVFNAGVKGWGLRAAEPLEPGRFIIEYIGEVID 637
Query: 311 --DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R + + V ++ M ++ IDAT KGN SRF+NHSCDPNC +KW V+
Sbjct: 638 AEEMIRRGRRYGKDPKHVHHYLMA-LKNGAVIDATAKGNVSRFINHSCDPNCESQKWTVD 696
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
+ RVG F + I GE + +DY+ ++G + +C+CGA++C+G +G
Sbjct: 697 RQLRVGFFVIKPIALGEEIVFDYQLERYGRKAQRCFCGAANCRGRIG 743
>gi|315364634|pdb|3OOI|A Chain A, Crystal Structure Of Human Histone-Lysine
N-Methyltransferase Nsd1 Set Domain In Complex With
S-Adenosyl-L-Methionine
Length = 232
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 95/159 (59%), Gaps = 3/159 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C
Sbjct: 74 VCPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRA 133
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ + + NFYM + KD IDA KGN++RF+NH C PNC +KW V G+TRVG+F
Sbjct: 134 RIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLF 193
Query: 377 AARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLG 414
A IKAG LT++Y G + C CGA +C G+LG
Sbjct: 194 ALSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLG 232
>gi|224082490|ref|XP_002306713.1| SET domain protein [Populus trichocarpa]
gi|222856162|gb|EEE93709.1| SET domain protein [Populus trichocarpa]
Length = 495
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 101/162 (62%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + K ++ KTE GWG+ A E I G+FIIEY GEVI ++R
Sbjct: 66 CPCGVYCKNQRFQKFEYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYCGEVISWKEAKKR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQVYENQGLKDAFIISLNSTESIDATKKGSLARFINHSCQPNCETRKWTVLGEIRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
++I G L YDY F + G +V+C CGA +C G+LG K +
Sbjct: 186 KQNISIGTELAYDYNFEWYGGAKVRCLCGAVNCSGFLGAKSR 227
>gi|198467361|ref|XP_001354372.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
gi|198149208|gb|EAL31425.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
Length = 2918
Score = 136 bits (342), Expect = 2e-29, Method: Composition-based stats.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F+ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1888 IECGPLCTNGDRCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEFIMEYVGEVIDS 1947
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT +GN SR++NHSCDPN +KW V GE
Sbjct: 1948 EEFERRQHRYSKDRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNAETQKWTVNGEL 2007
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ ++I GE +T+DY++ ++G + +CYC A++C+G++GT+
Sbjct: 2008 RIGFFSLKNILPGEEITFDYQYQRYGRDAQRCYCEAANCRGWIGTE 2053
>gi|171686378|ref|XP_001908130.1| hypothetical protein [Podospora anserina S mat+]
gi|170943150|emb|CAP68803.1| unnamed protein product [Podospora anserina S mat+]
Length = 919
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 100/160 (62%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+C C N+ F++++ + ++KT+ G+G+ A + +FI EYIGEVI++ R
Sbjct: 160 CNCGPGCQNQRFQRKQYADVSVIKTDKKGFGLRANRNLQPNDFIFEYIGEVINEPTFRNR 219
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 220 MIKYDREGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFA 279
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I+AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 280 GRPIRAGEELVFNYNVDRYGADPQPCYCGEQNCVGFIGGK 319
>gi|145540026|ref|XP_001455703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423511|emb|CAK88306.1| unnamed protein product [Paramecium tetraurelia]
Length = 1371
Score = 136 bits (342), Expect = 2e-29, Method: Composition-based stats.
Identities = 71/162 (43%), Positives = 98/162 (60%), Gaps = 7/162 (4%)
Query: 260 CHCSETCNNRPFRKEKKIKI--VKTEFCGWGVEAAEPINKGEFIIEYIGEV--IDDALCE 315
C C+E C NR F+K + ++ G G+ A E I KG+FI++Y+GE+ I+ A
Sbjct: 112 CPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAGERILKGQFIMQYVGEIFQINSAFGR 171
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R+ +Y Y+ ++ ID T KGN +RF+NHSC+PNCI EKW V GE +G+
Sbjct: 172 RRV--QEYSKSTCTYLMKLNNQEVIDPTSKGNLARFINHSCEPNCITEKWNVLGEVCIGI 229
Query: 376 FAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK 416
FA R I E LT+DY+F F P KC CGA+ C+GYLG K
Sbjct: 230 FAIRDINEDEELTFDYQFDVFHTPLTKCLCGANKCKGYLGLK 271
>gi|195566590|ref|XP_002106863.1| GD17127 [Drosophila simulans]
gi|194204255|gb|EDX17831.1| GD17127 [Drosophila simulans]
Length = 2246
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/188 (36%), Positives = 111/188 (59%), Gaps = 8/188 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1339 IECGPLCSNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDS 1398
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1399 EEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1458
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G F+ + I+ GE +T+DY+++++G + +CYC +++C+G++G G+ + G +
Sbjct: 1459 RIGFFSVKPIQPGEEITFDYQYLRYGRDAQRCYCESTNCRGWIG-----GEPDSDEGEQL 1513
Query: 431 KRSSTACL 438
+TA L
Sbjct: 1514 DEKATAML 1521
>gi|453087448|gb|EMF15489.1| hypothetical protein SEPMUDRAFT_161660 [Mycosphaerella populorum
SO2202]
Length = 966
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 97/166 (58%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C NR F+ ++ + ++KTE G+G+ A + +FI EYIGEVI +
Sbjct: 183 MECVGDCGCGAGCQNRRFQHKQYANVSVIKTEKKGYGLRANTELQANDFIFEYIGEVIGE 242
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
RL G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 243 KTFRNRLHQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKL 302
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R I AGE L ++Y ++G + + CYC +C G++G K
Sbjct: 303 RMGIFAERKIHAGEELVFNYNVDRYGADPQPCYCDEPNCTGFIGGK 348
>gi|195376627|ref|XP_002047094.1| GJ13235 [Drosophila virilis]
gi|194154252|gb|EDW69436.1| GJ13235 [Drosophila virilis]
Length = 2005
Score = 136 bits (342), Expect = 2e-29, Method: Composition-based stats.
Identities = 65/166 (39%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1050 IECGPLCTYGERCTNKRFQQHQGWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDS 1109
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1110 EEFERRQHLYSEDRNRHYYFMALRGEAIIDATTKGNISRYINHSCDPNAETQKWTVNGEL 1169
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ ++I GE +T+DY++ ++G + +CYC +++C+G++G +
Sbjct: 1170 RIGFFSVKTILPGEEITFDYQYQRYGRDAQRCYCESANCRGWIGNE 1215
>gi|195478285|ref|XP_002100470.1| GE17076 [Drosophila yakuba]
gi|194187994|gb|EDX01578.1| GE17076 [Drosophila yakuba]
Length = 2397
Score = 136 bits (342), Expect = 2e-29, Method: Composition-based stats.
Identities = 65/164 (39%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1405 IECGPLCTNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDS 1464
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1465 EEFERRQHLYSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1524
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY++ ++G + +CYC A++C+G++G
Sbjct: 1525 RIGFFSVKPIQPGEEITFDYQYQRYGRDAQRCYCEAANCRGWIG 1568
>gi|195126250|ref|XP_002007587.1| GI12297 [Drosophila mojavensis]
gi|193919196|gb|EDW18063.1| GI12297 [Drosophila mojavensis]
Length = 1972
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1026 IECGPLCSYGDRCTNKRFQQHQGWPCRVFRTEKKGCGITAELQIQPGEFIMEYVGEVIDS 1085
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1086 EEFERRQHLYSEDRNRHYYFMALRGEAIIDATTKGNISRYINHSCDPNAETQKWTVNGEL 1145
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
R+G F+ ++I GE +T+DY++ ++G + +CYC +++C+G++G +
Sbjct: 1146 RIGFFSVKTIMPGEEITFDYQYQRYGRDAQRCYCESANCRGWIGNE 1191
>gi|402087032|gb|EJT81930.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 962
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 65/159 (40%), Positives = 101/159 (63%), Gaps = 3/159 (1%)
Query: 261 HCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL 318
+C E C N+ F+ ++ K+ ++KTE G+G+ A ++ +F+ EYIGEVI + L +RL
Sbjct: 162 NCGEGCQNQRFQAKQYAKVSVIKTEKKGFGLRADTDLDANDFVFEYIGEVIGEELFRRRL 221
Query: 319 WDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA 378
+ +++FY + + +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 222 MKYDDQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFAL 281
Query: 379 RSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G +C+CG SC G LG K
Sbjct: 282 RAIKAGEELCFNYNVDRYGANPQRCHCGEPSCSGTLGGK 320
>gi|317138698|ref|XP_001817081.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Aspergillus oryzae RIB40]
Length = 849
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ ++ ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 184 IECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFIYEYVGEVINE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 244 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 304 RMGIFAERDIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 358
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 359 SNATIEALGI 368
>gi|6143888|gb|AAF04434.1|AC010718_3 unknown protein; 29143-26659 [Arabidopsis thaliana]
Length = 528
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K++K E GWG+ A E I G+FI+EY GEVI ++R
Sbjct: 72 CPCGVYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKR 131
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ GV++ Y+ + IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 132 AQTYETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFA 191
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
SI L YDY F + G +V+C CGA +C G+LG K +
Sbjct: 192 KESISPRTELAYDYNFEWYGGAKVRCLCGAVACSGFLGAKSR 233
>gi|342319484|gb|EGU11432.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Rhodotorula glutinis ATCC 204091]
Length = 844
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 104/172 (60%), Gaps = 5/172 (2%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + I+IV+TE G+GV A I+ F+ EY+GEVI +++
Sbjct: 276 CRCGRHCQNQRFQKRQYAPIEIVQTEKKGFGVRAGADISADTFVYEYVGEVIGPQPFQRK 335
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY + +D IDAT KG RFLNHSC+PNC++ KW V + R+G+F
Sbjct: 336 MKEYANEGIKHFYFMALDRDVFIDATKKGGKGRFLNHSCNPNCVVAKWTVGRKMRMGIFT 395
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKLELCW 426
R IK E LT++Y ++G + CYCG +C GY+G T+ +G ++ +
Sbjct: 396 KRDIKMHEELTFNYNVDRYGHVAQECYCGEPNCVGYIGGKTQTDLGGMDDLY 447
>gi|121806929|sp|Q2UTN6.1|SET2_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|83764935|dbj|BAE55079.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 965
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ ++ ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 202 IECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFIYEYVGEVINE 261
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 262 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 321
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 322 RMGIFAERDIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 376
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 377 SNATIEALGI 386
>gi|194895514|ref|XP_001978270.1| GG17783 [Drosophila erecta]
gi|190649919|gb|EDV47197.1| GG17783 [Drosophila erecta]
Length = 2384
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1392 IECGPLCTNGARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDS 1451
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1452 EEFERRQHLYSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1511
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ + I+ GE +T+DY++ ++G + +CYC A++C+G++G
Sbjct: 1512 RIGFFSVKPIQPGEEITFDYQYQRYGRDAQRCYCEAANCRGWIG 1555
>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Megachile rotundata]
Length = 1302
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/204 (38%), Positives = 109/204 (53%), Gaps = 17/204 (8%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C CNN+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 881 ILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTVARGWGLRSLEFIKAGQFVIEYVGE 940
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + IDA KGN SRF+NHSC PNC +KW V
Sbjct: 941 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTV 1000
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK---------- 416
G+TR+G+FA I+ GE LT++Y G K C CGA +C G++G K
Sbjct: 1001 NGDTRIGLFALCDIEPGEELTFNYNLACDGETRKPCLCGAPNCSGFIGLKVQKPQATTPS 1060
Query: 417 ---RKIGKLELCWGSKRKRSSTAC 437
+K+ K + KR R C
Sbjct: 1061 IQQKKLEKFDKMKRQKRSRKHVPC 1084
>gi|134083984|emb|CAK49139.1| unnamed protein product [Aspergillus niger]
Length = 734
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 109/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 197 IECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFIFEYVGEVINE 256
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 257 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 316
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 317 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 371
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 372 SNATIEALGI 381
>gi|350634064|gb|EHA22428.1| hypothetical protein ASPNIDRAFT_51069 [Aspergillus niger ATCC 1015]
Length = 962
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 109/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 197 IECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFIFEYVGEVINE 256
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 257 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 316
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 317 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 371
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 372 SNATIEALGI 381
>gi|389633989|ref|XP_003714647.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351646980|gb|EHA54840.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|440474559|gb|ELQ43296.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440479738|gb|ELQ60486.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 946
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/159 (40%), Positives = 100/159 (62%), Gaps = 3/159 (1%)
Query: 261 HCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL 318
+C + C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI + L RL
Sbjct: 161 NCGDGCQNQRFQRKQYANVSVIKTENKGYGLRADANLEPNDFVFEYIGEVIGEELFRSRL 220
Query: 319 WDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA 378
+ +++FY + + +DAT KGN RF NHSC+PNC ++KW V + R+G+FA
Sbjct: 221 MKYDTQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAM 280
Query: 379 RSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTK 416
R+IKAGE L ++Y ++G +CYCG S+C G LG K
Sbjct: 281 RAIKAGEELCFNYNVDRYGANPQRCYCGESNCSGILGGK 319
>gi|22330671|ref|NP_177797.2| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|42572135|ref|NP_974158.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|75243465|sp|Q84WW6.1|ASHH1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH1; AltName:
Full=ASH1 homolog 1; AltName: Full=Protein SET DOMAIN
GROUP 26
gi|25054844|gb|AAN71912.1| unknown protein [Arabidopsis thaliana]
gi|225898088|dbj|BAH30376.1| hypothetical protein [Arabidopsis thaliana]
gi|332197758|gb|AEE35879.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|332197759|gb|AEE35880.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
Length = 492
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K++K E GWG+ A E I G+FI+EY GEVI ++R
Sbjct: 72 CPCGVYCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKR 131
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ GV++ Y+ + IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 132 AQTYETHGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFA 191
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
SI L YDY F + G +V+C CGA +C G+LG K +
Sbjct: 192 KESISPRTELAYDYNFEWYGGAKVRCLCGAVACSGFLGAKSR 233
>gi|430813766|emb|CCJ28924.1| unnamed protein product [Pneumocystis jirovecii]
Length = 809
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 101/166 (60%), Gaps = 3/166 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C+ C+C E C NR F+ ++ I ++KT+ G+G+ A + G+FI EY+GEVID+
Sbjct: 184 VECTDDCNCGEDCQNRRFKLKQYSNIDVIKTKKKGYGIRANSDMESGQFIYEYVGEVIDE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ +++FY +++ IDAT KG +RFLNHSC PNC ++KW V +
Sbjct: 244 RKFRKRMKIYANEDIKHFYFMMLQRGEYIDATKKGGLARFLNHSCSPNCYVDKWVVGTKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R+I GE LT+DY ++G + CYC C G++G K
Sbjct: 304 RMGIFCKRNILKGEELTFDYNVDRYGNTAQPCYCEEPGCIGFIGGK 349
>gi|238503484|ref|XP_002382975.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
gi|220690446|gb|EED46795.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
gi|391863367|gb|EIT72678.1| clathrin coat binding protein [Aspergillus oryzae 3.042]
Length = 947
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 111/190 (58%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++++ ++ ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 184 IECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFIYEYVGEVINE 243
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT +GN RF NHSC+PNC ++KW V +
Sbjct: 244 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGEKL 303
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 304 RMGIFAERDIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 358
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 359 SNATIEALGI 368
>gi|358373289|dbj|GAA89888.1| SET and WW domain protein [Aspergillus kawachii IFO 4308]
Length = 963
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 109/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 198 IECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFIFEYVGEVINE 257
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 258 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 317
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 318 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 372
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 373 SNATIEALGI 382
>gi|325187666|emb|CCA22203.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1414
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/181 (40%), Positives = 106/181 (58%), Gaps = 7/181 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEKK---IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ CS AC E C N+ ++ +K ++ KT G+G+ E I +G + EY+GE+I
Sbjct: 323 YVECSAACEAREFCQNKRLQQPEKFPSLEAFKTLKKGFGIRTKENIRQGTIVGEYVGEII 382
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D +RL + R NFY +R IDA +G+ +RF+NHSCDPNC EKW VEG
Sbjct: 383 DQNELNRRLCAIG-RHELNFYYLLLRPGVYIDARNRGSLTRFVNHSCDPNCKTEKWTVEG 441
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWGS 428
+TR+ V A R I+ G LT+DY++ G ++ C+C A++C G++G KI LE S
Sbjct: 442 DTRIAVVALRHIEFGTELTFDYQWKSLGSRQLACHCNAANCSGFIGG--KIDVLEDNIAS 499
Query: 429 K 429
K
Sbjct: 500 K 500
>gi|402217077|gb|EJT97159.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 711
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I+IV T+ G+G+ AA + FI EYIGEVI+ +R
Sbjct: 94 CRCRSHCQNQRFQRKQYADIEIVSTDKKGYGLRAATDLQPDSFIYEYIGEVINQTTFLKR 153
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++++ IDAT KG RF NHSC+PNC + KW V R+G+FA
Sbjct: 154 MHEYADEGIKHFYFMMLQREEYIDATKKGGIGRFANHSCNPNCYVAKWVVGPRVRMGIFA 213
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I GE LT++Y ++G + + CYCG ++C G++G K
Sbjct: 214 KREISRGEELTFNYNVDRYGHDAQPCYCGEANCVGFIGGK 253
>gi|357443579|ref|XP_003592067.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
gi|355481115|gb|AES62318.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
Length = 491
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ KTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 79 CPCDIHCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRR 138
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 139 SHTYEIQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFA 198
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+I G L YDY F F G +V+C CGA C +LG K +
Sbjct: 199 LENIPIGTELAYDYNFEWFGGAKVRCLCGALKCSEFLGAKSR 240
>gi|170097553|ref|XP_001879996.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164645399|gb|EDR09647.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 880
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 96/160 (60%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + I IV TE G+G+ A E + + FI EY+G+V++ ++R
Sbjct: 204 CRCRSYCQNQRFQKREYANIDIVLTEKKGFGLRAEEDLPRDTFIYEYVGDVVNPTSFKKR 263
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ D G+Q+FY ++KD IDAT G RF NHSC PNC + KW + R+G+FA
Sbjct: 264 MRDYALEGIQHFYFMMLQKDEFIDATKGGGIGRFANHSCSPNCYVAKWTIGHHVRMGIFA 323
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R IK E LT++Y ++G + + CYCG +C G++G K
Sbjct: 324 KRDIKQYEELTFNYNVDRYGHQAQICYCGEPNCVGFIGGK 363
>gi|317037088|ref|XP_001398411.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Aspergillus niger CBS 513.88]
Length = 853
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 109/190 (57%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C C C N+ F++ + + ++KTE G+G+ A + +FI EY+GEVI++
Sbjct: 197 IECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFIFEYVGEVINE 256
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 257 GQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKL 316
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R I+AGE L ++Y ++G + + CYCG +C G++G G+ + +K
Sbjct: 317 RMGIFAERHIQAGEELVFNYNVDRYGADPQPCYCGEPNCTGFIG-----GRTQTERATKL 371
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 372 SNATIEALGI 381
>gi|402594990|gb|EJW88916.1| SET domain-containing protein [Wuchereria bancrofti]
Length = 1425
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 99/167 (59%), Gaps = 6/167 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
Y C C C NR F ++ K+++ GWG+ AAEP+ G FIIEY+GEVID
Sbjct: 581 YTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKGWGLRAAEPLEPGRFIIEYVGEVID 640
Query: 311 --DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R + + V ++ M ++ IDAT KGN SRF+NHSCDPNC +KW V
Sbjct: 641 AEEMIRRGRRYGKDPKHVHHYLMA-LKNGAVIDATAKGNVSRFINHSCDPNCESQKWTVN 699
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
+ RVG F + I GE + +DY+ ++G + +C+CGA++C+G +G
Sbjct: 700 RQLRVGFFVIKPIALGEEIVFDYQLERYGRKAQRCFCGAANCRGRIG 746
>gi|356576073|ref|XP_003556159.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
Length = 480
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/162 (44%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ KTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 66 CPCGVLCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQAYENQGLKDAFIICLNASESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
I G L YDY F F G +V+C CGA C G+LG K +
Sbjct: 186 KHDIPIGNELAYDYNFEWFGGAKVRCLCGALKCSGFLGAKSR 227
>gi|170587756|ref|XP_001898640.1| SET domain containing protein [Brugia malayi]
gi|158593910|gb|EDP32504.1| SET domain containing protein [Brugia malayi]
Length = 1449
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 99/167 (59%), Gaps = 6/167 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
Y C C C NR F ++ K+++ GWG+ AAEP+ G FIIEY+GEVID
Sbjct: 608 YTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKGWGLRAAEPLEPGRFIIEYVGEVID 667
Query: 311 --DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R + + V ++ M ++ IDAT KGN SRF+NHSCDPNC +KW V
Sbjct: 668 AEEMIRRGRRYGKDPKHVHHYLMA-LKNGAVIDATAKGNVSRFINHSCDPNCESQKWTVN 726
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
+ RVG F + I GE + +DY+ ++G + +C+CGA++C+G +G
Sbjct: 727 RQLRVGFFVIKPIALGEEIVFDYQLERYGRKAQRCFCGAANCRGRIG 773
>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Danio rerio]
Length = 773
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/164 (42%), Positives = 96/164 (58%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + +++ +T GWG+ I KG FI EY+GEVID+ C
Sbjct: 376 QVCPAGERCQNQCFIKRQYCQVETFRTLSRGWGLRCVHDIKKGGFISEYVGEVIDEEECR 435
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NH C PNC +KW V G+TRVG+
Sbjct: 436 ARIKHAQENNIGNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGL 495
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
F+ I AG LT++Y G + C CGAS+C G+LG + K
Sbjct: 496 FSLTDIPAGTELTFNYNLECLGNGKTVCKCGASNCSGFLGVRPK 539
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 38/87 (43%), Gaps = 17/87 (19%)
Query: 116 CRACHRFIYHGEEVF-CSVRGCGGVYHFICVKERL-GISNPRNFKCPQHACFIC------ 167
C AC R GE+V C V GCG YH C + R F+CP HAC C
Sbjct: 46 CFACKRL---GEDVRRCMVPGCGKFYHGECAASHAPTVPLNRAFRCPLHACLSCFILNPA 102
Query: 168 ----RQRLQWRCVRCTIASH--DKCAP 188
+ RC+RC +A H D C P
Sbjct: 103 NPSVAKGQLTRCIRCPVAYHSSDFCIP 129
>gi|346321276|gb|EGX90876.1| histone H3 lysine 36 (K36) methyltransferase [Cordyceps militaris
CM01]
Length = 905
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 108/167 (64%), Gaps = 4/167 (2%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS+ A +C C N+ F++++ + ++KTE G+G+ ++ +FI EYIGEVI+
Sbjct: 155 MECSETAGNCGGGCQNQRFQRKQYADVCVIKTEKKGYGLRTDAELSAHDFIFEYIGEVIN 214
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+A +R+ +G+++FY + K+ +DAT KGN RF NHSC PNC ++KW + +
Sbjct: 215 EATFRRRMLQYDQQGIKHFYFMSLSKNEFVDATRKGNLGRFCNHSCVPNCYVDKWVIGDK 274
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+AGE L ++Y ++G + + CYCG +C G++G K
Sbjct: 275 LRMGIFALRTIRAGEELVFNYNVDRYGADPQPCYCGEPTCVGFIGGK 321
>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
Length = 1259
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 75/182 (41%), Positives = 106/182 (58%), Gaps = 6/182 (3%)
Query: 257 SKACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALC 314
SK C E+C+N+ F K + F G+G+ A E + G+F+IEY+GEVI+
Sbjct: 1002 SKTCPAKESCSNQRFTKRIYPALEVRFFSDKGFGLVALEDLKSGQFVIEYVGEVINSEEF 1061
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV- 373
++R+ M+ N+Y + D TIDA KGN SRF+NHSC+PNC +KW + GETRV
Sbjct: 1062 DRRVMMMQAAKETNYYFLTVEPDLTIDAGPKGNVSRFINHSCEPNCETQKWTI-GETRVI 1120
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK-RKIGKLELCWGSKRK 431
G+FA + I AGE LT++Y G + C CGA C G++G K R K ++ K +
Sbjct: 1121 GLFAIKDINAGEELTFNYNLESLGNNKRVCLCGAGKCSGFIGEKYRPPNKKDIVISMKSE 1180
Query: 432 RS 433
RS
Sbjct: 1181 RS 1182
>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 100/177 (56%), Gaps = 3/177 (1%)
Query: 258 KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F R+ +++I +T GWG+ I KG F+ EY+GEVID+ C
Sbjct: 424 QVCPAGERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGHFVSEYVGEVIDEEECR 483
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + + NFYM + KD IDA KGN +RF+NH C PNC +KW V G+TRVG+
Sbjct: 484 ARIRHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVSGDTRVGL 543
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKLELCWGSKRK 431
FA + AG LT++Y G + C CGA +C G+LG + K G KR+
Sbjct: 544 FALVDVPAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPKNNPPSEDKGRKRR 600
>gi|147846734|emb|CAN80636.1| hypothetical protein VITISV_017995 [Vitis vinifera]
Length = 278
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 99/167 (59%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C+ + C CS C N+ F+K + K K+ + E GWG+ A E I GEF++EY GEVI
Sbjct: 90 IECTPRYCPCSIHCKNQRFQKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVIS 149
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
R +G+++ Y+ + IDAT KGN +RF+NHSC PNC KW V GE
Sbjct: 150 RTEARGRSQVYVSQGLKDVYIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGE 209
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLGTK 416
RVG+FA R+I G LTY Y F + +V+C CGA+ C G+LG K
Sbjct: 210 DRVGIFALRNISVGTELTYSYNFEWYSXAKVRCLCGATRCSGFLGGK 256
>gi|255078218|ref|XP_002502689.1| set domain protein [Micromonas sp. RCC299]
gi|226517954|gb|ACO63947.1| set domain protein [Micromonas sp. RCC299]
Length = 1065
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 99/168 (58%), Gaps = 4/168 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ C A C C + C N+ F+K+ K+ I +T G G+ + + KG+FIIEYIGEV+
Sbjct: 148 VECDPAFCPCGDGCQNQKFQKKDYAKLDIQRTGRKGHGLFTKQALKKGQFIIEYIGEVLH 207
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ R G +++Y + TIDA +GN RFLNHSCDPNC +KW V GE
Sbjct: 208 EDEYRSRKARYDDEGRRHYYFMTLSSSETIDAAERGNAGRFLNHSCDPNCETQKWMVNGE 267
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKR 417
+G++A I AG+ LT+DY F ++G +KC+CG S C G++G +
Sbjct: 268 LCIGIYALTDIDAGDELTFDYNFERYGDNPIKCFCGTSRCGGWIGAPK 315
>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Bombus
terrestris]
Length = 1238
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/170 (42%), Positives = 100/170 (58%), Gaps = 4/170 (2%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C C N+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 816 ILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGRGWGLRSLEHIKAGQFVIEYVGE 875
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + IDA KGN SRF+NHSC PNC +KW V
Sbjct: 876 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTV 935
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+TR+G+FA I+ GE LT++Y G K C CGAS+C G++G K
Sbjct: 936 NGDTRIGLFALCDIECGEELTFNYNLACDGETRKPCLCGASNCSGFIGLK 985
>gi|385301625|gb|EIF45804.1| putative histone lysine methyltransferase set2p [Dekkera
bruxellensis AWRI1499]
Length = 695
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 97/174 (55%), Gaps = 3/174 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++ + I I TE G+G+ A I FIIEY GEVID+ + R
Sbjct: 101 CSCGXRCQNQRFQRNQMADISIFLTEHKGYGMRANCDIPANTFIIEYKGEVIDEQAYKIR 160
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
G+++FY I+ IDAT KG+ RF NHSCDPN EKW V R+G+FA
Sbjct: 161 KEAYAKEGIKHFYFMMIQDGQIIDATKKGSLGRFCNHSCDPNAYXEKWVVNKRYRMGIFA 220
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R I GE +T+DY ++G E KCYCGA +C GYLG K + L L + R
Sbjct: 221 KRKIPKGEEITFDYNVDRYGAEPQKCYCGAKNCVGYLGGKTQTEVLRLLTYAAR 274
>gi|357443507|ref|XP_003592031.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
gi|355481079|gb|AES62282.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
Length = 479
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ KTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 67 CPCDIHCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRR 126
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 127 SHTYEIQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFA 186
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
I G L YDY F F G +V+C CGA C +LG K +
Sbjct: 187 LEDIPIGTELAYDYNFEWFGGAKVRCLCGALKCSEFLGAKSR 228
>gi|164655887|ref|XP_001729072.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
gi|159102961|gb|EDP41858.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
Length = 707
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 96/158 (60%), Gaps = 3/158 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++ K+ IVKT G+G+ A + + EF+ EYIGEVI+ +R
Sbjct: 29 CRCGVHCANQRFQQHAYAKVDIVKTPDKGYGLRAQTDLGRDEFVYEYIGEVINHDTFMRR 88
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ K +++FY +++D IDAT +G SRF+NHSC PNC + KW V R+G+FA
Sbjct: 89 MAQYKTEHIEHFYFMMLQRDEYIDATKRGGRSRFINHSCSPNCYVSKWHVGRHVRMGIFA 148
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R I AGE LT++Y ++G + + CYCG +C G LG
Sbjct: 149 KRRILAGEELTFNYNVDRYGNDPQPCYCGEPNCVGTLG 186
>gi|391338508|ref|XP_003743600.1| PREDICTED: probable histone-lysine N-methyltransferase Mes-4-like
[Metaseiulus occidentalis]
Length = 890
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 100/173 (57%), Gaps = 4/173 (2%)
Query: 253 YISCSKACHCSETCNNRPF--RKEKKIKIVKT-EFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ CS C E C NR F R + K+ T + GWG++A E + K +F+IEYIGE+I
Sbjct: 714 YVECSDDCAQREKCKNRRFQERAYRATKVFLTPDGRGWGLQAVEDLKKDDFVIEYIGEII 773
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D C++RL + G N Y E ++ IDA GN +RF+NHSC+PN EKW V
Sbjct: 774 DSKECDRRLSQLHAIGDPNVYFLEADANYIIDAGPAGNEARFINHSCNPNLRTEKWTVND 833
Query: 370 ETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYLGTKRKIGK 421
+ VG+FA R+I AGE LT++Y + E C C AS+C G LG RK K
Sbjct: 834 KLCVGLFAKRNIAAGEELTFNYNMQCRDVNENICLCNASNCVGTLGISRKQSK 886
>gi|320583119|gb|EFW97335.1| Histone methyltransferase [Ogataea parapolymorpha DL-1]
Length = 942
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 96/160 (60%), Gaps = 3/160 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+ + + I KTE G+G+ A E I FI+EY GE++D+ L ++R
Sbjct: 100 CGDKCQNQRFQNNEIAPVSIFKTEHKGYGMRADEDIYPHTFIVEYKGEIVDNELYKERKE 159
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G+++FY I+ + IDAT K + RF NHSCDPN +EKW V R+G+FA R
Sbjct: 160 QYSNEGIKHFYFMMIQDNEIIDATKKASIGRFCNHSCDPNAYVEKWVVNKRFRMGIFAKR 219
Query: 380 SIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
I GE +T+DY ++G E KCYCGA +C G +G K +
Sbjct: 220 KIIKGEEITFDYNVDRYGAEPQKCYCGAPNCLGVMGGKTQ 259
>gi|145533713|ref|XP_001452601.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420300|emb|CAK85204.1| unnamed protein product [Paramecium tetraurelia]
Length = 1137
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 101/162 (62%), Gaps = 7/162 (4%)
Query: 260 CHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV--IDDALCE 315
C C+E C NR F+K + + +K G G+ A + I KG+FI++YIGE+ I+ AL
Sbjct: 47 CPCAELCKNRRFQKHEDACVYPLKCGGKGMGLYAGDRIQKGQFIMQYIGEIFQINSALGR 106
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R+ + Y Y+ ++ ID T KGN +RF+NHSC+PNCI EKW V GE +G+
Sbjct: 107 RRVQE--YSKSTCTYLMKLNNQEVIDPTTKGNLARFINHSCEPNCITEKWNVLGEVCIGI 164
Query: 376 FAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTK 416
FA+R I E LT+DY+F F P KC CGA+ C+GYLG K
Sbjct: 165 FASRDINEDEELTFDYQFDVFHTPLTKCLCGAAKCKGYLGLK 206
>gi|46122361|ref|XP_385734.1| hypothetical protein FG05558.1 [Gibberella zeae PH-1]
Length = 1051
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 104/164 (63%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
++ +C+ C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI++
Sbjct: 285 SAEGGNCAGGCQNQRFQRKQYANVSVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPT 344
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ G+++FY + K +DAT KGN+ RF NHSC+PNC ++KW V + R+
Sbjct: 345 FRRRMIQYDEEGIKHFYFMSLNKSEFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRM 404
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+F +R I++GE L ++Y ++G + + CYCG +C G++G K
Sbjct: 405 GIFTSRKIQSGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 448
>gi|401625193|gb|EJS43214.1| set2p [Saccharomyces arboricola H-6]
Length = 736
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 97/158 (61%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT++ G+GV A E I +FI EY GEVI++ +RL
Sbjct: 105 CGNDCQNQRFQKKQYSPIAIFKTKYKGYGVRAEEDIEANQFIYEYKGEVIEEMEFRERLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+G+ R+G+FA +
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKGKLRMGIFAQK 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
Length = 2269
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/167 (41%), Positives = 97/167 (58%), Gaps = 3/167 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+ C N+ F K + ++ +TE GWG+ + I KG F+IEY+GEV+DD ++R
Sbjct: 1682 CNAGTRCKNQSFEKREYPPLEPYRTERRGWGLRSTVDIPKGSFVIEYVGEVVDDEEFKRR 1741
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + N+Y I KD IDA KGN +RF+NHSC PNC +KW V +TRVG+FA
Sbjct: 1742 MKRKQETMDNNYYFLTIDKDRIIDAGPKGNLARFMNHSCSPNCETQKWTVNMDTRVGLFA 1801
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLE 423
I AG LT++Y G E K C+C +C G++G K K K E
Sbjct: 1802 LEDIPAGTELTFNYNLDCVGNEKKACHCEEENCSGFIGVKVKNDKTE 1848
>gi|168044865|ref|XP_001774900.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673794|gb|EDQ60312.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1980
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/161 (41%), Positives = 96/161 (59%), Gaps = 4/161 (2%)
Query: 258 KACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C C C N+ F+K +++ + G G+ A E I +G FIIEY+GEV+D E
Sbjct: 884 QQCPCGPFCTNQQFQKRLYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFE 943
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R + ++FY + + IDA KGN RF+NHSC+PNC EKW V+GE +G+
Sbjct: 944 ARQKEYSMNSQKHFYFMTLSANEIIDACSKGNLGRFINHSCEPNCQTEKWMVDGEVCIGL 1003
Query: 376 FAARSIKAGEPLTYDYRFVQFG--PEVKCYCGASSCQGYLG 414
FA R +K GE +T+DY FV+ G KC CGA+ C+G++G
Sbjct: 1004 FAIRDVKKGEEVTFDYNFVRVGGADAKKCECGANKCRGFIG 1044
>gi|195155589|ref|XP_002018686.1| GL25818 [Drosophila persimilis]
gi|194114839|gb|EDW36882.1| GL25818 [Drosophila persimilis]
Length = 476
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 98/164 (59%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F+ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 148 IECGPLCSNGERCTNKRFQLNQCWPCRVFRTEKKGCGISAELAIPAGEFIMEYVGEVIDS 207
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A E+R +++Y +R IDAT GN SRF+NHSCDPN +KW V GE
Sbjct: 208 AEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNAETQKWTVNGEL 267
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R+G+F+ +SI GE +T+DYR+ + + CYC A++C+G+LG
Sbjct: 268 RIGLFSVKSIMPGEEITFDYRYQPYDRIAQPCYCEAANCRGWLG 311
>gi|357443463|ref|XP_003592009.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
gi|355481057|gb|AES62260.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
Length = 616
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ KTE GWG+ A E I G+F+IEY GEVI ++R
Sbjct: 204 CPCDIHCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRR 263
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 264 SHTYEIQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFA 323
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
I G L YDY F F G +V+C CGA C +LG K +
Sbjct: 324 LEDIPIGTELAYDYNFEWFGGAKVRCLCGALKCSEFLGAKSR 365
>gi|408394323|gb|EKJ73531.1| hypothetical protein FPSE_06149 [Fusarium pseudograminearum CS3096]
Length = 911
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 106/167 (63%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS + +C+ C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI+
Sbjct: 142 MECSAEGGNCAGGCQNQRFQRKQYANVSVIKTEKKGFGLRADSDLQPNDFVFEYIGEVIN 201
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K +DAT KGN+ RF NHSC+PNC ++KW V +
Sbjct: 202 EPTFRRRMIQYDEEGIKHFYFMSLNKSEFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDK 261
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F +R I++GE L ++Y ++G + + CYCG +C G++G K
Sbjct: 262 LRMGIFTSRKIQSGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 308
>gi|410516927|sp|Q4IB50.2|SET2_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
Length = 911
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 106/167 (63%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS + +C+ C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI+
Sbjct: 142 MECSAEGGNCAGGCQNQRFQRKQYANVSVIKTEKKGFGLRADSDLQPNDFVFEYIGEVIN 201
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K +DAT KGN+ RF NHSC+PNC ++KW V +
Sbjct: 202 EPTFRRRMIQYDEEGIKHFYFMSLNKSEFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDK 261
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F +R I++GE L ++Y ++G + + CYCG +C G++G K
Sbjct: 262 LRMGIFTSRKIQSGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 308
>gi|313226807|emb|CBY21952.1| unnamed protein product [Oikopleura dioica]
Length = 216
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 96/163 (58%), Gaps = 2/163 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ CS C C N+ F+K K KI + G G++A I +G+FIIEY+GEV+
Sbjct: 27 VECSSRCRTGNRCTNKNFQKRKYAPAKIFWAKGKGHGLKATAFIPRGKFIIEYLGEVVSA 86
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
++R + G Q+ Y E+ + TIDA KG SRF+NHSC+PN +KW V G
Sbjct: 87 KEFKKRSHEYARSGKQHHYFMELSRQATIDAYHKGAISRFINHSCEPNSETQKWTVNGLL 146
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
R+G FA R I+ E +T+DY+F+ FG KC CGA SC+G +G
Sbjct: 147 RIGFFAIRDIQPEEEITFDYQFIHFGQGQKCLCGAPSCRGIIG 189
>gi|156847005|ref|XP_001646388.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
70294]
gi|156117064|gb|EDO18530.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
70294]
Length = 693
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 97/158 (61%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+K++ I + KT+ G+GV A + I +FI EY GEVI++ +RL
Sbjct: 99 CGDNCQNQRFQKKEYADISVFKTKMKGYGVRANKDIEANDFIYEYKGEVIEEDEFRERLI 158
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D RG ++FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 159 DYDERGFKHFYFMMLQNAEFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAKR 218
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 219 DIYKGEEITFDYNVDRYGATAQKCYCEEPNCIGFLGGK 256
>gi|326438002|gb|EGD83572.1| hypothetical protein PTSG_04178 [Salpingoeca sp. ATCC 50818]
Length = 1398
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 68/166 (40%), Positives = 95/166 (57%), Gaps = 4/166 (2%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE---KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ Y C C C C N+ F++ IK T G+GV A I KG F+IEY GE
Sbjct: 355 LMYSECGPDCPCGRQCTNKRFQQRAWCTAIKRAPTPGKGYGVFATAYIPKGTFVIEYTGE 414
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
++ + QR + YR ++F+ + + IDA G+ +RF+NHSCDPNC +EKW V
Sbjct: 415 IMTSSAFTQRANTL-YRARKHFHCLNLDRGLVIDAGQAGSEARFINHSCDPNCHIEKWNV 473
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYL 413
G R GVFA+R IK E L+YDY F F ++ C CGA++C+G +
Sbjct: 474 NGHWRAGVFASRDIKGDEELSYDYNFHNFNEKLVCRCGAANCRGEI 519
>gi|346971461|gb|EGY14913.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 913
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 99/161 (61%), Gaps = 3/161 (1%)
Query: 259 ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
A +C + C N+ F+++ + ++KTE G+G+ A + +FI EYIGEVI++ +
Sbjct: 151 AGNCGDGCQNQRFQRKAYADVSVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRR 210
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+F
Sbjct: 211 RMVQYDDEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIF 270
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I AGE L ++Y ++G + + CYCG +C GY+G K
Sbjct: 271 TLRGIAAGEELVFNYNVDRYGADPQPCYCGEPNCTGYIGGK 311
>gi|195108992|ref|XP_001999076.1| GI23270 [Drosophila mojavensis]
gi|193915670|gb|EDW14537.1| GI23270 [Drosophila mojavensis]
Length = 1433
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 95/163 (58%), Gaps = 4/163 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
CH C N+ F RK ++++V G+G+ EPI +G+FIIEY+GEVI+ A ++R
Sbjct: 1194 CHAGTRCENQLFELRKSPRLEVVYMNERGFGLVCREPIKEGDFIIEYVGEVINHAEFQRR 1253
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +NFY + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1254 MAQKTKNRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCNNRVGLFA 1313
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
+ I LT++Y + K C+CGA C G +G K K
Sbjct: 1314 IKDIPENTELTFNYLWDDLMNNGKKACFCGAKRCSGEIGGKLK 1356
>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1443
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 102/200 (51%), Gaps = 32/200 (16%)
Query: 251 VQYISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWG---------VEAAEP----- 294
+QY + C E C N+ F R KI+KT GWG V A P
Sbjct: 1040 LQYECHPQVCPSGERCCNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKVSARRPGSPVP 1099
Query: 295 ---------------INKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFT 339
+ +GEF+ EYIGE+ID+ C R+ + + NFYM I KD
Sbjct: 1100 VFLPVGRRVGTSWSDVTQGEFVNEYIGELIDEEECRARIKYAQENNITNFYMLTIDKDRI 1159
Query: 340 IDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE 399
IDA KGN+SRF+NHSC PNC +KW V G+TRVG+FA I AG LT++Y G E
Sbjct: 1160 IDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNE 1219
Query: 400 -VKCYCGASSCQGYLGTKRK 418
C CGA +C G+LG + K
Sbjct: 1220 KTVCCCGAPNCSGFLGDRPK 1239
Score = 38.1 bits (87), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 31/65 (47%), Gaps = 13/65 (20%)
Query: 131 CSVRGCGGVYHFICVK-ERLGISNPRNFKCPQHACFIC-----------RQRLQWRCVRC 178
C V CG YH C++ L + + + F+CP H C C + RL RC+RC
Sbjct: 734 CHVPHCGKFYHEACIRLNPLTVFDNKGFRCPLHTCLGCCYGNRTKPKSTKGRL-MRCLRC 792
Query: 179 TIASH 183
+A H
Sbjct: 793 PVAYH 797
>gi|71013616|ref|XP_758647.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
gi|74702635|sp|Q4PBL3.1|SET2_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|46098305|gb|EAK83538.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
Length = 972
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 103/167 (61%), Gaps = 4/167 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I CS + C + C N+ F + + + IV+TE G+G+ A + I K FI EY+GEV++
Sbjct: 222 IECSASKCRWGKQCRNQRFHRRQYVDVDIVQTEKKGFGLRACQDIPKETFIYEYVGEVMN 281
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
QR+ + G+++FY ++ + +DAT KG RF+NHSC+PNC + KWQV
Sbjct: 282 QTTFLQRMQQYRIEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCAVSKWQVGKH 341
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+ GE LT++Y ++G + + C+CG +C G LG K
Sbjct: 342 LRMGIFAKRNIQKGEELTFNYNVDRYGNDAQECFCGEPNCVGTLGGK 388
>gi|195448204|ref|XP_002071555.1| GK25076 [Drosophila willistoni]
gi|194167640|gb|EDW82541.1| GK25076 [Drosophila willistoni]
Length = 2217
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F++ + ++ +TE G G+ A I GEFI+EY+GEVID
Sbjct: 1135 IECGPLCTNGDRCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDA 1194
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1195 EEFERRQHLYSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGEL 1254
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ ++I GE +T+DY++ ++G + +CYC A +C+G++G
Sbjct: 1255 RIGFFSVKTILPGEEITFDYQYQRYGRDAQRCYCEAINCRGWIG 1298
>gi|198451130|ref|XP_001358254.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
gi|198131348|gb|EAL27392.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
Length = 1541
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 97/163 (59%), Gaps = 4/163 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C + C NR F RK ++ +V G+G+ EPI +G+FIIEY+GEVI+ ++R
Sbjct: 1327 CRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQEEFQRR 1386
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +NFY + K+F IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1387 MLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTNRVGLFA 1446
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
+ I A LT++Y + + K CYCG+ C G +G K K
Sbjct: 1447 IQDIPAETELTFNYLWDDLLNDKKKACYCGSERCSGEIGGKLK 1489
>gi|342209884|gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
Length = 481
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 97/162 (59%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ KTE GWG+ A E + G+F+IEY GEVI ++R
Sbjct: 66 CPCDILCKNQKFQKCEYAKTKLFKTEGRGWGLLAGEDLKAGQFVIEYCGEVISWKEAKRR 125
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +G+++ ++ + +IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA
Sbjct: 126 SQAYENQGLKDAFIICLNASESIDATRKGSLARFINHSCRPNCETRKWNVLGEIRVGIFA 185
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ G L YDY F F G +V+C CGA C G+LG K +
Sbjct: 186 KHDVPIGTELAYDYNFEWFGGAKVRCLCGALKCSGFLGAKSR 227
>gi|388858360|emb|CCF48089.1| related to SET2-Histone methyltransferase [Ustilago hordei]
Length = 985
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 104/167 (62%), Gaps = 4/167 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I CS + C + C N+ F + + + IV+TE G+G+ AA+ I K F+ EY+GEV++
Sbjct: 196 IECSSSKCRWGKLCRNQRFHRRQYVNVDIVQTEKKGFGLRAAQDIPKETFVYEYVGEVMN 255
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+R+ + G+++FY ++ + +DAT KG RF+NHSC+PNC + KWQV
Sbjct: 256 QPTFLERMQQYRQEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCSVSKWQVGKH 315
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I+ GE LT++Y ++G + + C+CG +C G LG K
Sbjct: 316 LRMGIFAKRNIQKGEELTFNYNVDRYGNDAQECFCGEPNCVGTLGGK 362
>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 2 [Bombus impatiens]
Length = 1239
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C C N+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 817 ILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGRGWGLRSLEHIKAGQFVIEYVGE 876
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + IDA KGN SRF+NHSC PNC +KW V
Sbjct: 877 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTV 936
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+TR+G+FA I+ GE LT++Y G K C CGA +C G++G K
Sbjct: 937 NGDTRIGLFALCDIERGEELTFNYNLACDGETRKPCLCGAPNCSGFIGLK 986
>gi|255716246|ref|XP_002554404.1| KLTH0F04510p [Lachancea thermotolerans]
gi|238935787|emb|CAR23967.1| KLTH0F04510p [Lachancea thermotolerans CBS 6340]
Length = 686
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 96/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K + I + +T+ G+GV A + I EFI EYIGEVI + R+
Sbjct: 97 CGNDCRNQRFQKREYADIAVFQTQKKGYGVRAQKDIEAHEFIYEYIGEVIAEDEFRDRMI 156
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D +RG+++FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ R+G+FA R
Sbjct: 157 DYDHRGLKHFYFMMLQTGEFIDATVKGSLARFCNHSCNPNAYVNKWAVAGKLRMGIFANR 216
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 217 KILKGEEITFDYNVDRYGATAQPCYCDEPNCLGFLGGK 254
>gi|336374580|gb|EGO02917.1| hypothetical protein SERLA73DRAFT_47203 [Serpula lacrymans var.
lacrymans S7.3]
Length = 776
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++ + I IV+TE G+G+ AA +++ FI EY+G+V+ +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +G+++FY ++KD IDAT +G RF NHSC+PNC + KW + R+G+FA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+I E LT++Y ++G E + CYCG + C G++G K
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYCGEAKCVGFIGGK 269
>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
latipes]
Length = 2798
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 115/210 (54%), Gaps = 24/210 (11%)
Query: 251 VQYISCSKA-CHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIG 306
+ Y CS + C C E C+N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+G
Sbjct: 1926 MSYAECSPSTCPCGEQCDNQHIQRHEWVQCLERFRAEGKGWGIRTKEPLRAGQFIIEYLG 1985
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + R+ + +Y Y + ID+ GN +RF+NHSCDPNC ++KW
Sbjct: 1986 EVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWS 2044
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK-------- 416
V G R+G+FA + + +G LTYDY F F E + C CG+ SC+G +G K
Sbjct: 2045 VNGVYRIGLFALKDVSSGTELTYDYNFHSFNTEEQQACKCGSESCRGIIGGKSQRINGLP 2104
Query: 417 ------RKIGKLELCWGSK---RKRSSTAC 437
R++G+L+ SK +KR S +C
Sbjct: 2105 GKTGGARRLGRLKEKRKSKHQFKKRVSGSC 2134
>gi|168009924|ref|XP_001757655.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691349|gb|EDQ77712.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1715
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 95/161 (59%), Gaps = 4/161 (2%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C C C N+ F+K +++ + G G+ A E I +G FIIEY+GEV+D E
Sbjct: 722 QQCPCGPLCTNQQFQKRTYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFE 781
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R + ++FY + + IDA KGN RF+NHSC+PNC EKW V+GE +G+
Sbjct: 782 ARQKEYSMNSQKHFYFMTLSANEIIDACNKGNLGRFINHSCEPNCQTEKWMVDGEVCIGL 841
Query: 376 FAARSIKAGEPLTYDYRFVQFG--PEVKCYCGASSCQGYLG 414
FA R IK E +T+DY FV+ G KC CGAS C+G++G
Sbjct: 842 FAIRDIKEREEVTFDYNFVRVGGADAKKCECGASKCRGFIG 882
>gi|366997071|ref|XP_003678298.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
gi|342304169|emb|CCC71956.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
Length = 726
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K + I I KT+ G+GV A I EFI EY GEVID+ + R+
Sbjct: 91 CGNDCQNQRFQKSQYADISIFKTKMKGYGVRANADIETNEFIYEYTGEVIDEEIFRDRMI 150
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ + ++FY ++ IDAT KG+ +RF NHSC+PN + KW+V G+ R+G+FA+R
Sbjct: 151 EYDEKKFKHFYFMMLQNCEFIDATLKGSLARFCNHSCNPNAYVNKWEVAGKLRMGIFASR 210
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 211 KIIKGEEITFDYNVDRYGATAQKCYCEEPNCIGFLGGK 248
>gi|407926722|gb|EKG19683.1| WW/Rsp5/WWP [Macrophomina phaseolina MS6]
Length = 972
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 108/190 (56%), Gaps = 8/190 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C C C C N+ F +++ + + +T+ G+G+ A + +F+ EYIGEVI++
Sbjct: 185 MECVGDCGCGPDCQNQRFLRKQYADVTVFRTDKKGYGLRANTDLKPNDFVYEYIGEVINE 244
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 245 RAFRGRMVQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCFVDKWVVGEKL 304
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRKIGKLELCWGSKR 430
R+G+FA R IKAGE L ++Y ++G + + CYCG ++C G++G GK + +K
Sbjct: 305 RMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCGEANCTGFIG-----GKTQTDNATKL 359
Query: 431 KRSSTACLAI 440
++ L I
Sbjct: 360 SHATIEALGI 369
>gi|68611259|emb|CAD41011.3| OSJNBa0042L16.10 [Oryza sativa Japonica Group]
Length = 1153
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/180 (40%), Positives = 100/180 (55%), Gaps = 21/180 (11%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPI------------------NKGE 299
C C C N+ F+K + ++VKTE GWG+ A E I G+
Sbjct: 249 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMVTEFTLILWSANVVKYIQAGQ 308
Query: 300 FIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPN 359
F++EY GEVI ++R + +G+ + Y+ + D +IDAT KG+ +RF+NHSC PN
Sbjct: 309 FVMEYCGEVISWKEAKRRSQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPN 368
Query: 360 CILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
C KW V GE RVG+FA + I G L+YDY F F G V+C CGA SC G+LG K +
Sbjct: 369 CETRKWNVLGEVRVGIFAKQDIPIGTELSYDYNFEWFGGAMVRCLCGAGSCSGFLGAKSR 428
>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 1 [Bombus impatiens]
Length = 1230
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C C N+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 808 ILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGRGWGLRSLEHIKAGQFVIEYVGE 867
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + IDA KGN SRF+NHSC PNC +KW V
Sbjct: 868 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTV 927
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+TR+G+FA I+ GE LT++Y G K C CGA +C G++G K
Sbjct: 928 NGDTRIGLFALCDIERGEELTFNYNLACDGETRKPCLCGAPNCSGFIGLK 977
>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
Length = 1441
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 109/187 (58%), Gaps = 6/187 (3%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F RK ++++V G+G+ EPI +G+F+IEY+GEVI+ A ++R
Sbjct: 1221 CKAGSRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAEFQRR 1280
Query: 318 LWDMKYRGV-QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
+ + K RG +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+F
Sbjct: 1281 M-EQKQRGRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLF 1339
Query: 377 AARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSS 434
A + I A LT++Y + K C+CGA+ C G +G K K G ++ K +
Sbjct: 1340 AIKDIPANTELTFNYLWDDLMNNSKKACFCGATRCSGEIGGKLKDGAVKETNAQTAKGKA 1399
Query: 435 TACLAII 441
+A L I
Sbjct: 1400 SAKLKQI 1406
>gi|336387462|gb|EGO28607.1| hypothetical protein SERLADRAFT_359892 [Serpula lacrymans var.
lacrymans S7.9]
Length = 780
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++ + I IV+TE G+G+ AA +++ FI EY+G+V+ +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +G+++FY ++KD IDAT +G RF NHSC+PNC + KW + R+G+FA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+I E LT++Y ++G E + CYCG + C G++G K
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYCGEAKCVGFIGGK 269
>gi|302406220|ref|XP_003000946.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
VaMs.102]
gi|261360204|gb|EEY22632.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
VaMs.102]
Length = 936
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 98/159 (61%), Gaps = 3/159 (1%)
Query: 261 HCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL 318
+C + C N+ F+++ + ++KTE G+G+ A + +FI EYIGEVI++ +R+
Sbjct: 153 NCGDGCQNQRFQRKAYADVSVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRM 212
Query: 319 WDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA 378
G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+F
Sbjct: 213 VQYDDEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTL 272
Query: 379 RSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I AGE L ++Y ++G + + CYCG +C GY+G K
Sbjct: 273 RGIAAGEELVFNYNVDRYGADPQPCYCGEPNCTGYIGGK 311
>gi|194746360|ref|XP_001955648.1| GF16138 [Drosophila ananassae]
gi|190628685|gb|EDV44209.1| GF16138 [Drosophila ananassae]
Length = 1460
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/183 (38%), Positives = 103/183 (56%), Gaps = 4/183 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C + C NR F R ++++V G+G+ EPI +G F+IEY+GEVI+ A ++R
Sbjct: 1226 CRAGDRCENRMFEQRISPRLEVVYMNERGFGLVCREPIAEGTFVIEYVGEVINHAEFQER 1285
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
L + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1286 LIQKQRNRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCVHRVGIFA 1345
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I A LT++Y + K C+CGA+ C G +G K K G + K S+
Sbjct: 1346 IKDIPANTELTFNYLWDDLMNNSKKACFCGATRCSGEIGGKLKDGSAKETTSQSAKGKSS 1405
Query: 436 ACL 438
+ L
Sbjct: 1406 SKL 1408
>gi|389743273|gb|EIM84458.1| hypothetical protein STEHIDRAFT_82382 [Stereum hirsutum FP-91666
SS1]
Length = 1040
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 97/160 (60%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++ + I IV+TE G+G+ AA + K FI EY+G+VI +R
Sbjct: 281 CRCKSYCRNQRFQRLQYAPIDIVQTEKKGFGLRAAADLRKDTFIYEYVGDVISHPTFVKR 340
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++KD IDAT +G RF NHSC+PNC + KW V R+G+FA
Sbjct: 341 MREYAEEGIRHFYFMMLQKDEYIDATKRGGIGRFANHSCNPNCYVAKWTVGSRIRMGIFA 400
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I+A E LT++Y ++G + + CYCG C G++G K
Sbjct: 401 NRFIRANEELTFNYNVDRYGHDAQPCYCGEPKCVGFIGGK 440
>gi|407918366|gb|EKG11637.1| hypothetical protein MPH_11130 [Macrophomina phaseolina MS6]
Length = 980
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 108/194 (55%), Gaps = 23/194 (11%)
Query: 258 KACHCSETCNNRPFRKEKK------------------IKIVKTEFCGWGVEAAEPINKGE 299
++C+ C NR F +K +++++T G GV A P ++ +
Sbjct: 469 RSCNLGPNCGNRQFNNLQKRYKDDTRQGKYFKSFNVGVEVMETPDRGHGVRAMRPFHQDQ 528
Query: 300 FIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPN 359
++EYIGE+I ++R+ ++ Y+ + FY+ ID ++GN +RF+NHSCDPN
Sbjct: 529 IVVEYIGEIITQQESDRRVNEV-YKDHKAFYLMNFYDKLIIDG-YRGNVARFVNHSCDPN 586
Query: 360 CILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKI 419
C +EKW V GE R+ +FA R+I GE LT+ Y F +G E CYCGA +C G +G + +
Sbjct: 587 CRMEKWTVNGEQRMALFANRNIMTGEELTWHYNFESYGKEQPCYCGAWNCAGVIGRRNRF 646
Query: 420 GKL--ELCWGSKRK 431
KL E G+KRK
Sbjct: 647 -KLPKEPAAGNKRK 659
>gi|302903800|ref|XP_003048936.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
77-13-4]
gi|256729870|gb|EEU43223.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
77-13-4]
Length = 928
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 104/167 (62%), Gaps = 4/167 (2%)
Query: 254 ISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS +A +C C N+ F++++ + ++KTE G+G+ A + +F+ EYIGEVI+
Sbjct: 152 MECSAEAGNCGGGCQNQRFQRKQYANVSVIKTEKKGFGLRADADLQPNDFVYEYIGEVIN 211
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ +R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V +
Sbjct: 212 EPTFRRRMLQYDEEGIKHFYFMSLNKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDK 271
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R I++GE L ++Y ++G + + CYCG +C G++G K
Sbjct: 272 LRMGIFTLRKIQSGEELVFNYNVDRYGADPQPCYCGEPNCVGFIGGK 318
>gi|254566589|ref|XP_002490405.1| Histone methyltransferase [Komagataella pastoris GS115]
gi|238030201|emb|CAY68124.1| Histone methyltransferase [Komagataella pastoris GS115]
gi|328350800|emb|CCA37200.1| histone-lysine N-methyltransferase SETD2 [Komagataella pastoris CBS
7435]
Length = 722
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/165 (41%), Positives = 100/165 (60%), Gaps = 4/165 (2%)
Query: 254 ISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C S C C + C N+ F+K++ I + +TE G+GV A I + FI EY+GEVID
Sbjct: 98 IECISGTCSCGDDCQNQRFQKKQYAPIAVFETEKKGYGVRAQADIRQDAFIYEYLGEVID 157
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
++ +R + +G+++FY ++K IDAT KG RF NHSC PN ++KW+V +
Sbjct: 158 ESTFRKRKENYDNQGLEHFYFMMLQKGEFIDATAKGGLGRFCNHSCRPNAYVDKWEVGNK 217
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
R+G+FA R I GE + +DY ++G KCYCG +C G+LG
Sbjct: 218 LRMGIFAKREIYKGEEICFDYNVDRYGANPQKCYCGEDNCIGFLG 262
>gi|429858497|gb|ELA33313.1| set and ww domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 833
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 100/159 (62%), Gaps = 3/159 (1%)
Query: 261 HCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL 318
+C + C N+ F++++ + ++KT+ G+G+ A + +FI EYIGEVI++ +R+
Sbjct: 152 NCGDGCQNQRFQRKQYANVSVIKTDKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRM 211
Query: 319 WDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA 378
G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+G+F +
Sbjct: 212 SQYDDEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFTS 271
Query: 379 RSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R IKAGE L ++Y ++G + CYCG +C G++G K
Sbjct: 272 RKIKAGEELVFNYNVDRYGANPQPCYCGEVNCTGFIGGK 310
>gi|343427717|emb|CBQ71244.1| related to SET2-Histone methyltransferase [Sporisorium reilianum
SRZ2]
Length = 971
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 103/167 (61%), Gaps = 4/167 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I CS + C + C N+ F + + + IV+TE G+G+ AA+ I K F+ EY+GEV++
Sbjct: 215 IECSSSKCRWGKQCRNQRFHRRQYVDVDIVQTEKKGFGLRAAQDIPKESFVYEYVGEVMN 274
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
R+ + G+++FY ++ + +DAT KG RF+NHSC+PNC + KWQV
Sbjct: 275 QPTFLDRMQQYRVEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCSVSKWQVGKH 334
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I GE LT++Y ++G + + C+CG ++C G LG K
Sbjct: 335 IRMGIFAKRNILQGEELTFNYNVDRYGNDAQECFCGEANCVGTLGGK 381
>gi|195130337|ref|XP_002009608.1| GI15146 [Drosophila mojavensis]
gi|193908058|gb|EDW06925.1| GI15146 [Drosophila mojavensis]
Length = 1885
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 101/164 (61%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C N+ F++ + ++ +T+ G G+ A I GEFI+EY+GEVID
Sbjct: 959 IECGPLCTYGDRCTNKRFQQHQGWPCRVFRTKKKGCGITAEMLIPPGEFIMEYVGEVIDS 1018
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
E+R +++Y +R + IDAT KGN SR++NHSCDPN +KW V GE
Sbjct: 1019 EEFERRQHHYSQIRNRHYYFMALRGEAIIDATVKGNISRYINHSCDPNAETQKWTVNGEL 1078
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G F+ ++I GE +T+DY++ ++G + +CYC + +C+G++G
Sbjct: 1079 RIGFFSVKTILPGEEITFDYQYQRYGRDAQRCYCESENCRGWIG 1122
>gi|322799945|gb|EFZ21071.1| hypothetical protein SINV_04653 [Solenopsis invicta]
Length = 1817
Score = 132 bits (333), Expect = 2e-28, Method: Composition-based stats.
Identities = 68/164 (41%), Positives = 95/164 (57%), Gaps = 11/164 (6%)
Query: 254 ISCSKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C + C NR F+ K ++ +TE G+G+ A I GEFI+EY+GEV+D
Sbjct: 839 IECGPRCVVGDRCTNRRFQNCDYAKCEVFRTEKKGFGLRAVVNITVGEFIMEYVGEVVDP 898
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R + +++Y ++ D IDAT KGN SRF+NHSCDPN +KW V GE
Sbjct: 899 KDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDPNAETQKWTVNGEL 958
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R I AGE +T+DY F ++G E KCYC A +C+G++G
Sbjct: 959 R--------IAAGEEITFDYHFQRYGKEAQKCYCEAPNCRGWIG 994
>gi|300124007|emb|CBK25278.2| unnamed protein product [Blastocystis hominis]
Length = 223
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/174 (40%), Positives = 99/174 (56%), Gaps = 4/174 (2%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIV--KTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ C + C C + C N+ FR+ + K+V E GWGV I +G FIIEY+GEVI
Sbjct: 39 YVECDPSTCPCGDDCTNQMFRRHEYQKLVLFYDEQKGWGVRTDVFIPRGTFIIEYVGEVI 98
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
E R K +G + Y + D IDAT KGN SRF+NHSCDPNC ++KW
Sbjct: 99 SQKESEYRRQVGKRQGQMHMYYMSLAPDQLIDATDKGNASRFINHSCDPNCEIQKWATSS 158
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRKIGKL 422
VG+FA R I GE +T+DY+F + G + C+CG+ C+ LG +++ +
Sbjct: 159 TYSVGIFAIRDIIPGEEITFDYQFERIGNGAIPCFCGSPKCRHILGKPKEVPNI 212
>gi|322697133|gb|EFY88916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium acridum
CQMa 102]
Length = 895
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 99/164 (60%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+ +C C N+ F+++ + ++KT+ G+G+ + +FI EY+GEVI++
Sbjct: 151 VASGSNCGGGCQNQRFQRKLWADVAVIKTDKKGYGLRTESSLQPNDFIYEYVGEVINEPT 210
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ G+++FY + K +DAT KGN RF NHSC+PNC ++KW V + R+
Sbjct: 211 FRRRMLQYDEEGIKHFYFMSLSKSEFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRM 270
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R I+AGE L ++Y ++G E + CYCG +C G++G K
Sbjct: 271 GIFALRKIQAGEELVFNYNVDRYGAEPQPCYCGEQNCVGFIGGK 314
>gi|66828443|ref|XP_647576.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60475584|gb|EAL73519.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 898
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 104/170 (61%), Gaps = 6/170 (3%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ C+ + C + C N+ F++++ IK T GWG+ A E I + +FI+EY GEVI
Sbjct: 592 YVECNIEHCELGKKCTNQRFQRKQYSNIKPAFTGKKGWGLIANEDIEEKQFIMEYCGEVI 651
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
C +R+ + + FY + +DA+ +GN +RF+NHSCDPNC +KW V G
Sbjct: 652 SKQTCLRRMKEAENEKF--FYFLTLDSKECLDASKRGNLARFMNHSCDPNCETQKWTVGG 709
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTKRK 418
E ++G+FA + I G LT+DY + +FG + + CYCG+ +C+GYLG K K
Sbjct: 710 EVKIGIFAIKPIPKGTELTFDYNYERFGAQKQECYCGSVNCRGYLGQKSK 759
>gi|426193608|gb|EKV43541.1| hypothetical protein AGABI2DRAFT_76781 [Agaricus bisporus var.
bisporus H97]
Length = 851
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ ++++ I+IV TE G+G+ A E + K FI EY+G+V++ ++R
Sbjct: 148 CRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIYEYVGDVVNPISFKKR 207
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++KD IDAT G RF NHSC+PNC + KW V + R+G+FA
Sbjct: 208 MREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYVAKWTVGDKVRMGIFA 267
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R++K E LT++Y ++G + + C+CG ++C GY+G K
Sbjct: 268 KRTVKKHEELTFNYNVDRYGHQAQTCFCGEANCVGYIGGK 307
>gi|222623047|gb|EEE57179.1| hypothetical protein OsJ_07116 [Oryza sativa Japonica Group]
Length = 1963
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 109/215 (50%), Gaps = 15/215 (6%)
Query: 215 DKQEVFCRLPLPYADEEFKIDLTWKDLMENKV----------GPPPVQYISCSKACHC-- 262
DK C +P D+E +L D ++ G P + + H
Sbjct: 1220 DKTFANCSIPQEKTDDEINAELGLSDASADEANGDGSNSKASGEPNFALLRSNLFLHRNR 1279
Query: 263 -SETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDM 321
+++ + R K+ T G+G++ E +++G F+IEY+GEV+D E R
Sbjct: 1280 RTQSIDEFQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYY 1339
Query: 322 KYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSI 381
+G ++FY + IDA KGN RF+NHSC PNC EKW V GE +G+FA R+I
Sbjct: 1340 ASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1399
Query: 382 KAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
K GE LT+DY +V+ KC+CG + C+GY+G
Sbjct: 1400 KKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIG 1434
>gi|307106603|gb|EFN54848.1| hypothetical protein CHLNCDRAFT_35816, partial [Chlorella
variabilis]
Length = 239
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 12/167 (7%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
+++ T GWGV A I +G FI+EY GEVIDD C +R D K R +FYM E+
Sbjct: 1 MEVYLTPDKGWGVRAMAFIPRGTFIVEYAGEVIDDKECSRRAEDAKARNEPHFYMMEMAP 60
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG--ETRVGVFAARSIKAGEPLTYDYRFV 394
IDA KGN +R LN SCDPNC +KW G E RVG+F+ R + GE LTYDY+F
Sbjct: 61 GLIIDARSKGNLARLLNSSCDPNCETQKWHDAGNSEVRVGIFSLRDVLPGEELTYDYQFQ 120
Query: 395 QFG-----PEVKCYCGASSCQGYLGTK----RKIG-KLELCWGSKRK 431
FG +C CGA +C+G + T+ R G ++++ W ++R+
Sbjct: 121 HFGLAAAAGAYRCKCGAPNCRGTMDTQPERTRDFGRRVDVWWPAERR 167
>gi|409074525|gb|EKM74921.1| hypothetical protein AGABI1DRAFT_123473 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 881
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ ++++ I+IV TE G+G+ A E + K FI EY+G+V++ ++R
Sbjct: 170 CRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIYEYVGDVVNPISFKKR 229
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++KD IDAT G RF NHSC+PNC + KW V + R+G+FA
Sbjct: 230 MREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYVAKWTVGDKVRMGIFA 289
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R++K E LT++Y ++G + + C+CG ++C GY+G K
Sbjct: 290 KRTVKKHEELTFNYNVDRYGHQAQTCFCGEANCVGYIGGK 329
>gi|403419215|emb|CCM05915.1| predicted protein [Fibroporia radiculosa]
Length = 861
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 103/166 (62%), Gaps = 9/166 (5%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGE------FIIEYIGEVIDD 311
C C C N+ F++++ I+IV+TE G+G+ AA+ ++KG+ FI EY+G+V+
Sbjct: 181 CRCRSYCQNQRFQRKEYAPIEIVQTEKKGFGLRAAQDLHKGQPYYRDAFIYEYLGDVVSQ 240
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+R+ G+++FY ++KD IDAT +G RF NHSC+PNC + KW V
Sbjct: 241 PSFLKRMRLYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTVGHHV 300
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R IK E LT++Y ++G + + CYCG S+C G++G K
Sbjct: 301 RMGIFANRDIKKDEELTFNYNVDRYGHDAQPCYCGESNCVGFIGGK 346
>gi|218190961|gb|EEC73388.1| hypothetical protein OsI_07633 [Oryza sativa Indica Group]
Length = 1906
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 109/215 (50%), Gaps = 15/215 (6%)
Query: 215 DKQEVFCRLPLPYADEEFKIDLTWKDLMENKV----------GPPPVQYISCSKACHC-- 262
DK C +P D+E +L D ++ G P + + H
Sbjct: 1235 DKTFADCSIPQEKTDDEINAELGLSDASADEANGDGSNSKASGEPNFALLRSNLFLHRNR 1294
Query: 263 -SETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDM 321
+++ + R K+ T G+G++ E +++G F+IEY+GEV+D E R
Sbjct: 1295 RTQSIDEFQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYY 1354
Query: 322 KYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSI 381
+G ++FY + IDA KGN RF+NHSC PNC EKW V GE +G+FA R+I
Sbjct: 1355 ASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1414
Query: 382 KAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLG 414
K GE LT+DY +V+ KC+CG + C+GY+G
Sbjct: 1415 KKGEELTFDYNYVRVSGAAPQKCFCGTAKCRGYIG 1449
>gi|301100332|ref|XP_002899256.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262104173|gb|EEY62225.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 1501
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 102/168 (60%), Gaps = 5/168 (2%)
Query: 253 YISCSKACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ CS +C ++ C N+ + ++ ++ KTE G+GV + I + + EY+GE+I
Sbjct: 100 YVECSASCPAAQYCRNQRLQHPERYPLLEPFKTEQKGYGVRTRQHIPQLSIVGEYVGEII 159
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D +RL + R NFY + IDA KG+F+RF+NHSC+PNC EKW V+G
Sbjct: 160 DQKELARRLKSVP-RHELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKG 218
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTK 416
ETR+ V A R I+ GE LT+DY++ G ++ C CGA SC+G +GT+
Sbjct: 219 ETRIAVSALRDIEVGEELTFDYKWKALGSRQITCCCGAPSCKGVIGTQ 266
>gi|195395005|ref|XP_002056127.1| GJ10771 [Drosophila virilis]
gi|194142836|gb|EDW59239.1| GJ10771 [Drosophila virilis]
Length = 1430
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 95/163 (58%), Gaps = 4/163 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
CH C N+ F RK ++++V G+G+ EPI +G+FIIEY+GEVI+ A ++R
Sbjct: 1209 CHAGRQCENQLFELRKSPRLEVVYMNERGFGLVCREPIAEGDFIIEYVGEVINHAEFQRR 1268
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +NFY + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1269 MAQKTRDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFA 1328
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
+ I LT++Y + K C+CGA C G +G K K
Sbjct: 1329 IKDIPENTELTFNYLWDDLMNNGKKACFCGAKRCSGEIGGKLK 1371
>gi|299741036|ref|XP_001834167.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
okayama7#130]
gi|298404525|gb|EAU87663.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
okayama7#130]
Length = 826
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 97/160 (60%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K + I+IV TE G+G+ A + I K FI EY+G+V+ ++R
Sbjct: 175 CRCRSHCQNQRFQKRQYANIEIVLTEKKGYGLRAEDDIPKDSFIYEYVGDVVSPNSFKKR 234
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+Q+FY ++KD IDAT G RF NHSC+PNC + KW V R+G+F+
Sbjct: 235 MREYAEEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFS 294
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I+ E LT++Y ++G + + CYCG +C G++G K
Sbjct: 295 KRLIRKHEELTFNYNVDRYGHQAQTCYCGEPNCIGFIGGK 334
>gi|198476699|ref|XP_002132425.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
gi|198137811|gb|EDY69827.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
Length = 483
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/164 (41%), Positives = 97/164 (59%), Gaps = 3/164 (1%)
Query: 254 ISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I C C E C N+ F+ + ++ TE G G+ A I GEFI+EY+GEVID
Sbjct: 148 IECGPLCSNGERCTNKRFQLNQCWPCRVFHTEKKGCGITAELAIPAGEFIMEYVGEVIDS 207
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
A E+R +++Y +R IDAT GN SRF+NHSCDPN +KW V GE
Sbjct: 208 AEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNAETQKWTVNGEL 267
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
R+G+F+ ++I GE +T+DYR+ + + CYC A++C+G+LG
Sbjct: 268 RIGLFSVKTIMPGEEITFDYRYQPYDRIAQPCYCEAANCRGWLG 311
>gi|412985970|emb|CCO17170.1| predicted protein [Bathycoccus prasinos]
Length = 1058
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 258 KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
K C CS C N+ F +K K++++ ++ G G+ + E I GEF++EY+GEV+ + E
Sbjct: 216 KFCPCSTACKNQRFSRKKFKELEVKRSSKKGHGLFSEENIKPGEFVLEYVGEVLHEEAYE 275
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R ++FY + TIDAT +G RFLNHSCDPNC +KW V+GE +GV
Sbjct: 276 ERKRQYAREKRRHFYFMTLSSSETIDATERGGVGRFLNHSCDPNCETQKWMVKGELCIGV 335
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
FA R I+ GE +T DY+F ++G + ++C+C ++C G++G
Sbjct: 336 FALREIQPGEEITIDYKFERYGEKPMRCFCDTAACCGWIG 375
>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
Length = 1441
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 105/186 (56%), Gaps = 4/186 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F RK ++++V G+G+ EPI +G+F+IEY+GEVI+ A ++R
Sbjct: 1221 CKAGNRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAEFQRR 1280
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1281 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLFA 1340
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA+ C G +G K K G ++ K ++
Sbjct: 1341 IKDIPVNTELTFNYLWDDLMNNSKKACFCGATRCSGEIGGKLKDGAVKESNAQTAKGKAS 1400
Query: 436 ACLAII 441
A L I
Sbjct: 1401 AKLKQI 1406
>gi|290992999|ref|XP_002679121.1| set domain-containing protein [Naegleria gruberi]
gi|284092736|gb|EFC46377.1| set domain-containing protein [Naegleria gruberi]
Length = 153
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 98/143 (68%), Gaps = 3/143 (2%)
Query: 275 KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEI 334
KKI++ + G+GV A E I+KG FIIEY+GEV+D+ + +QR Y G +++Y +
Sbjct: 7 KKIEVKRAGEKGFGVFAKEKISKGTFIIEYVGEVLDEKIYKQR--QEAYDGERHYYFLSV 64
Query: 335 RKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFV 394
+ IDA+ KGN +RF+NHSCDPN +L+KW V ++R+GVFA R I+ GE +T+DY
Sbjct: 65 GTNQIIDASKKGNNARFINHSCDPNSVLQKWTVGHQSRIGVFALRDIEKGEEVTFDYAME 124
Query: 395 QFGPEV-KCYCGASSCQGYLGTK 416
+G KCYCG+++C+G + +K
Sbjct: 125 CYGVSFQKCYCGSANCRGTITSK 147
>gi|448089486|ref|XP_004196819.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|448093762|ref|XP_004197850.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|359378241|emb|CCE84500.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|359379272|emb|CCE83469.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
Length = 744
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 102/164 (62%), Gaps = 3/164 (1%)
Query: 256 CSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+K C + C N+ F+K++ + + +TE G+G+ A + I FI EY GEVID+
Sbjct: 89 TNKYGTCGKDCQNQRFQKKQYASVAVFQTEMKGYGLRAEKDIPDSSFIYEYTGEVIDEEA 148
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R+ + +++FY +++D IDAT KG+ +RF NHSC+PN ++KW V + R+
Sbjct: 149 FRKRMVQYDEKNIKHFYFMMLKQDSFIDATRKGSLARFCNHSCNPNAYVDKWVVGEKLRM 208
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
G+FA R+IK GE +T+DY ++G + + CYCG +C +LG K
Sbjct: 209 GIFAKRTIKRGEEITFDYNVDRYGAQKQPCYCGEPNCIKWLGGK 252
>gi|254586457|ref|XP_002498796.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
gi|238941690|emb|CAR29863.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
Length = 680
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 96/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C E C N+ F++++ I + +T+ G+GV A I +FI EY+G+VI + + RL
Sbjct: 95 CGEDCQNQRFQRKQYADIAVFQTKLKGYGVRAQSDIEPHQFIYEYMGDVIPEDVFRDRLV 154
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D +G Q+FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 155 DYDQKGYQHFYFMMLQNGEFIDATVKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAHR 214
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 215 KILKGEEITFDYNVDRYGATAQKCYCEEPNCIGFLGGK 252
>gi|195143973|ref|XP_002012971.1| GL23881 [Drosophila persimilis]
gi|194101914|gb|EDW23957.1| GL23881 [Drosophila persimilis]
Length = 1466
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 97/163 (59%), Gaps = 4/163 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C + C NR F RK ++ +V G+G+ EPI +G+FIIEY+GEVI+ ++R
Sbjct: 1252 CRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQEEFQRR 1311
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +NFY + K+F IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1312 MLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTHRVGLFA 1371
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
+ I A LT++Y + + K C+CG+ C G +G K K
Sbjct: 1372 IQDIPAETELTFNYLWDDLLNDKKKACHCGSERCSGEIGGKLK 1414
>gi|302307921|ref|NP_984733.2| AEL128Cp [Ashbya gossypii ATCC 10895]
gi|442570024|sp|Q757Y8.2|SET2_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|299789232|gb|AAS52557.2| AEL128Cp [Ashbya gossypii ATCC 10895]
gi|374107951|gb|AEY96858.1| FAEL128Cp [Ashbya gossypii FDAG1]
Length = 684
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+K++ I + +TE G+GV A I EFI EYIGEVI +A R+
Sbjct: 95 CGDDCQNQRFQKKEYADIAVFQTEKKGYGVRAERDIEANEFIYEYIGEVISEADFRDRMV 154
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D RG ++FY ++ IDAT +G +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 155 DYDMRGFKHFYFMMLQAGEFIDATERGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAHR 214
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 215 KILKGEEITFDYNVDRYGATAQPCYCDEPNCIGFLGGK 252
>gi|390355933|ref|XP_784903.3| PREDICTED: uncharacterized protein LOC579712 isoform 3
[Strongylocentrotus purpuratus]
Length = 3326
Score = 130 bits (328), Expect = 9e-28, Method: Composition-based stats.
Identities = 69/172 (40%), Positives = 98/172 (56%), Gaps = 8/172 (4%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+Q+ S +C C + C N+ ++ ++ TE GWGV +PI FIIEY+GE
Sbjct: 2504 IQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPIRHSSFIIEYLGE 2563
Query: 308 VID-DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
VI L ++ L D +Y+ ++ Y + ID GN RF+NHSC+PNC ++KW
Sbjct: 2564 VISVKELWKRALDDYQYQ--KHHYCLNLDGGMVIDGYRYGNEGRFVNHSCNPNCEMQKWM 2621
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA R I+ GE LTYDY F F E + C CG +C+GY+G K
Sbjct: 2622 VNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQECNCGHETCRGYIGGK 2673
>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
lupus familiaris]
Length = 742
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 101/200 (50%), Gaps = 32/200 (16%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINK----------- 297
+QY + C E C N+ F K +I+KTE GWG+ I K
Sbjct: 460 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKVAWGTTLGIWK 519
Query: 298 ------------------GEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFT 339
GEF+ EY+GE+ID+ C R+ V NFYM + KD
Sbjct: 520 NSVKQDTFVSLLETDNKQGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRI 579
Query: 340 IDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGP- 398
IDA KGN+SRF+NHSC+PNC +KW V G+ RVG+FA I AG LT++Y G
Sbjct: 580 IDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFALCDIPAGMELTFNYNLDCLGNG 639
Query: 399 EVKCYCGASSCQGYLGTKRK 418
+C+CGA +C G+LG + K
Sbjct: 640 RTECHCGAENCSGFLGVRPK 659
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQR---------LQWRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 96 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPV 155
Query: 181 ASH 183
A H
Sbjct: 156 AYH 158
>gi|390355937|ref|XP_003728662.1| PREDICTED: uncharacterized protein LOC579712 isoform 2
[Strongylocentrotus purpuratus]
Length = 3111
Score = 130 bits (328), Expect = 9e-28, Method: Composition-based stats.
Identities = 69/172 (40%), Positives = 98/172 (56%), Gaps = 8/172 (4%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+Q+ S +C C + C N+ ++ ++ TE GWGV +PI FIIEY+GE
Sbjct: 2272 IQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPIRHSSFIIEYLGE 2331
Query: 308 VID-DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
VI L ++ L D +Y+ ++ Y + ID GN RF+NHSC+PNC ++KW
Sbjct: 2332 VISVKELWKRALDDYQYQ--KHHYCLNLDGGMVIDGYRYGNEGRFVNHSCNPNCEMQKWM 2389
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA R I+ GE LTYDY F F E + C CG +C+GY+G K
Sbjct: 2390 VNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQECNCGHETCRGYIGGK 2441
>gi|390355935|ref|XP_003728661.1| PREDICTED: uncharacterized protein LOC579712 isoform 1
[Strongylocentrotus purpuratus]
Length = 3164
Score = 130 bits (328), Expect = 1e-27, Method: Composition-based stats.
Identities = 69/172 (40%), Positives = 98/172 (56%), Gaps = 8/172 (4%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+Q+ S +C C + C N+ ++ ++ TE GWGV +PI FIIEY+GE
Sbjct: 2325 IQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPIRHSSFIIEYLGE 2384
Query: 308 VID-DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
VI L ++ L D +Y+ ++ Y + ID GN RF+NHSC+PNC ++KW
Sbjct: 2385 VISVKELWKRALDDYQYQ--KHHYCLNLDGGMVIDGYRYGNEGRFVNHSCNPNCEMQKWM 2442
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA R I+ GE LTYDY F F E + C CG +C+GY+G K
Sbjct: 2443 VNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQECNCGHETCRGYIGGK 2494
>gi|452822434|gb|EME29453.1| chondroitin-glucuronate 5-epimerase [Galdieria sulphuraria]
Length = 1135
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 100/169 (59%), Gaps = 8/169 (4%)
Query: 251 VQYISCS-KACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V + C + C C N+ F+K ++K+ + GWG++AAE + KG FIIEY GE
Sbjct: 788 VSFTECHPEYCRTGSKCQNQRFQKCEYARVKLFQAGERGWGLKAAEFLPKGTFIIEYQGE 847
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID E+R +Y G ++FY + D IDA+ K N +RF+NHSC PNC EKW V
Sbjct: 848 VIDTEEYERR--KRRYAGERHFYFMSLDSDHMIDASRKSNMARFINHSCQPNCHTEKWTV 905
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFG---PEVKCYCGASSCQGYL 413
GE VG+FA++ I+AG L +DY + G V+CYCGA C+ +L
Sbjct: 906 LGEPCVGIFASQDIEAGTELVFDYNVDRKGVGEESVRCYCGAPKCRNWL 954
>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 94/165 (56%), Gaps = 14/165 (8%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GEVID+ C
Sbjct: 1079 QVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECM 1138
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ +D IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1139 ARI-----------KHAHENQDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1187
Query: 376 FAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
FA I AG LT++Y G E C CGAS+C G+LG + K
Sbjct: 1188 FAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSGFLGDRPKT 1232
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKER-LGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CVK+ L + R F+CP H+C C + RCVRC
Sbjct: 728 CVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCP 787
Query: 180 IASH 183
+A H
Sbjct: 788 VAYH 791
>gi|363756006|ref|XP_003648219.1| hypothetical protein Ecym_8108 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891419|gb|AET41402.1| Hypothetical protein Ecym_8108 [Eremothecium cymbalariae
DBVPG#7215]
Length = 682
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F R+ I + +TE G+GV A + I EFI EYIGEVI ++ +R+
Sbjct: 95 CGDNCQNQRFQGRQYADIAVFQTEKKGYGVRAEKDIEANEFIYEYIGEVISESEFRERMV 154
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D RG ++FY ++ IDAT KG +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 155 DYDVRGYKHFYFMMLQTGEFIDATEKGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFANR 214
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 215 KIFKGEEITFDYNVDRYGATAQPCYCDEPNCIGFLGGK 252
>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
Length = 1418
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/181 (38%), Positives = 101/181 (55%), Gaps = 4/181 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 1208 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAEFQRR 1267
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1268 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 1327
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 1328 IKDIPVNTELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAIKAHAKLKQMRRAK 1387
Query: 436 A 436
A
Sbjct: 1388 A 1388
>gi|410080658|ref|XP_003957909.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
gi|372464496|emb|CCF58774.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
Length = 711
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 94/166 (56%), Gaps = 3/166 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C E C N+ F+K++ I + +TE G+GV + EFI EYIGEVID+ RL
Sbjct: 99 CGENCQNQRFQKKQYADISVFQTEMKGYGVRCNTDLEANEFIYEYIGEVIDEESFRDRLI 158
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
++FY ++ IDAT KG RF NHSC+PN + KW V G+ ++G+F+ R
Sbjct: 159 KYDEMHFKHFYFMMLQNGQFIDATRKGALGRFCNHSCNPNAYVNKWVVNGKLKMGIFSKR 218
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLEL 424
I+ GE +T+DY ++G KCYCG +C G+LG K + L
Sbjct: 219 KIQKGEEITFDYNVDRYGANAQKCYCGEPNCIGFLGGKTQTDSASL 264
>gi|323451560|gb|EGB07437.1| hypothetical protein AURANDRAFT_27977 [Aureococcus anophagefferens]
Length = 224
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/172 (42%), Positives = 100/172 (58%), Gaps = 10/172 (5%)
Query: 254 ISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I C A C + C NR F R K+ + K CG G+ A EPI +G F++E +GE+I
Sbjct: 48 IECDPATCPMGDKCQNRCFAARLGSKVSVEKAGRCGRGLFAREPIPEGAFVVEALGELIS 107
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIR----KDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
+ ++RL + G +++YM K IDAT KGN R+ NHSCDP+C LEKW+
Sbjct: 108 EEEAQERLATARANGDEHYYMLAASDAATKGLVIDATRKGNEFRWANHSCDPSCRLEKWR 167
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRK 418
+ R G+ A RSIK GE LTYDYR+ F +CYCGA++C G + +K
Sbjct: 168 CGSQDRYGIVALRSIKPGEELTYDYRWASF---ERCYCGAANCVGVMDGVKK 216
>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
Length = 1510
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + E+R + RG
Sbjct: 1363 INQLKFRK-KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRG 1421
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ IK GE
Sbjct: 1422 IGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGE 1481
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA +C+GYL
Sbjct: 1482 EITYDYKFPIEDDKIDCLCGAKTCRGYLN 1510
>gi|326676505|ref|XP_692254.4| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Danio
rerio]
Length = 2933
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 100/172 (58%), Gaps = 7/172 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C CS+ C+N+ +K + ++ + + E GWG+ +P+ G+FIIEY+GEV
Sbjct: 2035 YAECSPSTCPCSDQCDNQRIQKHEWVQCLERFRAEGKGWGIRTKQPLRAGQFIIEYLGEV 2094
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2095 VSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMGNEARFVNHSCEPNCEMQKWSVN 2153
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
G R+G+FA + I +G LTYDY F F E + C CG+ C+G +G K K
Sbjct: 2154 GVYRIGLFALKDINSGTELTYDYNFHSFNTEEQQVCKCGSEGCRGIIGGKSK 2205
>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
AltName: Full=SET domain-containing protein 2
gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
Length = 1507
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + E+R + RG
Sbjct: 1360 INQLKFRK-KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRG 1418
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ IK GE
Sbjct: 1419 IGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGE 1478
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA +C+GYL
Sbjct: 1479 EITYDYKFPIEDDKIDCLCGAKTCRGYLN 1507
>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
Length = 1802
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + E+R + RG
Sbjct: 1655 INQLKFRK-KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRG 1713
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ IK GE
Sbjct: 1714 IGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGE 1773
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA +C+GYL
Sbjct: 1774 EITYDYKFPIEDDKIDCLCGAKTCRGYLN 1802
>gi|400596232|gb|EJP64008.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 900
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 103/167 (61%), Gaps = 4/167 (2%)
Query: 254 ISCSK-ACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS+ A +C C N+ F+++ + ++KT+ G+G+ + +FI EYIGEVI+
Sbjct: 154 MECSETAGNCGGGCQNQRFQRKHYADVSVIKTDKKGYGLRTDADLAANDFIFEYIGEVIN 213
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+A +R+ G+++FY + K+ +DAT KGN RF NHSC PNC ++KW V +
Sbjct: 214 EATFRRRMLQYDQEGIKHFYFMSLNKNEFVDATRKGNLGRFCNHSCVPNCYVDKWVVGDK 273
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R++ AGE L ++Y ++G + + CYCG +C ++G K
Sbjct: 274 LRMGIFALRAVSAGEELVFNYNVDRYGADPQPCYCGEPNCLRFIGGK 320
>gi|47219458|emb|CAG10822.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2598
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 251 VQYISCSKA-CHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIG 306
+ + CS + C C++ C+N+ ++ + ++ + +TE GWG+ +P+ G+FIIEY+G
Sbjct: 1717 MSFAECSPSTCPCADQCDNQRIQRHEWVQCLERFRTEGKGWGIRTKQPLRAGQFIIEYLG 1776
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + R+ + + N Y + ID+ GN +RF+NHSC+PNC ++KW
Sbjct: 1777 EVVSEQEFRSRMMEQYFSHSGN-YCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWS 1835
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK-RKIGKLE 423
V G R+G+FA I +G LTYDY F F E + C CG+ SC+G +G K ++I L
Sbjct: 1836 VNGVYRIGLFALGEIPSGTELTYDYNFHSFNTEEQQACKCGSESCRGIIGGKSQRINGLP 1895
Query: 424 LCWGSKRK 431
+ G R+
Sbjct: 1896 VKAGGARR 1903
>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
Length = 739
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + E+R + RG
Sbjct: 592 INQLKFRK-KMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEEREKAYERRG 650
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ IK GE
Sbjct: 651 IGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGE 710
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA +C+GYL
Sbjct: 711 EITYDYKFPIEDDKIDCLCGAKTCRGYLN 739
>gi|349579035|dbj|GAA24198.1| K7_Set2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 733
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|330443605|ref|NP_012367.2| Set2p [Saccharomyces cerevisiae S288c]
gi|347595709|sp|P46995.2|SET2_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=Lysine N-methyltransferase 3;
AltName: Full=SET domain-containing protein 2
gi|329138914|tpg|DAA08635.2| TPA: Set2p [Saccharomyces cerevisiae S288c]
Length = 733
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
AltName: Full=Maternal-effect sterile 4 homolog
gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
Length = 1427
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 102/186 (54%), Gaps = 4/186 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 1217 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRR 1276
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1277 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 1336
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 1337 IKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAVKAHAKLKQMRRAK 1396
Query: 436 ACLAII 441
A I
Sbjct: 1397 ASAVRI 1402
>gi|1008361|emb|CAA89464.1| EZL1 [Saccharomyces cerevisiae]
Length = 733
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
Length = 1423
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 102/186 (54%), Gaps = 4/186 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 1213 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRR 1272
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1273 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 1332
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 1333 IKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAVKAHAKLKQMRRAK 1392
Query: 436 ACLAII 441
A I
Sbjct: 1393 ASAVRI 1398
>gi|323354369|gb|EGA86208.1| Set2p [Saccharomyces cerevisiae VL3]
Length = 683
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|388581385|gb|EIM21694.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
Length = 681
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 102/169 (60%), Gaps = 5/169 (2%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C CNN+ F++++ I I+KTE G+G+ A +++ F+IEYIGEV+ +R
Sbjct: 69 CKTGPQCNNQRFQRKQWANIDIIKTEKKGYGLRANVDLDRDTFLIEYIGEVVTQTQFLRR 128
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY ++ + IDAT +GN RF NHSC PNC + KW V ++G+F
Sbjct: 129 MNTYSKEGIKHFYFMMLQNEEFIDATRRGNIGRFANHSCAPNCFVSKWVVGKYVKMGIFT 188
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKLE 423
R I+ GE LT++Y ++G + + CYCG +C G++G T+ IG ++
Sbjct: 189 KRKIEKGEELTFNYNVDRYGHDAQPCYCGEPNCVGFIGGKTQTDIGGMD 237
>gi|151944961|gb|EDN63216.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
gi|190409343|gb|EDV12608.1| SET domain protein 2 [Saccharomyces cerevisiae RM11-1a]
gi|207344135|gb|EDZ71371.1| YJL168Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256271359|gb|EEU06424.1| Set2p [Saccharomyces cerevisiae JAY291]
Length = 733
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|365764896|gb|EHN06414.1| Set2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 733
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|406606115|emb|CCH42475.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 815
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 100/158 (63%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C E C+N+ F+K++ I +++TE G+GV + I FI EYIGEVID+ ++R+
Sbjct: 151 CGEDCSNQRFQKKQYVDINVIQTEKKGYGVRSQIDIEPDTFIYEYIGEVIDEKSFKKRML 210
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ ++FY ++K +DAT KG+ +RF NHSC+PNC +EKW V + ++G+FA R
Sbjct: 211 EYDELNFKHFYFMMLQKGEFLDATRKGSLARFCNHSCNPNCYVEKWVVGEKLKMGIFAKR 270
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYCG +C G++G K
Sbjct: 271 KILKGEEITFDYNVDRYGANAQPCYCGEPNCIGFIGGK 308
>gi|443897443|dbj|GAC74784.1| clathrin coat binding protein [Pseudozyma antarctica T-34]
Length = 925
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 101/167 (60%), Gaps = 4/167 (2%)
Query: 254 ISCSKA-CHCSETCNNRPFRKEKKI--KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
I CS + C + C N+ F + + + IV+TE G+G+ AA I K F+ EY+GEV++
Sbjct: 166 IECSSSKCRWGKLCRNQRFHRRQYVYVDIVQTEKKGFGLRAAHDIPKESFVYEYVGEVMN 225
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
R+ + G+++FY ++ + +DAT KG RF+NHSC+PNC + KWQV
Sbjct: 226 QHTFLDRMQLYRTEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCSVSKWQVGKH 285
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+FA R+I GE LT++Y ++G + + C+CG +C G LG K
Sbjct: 286 LRMGIFAKRNIAKGEELTFNYNVDRYGNDAQECFCGEPNCVGTLGGK 332
>gi|21392158|gb|AAM48433.1| RE61305p [Drosophila melanogaster]
Length = 1016
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 102/186 (54%), Gaps = 4/186 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 806 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRR 865
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 866 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 925
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 926 IKDIPVNSELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAVKAHAKLKQMRRAK 985
Query: 436 ACLAII 441
A I
Sbjct: 986 ASAVRI 991
>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 2548
Score = 129 bits (324), Expect = 3e-27, Method: Composition-based stats.
Identities = 71/171 (41%), Positives = 98/171 (57%), Gaps = 12/171 (7%)
Query: 254 ISCSKA-CHCSET-----CNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
I CSK+ C+ E C NRPFRK K ++I KTE G+GV I GE I EY+
Sbjct: 2090 IQCSKSRCNLPEKIQDRKCFNRPFRKSFVKDLEIKKTEKTGYGVFCKRDIKNGELICEYV 2149
Query: 306 GEVIDDALCEQRLW----DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
GEV+ E+RL + K + N+Y+ +I KD ID+ KG+ SRF+NHSC PN +
Sbjct: 2150 GEVLGKREFEKRLEVYQEESKKTDMYNWYIIQINKDVYIDSGKKGSISRFINHSCSPNSV 2209
Query: 362 LEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGY 412
+KW V G R+G+FA R I +GE +TY+Y + +C C + +C Y
Sbjct: 2210 SQKWIVRGFYRIGIFALRDIPSGEEITYNYSYNFLFNNFECLCKSPNCMNY 2260
>gi|15488418|gb|AAL01110.1|AF408059_1 ASH1-like protein 1 [Arabidopsis thaliana]
Length = 229
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/144 (45%), Positives = 88/144 (61%), Gaps = 1/144 (0%)
Query: 276 KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIR 335
K K++K E GWG+ A E I G+FI+EY GEVI ++R + GV++ Y+ +
Sbjct: 8 KTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLN 67
Query: 336 KDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQ 395
IDAT KG+ +RF+NHSC PNC KW V GE RVG+FA SI L YDY F
Sbjct: 68 ASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEW 127
Query: 396 F-GPEVKCYCGASSCQGYLGTKRK 418
+ G +V+C CGA +C G+LG K +
Sbjct: 128 YGGAKVRCLCGAVACSGFLGAKSR 151
>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Otolemur garnettii]
Length = 1427
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C E C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1228
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1229 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1279
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|324500837|gb|ADY40382.1| Histone-lysine N-methyltransferase SETD2 [Ascaris suum]
Length = 950
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 98/167 (58%), Gaps = 6/167 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI- 309
Y C C C+NR ++ K+++ GWG+ A EP+ G+F++EY+GEVI
Sbjct: 148 YTECGSRCPSGARCSNRRLHNKEYAKVEVFYAGVKGWGLRACEPLEPGQFVMEYVGEVIS 207
Query: 310 -DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
++ R + + V ++ M ++ IDAT +GN SRF+NHSCDPNC EKW V+
Sbjct: 208 AEEMRRRVRRYGRDPKHVHHYLMA-LKNGAVIDATIRGNISRFINHSCDPNCRSEKWTVD 266
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLG 414
R+G FA + + GE L +DY+ +G + +CYCGA++C+G +G
Sbjct: 267 RRVRIGFFATKKVAVGEELVFDYQLELYGRKAQRCYCGAANCRGLIG 313
>gi|323332954|gb|EGA74356.1| Set2p [Saccharomyces cerevisiae AWRI796]
Length = 537
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/166 (40%), Positives = 96/166 (57%), Gaps = 3/166 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKIGKLEL 424
I GE +T+DY ++G + KCYC +C G+LG K + L
Sbjct: 225 KILKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGKTQTDAASL 270
>gi|449664137|ref|XP_002169363.2| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Hydra
magnipapillata]
Length = 655
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 104/185 (56%), Gaps = 26/185 (14%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
I C+ C E C NR F+K++ +F N+G F+ EY GEV+D
Sbjct: 283 IECNNRCPTREFCTNRNFQKQE-------DFYE---------NEGSFVSEYCGEVVDYTE 326
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
+R G+ ++Y ++ + IDAT KG+ SRF+NHSCDPNCI +KW V G RV
Sbjct: 327 FHRRTKLYNAEGMNHYYFMTLKTNEIIDATKKGSKSRFINHSCDPNCITQKWTVNGFLRV 386
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKR---------KIGKLE 423
G FA R I+AGE L++DY+F ++G + KCYCGA+ C+G +G ++ KI +L
Sbjct: 387 GFFALRYIEAGEELSFDYQFQRYGEKPQKCYCGAAICRGVIGVEQSGPIPSYIEKISELN 446
Query: 424 LCWGS 428
GS
Sbjct: 447 ELLGS 451
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 105/207 (50%), Gaps = 34/207 (16%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINK------------------ 297
+ C E C N+ F K + ++I +T CGWG+ A I K
Sbjct: 423 QVCAAGERCQNQAFSKRQYTPVEIYRTLSCGWGLRAVSDIKKVRRGDGDGEQLQGVPGVL 482
Query: 298 ---------GEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNF 348
G F+IEY+GEVID+ C R+ + + NFYM + KD IDA KGN
Sbjct: 483 MTSSDATSQGAFVIEYVGEVIDEEECRARIKHAQENDIFNFYMLTLDKDRIIDAGPKGNQ 542
Query: 349 SRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGAS 407
+RF+NH C PNC +KW V G+TRVG+FA + I G+ L ++Y G + C CGA
Sbjct: 543 ARFMNHCCQPNCETQKWTVNGDTRVGLFALQDIPKGKELNFNYNLECLGNGKTVCKCGAP 602
Query: 408 SCQGYLGTKRK----IGKLELCWGSKR 430
+C G+LG + K KL+L G +R
Sbjct: 603 NCSGFLGVRPKNQPSAEKLKLKEGKRR 629
>gi|449544167|gb|EMD35141.1| hypothetical protein CERSUDRAFT_54162 [Ceriporiopsis subvermispora
B]
Length = 788
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 96/160 (60%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I++VKTE G+G+ A + K FI EY+G+V+ +R
Sbjct: 105 CRCRSYCQNQRFQRKEYADIEVVKTEKKGFGLRAGADLRKDTFIYEYVGDVVSQPSFLKR 164
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ +++FY ++KD IDAT +G RF NHSC+PNC + KW + R+G+FA
Sbjct: 165 MRQYAEENIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTIGEHVRMGIFA 224
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R IK E LT++Y ++G + + CYCG +C G++G K
Sbjct: 225 NRYIKKDEELTFNYNVDRYGNDAQPCYCGEPNCVGFIGGK 264
>gi|259147310|emb|CAY80562.1| Set2p [Saccharomyces cerevisiae EC1118]
Length = 733
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+ GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KLLKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|323337021|gb|EGA78277.1| Set2p [Saccharomyces cerevisiae Vin13]
Length = 733
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+ GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KLLKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|392298598|gb|EIW09695.1| Set2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 733
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+ GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KLLKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|358253208|dbj|GAA52505.1| histone-lysine N-methyltransferase SETD2 [Clonorchis sinensis]
Length = 757
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 62/123 (50%), Positives = 82/123 (66%), Gaps = 1/123 (0%)
Query: 297 KGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC 356
K FIIEY+GEVID A +R+ + G + Y + D IDA KGN++RF+NHSC
Sbjct: 4 KSTFIIEYVGEVIDFAEFRRRIRRYERLGHAHHYFMALESDRFIDAGAKGNWARFVNHSC 63
Query: 357 DPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYCGASSCQGYLGT 415
+PNC+ +KW V+GE R+G FA I+AGE +T DY+FVQFG E KCYCG +C G +G+
Sbjct: 64 EPNCVTQKWSVDGEIRIGFFAREDIEAGEEITIDYQFVQFGVSEQKCYCGKPTCSGIMGS 123
Query: 416 KRK 418
K
Sbjct: 124 TSK 126
>gi|145355325|ref|XP_001421914.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582153|gb|ABP00208.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 860
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 260 CHCSETC-NNRPFRKEKKIKIVK-TEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N R R E + V+ T G G+ AAE + GEF++EY GEV+ + ++R
Sbjct: 161 CPCGSACGNQRMSRGESRATTVRRTGKKGHGLFAAERVGAGEFVLEYCGEVLHEEAYKER 220
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G ++Y + TIDAT +GN RFLNHSC PNC +KW V GE +G+FA
Sbjct: 221 KRRYQDEGRSHYYFMTLSSSETIDATIRGNEGRFLNHSCAPNCETQKWMVRGELCIGIFA 280
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
R I+ GE LT DY+F +FG + +CYC A +C G++G
Sbjct: 281 TRDIEEGEELTIDYKFERFGEKPSRCYCMAGACCGWIG 318
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 95/171 (55%), Gaps = 5/171 (2%)
Query: 254 ISCSKA-CHCSETCNNRPF--RKEKKIKIVKT-EFCGWGVEAAEPINKGEFIIEYIGEVI 309
I C A C E C N+ F R+ ++ KT + GWG++ + K +F+ EY+GE+I
Sbjct: 1112 IECHPAGCPAGEKCQNQRFVKRQYPAVESFKTPDGRGWGLKTLVDVKKHDFVYEYVGELI 1171
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ ++R+ V NFYM + K+ IDA K N SRF+NHSC PNC +KW V G
Sbjct: 1172 DEEEVQRRIKKAHEDNVTNFYMLTLDKNRIIDAGPKANMSRFMNHSCQPNCETQKWMVNG 1231
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKI 419
+ RVG+FA I G LT++Y G E C CGA C GY+G + K
Sbjct: 1232 DIRVGLFAMDDIPTGSELTFNYNLDCLGNEKTPCNCGAPICSGYIGVRPKT 1282
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 14/67 (20%)
Query: 131 CSVRGCGGVYHFICVKERLG-ISNPRNFKCPQHACFIC-------------RQRLQWRCV 176
CS+ CG YH C+++ + + F+CP H C C R R+ RCV
Sbjct: 743 CSIPICGKYYHEDCLRKFPNTVFEAKGFRCPLHVCGTCVAVAGGDVKKVKSRGRILARCV 802
Query: 177 RCTIASH 183
RC A H
Sbjct: 803 RCPTAYH 809
>gi|308485102|ref|XP_003104750.1| CRE-MET-1 protein [Caenorhabditis remanei]
gi|308257448|gb|EFP01401.1| CRE-MET-1 protein [Caenorhabditis remanei]
Length = 1582
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 104/167 (62%), Gaps = 16/167 (9%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFC-GWGVEAAEPINKGEFIIEYIGEVIDDALCEQR- 317
C C N+ F K+K ++ T G G+ A + I KG FIIEY+GEV++ E+R
Sbjct: 682 CPANCKNQRFAKKKYASVEAFHTGTAKGCGLRALKDIKKGRFIIEYVGEVVERDDYEKRK 741
Query: 318 ---LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE---- 370
D K++ + Y+C+ +TIDAT GN SRF+NHSC+PN + EKW V
Sbjct: 742 KKYAADEKHK---HHYLCDTGV-YTIDATVYGNPSRFVNHSCEPNAVCEKWSVPKTPGDI 797
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+R+G FA +SIKAGE +T+DY+FV +G E +C+CGA SC G++GTK
Sbjct: 798 SRIGFFAKKSIKAGEEITFDYQFVNYGREAQQCFCGAPSCNGWIGTK 844
>gi|195453973|ref|XP_002074027.1| GK14418 [Drosophila willistoni]
gi|194170112|gb|EDW85013.1| GK14418 [Drosophila willistoni]
Length = 1420
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 95/163 (58%), Gaps = 4/163 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C + C N+ F RK ++ +V G+G+ E I +G+F+IEY+GEVI+ A ++R
Sbjct: 1179 CRAGDRCENQMFEMRKSPRLDVVYMNDRGFGLVCREAIAEGDFVIEYVGEVINHAEFQKR 1238
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 1239 MLQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWSVNCIHRVGLFA 1298
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
+ I A LT++Y + K CYCGA C G +G K K
Sbjct: 1299 IKDIPANTELTFNYLWDDLMNNGKKACYCGAERCSGQIGGKLK 1341
>gi|323348010|gb|EGA82268.1| Set2p [Saccharomyces cerevisiae Lalvin QA23]
Length = 650
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I I KT+ G+GV A + I +FI EY GEVI++ RL
Sbjct: 105 CGNDCQNQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLI 164
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D R ++FY ++ IDAT KG+ +RF NHSC PN + KW V+ + R+G+FA R
Sbjct: 165 DYDQRHFKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQR 224
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+ GE +T+DY ++G + KCYC +C G+LG K
Sbjct: 225 KLLKGEEITFDYNVDRYGAQAQKCYCEEPNCIGFLGGK 262
>gi|345486764|ref|XP_001606931.2| PREDICTED: histone-lysine N-methyltransferase ash1 [Nasonia
vitripennis]
Length = 1690
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 93/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C E C N+ +K ++ TE GWGV EPI GEFI+EY+GEV
Sbjct: 838 FSECSPQLCPCGERCKNQKIQKHDWAPGLQRFMTESKGWGVRTHEPIRTGEFILEYVGEV 897
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 898 VSEREFKTRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVH 956
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA R I AGE LTYDY F F P +C CG+ C+G +G K
Sbjct: 957 GLPRMALFALRDITAGEELTYDYNFALFNPSEGQECRCGSEGCRGVIGGK 1006
>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
Length = 1285
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 98/166 (59%), Gaps = 5/166 (3%)
Query: 254 ISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS C C E C+N+ F R+ + +V+ G+G++A E I KG F+IEY+GEV+D
Sbjct: 551 VECSPDTCPCGERCSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLD 610
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R + + ++FY + IDA KGN RF+NHSC+PNC EKW V GE
Sbjct: 611 SRSFELRQKEYARQRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGE 670
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQF--GPEVKCYCGASSCQGYLG 414
+G+FA R + E +T++Y F + KC+CG++ C+GY+G
Sbjct: 671 ICIGLFAIRDVAKNEEITFNYNFERLYGAAAKKCHCGSAHCRGYIG 716
>gi|393219563|gb|EJD05050.1| hypothetical protein FOMMEDRAFT_105290 [Fomitiporia mediterranea
MF3/22]
Length = 872
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 95/160 (59%), Gaps = 3/160 (1%)
Query: 260 CHCSETC-NNRPFRKE-KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N R RKE ++IV TE G+G+ A I K FI EYIG+V+ +R
Sbjct: 157 CRCRGYCRNQRLQRKEYANVEIVLTEKKGFGLRAGSDITKDAFIYEYIGDVVSQPSFAKR 216
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++KD IDAT +G RF NHSC+PNC + KW V R+G++A
Sbjct: 217 MREYAEEGIRHFYFMMLQKDEFIDATKRGGKGRFANHSCNPNCYVAKWTVGDRVRMGIYA 276
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+IK E LT++Y ++G + + CYCG +C G LG K
Sbjct: 277 KRNIKKDEELTFNYNVDRYGHDAQPCYCGEPNCVGVLGGK 316
>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
Length = 207
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 88/138 (63%), Gaps = 2/138 (1%)
Query: 288 GVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGN 347
++A + I KG FIIEYIGE+I RL + GV N+Y+ E+ IDA +GN
Sbjct: 32 ALKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGVTNYYILELDNLRMIDAGPRGN 91
Query: 348 FSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCG 405
+RF+NHSCDPNC ++ W V+G+TR+G+F+ R I+ GE LT++Y+ Q E KC CG
Sbjct: 92 IARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQSSDEGKTKCLCG 151
Query: 406 ASSCQGYLGTKRKIGKLE 423
+ +C G++G K K K E
Sbjct: 152 SKNCAGFIGDKVKNEKSE 169
>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
Length = 1129
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 92/164 (56%), Gaps = 3/164 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + +T GWG++ I KG+F+ EY+G++ID C+
Sbjct: 867 QMCKAGEKCQNQRFMKREYPDQSAYRTGARGWGLKTNVDIKKGQFVNEYVGDLIDSEECQ 926
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+R+ + NFYM + IDA KGN SRF+NHSCDPN + +KW V G+ RVG+
Sbjct: 927 RRIKAAHDNNIVNFYMLTLDNRRIIDAGPKGNLSRFMNHSCDPNLVTQKWTVNGDVRVGL 986
Query: 376 FAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
FA I AG LT++Y G + C C +++C G+LG K
Sbjct: 987 FAVCDIPAGTELTFNYNLAAVGSTKTVCQCKSANCSGFLGIPPK 1030
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 28/64 (43%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFIC----------RQRLQWRCVRCT 179
CS GCG YH CV R + +F CP HAC C +Q RCVRC
Sbjct: 553 CSSHGCGKFYHMKCVSGIRSAKVDNGHFTCPLHACATCAVDDVYKPKPKQNRLLRCVRCP 612
Query: 180 IASH 183
A H
Sbjct: 613 TAYH 616
>gi|348530060|ref|XP_003452529.1| PREDICTED: hypothetical protein LOC100707110 [Oreochromis niloticus]
Length = 2876
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 107/188 (56%), Gaps = 8/188 (4%)
Query: 251 VQYISCSKA-CHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIG 306
+ + CS + C C + C+N+ ++ + ++ + +TE GWG+ E + G+FIIEY+G
Sbjct: 1989 MSFAECSPSTCPCGDQCDNQHIQRHEWVQCLERFRTEGKGWGIRTKESLRSGQFIIEYLG 2048
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + R+ + +Y Y + ID+ GN +RF+NHSC+PNC ++KW
Sbjct: 2049 EVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWS 2107
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK-RKIGKLE 423
V G R+G+FA + I +G LTYDY F F E + C CG+ SC+G +G K ++I L
Sbjct: 2108 VNGVYRIGLFALKDISSGTELTYDYNFHSFNTEEQQVCKCGSESCRGIIGGKSQRINGLP 2167
Query: 424 LCWGSKRK 431
G R+
Sbjct: 2168 GKSGGTRR 2175
>gi|195574451|ref|XP_002105202.1| GD18047 [Drosophila simulans]
gi|194201129|gb|EDX14705.1| GD18047 [Drosophila simulans]
Length = 567
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/181 (38%), Positives = 101/181 (55%), Gaps = 4/181 (2%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR F RK ++++V G+G+ EPI G+F+IEY+GEVI+ A ++R
Sbjct: 357 CKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAEFQRR 416
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +N+Y + KDF IDA KGN +RF+NHSC+PNC +KW V RVG+FA
Sbjct: 417 MEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFA 476
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCWGSKRKRSST 435
+ I LT++Y + K C+CGA C G +G K K ++ K+ R +
Sbjct: 477 IKDIPVNTELTFNYLWDDLMNNSKKACFCGAKRCSGEIGGKLKDDAVKAHAKLKQMRRAK 536
Query: 436 A 436
A
Sbjct: 537 A 537
>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
paniscus]
Length = 1426
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1227
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1228 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1278
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan troglodytes]
Length = 1426
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1227
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1228 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1278
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|51849607|dbj|BAD42330.1| hypothetical protein [Nannochloris bacillaris]
Length = 334
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 99/165 (60%), Gaps = 3/165 (1%)
Query: 257 SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
+K C C E C NR F K K++I + G+G+ AAE + G+FI+EY+GEV+++
Sbjct: 115 AKYCPCGERCTNRGFSKRAYAKLEIRRAGAKGFGLFAAEDVKAGQFIVEYVGEVLEEEEY 174
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
+R G +++Y + IDA +G RF+NHSC+PNC +KW V GE +G
Sbjct: 175 ARRKEFYIATGQRHYYFMNVGNGEVIDAARRGGLGRFINHSCEPNCETQKWVVRGELAIG 234
Query: 375 VFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
+FA + AG LT+DY F ++G + +KC CG+ +C+G +G ++
Sbjct: 235 LFALEDVPAGSVLTFDYNFERYGDKPMKCLCGSKACRGVIGGSQE 279
>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
Length = 1426
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1119 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1179 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1227
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1228 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1278
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Callithrix jacchus]
Length = 1427
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 1120 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 1179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 1180 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1228
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 1229 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 1279
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICRQRLQ---------WRCVRCTI 180
CSV CG YH CV++ I + F+CPQH C C RC+RC +
Sbjct: 765 CSVGACGKFYHEACVRKFPTAIFESKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPV 824
Query: 181 ASH 183
A H
Sbjct: 825 AYH 827
>gi|359493199|ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 515
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 70/162 (43%), Positives = 92/162 (56%), Gaps = 3/162 (1%)
Query: 260 CHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F+K K K+ +TE GWG+ A E I G F+IEY GEVI R
Sbjct: 80 CRCGLFCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGR 139
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
G+++ ++ + IDAT KG+ RF+NHSC PNC KW V GE RVG+FA
Sbjct: 140 SQVYASLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFA 199
Query: 378 ARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGTKRK 418
+ I G L Y+Y F + G +V+C CGA SC G+LG K +
Sbjct: 200 KQDISIGTELAYNYNFEWYGGAKVRCLCGAISCSGFLGAKSR 241
>gi|348683753|gb|EGZ23568.1| hypothetical protein PHYSODRAFT_487680 [Phytophthora sojae]
Length = 868
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/181 (37%), Positives = 103/181 (56%), Gaps = 10/181 (5%)
Query: 247 GPPPVQYISCSKA-----CHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGE 299
GP P + +K C E C NR ++ + + T GW + EP+ G+
Sbjct: 505 GPAPTPGMKFNKQNRESNCMLGENCGNRALHQKIYPRFEKFHTVEKGWALRVLEPVKAGQ 564
Query: 300 FIIEYIGEVIDDALCEQRLWDMKYRGVQ--NFYMCEIRKDFTIDATFKGNFSRFLNHSCD 357
+IEY+GEVI++ E+RL D + N Y+ E+ K IDA FKG+ SRF+NHSCD
Sbjct: 565 LVIEYVGEVINEEEKERRLLDHAKNSPEDKNMYIMELGKGEYIDARFKGSVSRFINHSCD 624
Query: 358 PNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYLGTK 416
PNC L KW+V+G R+ + A + I+ G L+YDY+F + E KC+C +++C+G + +
Sbjct: 625 PNCHLLKWRVKGVNRIAITALKDIEPGTELSYDYQFHTKQAMEWKCHCKSANCRGTMAPE 684
Query: 417 R 417
+
Sbjct: 685 K 685
>gi|345566121|gb|EGX49068.1| hypothetical protein AOL_s00079g289 [Arthrobotrys oligospora ATCC
24927]
Length = 980
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 98/160 (61%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
+C C N+ F+K + + +++TE G+G+ A + G FI EY+GEVI ++ +R
Sbjct: 199 SNCGGQCQNQRFQKREYANVSVIQTEMKGYGLRANTSMEPGTFIYEYVGEVIGESQFRKR 258
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+++FY I+ IDAT +G +RF NHSC+PN ++EKW V G+ R+G+FA
Sbjct: 259 RELYGKEDIKHFYFMSIKVGEYIDATKRGCLARFCNHSCNPNSMVEKWVVGGKLRMGIFA 318
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTK 416
I+AGE LT+DY ++G + KC+CG +C GY+G K
Sbjct: 319 KVKIEAGEELTFDYNVDRYGADPQKCFCGEPNCIGYIGGK 358
>gi|50285531|ref|XP_445194.1| hypothetical protein [Candida glabrata CBS 138]
gi|74638008|sp|Q6FX50.1|SET2_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49524498|emb|CAG58098.1| unnamed protein product [Candida glabrata]
Length = 716
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 93/158 (58%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K++ I + KT+ G+GV A I +FI EY GEVI++ RL
Sbjct: 123 CGNDCANQRFQKKEYANIAVFKTKMKGYGVRAESDIEINDFIYEYKGEVIEEEEFRDRLV 182
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
D + ++FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ R+G+FA R
Sbjct: 183 DYDQKKFRHFYFMMLQSGEFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLRMGIFAKR 242
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 243 KILKGEEITFDYNVDRYGAAAQKCYCEEPNCIGFLGGK 280
>gi|195037347|ref|XP_001990122.1| GH19166 [Drosophila grimshawi]
gi|193894318|gb|EDV93184.1| GH19166 [Drosophila grimshawi]
Length = 1434
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 100/173 (57%), Gaps = 5/173 (2%)
Query: 251 VQYISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ Y C+ K C+ + C N+ F RK ++++V G+G+ E I +G+FIIEY+GE
Sbjct: 1207 ILYHECNPKYCNAGKKCENQLFELRKSPRLEVVYMNERGFGLVCREAIAEGDFIIEYVGE 1266
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VI+ A ++R+ +NFY + KDF IDA KGN +RF+NHSC+PNC +KW V
Sbjct: 1267 VINHAEFQRRVAQKTNDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTV 1326
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
RVG+FA + I LT++Y + K C+CGA C G +G K K
Sbjct: 1327 NCINRVGLFAIKDIPENTELTFNYLWDDLMNNGKKACFCGAKRCSGEIGGKLK 1379
>gi|392579013|gb|EIW72140.1| hypothetical protein TREMEDRAFT_25563 [Tremella mesenterica DSM
1558]
Length = 869
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 94/160 (58%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C E C N+ F R+ I +V TE G+G+ AA+ I +G I EYIGEVI + +R
Sbjct: 181 CRAREQCQNQRFGKRQYAPIDVVLTEMKGFGLRAADDIPEGSLIYEYIGEVIKETTFRKR 240
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G+++FY ++K+ IDAT KG RF NHSC+PN ++KW V R+G+FA
Sbjct: 241 MAQYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNSEVQKWVVGRRLRMGIFA 300
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R I E +T++Y ++G + + CYCG +C G +G K
Sbjct: 301 KRDIIKDEEITFNYNVDRYGHDAQPCYCGEPNCVGTIGGK 340
>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1074
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 89/139 (64%), Gaps = 3/139 (2%)
Query: 278 KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKD 337
++++T+ G G+ + N G ++EY GEVI E RL + Y+ ++Y + ++
Sbjct: 249 EVLQTDKKGCGLLSIRSFNAGSLVVEYTGEVIHLDEVEHRLNTI-YKESDSYYFLGLEEE 307
Query: 338 FTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQF- 396
+ IDA KG+ +RF NHSCDPN ++KW V GE R+G+FA RSI+AGE +TYDY F F
Sbjct: 308 YVIDAGQKGSVARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEEITYDYNFEWFE 367
Query: 397 -GPEVKCYCGASSCQGYLG 414
G KCYCG+ +C G++G
Sbjct: 368 NGEPQKCYCGSKNCHGFIG 386
>gi|441621285|ref|XP_003269632.2| PREDICTED: histone-lysine N-methyltransferase NSD3 [Nomascus
leucogenys]
Length = 505
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 14/171 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+QY + C + C N+ F K +I+KTE GWG+ I KGEF+ EY+GE+
Sbjct: 267 LQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGEL 326
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
ID+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V
Sbjct: 327 IDEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 375
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
G+ RVG+FA I AG LT++Y G +C+CGA +C G+LG + K
Sbjct: 376 GDVRVGLFALCDIPAGMELTFNYNLDCLGNGRTECHCGADNCSGFLGVRPK 426
>gi|301629157|ref|XP_002943714.1| PREDICTED: hypothetical protein LOC100496979 [Xenopus (Silurana)
tropicalis]
Length = 1666
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 7/172 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E CNN+ ++ + ++ + + E GWG+ EP+ +FIIEY+GEV
Sbjct: 25 FAECSPNTCPCGEQCNNQCIQRHEWVQCLERFRAEGKGWGIRTKEPLKASQFIIEYLGEV 84
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 85 VSETEFRNRTIE-QYHNHSDHYCLSLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 143
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
G R+G++A + + AG LTYDY F F E + C CG C+G +G K +
Sbjct: 144 GVYRIGLYALKDMPAGTELTYDYNFHSFNTEKQQVCKCGVEKCRGIIGGKSQ 195
>gi|451997634|gb|EMD90099.1| hypothetical protein COCHEDRAFT_1225655 [Cochliobolus
heterostrophus C5]
Length = 787
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 104/187 (55%), Gaps = 21/187 (11%)
Query: 264 ETCNNRPFRKEKK----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
E CNNRPF + KK ++++ TE G+GV A + I+EY GE+I +
Sbjct: 406 EQCNNRPFAELKKRAKGNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSE 465
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+R+ + Y+ + +Y+ IDAT +G +RF+NHSC+PNC + KW V GE R+
Sbjct: 466 CERRMKQV-YKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRM 523
Query: 374 GVFA-ARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLE------L 424
+FA +R I GE LTYDY F F + +C CG SC+G LG K K E L
Sbjct: 524 ALFAGSRGIMTGEELTYDYNFDPFSQKNIQECRCGTESCRGVLGPKPKKPVEEKSLTSAL 583
Query: 425 CWGSKRK 431
G+KRK
Sbjct: 584 IAGTKRK 590
>gi|268566865|ref|XP_002639833.1| C. briggsae CBR-MET-1 protein [Caenorhabditis briggsae]
Length = 2074
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 103/167 (61%), Gaps = 16/167 (9%)
Query: 262 CSETCNNRPFRKEKKIKIVKTEFCGW----GVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C+ C N+ F K KKI V+ G G+ A + I KG FIIEYIGEV++ E+R
Sbjct: 714 CTAKCKNQRFAK-KKIASVEAYHTGTAKGCGLRALKDIKKGRFIIEYIGEVVERDDYEKR 772
Query: 318 LWDMKY---RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE---- 370
+KY + ++ Y+C+ +TIDAT GN SRF+NHSCDPN + EKW V
Sbjct: 773 --KLKYAADKKHKHHYLCDT-GIYTIDATVYGNASRFVNHSCDPNAVCEKWSVPKTPGDI 829
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
+R+G FA + IK+GE +T+DY+FV +G + +C CGA SC G++G K
Sbjct: 830 SRIGFFAKKFIKSGEEITFDYQFVNYGRDAQQCLCGAPSCTGWIGEK 876
>gi|330792328|ref|XP_003284241.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
gi|325085814|gb|EGC39214.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
Length = 151
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 88/134 (65%), Gaps = 3/134 (2%)
Query: 286 GWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFK 345
GWG+ + E INKG+F++EY GEVI C R+ + + FY + +DA+ +
Sbjct: 2 GWGLISCENINKGDFVMEYCGEVISKTTCLNRMQENENEKF--FYFLTLNSKECLDASRR 59
Query: 346 GNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFG-PEVKCYC 404
GN +RF+NHSCDPNC +KW V GE ++G+F+ + I+ G LT+DY + +FG + +CYC
Sbjct: 60 GNLARFINHSCDPNCETQKWIVGGEVKIGIFSIKPIEKGTELTFDYNYERFGASKQECYC 119
Query: 405 GASSCQGYLGTKRK 418
G+ +C+G+LG K K
Sbjct: 120 GSKNCRGFLGQKAK 133
>gi|431892339|gb|ELK02779.1| Putative histone-lysine N-methyltransferase ASH1L [Pteropus alecto]
Length = 1291
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 103/184 (55%), Gaps = 8/184 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 445 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 504
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 505 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 563
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK-RKIGKLELC 425
G R+G++A R + AG LTYDY F F E + C CG C+G +G K +++ L
Sbjct: 564 GVYRIGLYALRDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGKSQRVNGLASS 623
Query: 426 WGSK 429
GS+
Sbjct: 624 RGSQ 627
>gi|302798461|ref|XP_002980990.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
gi|300151044|gb|EFJ17691.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
Length = 242
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 98/166 (59%), Gaps = 5/166 (3%)
Query: 254 ISCS-KACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
+ CS C C E C+N+ F R+ + +V+ G+G++A E I KG F+IEY+GEV+D
Sbjct: 48 VECSPDTCPCGERCSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLD 107
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R + + ++FY + IDA KGN RF+NHSC+PNC EKW V GE
Sbjct: 108 SRSFELRQKEYARQRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGE 167
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQF--GPEVKCYCGASSCQGYLG 414
+G+FA R + E +T++Y F + KC+CG++ C+GY+G
Sbjct: 168 ICIGLFAIRDVAKNEEITFNYNFERLYGAAAKKCHCGSAHCRGYIG 213
>gi|296413387|ref|XP_002836395.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630213|emb|CAZ80586.1| unnamed protein product [Tuber melanosporum]
Length = 901
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 98/167 (58%), Gaps = 16/167 (9%)
Query: 264 ETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F K+ +++VKT CG G+ A + I+EY GE+I
Sbjct: 426 ERCTNRAFADLKERVKSGTKFAEGVEVVKTANCGHGLRATRGFMPNQIIVEYTGEIITQE 485
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
E+R+ ++ Y+ +N+Y+ ++ +DAT +G+ +RF+NHSCDPNC +EKW VEG R
Sbjct: 486 ESERRMVEV-YKDNKNYYLMLFHQNMILDAT-RGSVARFVNHSCDPNCRMEKWLVEGRPR 543
Query: 373 VGVFAARS-IKAGEPLTYDYRFVQFG--PEVKCYCGASSCQGYLGTK 416
+ +FA I+AGE LTYDY F F + C+CGA +C+G LG K
Sbjct: 544 MALFAGDDGIEAGEELTYDYNFNWFTGVSQQTCHCGADNCRGALGKK 590
>gi|301115083|ref|XP_002999311.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262111405|gb|EEY69457.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 905
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/181 (37%), Positives = 101/181 (55%), Gaps = 10/181 (5%)
Query: 247 GPPPVQYISCSK-----ACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGE 299
GP P + +K C E C NR ++ + + T GW + EP G+
Sbjct: 499 GPAPTPGMKFNKHNRESNCMLGENCGNRALHQKVYPRFQKFHTVEKGWALRLLEPAKAGQ 558
Query: 300 FIIEYIGEVIDDALCEQRLWDMKYRGVQ--NFYMCEIRKDFTIDATFKGNFSRFLNHSCD 357
+IEY+GEVI++ E+RL D + N Y+ E+ K IDA FKG+ SRF+NHSCD
Sbjct: 559 LVIEYVGEVINEEEKERRLLDHAKNSPEDKNMYIMELGKGEYIDARFKGSVSRFINHSCD 618
Query: 358 PNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYLGTK 416
PNC L KW+V+G R+ + A + I+ G L+YDY+F + E KC+C A +C+G + +
Sbjct: 619 PNCHLLKWRVKGVNRIAITALKDIEPGTELSYDYQFHTKQAMEWKCHCKAKNCRGTMAPE 678
Query: 417 R 417
+
Sbjct: 679 K 679
>gi|281201147|gb|EFA75361.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1458
Score = 126 bits (317), Expect = 2e-26, Method: Composition-based stats.
Identities = 57/141 (40%), Positives = 86/141 (60%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++KKIK ++ WG+ A EPI + +IEYIGEVI + ++R +G+ + Y+
Sbjct: 1317 RKKKIKFSRSAIHDWGLFALEPITAKDMVIEYIGEVIRQKVADEREKRYTKKGIGSSYLF 1376
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I D IDATFKGN +RF+NH CDPNCI + Q + ++ ++A R I GE +TYDY+
Sbjct: 1377 RIDDDTIIDATFKGNLARFINHCCDPNCIAKIIQTGNQKKIVIYAKRDIMIGEEITYDYK 1436
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C C ++ C+G L
Sbjct: 1437 FPIEDVKIPCLCKSAKCRGTL 1457
>gi|451852041|gb|EMD65336.1| hypothetical protein COCSADRAFT_35392 [Cochliobolus sativus ND90Pr]
Length = 787
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/187 (40%), Positives = 104/187 (55%), Gaps = 21/187 (11%)
Query: 264 ETCNNRPF---RKEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
E CNNRPF RK K ++++ TE G+GV A + I+EY GE+I +
Sbjct: 406 EQCNNRPFAELRKRAKGNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSE 465
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+R+ + Y+ + +Y+ IDAT +G +RF+NHSC+PNC + KW V GE R+
Sbjct: 466 CERRMKQV-YKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRM 523
Query: 374 GVFA-ARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLE------L 424
+FA +R I GE LTYDY F F + +C CG SC+G LG K K E L
Sbjct: 524 ALFAGSRGIMTGEELTYDYNFDPFSQKNIQECRCGTESCRGVLGPKPKKPVEEKSLTSTL 583
Query: 425 CWGSKRK 431
G+KRK
Sbjct: 584 IAGTKRK 590
>gi|392587279|gb|EIW76613.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 793
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 97/160 (60%), Gaps = 3/160 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C C N+ F++++ I++V+TE G+G+ A I K FI EY+G+V+ +R
Sbjct: 105 CRCRNYCQNQRFQRKQYANIEVVQTEKKGFGLRAGSDIPKDAFIYEYVGDVVSHPSFLKR 164
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + G+++FY ++ + IDAT +G RF NHSC+PNC + KW V R+G+FA
Sbjct: 165 MREYAAEGIRHFYFMMLQSNEYIDATKRGGKGRFANHSCNPNCYVAKWTVGPHVRMGIFA 224
Query: 378 ARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R++K E LT++Y ++G + + CYCG C G++G K
Sbjct: 225 KRAVKRNEELTFNYNVDRYGHDPQPCYCGEPQCVGFIGGK 264
>gi|403214852|emb|CCK69352.1| hypothetical protein KNAG_0C02410 [Kazachstania naganishii CBS
8797]
Length = 708
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 96/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F++++ + + +T+ G+GV A E I++ +FI EY+GEVID+ R+
Sbjct: 96 CGKNCQNQRFQRKQYADVMVFQTKMKGYGVLAKEDIDQHQFIYEYMGEVIDEEEFRDRMA 155
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ ++FY ++ IDAT KG +RF NHSC PN + KW V G+ ++G+FA+R
Sbjct: 156 TYDEKKFKHFYFMMLQNGQFIDATMKGCLARFCNHSCSPNAYVNKWVVNGKLKMGIFASR 215
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I E +T+DY ++G KCYCG +C G+LG K
Sbjct: 216 KILKDEEITFDYNVDRYGATAQKCYCGEPNCIGFLGGK 253
>gi|350427843|ref|XP_003494899.1| PREDICTED: hypothetical protein LOC100746434 [Bombus impatiens]
Length = 2217
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 95/170 (55%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C E C N+ +K + ++ TE GWGV + I G FI+EY+GEV
Sbjct: 1405 FSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGWGVRTQQSIKSGVFILEYVGEV 1464
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 1465 VSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVH 1523
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA+R IK GE LTYDY F F P +C CG+S+C+G +G K
Sbjct: 1524 GLPRMALFASRDIKPGEELTYDYNFALFNPSEGQECRCGSSACRGVIGGK 1573
>gi|340724009|ref|XP_003400378.1| PREDICTED: hypothetical protein LOC100647786 [Bombus terrestris]
Length = 2172
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 95/170 (55%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C E C N+ +K + ++ TE GWGV + I G FI+EY+GEV
Sbjct: 1360 FSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGWGVRTQQSIKSGVFILEYVGEV 1419
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 1420 VSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVH 1478
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA+R IK GE LTYDY F F P +C CG+S+C+G +G K
Sbjct: 1479 GLPRMALFASRDIKPGEELTYDYNFALFNPSEGQECRCGSSACRGVIGGK 1528
>gi|345565974|gb|EGX48921.1| hypothetical protein AOL_s00079g142 [Arthrobotrys oligospora ATCC
24927]
Length = 1227
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/180 (38%), Positives = 97/180 (53%), Gaps = 16/180 (8%)
Query: 251 VQYISCSKACHCSETCNNRPFRK-----------EKKIKIVKTEFCGWGVEAAEPINKGE 299
V Y C K + C NRPFR + I+I+ T+ G+G+ A +
Sbjct: 321 VMYYECDKNNCSLKDCQNRPFRDLAIRVENDTWFDDGIEIILTQDKGYGLRACRSFGPNQ 380
Query: 300 FIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPN 359
I+EYIGE+I CE+RL Y+ +++Y+ E IDAT +G+ +RF+NHSC PN
Sbjct: 381 IIVEYIGEIITQDECEERLHG-PYKDNESYYLMEFDNSLIIDAT-RGSLARFVNHSCSPN 438
Query: 360 CILEKWQVEGETRVGVFAA-RSIKAGEPLTYDYRFVQF--GPEVKCYCGASSCQGYLGTK 416
C +EKW V G+ R+ +FA I+ GE LTYDY F F G C CG C+G +G +
Sbjct: 439 CKMEKWMVAGQPRMALFAGDEGIEVGEELTYDYNFSWFSGGSTQLCRCGTEQCRGLVGKR 498
>gi|410911836|ref|XP_003969396.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Takifugu
rubripes]
Length = 2782
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 99/172 (57%), Gaps = 7/172 (4%)
Query: 251 VQYISCSKA-CHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIG 306
+ + CS + C ++ C+N+ ++ ++ + +TE GWG+ EP+ G+FIIEY+G
Sbjct: 1951 MSFAECSPSTCPSADQCDNQHIQRHDWVQCLERFRTEGKGWGIRTKEPLRAGQFIIEYLG 2010
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + R+ + + N Y + ID+ GN +RF+NHSC+PNC ++KW
Sbjct: 2011 EVVSEQEFRSRMMEQYFSHSGN-YCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWS 2069
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA I +G LTYDY F F E + C CG+ SC+G +G K
Sbjct: 2070 VNGVYRIGLFALGEIPSGTELTYDYNFHSFNTEEQQACMCGSESCRGIIGGK 2121
>gi|383854492|ref|XP_003702755.1| PREDICTED: uncharacterized protein LOC100877317 [Megachile rotundata]
Length = 2187
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C + C N+ +K + ++ TE GWGV + I G FI+EY+GEV
Sbjct: 1375 FSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGWGVRTQQSIKSGVFILEYVGEV 1434
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y ++ ID G RF+NHSC+PNC ++KW V
Sbjct: 1435 VSEREFKSRMA-TRYANDTHHYCLQLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVH 1493
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA+R IK GE LTYDY F F P +C CG+S+C+G +G K
Sbjct: 1494 GLPRMALFASRDIKPGEELTYDYNFALFNPSEGQECRCGSSACRGVIGGK 1543
>gi|157818737|ref|NP_001101159.1| probable histone-lysine N-methyltransferase ASH1L [Rattus norvegicus]
gi|149048100|gb|EDM00676.1| ash1 (absent, small, or homeotic)-like (Drosophila) (predicted)
[Rattus norvegicus]
Length = 2918
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2069 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2128
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2129 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2187
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2188 GVYRIGLYALKDVPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2237
>gi|344241969|gb|EGV98072.1| putative histone-lysine N-methyltransferase ASH1L [Cricetulus
griseus]
Length = 1546
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 1210 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 1269
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 1270 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 1328
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 1329 GVYRIGLYALKDVLAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 1378
>gi|302846429|ref|XP_002954751.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
gi|300259934|gb|EFJ44157.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
Length = 261
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 120/229 (52%), Gaps = 19/229 (8%)
Query: 207 RHPAKWLLDKQEVFCRLPLPYADEEFKIDLT-------WKDLMENKVGPPPVQYISCSKA 259
R PA W L K V DE+ D+ W+ + GP + + C +
Sbjct: 4 RIPAVWQLLKDNVLAHRKRKVQDED---DIMICHCKPPWRG--GDGCGPDCINRMLCIEC 58
Query: 260 ----CHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C E C N+ F K ++I + G+G+ A E I G+FIIEYIGEV+++
Sbjct: 59 VPGFCPSEEKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDE 118
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
++R G +++Y I IDA KGN SRF+NHSC+PNC +KW V GE +
Sbjct: 119 YQRRKEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVRGELAI 178
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRKIGK 421
G+FA R I LT+DY F ++G + ++CYCG+++C+ ++G + + +
Sbjct: 179 GLFAVRDIPKDTELTFDYNFERYGDKPMRCYCGSTNCRKFIGGMQVVAR 227
>gi|19115892|ref|NP_594980.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe
972h-]
gi|74626626|sp|O14026.1|SET2_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=Lysine N-methyltransferase 3;
AltName: Full=SET domain-containing protein 2
gi|2408044|emb|CAB16247.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe]
Length = 798
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C +C N+ F++ + K+ + TE G+G+ A + K F+ EYIGEVI + +R+
Sbjct: 165 CGPSCQNQRFQRHEFAKVDVFLTEKKGFGLRADANLPKDTFVYEYIGEVIPEQKFRKRMR 224
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G+++FY ++K IDAT +G+ +RF NHSC PNC ++KW V + R+G+F R
Sbjct: 225 QYDSEGIKHFYFMMLQKGEYIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKR 284
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE LT+DY ++G + + CYCG C GY+G K
Sbjct: 285 DIIRGEELTFDYNVDRYGAQAQPCYCGEPCCVGYIGGK 322
>gi|134113727|ref|XP_774448.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819404|sp|P0CO29.1|SET2_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|50257086|gb|EAL19801.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 834
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 101/176 (57%), Gaps = 6/176 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C + C + C+N+ F K + + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIAGECRAGKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKL 422
R+G+F R + GE +T++Y ++G + + CYCG +C G +G T+ IG +
Sbjct: 276 RLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGKTQTDIGTM 331
>gi|410986774|ref|XP_003999684.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2 [Felis
catus]
Length = 2974
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 102/183 (55%), Gaps = 9/183 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2124 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2183
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2184 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2242
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCW 426
G R+G++A + + AG LTYDY F F E + C CG C+G +G K + ++
Sbjct: 2243 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGKSQ--RMNGLT 2300
Query: 427 GSK 429
GSK
Sbjct: 2301 GSK 2303
>gi|410986772|ref|XP_003999683.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1 [Felis
catus]
Length = 2965
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 102/183 (55%), Gaps = 9/183 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIGKLELCW 426
G R+G++A + + AG LTYDY F F E + C CG C+G +G K + ++
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGKSQ--RMNGLT 2291
Query: 427 GSK 429
GSK
Sbjct: 2292 GSK 2294
>gi|355745722|gb|EHH50347.1| hypothetical protein EGM_01160 [Macaca fascicularis]
Length = 2904
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|73960946|ref|XP_537251.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Canis lupus familiaris]
Length = 2965
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|405121647|gb|AFR96415.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
grubii H99]
Length = 836
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 101/176 (57%), Gaps = 6/176 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C C + C+N+ F K++ + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIGGECRAGKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKL 422
R+G+F R + GE +T++Y ++G + + CYCG +C G +G T+ IG +
Sbjct: 276 RLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGKTQTDIGTM 331
>gi|58269858|ref|XP_572085.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
neoformans JEC21]
gi|338819405|sp|P0CO28.1|SET2_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|57228321|gb|AAW44778.1| histone-lysine N-methyltransferase, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 834
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 101/176 (57%), Gaps = 6/176 (3%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C + C + C+N+ F K + + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIAGECRAGKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG--TKRKIGKL 422
R+G+F R + GE +T++Y ++G + + CYCG +C G +G T+ IG +
Sbjct: 276 RLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGKTQTDIGTM 331
>gi|148683294|gb|EDL15241.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Mus musculus]
Length = 2918
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2069 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2128
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2129 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2187
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2188 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2237
>gi|351696657|gb|EHA99575.1| Putative histone-lysine N-methyltransferase ASH1L [Heterocephalus
glaber]
Length = 2930
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2080 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2139
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2140 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2198
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2199 GVYRIGLYALKDMTAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2248
>gi|198435574|ref|XP_002121834.1| PREDICTED: absent, small, or homeotic discs 1 homolog [Ciona
intestinalis]
Length = 2850
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 7/176 (3%)
Query: 251 VQYISCS-KACHCSETCNNRPFRKE---KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIG 306
+ YI CS C C + C NR +K+ K ++ +T GWGV I +G+F++EY+G
Sbjct: 2092 LMYIECSPDTCPCQDKCANRCIQKQQWWKDLERFRTNDRGWGVRTNSDIPEGQFLLEYVG 2151
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + +R + Y + Y ++ ID N RF+NHSC PNC ++KW
Sbjct: 2152 EVVSEREFRRRTIE-NYNAHNDHYCVQLEAGTVIDGYRLANEGRFVNHSCQPNCEMQKWV 2210
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIG 420
V GE RVG+FA R I + E LTYDY F + + + C CG+S C+G +G K + G
Sbjct: 2211 VNGEYRVGLFAKRPIVSSEELTYDYNFHAYNLDRQQPCRCGSSECRGVIGGKTQRG 2266
>gi|193202392|ref|NP_871842.2| Protein MET-1, isoform b [Caenorhabditis elegans]
gi|25395254|pir||B87754 protein C43E11.3 [imported] - Caenorhabditis elegans
gi|351050565|emb|CCD65167.1| Protein MET-1, isoform b [Caenorhabditis elegans]
Length = 1590
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 103/168 (61%), Gaps = 18/168 (10%)
Query: 262 CSETCNNRPFRKEKKIKIVKTEFCGW----GVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K KK V+ G G+ A + I KG FIIEYIGEV++ E+R
Sbjct: 654 CQVKCKNQRFAK-KKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR 712
Query: 318 ----LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV---EGE 370
D K++ + Y+C+ +TIDAT GN SRF+NHSCDPN I EKW V G+
Sbjct: 713 KTKYAADKKHK---HHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGD 768
Query: 371 -TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
RVG F+ R IKAGE +T+DY+FV +G + +C+CG++SC G++G K
Sbjct: 769 VNRVGFFSKRFIKAGEEITFDYQFVNYGRDAQQCFCGSASCSGWIGQK 816
>gi|73622271|ref|NP_619620.3| histone-lysine N-methyltransferase ASH1L [Mus musculus]
gi|341940590|sp|Q99MY8.3|ASH1L_MOUSE RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; AltName: Full=Absent small and
homeotic disks protein 1 homolog
Length = 2958
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2109 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2168
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2169 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2227
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2228 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2277
>gi|119573453|gb|EAW53068.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Homo sapiens]
Length = 2969
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2287
>gi|117949323|sp|Q9NR48.2|ASH1L_HUMAN RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; Short=huASH1; AltName:
Full=Absent small and homeotic disks protein 1 homolog;
AltName: Full=Lysine N-methyltransferase 2H
Length = 2969
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2287
>gi|13442965|gb|AAK26242.1|AF247132_1 putative chromatin remodeling factor [Mus musculus]
Length = 2669
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 1820 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 1879
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 1880 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 1938
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 1939 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 1988
>gi|350583322|ref|XP_003125756.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like,
partial [Sus scrofa]
Length = 2824
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 1968 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2027
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2028 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2086
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2087 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2136
>gi|397492363|ref|XP_003817092.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Pan paniscus]
Length = 2964
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|354478852|ref|XP_003501628.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cricetulus griseus]
Length = 2962
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2110 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2169
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2170 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2228
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2229 GVYRIGLYALKDVLAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2278
>gi|355558542|gb|EHH15322.1| hypothetical protein EGK_01394 [Macaca mulatta]
Length = 2796
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2039 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2098
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2099 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2157
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2158 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2207
>gi|110349788|ref|NP_060959.2| histone-lysine N-methyltransferase ASH1L [Homo sapiens]
gi|225000936|gb|AAI72595.1| Ash1 (absent, small, or homeotic)-like (Drosophila) [synthetic
construct]
Length = 2964
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|301785832|ref|XP_002928328.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Ailuropoda melanoleuca]
Length = 2965
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|417515828|gb|JAA53722.1| histone-lysine N-methyltransferase ASH1L [Sus scrofa]
Length = 2951
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2101 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2160
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2161 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2219
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2220 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2269
>gi|338724967|ref|XP_001499134.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Equus caballus]
Length = 2963
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2113 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2172
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2173 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2231
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2232 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2281
>gi|410226116|gb|JAA10277.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264036|gb|JAA19984.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264040|gb|JAA19986.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410306368|gb|JAA31784.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410355463|gb|JAA44335.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
Length = 2964
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|403293713|ref|XP_003937857.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Saimiri
boliviensis boliviensis]
Length = 2970
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2120 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2180 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2239 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2288
>gi|380814664|gb|AFE79206.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
gi|383419979|gb|AFH33203.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
Length = 2963
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2113 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2172
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2173 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2231
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2232 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2281
>gi|281338719|gb|EFB14303.1| hypothetical protein PANDA_018255 [Ailuropoda melanoleuca]
Length = 2981
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2131 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2190
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2191 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2249
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2250 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2299
>gi|410033849|ref|XP_003949641.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Pan troglodytes]
Length = 2964
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|344286471|ref|XP_003414981.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Loxodonta africana]
Length = 2917
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2067 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2126
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2127 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2185
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2186 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2235
>gi|300795068|ref|NP_001179672.1| probable histone-lysine N-methyltransferase ASH1L [Bos taurus]
gi|296489728|tpg|DAA31841.1| TPA: ash1 (absent, small, or homeotic)-like [Bos taurus]
Length = 2965
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|330920531|ref|XP_003299046.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
gi|311327461|gb|EFQ92879.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
Length = 791
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 96/168 (57%), Gaps = 15/168 (8%)
Query: 264 ETCNNRPFRKEKK----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
E CNNRPF + K+ ++++ TE G+GV A + I+EY GE+I +
Sbjct: 404 EQCNNRPFAELKRRAKGNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSE 463
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+R+ + Y+ + +Y+ IDAT +G +RF+NHSC+PNC + KW V GE R+
Sbjct: 464 CERRMKQV-YKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRM 521
Query: 374 GVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA R I GE LTYDY F F + +C CG SC+G LG K K
Sbjct: 522 ALFAGPRGIMTGEELTYDYNFDPFSQKNIQECRCGTESCRGVLGPKPK 569
>gi|25141373|ref|NP_491340.2| Protein MET-1, isoform a [Caenorhabditis elegans]
gi|351050564|emb|CCD65166.1| Protein MET-1, isoform a [Caenorhabditis elegans]
Length = 1604
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 103/168 (61%), Gaps = 18/168 (10%)
Query: 262 CSETCNNRPFRKEKKIKIVKTEFCGW----GVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K KK V+ G G+ A + I KG FIIEYIGEV++ E+R
Sbjct: 668 CQVKCKNQRFAK-KKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR 726
Query: 318 ----LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV---EGE 370
D K++ + Y+C+ +TIDAT GN SRF+NHSCDPN I EKW V G+
Sbjct: 727 KTKYAADKKHK---HHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGD 782
Query: 371 -TRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
RVG F+ R IKAGE +T+DY+FV +G + +C+CG++SC G++G K
Sbjct: 783 VNRVGFFSKRFIKAGEEITFDYQFVNYGRDAQQCFCGSASCSGWIGQK 830
>gi|395845197|ref|XP_003795328.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Otolemur
garnettii]
Length = 2961
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2111 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2170
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2171 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2229
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2230 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2279
>gi|332219957|ref|XP_003259124.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Nomascus leucogenys]
Length = 2892
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|440903623|gb|ELR54260.1| Putative histone-lysine N-methyltransferase ASH1L [Bos grunniens
mutus]
Length = 2965
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|426216789|ref|XP_004002640.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Ovis aries]
Length = 2965
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2115 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2174
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2175 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2233
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2234 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2283
>gi|390476801|ref|XP_002760038.2| PREDICTED: histone-lysine N-methyltransferase ASH1L [Callithrix
jacchus]
Length = 2970
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2120 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2180 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2239 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2288
>gi|348579791|ref|XP_003475662.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cavia porcellus]
Length = 2964
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2282
>gi|7739725|gb|AAF68983.1|AF257305_1 ASH1 [Homo sapiens]
Length = 2969
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2287
>gi|258573585|ref|XP_002540974.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901240|gb|EEP75641.1| predicted protein [Uncinocarpus reesii 1704]
Length = 727
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 25/194 (12%)
Query: 264 ETCNNRPF---RKEKK--------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NRPF R+ K ++++KTE G+GV + N + I+EY GE++
Sbjct: 394 ELCQNRPFSELRRRSKAGGKFNIGVEVIKTEDRGYGVRSNRAFNPNQIIVEYTGEILTQE 453
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + Y+ +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+ R
Sbjct: 454 ECERRMRTV-YKKNDCYYLMYFDQNMVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPR 511
Query: 373 VGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK---------IG 420
+ +FA I GE LTYDY F + + +C CGA +C+G LG ++K G
Sbjct: 512 MALFAGEDGIMTGEELTYDYNFDPYSQKNVQECRCGAPTCRGVLGPRQKEAFKNKEKDKG 571
Query: 421 KLELCWGSKRKRSS 434
GSKRK S+
Sbjct: 572 SKTRTTGSKRKSSN 585
>gi|322798377|gb|EFZ20101.1| hypothetical protein SINV_07690 [Solenopsis invicta]
Length = 2203
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C CS+ C N+ +K + ++ TE GWGV + I G+FI+EY+GEV
Sbjct: 1385 FSECSPQLCPCSDKCENQKIQKHEWSPGLQRFMTEDKGWGVRTQQAIKSGDFILEYVGEV 1444
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 1445 VSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVL 1503
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA+R IK GE LTYDY F F P +C CG++ C+G +G K
Sbjct: 1504 GLPRMALFASRDIKPGEELTYDYNFALFNPSEGQQCRCGSNVCRGVIGGK 1553
>gi|321260843|ref|XP_003195141.1| histone-lysine N-methyltransferase [Cryptococcus gattii WM276]
gi|317461614|gb|ADV23354.1| Histone-lysine N-methyltransferase, putative [Cryptococcus gattii
WM276]
Length = 833
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 98/168 (58%), Gaps = 4/168 (2%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C + C + C+N+ F K++ + +V TE G+G+ A+ I I EYIGEV+
Sbjct: 156 YIECIAGECRAGKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSIIPANTLIYEYIGEVV 215
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+ +R+ G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 216 AEKTFRKRMQQYADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGR 275
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
R+G+F R + GE +T++Y ++G + + CYCG +C G +G K
Sbjct: 276 RLRMGIFTKRDVVKGEEITFNYNVDRYGHDAQTCYCGEPNCVGTIGGK 323
>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
Length = 375
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 91/142 (64%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K K+ WG+ A E I+ G+ +IEY+GE++ + + R + +G+ + Y+
Sbjct: 234 RKKQLKFSKSPIHNWGLYAMEHIDMGDMVIEYVGEIVRQTVADIRERQYERQGIGSSYLF 293
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I D +DAT KGN +RF+NHSCDP+C + +VEGE ++ ++A R I+ GE +TYDY+
Sbjct: 294 RIDDDTVVDATKKGNIARFINHSCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYK 353
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CGA +C+G L
Sbjct: 354 FPIEDVKIPCLCGAKACRGTLN 375
>gi|74140676|dbj|BAC28183.2| unnamed protein product [Mus musculus]
Length = 418
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 62 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 121
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 122 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 180
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 181 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 230
>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
lacrymans S7.3]
Length = 260
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 91/146 (62%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ ++ WG+ A E I++GE +IEY+GEVI + ++R + +G+
Sbjct: 114 NQLQTRKKHLRFARSPIHDWGLYAMERISRGEMVIEYVGEVIRAQVADKREKVYERQGIG 173
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D +DAT KGN R +NHSCDPNC + + GE ++ ++A + I+ GE +
Sbjct: 174 SSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEI 233
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG++ C+GYL
Sbjct: 234 TYDYHFPIEQDKIPCLCGSAKCRGYL 259
>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 810
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 92/164 (56%), Gaps = 5/164 (3%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C NR R K+ + GWG++ A + KG I EY+GEVID+A+ E R
Sbjct: 521 CKLGAGCGNRRLQNRVNAKVAPFREAGMGWGLKVAVDVPKGSLIGEYVGEVIDEAMVEHR 580
Query: 318 LWDMKYRGVQN--FYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
+ + + + FY+ E+ + IDA KGN R +NHSC+PNC ++ W + G TR+G+
Sbjct: 581 MAEQRRLRPNDGEFYIMELGQSLFIDAKEKGNLMRLINHSCNPNCDVQAWNIAGYTRLGI 640
Query: 376 FAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYLGTKRK 418
+A + + GE L+YDY+F KC CGA +C+G L K +
Sbjct: 641 YAKKDLAKGESLSYDYKFSTNEKARFKCMCGAENCRGTLAPKEQ 684
>gi|307167950|gb|EFN61316.1| Probable histone-lysine N-methyltransferase ASH1L [Camponotus
floridanus]
Length = 2136
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 91/162 (56%), Gaps = 6/162 (3%)
Query: 260 CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C E C N+ +K + ++ TE GWGV + I G+FI+EY+GEV+ + +
Sbjct: 1335 CPSGEKCENQKIQKHEWSPGLQKFMTEDKGWGVRTQQSIKSGDFILEYVGEVVSEREFKS 1394
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ +Y + Y + ID G RF+NHSC+PNC ++KW V G R+ +F
Sbjct: 1395 RMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVLGLPRMALF 1453
Query: 377 AARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
A+R IK GE LTYDY F F P +C CG+S+C+G +G K
Sbjct: 1454 ASRDIKPGEELTYDYNFALFNPSEGQQCRCGSSACRGVIGGK 1495
>gi|74210080|dbj|BAE21322.1| unnamed protein product [Mus musculus]
Length = 484
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/170 (41%), Positives = 95/170 (55%), Gaps = 14/170 (8%)
Query: 252 QYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
QY + C + C N+ F K +++KTE GWG+ I KGEF+ EY+GE+I
Sbjct: 179 QYECHPQVCPAGDRCQNQCFTKRLYPNAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELI 238
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
D+ C R+ R +N D IDA KGN+SRF+NHSC+PNC +KW V G
Sbjct: 239 DEEECRLRI----KRAHEN-------SDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNG 287
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
+ RVG+FA I AG LT++Y G C+CGA +C G+LG + K
Sbjct: 288 DVRVGLFALCDIPAGMELTFNYNLDCLGNGRTVCHCGADNCSGFLGVRPK 337
>gi|242011020|ref|XP_002426255.1| protein MLP1, putative [Pediculus humanus corporis]
gi|212510318|gb|EEB13517.1| protein MLP1, putative [Pediculus humanus corporis]
Length = 2688
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C E C+N+ +K + ++ T+ GWG+ +PI G+FI+EY+GEV
Sbjct: 1920 FAECSPQLCPCKEKCSNQRIQKHEFAPGLQKFMTKNKGWGIRTKQPIKAGDFILEYVGEV 1979
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ D + R+ + Y ++ Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 1980 VSDKEFKDRMASI-YVNDKHHYCLHLDGGLVIDGHRMGGDGRFVNHSCNPNCEMQKWSVN 2038
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA R+I A + LTYDY F F P CYCG + C+G +G K
Sbjct: 2039 GLFRMALFALRNIPAHQELTYDYNFSLFNPAEGQPCYCGTNECRGVIGGK 2088
>gi|93003038|tpd|FAA00102.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 883
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 7/176 (3%)
Query: 251 VQYISCS-KACHCSETCNNRPFRKE---KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIG 306
+ YI CS C C + C NR +K+ K ++ +T GWGV I +G+F++EY+G
Sbjct: 92 LMYIECSPDTCPCQDKCANRCIQKQQWWKDLERFRTNDRGWGVRTNSDIPEGQFLLEYVG 151
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + +R + Y + Y ++ ID N RF+NHSC PNC ++KW
Sbjct: 152 EVVSEREFRRRTIE-NYNAHNDHYCVQLEAGTVIDGYRLANEGRFVNHSCQPNCEMQKWV 210
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRKIG 420
V GE RVG+FA R I + E LTYDY F + + + C CG+S C+G +G K + G
Sbjct: 211 VNGEYRVGLFAKRPIVSSEELTYDYNFHAYNLDRQQPCRCGSSECRGVIGGKTQRG 266
>gi|402856517|ref|XP_003892835.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Papio anubis]
Length = 1277
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 427 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 486
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 487 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 545
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 546 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 595
>gi|332021876|gb|EGI62212.1| Putative histone-lysine N-methyltransferase ASH1L [Acromyrmex
echinatior]
Length = 2178
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 92/162 (56%), Gaps = 6/162 (3%)
Query: 260 CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C C + C N+ +K + ++ TE GWGV + I G+FI+EY+GEV+ + +
Sbjct: 1379 CPCGDKCENQKIQKHEWSPGLQRFMTEDKGWGVRTQQAIKAGDFILEYVGEVVSEREFKS 1438
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ +Y + Y + ID G RF+NHSC+PNC ++KW V G R+ +F
Sbjct: 1439 RMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVLGLPRMALF 1497
Query: 377 AARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
A+R IK GE LTYDY F F P +C CG+++C+G +G K
Sbjct: 1498 ASRDIKPGEELTYDYNFALFNPSEGQQCRCGSNACRGVIGGK 1539
>gi|417407083|gb|JAA50168.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2832
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C + C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2112 FAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2171
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2172 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2230
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2231 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2280
>gi|189202688|ref|XP_001937680.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984779|gb|EDU50267.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 646
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 96/168 (57%), Gaps = 15/168 (8%)
Query: 264 ETCNNRPFRKEKK----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
E CNNRPF + K+ ++++ TE G+GV A + I+EY GE+I +
Sbjct: 374 EQCNNRPFAELKRRSKGNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSE 433
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+R+ + Y+ + +Y+ IDAT +G +RF+NHSC+PNC + KW V GE R+
Sbjct: 434 CERRMKQV-YKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRM 491
Query: 374 GVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA R I GE LTYDY F F + +C CG SC+G LG K K
Sbjct: 492 ALFAGPRGIMTGEELTYDYNFDPFSQKNIQECRCGTESCRGVLGPKPK 539
>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
Length = 296
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 94/148 (63%), Gaps = 2/148 (1%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ R+ K ++ ++ WG+ A EPI+K E +IEY+GE++ + E R G+
Sbjct: 148 NQFKRRRKLLRFARSMIHEWGLFAQEPIDKDELVIEYVGEIVRQTVAEDRERRYARIGIG 207
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D+ IDAT G+ +RF+NHSCD NC + V+G+ R+G+++ R I A E +
Sbjct: 208 SSYLFRIDEDYVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPIAANEEI 267
Query: 388 TYDYRF-VQFGP-EVKCYCGASSCQGYL 413
TYDY+F + GP ++ C+CGA +C+G L
Sbjct: 268 TYDYKFPREEGPNKIPCFCGARTCRGTL 295
>gi|291397821|ref|XP_002715465.1| PREDICTED: absent, small, or homeotic 1-like [Oryctolagus cuniculus]
Length = 2961
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C + C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2111 FAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2170
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2171 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2229
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2230 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2279
>gi|320036571|gb|EFW18510.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 742
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 108/191 (56%), Gaps = 21/191 (10%)
Query: 263 SETCNNRPF---RKEKK--------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
SE C NRPF R+ K ++++KTE G+GV + + + I+EY GE++
Sbjct: 376 SELCRNRPFSALRRRAKAGGKFNIGVEVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQ 435
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 436 EECERRMRTV-YKKNECYYLMYFDQNMVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKP 493
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLELCWGS 428
R+ +FA I GE LTYDY F + + +C CGA +C+G LG + K W +
Sbjct: 494 RMALFAGEDGIMTGEELTYDYNFDPYSQKNVQECRCGAPTCRGVLGPRPK-----ESWKN 548
Query: 429 KRKRSSTACLA 439
K K +A A
Sbjct: 549 KDKEKKSAPAA 559
>gi|119192128|ref|XP_001246670.1| hypothetical protein CIMG_00441 [Coccidioides immitis RS]
gi|392864092|gb|EAS35106.2| histone-lysine N-methyltransferase [Coccidioides immitis RS]
Length = 742
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 108/191 (56%), Gaps = 21/191 (10%)
Query: 263 SETCNNRPF---RKEKK--------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
SE C NRPF R+ K ++++KTE G+GV + + + I+EY GE++
Sbjct: 376 SELCRNRPFSALRRRAKAGGKFNIGVEVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQ 435
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 436 EECERRMRTV-YKKNECYYLMYFDQNMVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKP 493
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLELCWGS 428
R+ +FA I GE LTYDY F + + +C CGA +C+G LG + K W +
Sbjct: 494 RMALFAGEDGIMTGEELTYDYNFDPYSQKNVQECRCGAPTCRGVLGPRPK-----ESWKN 548
Query: 429 KRKRSSTACLA 439
K K +A A
Sbjct: 549 KDKEKKSAPAA 559
>gi|417407091|gb|JAA50172.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2962
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C + C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2112 FAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2171
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2172 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2230
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2231 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2280
>gi|398388287|ref|XP_003847605.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
gi|339467478|gb|EGP82581.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
Length = 1069
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 115/204 (56%), Gaps = 26/204 (12%)
Query: 251 VQYISCSKACHC-SETCNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKG 298
+QY + C + C+NR F + KK +++VKT G+G+ + N G
Sbjct: 682 MQYECNDQNCRLPASVCSNRAFAQLAARTKKGGLFDVGVEVVKTHERGFGIRSCRSFNAG 741
Query: 299 EFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDP 358
+ I+EY GE+I + C++R+ ++ Y +Y+ E+ ++ ID T KG+ +RF+NHSC+P
Sbjct: 742 QIIMEYTGEIISEGECQRRMREV-YHDKACYYLMELERNLVIDGT-KGSMARFINHSCEP 799
Query: 359 NCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
NC + +V G R+ VFA S I GE LTYDY F FG KCYCGA++C+G L +
Sbjct: 800 NCEVRMVKVNGTPRMAVFAGESGIMTGEELTYDYNFDNFGESAQKCYCGAATCRGTLSRR 859
Query: 417 ------RKIGKLELCWGSKRKRSS 434
+K+ K E S+RKR +
Sbjct: 860 LNATEQKKMAKEE----SERKRKA 879
>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
Length = 1670
Score = 123 bits (309), Expect = 1e-25, Method: Composition-based stats.
Identities = 57/141 (40%), Positives = 89/141 (63%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K IK ++ G+G+ A E I + E IIEYIG+ I + ++R + RG+ + Y+
Sbjct: 1529 RKKMIKFARSRIHGYGLYAMETIAQDEMIIEYIGQKIRSLVADEREKAYERRGIGSSYLF 1588
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I + IDAT +GNF+RF+NHSC PNC + +EGE R+ +++ +I GE +TYDY+
Sbjct: 1589 RIDEHTVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSTINKGEEITYDYK 1648
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CGA +C+GYL
Sbjct: 1649 FPIEEDKIDCLCGAKTCRGYL 1669
>gi|303313071|ref|XP_003066547.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106209|gb|EER24402.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 742
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 108/191 (56%), Gaps = 21/191 (10%)
Query: 263 SETCNNRPF---RKEKK--------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
SE C NRPF R+ K ++++KTE G+GV + + + I+EY GE++
Sbjct: 376 SELCRNRPFSALRRRAKAGGKFNIGVEVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQ 435
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 436 EECERRMRTV-YKKNECYYLMYFDQNMVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKP 493
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLELCWGS 428
R+ +FA I GE LTYDY F + + +C CGA +C+G LG + K W +
Sbjct: 494 RMALFAGEDGIMTGEELTYDYNFDPYSQKNVQECRCGAPTCRGVLGPRPK-----ESWKN 548
Query: 429 KRKRSSTACLA 439
K K +A A
Sbjct: 549 KDKEKKSAPAA 559
>gi|350631656|gb|EHA20027.1| hypothetical protein ASPNIDRAFT_179076 [Aspergillus niger ATCC
1015]
Length = 853
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 109/192 (56%), Gaps = 18/192 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C E C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 453 CGVGEECGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 512
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 513 ITQTECEKRMRTI-YKHNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 570
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG--TKRKIGKLE 423
G+ R+ +FA R I GE LTYDY F + + +C CG+S+C+G LG K K+ + +
Sbjct: 571 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSNCRGILGPRPKEKVQRAK 630
Query: 424 LCWGSKRKRSST 435
K K+S+T
Sbjct: 631 EQKVEKSKKSAT 642
>gi|126307634|ref|XP_001366993.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Monodelphis domestica]
Length = 2968
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFDKCRGIIGGK 2287
>gi|30704948|gb|AAH52194.1| Ash1l protein, partial [Mus musculus]
Length = 963
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 282
>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Otolemur garnettii]
Length = 1371
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 81/132 (61%), Gaps = 2/132 (1%)
Query: 258 KACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE 315
+ C E C N+ F K + + KI+KT+ GWG+ A I KGEF+ EY+GE+ID+ C
Sbjct: 1044 QVCPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECM 1103
Query: 316 QRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGV 375
R+ + +FYM I KD IDA KGN+SRF+NHSC PNC KW V G+TRVG+
Sbjct: 1104 ARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGL 1163
Query: 376 FAARSIKAGEPL 387
FA I AG L
Sbjct: 1164 FAVCDIPAGTAL 1175
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 11/64 (17%)
Query: 131 CSVRGCGGVYHFICVKE-RLGISNPRNFKCPQHACFICR----------QRLQWRCVRCT 179
C V CG YH CV++ L + R F+CP H+C C + RCVRC
Sbjct: 730 CVVTQCGKFYHETCVRKFPLTVFESRGFRCPLHSCVSCHASNPANPRPSKGKMMRCVRCP 789
Query: 180 IASH 183
+A H
Sbjct: 790 VAYH 793
>gi|328785970|ref|XP_001122948.2| PREDICTED: hypothetical protein LOC727238 [Apis mellifera]
Length = 2172
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 95/170 (55%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS + C C + C N+ +K + ++ TE GWGV + I G FI+EY+GEV
Sbjct: 1360 FSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGWGVRTQQSIKSGVFILEYVGEV 1419
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + + R+ +Y + Y + ID G RF+NHSC+PNC ++KW V
Sbjct: 1420 VSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVH 1478
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ +FA+R IK GE LTYDY F F P +C CG+++C+G +G K
Sbjct: 1479 GLPRMALFASRDIKPGEELTYDYNFALFNPSEGQECRCGSNACRGVIGGK 1528
>gi|221058949|ref|XP_002260120.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810193|emb|CAQ41387.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 2872
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 96/166 (57%), Gaps = 6/166 (3%)
Query: 266 CNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW---- 319
C NRPFR+ K ++I +TE G+GV + I GE I EY+GEV+ E+R+
Sbjct: 2435 CFNRPFRQSAIKDLEIRQTERTGYGVFSKRDIKNGELICEYVGEVLAKKEFEERVEAYQE 2494
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ K + N+Y +I +D ID+ KGN SRF+NHSC PN + +KW V G R+G+FA R
Sbjct: 2495 ESKKTNMYNWYSIQINRDVHIDSRKKGNISRFVNHSCSPNSVSQKWIVRGFYRIGIFAQR 2554
Query: 380 SIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELC 425
I AGE +TY+Y + +C C +++C Y K++ E+
Sbjct: 2555 DIPAGEEITYNYSYNFVFNNFECLCKSANCMNYTLKKKEETDNEMS 2600
>gi|395532129|ref|XP_003768124.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1
[Sarcophilus harrisii]
Length = 2969
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2120 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2179
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2180 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2238
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2239 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFDKCRGIIGGK 2288
>gi|395532131|ref|XP_003768125.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2
[Sarcophilus harrisii]
Length = 2974
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2125 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2184
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2185 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2243
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2244 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFDKCRGIIGGK 2293
>gi|317033153|ref|XP_001394944.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus niger CBS
513.88]
Length = 825
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 109/192 (56%), Gaps = 18/192 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C E C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 425 CGVGEECGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 484
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 485 ITQTECEKRMRTI-YKHNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 542
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG--TKRKIGKLE 423
G+ R+ +FA R I GE LTYDY F + + +C CG+S+C+G LG K K+ + +
Sbjct: 543 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSNCRGILGPRPKEKVQRAK 602
Query: 424 LCWGSKRKRSST 435
K K+S+T
Sbjct: 603 EQKVEKSKKSAT 614
>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
JAM81]
Length = 1367
Score = 123 bits (308), Expect = 2e-25, Method: Composition-based stats.
Identities = 57/141 (40%), Positives = 87/141 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+IK K+ WG+ A EPI+ + +IEYIGE+I + + R + G+ + Y+
Sbjct: 1226 RKKRIKFDKSIIHDWGLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYEASGIGSSYLF 1285
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ +D IDAT GN +RF+NH C+PNC + V+G R+ ++A R IK GE LTYDY+
Sbjct: 1286 RVDEDTIIDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYK 1345
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CGA +C+G L
Sbjct: 1346 FPIEEDKIPCLCGAVNCRGTL 1366
>gi|449490008|ref|XP_004176439.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L-like [Taeniopygia guttata]
Length = 2968
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2119 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2178
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2179 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2237
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2238 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFDKCRGIIGGK 2287
>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1359
Score = 123 bits (308), Expect = 2e-25, Method: Composition-based stats.
Identities = 57/141 (40%), Positives = 87/141 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++KKIK ++ WG+ A EPI + +IEYIGEVI + ++R +G+ + Y+
Sbjct: 1218 RKKKIKFERSLIHDWGLFALEPIYARDMVIEYIGEVIRQKVADEREKRYTKQGIGSSYLF 1277
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I D IDATFKGN +RF+NH CDPNC+ + + G+ ++ ++A R I GE LTYDY+
Sbjct: 1278 RIDDDTIIDATFKGNQARFINHCCDPNCMAKVITMGGQKKIIIYAKRDINVGEELTYDYK 1337
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C C ++ C+G L
Sbjct: 1338 FPIEDVKIPCLCKSAKCRGTL 1358
>gi|363742848|ref|XP_422858.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Gallus
gallus]
Length = 2954
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2105 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2164
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2165 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2223
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2224 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2273
>gi|326933478|ref|XP_003212830.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Meleagris gallopavo]
Length = 2974
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2125 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2184
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2185 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2243
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2244 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2293
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 91/162 (56%), Gaps = 14/162 (8%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F K + +++I +T GWG+ I KGEF+ EY+GE+ID+ C R
Sbjct: 1569 CPAGGRCQNQCFSKRQYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRAR 1628
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ + +D IDA KGN++RF+NH C PNC +KW V G+TRVG+FA
Sbjct: 1629 I-----------RYAQEHEDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1677
Query: 378 ARSIKAGEPLTYDYRFVQFGP-EVKCYCGASSCQGYLGTKRK 418
IKAG LT++Y G + C CGA +C G+LG + K
Sbjct: 1678 LSDIKAGTELTFNYNLECLGNGKTVCKCGAPNCSGFLGVRPK 1719
>gi|453080036|gb|EMF08088.1| hypothetical protein SEPMUDRAFT_152375 [Mycosphaerella populorum
SO2202]
Length = 881
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 97/166 (58%), Gaps = 15/166 (9%)
Query: 264 ETCNNRPF----RKEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
+ C+NRPF R+ K +++VKT+ G GV AA P G+ I+EY GE+I +
Sbjct: 410 DNCSNRPFSELERRRTKGGRYDIGVEVVKTKNRGHGVRAARPFQPGQLIMEYTGEIITED 469
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
C++R+ Y N+++ E+ +D KG+ +RF+NHSCDPNC ++ +V +R
Sbjct: 470 ECQERMA-TTYLNATNYFVMEMENGLILDGN-KGSEARFINHSCDPNCEVKMTRVGQVSR 527
Query: 373 VGVFAARS-IKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
+GV+A + I G+ LTYDY F F + CYCG C+GYLG +
Sbjct: 528 LGVYAGPAGIMTGQELTYDYNFQNFSDHRQACYCGGQHCRGYLGKR 573
>gi|210075797|ref|XP_002143056.1| YALI0D21684p [Yarrowia lipolytica]
gi|199425837|emb|CAG81317.4| YALI0D21684p [Yarrowia lipolytica CLIB122]
Length = 1284
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 100/170 (58%), Gaps = 11/170 (6%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
I+I+ TE G+G+ A GE + EY G+VI + ++RL D +Y N+Y + K
Sbjct: 245 IEIIHTESKGYGLRACRSFAPGELVCEYTGDVISPSEIKRRL-DNEYADSDNYYFLNLEK 303
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQ 395
ID+ +G+ +RF+NHSC PNC ++KW V+G RVG+FA ++ G+ LTYDY FV
Sbjct: 304 GAVIDSGLRGSAARFVNHSCMPNCEMQKWFVKGLPRVGLFAGEDGVQNGQELTYDYNFVW 363
Query: 396 F-GPEVK-CYCGASSCQGYLG-------TKRKIGKLELCWGSKRKRSSTA 436
F G + + C+CGAS C+G +G T + + S RKR+STA
Sbjct: 364 FEGAKAQTCHCGASQCRGIIGRRAGEKATTERASSVGAIESSSRKRASTA 413
>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
Length = 2067
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 89/144 (61%), Gaps = 3/144 (2%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
+ KK++ ++ WGV A EPI G+ +IEY GE+I +A+ ++R + + + YM
Sbjct: 1923 RRKKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQYEDMQIGSDYMF 1982
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ +D +DATFKG+ +RF+NHSCDP+C VEG ++ ++A R + GE L+YDY+
Sbjct: 1983 RVDEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDVAMGEELSYDYK 2042
Query: 393 FVQFGPE---VKCYCGASSCQGYL 413
F E V C+CG+ C+G++
Sbjct: 2043 FPPEPDEAARVPCHCGSEKCRGFI 2066
>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
972h-]
gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1; AltName: Full=Set1 complex
component set1; Short=Set1C component set1; AltName:
Full=Spset1
gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
Length = 920
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 88/142 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++ + G+ A E I+K + +IEYIGE+I + + R + G+ + Y+
Sbjct: 779 RKKQLHFGPSRIHTLGLFAMENIDKNDMVIEYIGEIIRQRVADNREKNYVREGIGDSYLF 838
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I +D +DAT KGN +RF+NHSC PNCI +VEG+ ++ ++A R I GE LTYDY+
Sbjct: 839 RIDEDVIVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYK 898
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F + ++ C CGA +C+GYL
Sbjct: 899 FPEEADKIPCLCGAPTCRGYLN 920
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 89/142 (62%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ + G+ A E I+K + +IEY+GE++ + + R G+ + Y+
Sbjct: 836 RKKQLRFGPSRIHTLGLFAMENIDKNDMVIEYVGEIVRQRVADTRERKYVREGIGDSYLF 895
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I KD +DAT KGN +RF+NHSC PNCI + +VEG ++ ++A R I+ GE LTYDY+
Sbjct: 896 RIDKDAIVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEEGEELTYDYK 955
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F + ++ C CGA +C+GYL
Sbjct: 956 FPEEVDKIPCLCGAPTCRGYLN 977
>gi|324500453|gb|ADY40214.1| Histone-lysine N-methyltransferase lin-59 [Ascaris suum]
Length = 1467
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/165 (40%), Positives = 93/165 (56%), Gaps = 9/165 (5%)
Query: 255 SCSKACHCSETCNNRPFRKE--KKIKIVKTEFC-GWGVEAAEPINKGEFIIEYIGEVIDD 311
SC + HCS N R FR+E K+ + +T G GV P+ KG+F+ EY+GEV+
Sbjct: 769 SCGRGVHCS---NKRIFRRECVDKLSLFETSNGRGLGVRTDVPLQKGQFVCEYVGEVVSM 825
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
+ R YR +N Y + + IDA KGN +RF+NHSC PNC +++W V G+
Sbjct: 826 ETFDARNA-HSYRAFRNHYALNLCPGYVIDAYQKGNIARFVNHSCVPNCEMQRWSVNGQH 884
Query: 372 RVGVFAARSIKAGEPLTYDYRFVQFG--PEVKCYCGASSCQGYLG 414
R+G+FA R + GE LTYDY + F C CG +C+G+L
Sbjct: 885 RIGLFALRVVAKGEELTYDYNWDSFDFYGVTPCSCGVPNCRGFLN 929
>gi|378733973|gb|EHY60432.1| histone-lysine N-methyltransferase ASH1L [Exophiala dermatitidis
NIH/UT8656]
Length = 780
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 101/176 (57%), Gaps = 17/176 (9%)
Query: 258 KACHCSET-CNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
+ C+ SE C NR F ++ ++++KTE G+GV + + I+EY
Sbjct: 433 RNCNLSEELCGNRQFEALRQRVKKGGKYNVGVEVIKTEDRGYGVRSNRTFEPNQIIVEYT 492
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GE+I CE+R+ M Y+G + +Y+ ++ IDAT +G+ +RF+NHSC PNC +EKW
Sbjct: 493 GEIITQEECERRMNTM-YKGNECYYLMLFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 550
Query: 366 QVEGETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
V G+ R+ +FA R I GE LTYDY F + + C CGA +C+G LG + K
Sbjct: 551 TVNGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQVCRCGAENCRGVLGPRSK 606
>gi|358369137|dbj|GAA85752.1| histone-lysine N-methyltransferase [Aspergillus kawachii IFO 4308]
Length = 852
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 100/173 (57%), Gaps = 16/173 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C E C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 452 CGVGEECGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 511
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 512 ITQTECEKRMRTI-YKHNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 569
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R I GE LTYDY F + + +C CG+S+C+G LG + K
Sbjct: 570 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSNCRGILGPRPK 622
>gi|159473232|ref|XP_001694743.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158276555|gb|EDP02327.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1105
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 93/158 (58%), Gaps = 3/158 (1%)
Query: 260 CHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C + C N+ F K ++I + G+G+ A E I G+FIIEYIGEV+++ ++R
Sbjct: 182 CPSEDKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDEYQRR 241
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
G +++Y I IDA KGN SRF+NHSC+PNC +KW V GE +G+FA
Sbjct: 242 KEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVHGELAIGLFA 301
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
R I LT+DY F ++G + ++CYC + C+ ++G
Sbjct: 302 VRDIPKDTELTFDYNFERYGDKPMRCYCKSGGCRKFIG 339
>gi|50302715|ref|XP_451294.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74607741|sp|Q6CXP5.1|SET2_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49640425|emb|CAH02882.1| KLLA0A06600p [Kluyveromyces lactis]
Length = 702
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 64/158 (40%), Positives = 89/158 (56%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C C N+ F+K+ I + KTE G+GV A I FI EYIGEVI + R+
Sbjct: 92 CGTDCQNQRFQKKAYADISVFKTERKGFGVRANSDIEPHNFIYEYIGEVIQEEEFRNRMV 151
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
G ++FY ++ IDAT KG +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 152 KYDQMGFKHFYFMMLQTGQFIDATLKGCIARFCNHSCNPNAYVNKWVVNGKLKMGIFANR 211
Query: 380 SIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLGTK 416
I GE +T+DY ++G + CYC +C G+LG K
Sbjct: 212 HISKGEEVTFDYNVDRYGANAQPCYCEEPNCIGFLGGK 249
>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
SS1]
Length = 1095
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 93/160 (58%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+S +A N+ ++K ++ ++ WG+ A E IN+G+ +IEY+GEVI +
Sbjct: 935 LSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKINRGDLVIEYVGEVIRAQV 994
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
++R + +G+ + Y+ I +D +DAT KGN R +NHSCDPNC + + GE ++
Sbjct: 995 ADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKI 1054
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYL 413
++A + I+ G +TYDY F ++ C CG++ C+G L
Sbjct: 1055 VIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTL 1094
>gi|290985403|ref|XP_002675415.1| predicted protein [Naegleria gruberi]
gi|284089011|gb|EFC42671.1| predicted protein [Naegleria gruberi]
Length = 438
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 97/169 (57%), Gaps = 7/169 (4%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIV--KTEFCGWGVE-AAEPINKGEFIIEYIGEVID 310
I C C+C + C NR F K +++ +++ G GV+ + I KG FI EY+GEVI
Sbjct: 233 IECPTDCNCGDNCQNRVFTKMAYPELICFQSDTKGIGVKCNQDVIKKGTFITEYVGEVIS 292
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E R Y+ + Y + ++ IDAT+ GN +RF+NHSC PN + W V G+
Sbjct: 293 VDKFETRT-KRSYKKSLHHYCMNMNENEIIDATWMGNIARFINHSCAPNARTQTWDVNGQ 351
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK---CYCGASSCQGYLGTK 416
RVG+FA + I GE +TY+Y F+ + E K C CGA +CQG +GT+
Sbjct: 352 NRVGIFAIKDIVKGEEITYNYNFLIYNDETKQQECKCGAPNCQGVIGTR 400
>gi|121704806|ref|XP_001270666.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
gi|119398812|gb|EAW09240.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
Length = 847
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KTE G+GV + + I+EY GE+
Sbjct: 446 CGLGPDCGNRSFEELKQRTKVGGKYNIGVEVIKTEDRGYGVRSNRTFEPNQVIVEYTGEI 505
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 506 ITQVECEKRMRTL-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 563
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
G+ R+ +FA R I GE LTYDY F + + +C CGA +C+G LG
Sbjct: 564 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGAPNCRGILG 612
>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
Length = 1340
Score = 122 bits (305), Expect = 5e-25, Method: Composition-based stats.
Identities = 54/141 (38%), Positives = 84/141 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
+ KKIK +++ WG+ A E IN + +IEYIGEVI + ++R +G+ + Y+
Sbjct: 1199 RRKKIKFQRSDIHDWGLFAMETINAKDMVIEYIGEVIRQKVADEREKRYIKKGIGSSYLF 1258
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D IDAT KGN +RF+NH CDPNCI + + + ++ ++A R I GE +TYDY+
Sbjct: 1259 RVDDDTIIDATLKGNLARFINHCCDPNCIAKVLTINNQKKIIIYAKRDINIGEEITYDYK 1318
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C C + C+G L
Sbjct: 1319 FPIEDEKIPCLCKSPKCRGTL 1339
>gi|290981399|ref|XP_002673418.1| predicted protein [Naegleria gruberi]
gi|284087001|gb|EFC40674.1| predicted protein [Naegleria gruberi]
Length = 798
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 105/179 (58%), Gaps = 30/179 (16%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+ C++ C CSETC NR +K +K + KT+ GWG+ A + ++KG+F+ Y GE+I D
Sbjct: 630 LECNRDCGCSETCPNRVVQKGSNVKLCVFKTKNRGWGLRANQKLSKGQFVEVYFGELITD 689
Query: 312 ALCEQR-----------LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNC 360
A+ E+R L+D+ + GVQ CE +TID+TF GN +RFLNHSCD N
Sbjct: 690 AIAEKRGERYDRKGLSYLFDLAHGGVQ----CE----YTIDSTFIGNVTRFLNHSCDGN- 740
Query: 361 ILEKWQVEGETR------VGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYL 413
L++ V E R + F R IK GE LT+DY ++ VKC CG+ +C+G+L
Sbjct: 741 -LKQLLVCNEIRDPRYGDIAFFCKRDIKEGEELTFDYEYI-VEKRVKCLCGSKNCKGWL 797
>gi|317455359|pdb|3OPE|A Chain A, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
gi|317455360|pdb|3OPE|B Chain B, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
Length = 222
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 97/172 (56%), Gaps = 7/172 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 48 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 107
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 108 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 166
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
G R+G++A + + AG LTYDY F F E + C CG C+G +G K +
Sbjct: 167 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGKSQ 218
>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
Length = 1685
Score = 121 bits (304), Expect = 6e-25, Method: Composition-based stats.
Identities = 59/147 (40%), Positives = 87/147 (59%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++ K+ WG+ A EPI E +IEY+G+V+ ++ E R + +G+
Sbjct: 1539 NQLKFRK-KQLIFAKSPIHAWGLIALEPIAAEEMVIEYVGQVVRKSVAELRERQYEAKGI 1597
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
Y+ I DF IDAT GN RF+NHSC PNC + VEG+ ++ +++ R I E
Sbjct: 1598 GGSYLFRIDDDFVIDATMCGNNGRFINHSCQPNCYAKIITVEGKKKIVIYSKRDINVMEE 1657
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGAS+C+G L
Sbjct: 1658 ITYDYKFPYEEEKIPCQCGASTCRGTL 1684
>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
Length = 539
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 89/150 (59%), Gaps = 2/150 (1%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N +RK K IK ++ GWG+ A EPI E I+EY+G+ I + ++R +G
Sbjct: 391 VNQLKYRK-KMIKFARSRIHGWGLYALEPITPDEMIVEYVGQKIRPTVADEREKRYIRKG 449
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I D IDAT GNF+RF+NHSC PNC + V+GE R+ +++ I G+
Sbjct: 450 MGSSYLFRIDSDNVIDATNMGNFARFINHSCQPNCYAKVVVVDGEKRIVIYSKTPINKGD 509
Query: 386 PLTYDYRF-VQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ ++ C CGA SC+G L
Sbjct: 510 EITYDYKFPIEEEDKIDCLCGAPSCRGTLN 539
>gi|113470951|gb|ABI34877.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Danio rerio]
Length = 129
Score = 121 bits (304), Expect = 6e-25, Method: Composition-based stats.
Identities = 56/129 (43%), Positives = 79/129 (61%), Gaps = 1/129 (0%)
Query: 289 VEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNF 348
++ + + KG+F++EY+GE+ID C+QR+ V NFYM + KD IDA KGN
Sbjct: 1 LKTKQDLKKGDFVMEYVGELIDSEECKQRIRTANENHVTNFYMLTLTKDRVIDAGPKGNL 60
Query: 349 SRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGP-EVKCYCGAS 407
SRF+NHSC PNC +KW V G+ R+G+F I A LT++Y G C+CG+
Sbjct: 61 SRFMNHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCLGNGRTSCHCGSE 120
Query: 408 SCQGYLGTK 416
+C G+LG K
Sbjct: 121 NCSGFLGVK 129
>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1389
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 87/141 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ ++ WG+ A E I++GE +IEY+GE+I + ++R + +G+ + Y+
Sbjct: 1248 RKKHLRFARSPIHDWGLYAMEKISRGEMVIEYVGEIIRAQVADKREKAYERQGIGSSYLF 1307
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I +D +DAT KGN R +NHSCDPNC + + E ++ ++A + I+ G +TYDY
Sbjct: 1308 RIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITYDYH 1367
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CG++ C+GYL
Sbjct: 1368 FPIEQDKIPCLCGSAKCRGYL 1388
>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
SS1]
Length = 1014
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 93/160 (58%)
Query: 254 ISCSKACHCSETCNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
+S +A N+ ++K ++ ++ WG+ A E I++GE +IEY+GEVI +
Sbjct: 854 LSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYVGEVIRAQV 913
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
++R + +G+ + Y+ I +D +DAT KGN R +NHSCDPNC + + GE ++
Sbjct: 914 ADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKI 973
Query: 374 GVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYL 413
++A + I+ G +TYDY F ++ C CG++ C+G L
Sbjct: 974 VIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTL 1013
>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
Length = 1614
Score = 121 bits (304), Expect = 7e-25, Method: Composition-based stats.
Identities = 53/142 (37%), Positives = 91/142 (64%), Gaps = 1/142 (0%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ ++ WG+ A E I+KGE +IEY+GEVI + ++R + +G+ + Y+
Sbjct: 1472 RKKHLRFSRSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVADKREKTYEKQGIGSSYLF 1531
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I ++F +DAT KGN R +NHSCDPNC + + G ++ ++A + I+ GE +TYDY
Sbjct: 1532 RIDEEFVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITYDYH 1591
Query: 393 F-VQFGPEVKCYCGASSCQGYL 413
F ++ ++ C CG++ C+GYL
Sbjct: 1592 FPIEQDNKIPCLCGSARCRGYL 1613
>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 164
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 90/146 (61%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ ++ WG+ A E I +GE +IEY+GEVI + ++R + +G+
Sbjct: 18 NQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREKAYERQGIG 77
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D +DAT KGN R +NHSCDPNC + + GE ++ ++A + I+ G+ +
Sbjct: 78 SSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEI 137
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG++ C+GYL
Sbjct: 138 TYDYHFPIEQDKIPCLCGSAKCRGYL 163
>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 191
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 90/146 (61%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ ++ WG+ A E I +GE +IEY+GEVI + ++R + +G+
Sbjct: 45 NQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIG 104
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D +DAT KGN R +NHSCDPNC + + GE ++ ++A + I+ G+ +
Sbjct: 105 SSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEI 164
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG++ C+GYL
Sbjct: 165 TYDYHFPIEQDKIPCLCGSARCRGYL 190
>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 574
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 92/148 (62%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K++ WG+ A EPI E +IEY+G+++ L + R + + RG+
Sbjct: 428 NQLKFRK-KQLKFSKSDIHDWGLFAMEPIAADEMVIEYVGQMVRPFLADFREKEYEKRGI 486
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +EG+ ++ +++ + IK E
Sbjct: 487 GSSYLFRIDLETIIDATKCGNLARFINHSCNPNCYAKIITIEGQKKIVIYSKKDIKVDEE 546
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA+ C+GYL
Sbjct: 547 ITYDYKFPIEEEKIPCLCGAAQCKGYLN 574
>gi|119493172|ref|XP_001263805.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
gi|119411965|gb|EAW21908.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
Length = 839
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 444 CGLGPDCGNRSFDELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 503
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 504 ITQAECEKRMRTI-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 561
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
G+ R+ +FA I GE LTYDY F + + +C CGAS+C+G LG
Sbjct: 562 GKPRMALFAGDHGIMTGEELTYDYNFDPYSQKNVQQCRCGASNCRGILG 610
>gi|91077840|ref|XP_971447.1| PREDICTED: similar to set domain protein [Tribolium castaneum]
Length = 1549
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 6/162 (3%)
Query: 260 CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C S C N+ ++ + ++ TE GWGV PI GEFI+EY+GEV+ D ++
Sbjct: 797 CPASHKCQNQKIQRHEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKE 856
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ + Y + Y + ID G RF+NHSC PNC ++KW V G+ R+ +F
Sbjct: 857 RMATI-YVNDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALF 915
Query: 377 AARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
A R I++ E LTYDY F F P +C CG+ C+G +G K
Sbjct: 916 ALRDIESSEELTYDYNFSLFNPAEGQECKCGSEMCRGVIGGK 957
>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
Length = 1505
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 91/164 (55%), Gaps = 16/164 (9%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV---------------ID 310
N FRK K IK ++ GWG+ A E I + E I+EYIG+ I
Sbjct: 1343 INQLKFRK-KMIKFARSRIHGWGLYAMETIAQDEMIVEYIGQTVIVFQNFSSILFHLQIR 1401
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ ++R + RG+ + Y+ I ++ IDAT +GNF+RF+NHSC PNC + +EGE
Sbjct: 1402 SLVADEREKAYERRGIGSSYLFRIDENSVIDATKRGNFARFINHSCQPNCYAKVLTIEGE 1461
Query: 371 TRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
R+ +++ I GE +TYDY+F ++ C CGA +C+GYL
Sbjct: 1462 KRIVIYSRSVINKGEEITYDYKFPIEDDKIDCLCGAKACRGYLN 1505
>gi|71000549|ref|XP_754958.1| histone-lysine N-methyltransferase (Ash1) [Aspergillus fumigatus
Af293]
gi|66852595|gb|EAL92920.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus Af293]
gi|159127971|gb|EDP53086.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus A1163]
Length = 845
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 444 CGLGPGCGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 503
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 504 ITQAECEKRMRTI-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 561
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
G+ R+ +FA I GE LTYDY F + + +C CGAS+C+G LG
Sbjct: 562 GKPRMALFAGDHGIMTGEELTYDYNFDPYSQKNVQQCRCGASNCRGILG 610
>gi|299116101|emb|CBN74517.1| Possible Cdc2-related protein kinase [Ectocarpus siliculosus]
Length = 1898
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 93/157 (59%), Gaps = 6/157 (3%)
Query: 261 HCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL 318
C C N+ +K + ++++ K + G G++ EP++KG+FI EY+GE+I +R+
Sbjct: 257 QCHRGCRNQRIQKRENAQVEVFKADGKGMGLKVVEPVSKGQFIAEYVGEIITRKELNKRM 316
Query: 319 WDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA 378
+ G + YM ++ D +DA KG +RF+NHSC+P C LE+W G+ R VF+
Sbjct: 317 --ISSAGTRKLYMMQLGDDTYLDAKRKGGIARFVNHSCEPTCRLEQWTAMGQPRCAVFSL 374
Query: 379 RSIKAGEPLTYDYRFV--QFGPEVKCYCGASSCQGYL 413
R++KAGE L++DY++ KC CG+ C+G +
Sbjct: 375 RAMKAGEELSFDYQWEAHHLRENTKCLCGSPQCRGTI 411
>gi|270001477|gb|EEZ97924.1| hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
Length = 1647
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 89/162 (54%), Gaps = 6/162 (3%)
Query: 260 CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C S C N+ ++ + ++ TE GWGV PI GEFI+EY+GEV+ D ++
Sbjct: 895 CPASHKCQNQKIQRHEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKE 954
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ + Y + Y + ID G RF+NHSC PNC ++KW V G+ R+ +F
Sbjct: 955 RMATI-YVNDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALF 1013
Query: 377 AARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
A R I++ E LTYDY F F P +C CG+ C+G +G K
Sbjct: 1014 ALRDIESSEELTYDYNFSLFNPAEGQECKCGSEMCRGVIGGK 1055
>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 144
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 88/141 (62%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ ++ WG+ A E I+KGE +IEY+GEVI + E+R + +G+ + Y+
Sbjct: 3 RKKHLRFARSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGSSYLF 62
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I +D +DAT KGN R +NHSCDPNC + + GE ++ ++A + I+ G+ +TYDY
Sbjct: 63 RIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYDYH 122
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CG+ C+G+L
Sbjct: 123 FPFEQDKILCLCGSVKCRGFL 143
>gi|310794022|gb|EFQ29483.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 828
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 97/167 (58%), Gaps = 16/167 (9%)
Query: 264 ETCNNRPFR--KEKK---------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F +E+K ++++KT G+G+ A + I+EY GE+I D
Sbjct: 442 EHCTNRAFADLQERKSGGGKYRVGVEVIKTSDRGYGIRANRCFEPNQIIMEYTGEIITDE 501
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
C +R+ + KY+ + +Y+ ++ IDAT KG+ +RF+NHSC PNC + KW V G+ R
Sbjct: 502 ECSERMEN-KYKDSKCYYLMSFDQNMIIDAT-KGSIARFVNHSCAPNCRMIKWIVSGQPR 559
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
+ +FA + I GE LTYDY F F E KC CGA +C+G LG K
Sbjct: 560 MALFAGDKPIMTGEELTYDYNFSPFSDENVQKCLCGAPNCRGILGPK 606
>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
magnipapillata]
Length = 1403
Score = 120 bits (300), Expect = 2e-24, Method: Composition-based stats.
Identities = 55/147 (37%), Positives = 89/147 (60%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
NN+ R++K ++ K+ WG+ A EPIN E + EY+G+++ + E R + +G+
Sbjct: 1256 NNKLMRRKKALRFSKSSIHNWGLFACEPINADEMVCEYVGQMVRSIVAEIRERRYEKQGI 1315
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + D IDAT G +RF+NH CDPNC + VEG ++ +++ R+IK GE
Sbjct: 1316 GSSYLFRLDSDSVIDATKDGCNARFINHCCDPNCYAKVILVEGAKKIVIYSRRAIKLGEE 1375
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA+ C+G L
Sbjct: 1376 ITYDYKFPIEDEKIPCLCGAALCRGTL 1402
>gi|449301739|gb|EMC97748.1| hypothetical protein BAUCODRAFT_402857 [Baudoinia compniacensis
UAMH 10762]
Length = 822
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 94/150 (62%), Gaps = 6/150 (4%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KT G+GV + G+ I+EY GE+I + C++R+ + Y+ Q +Y+ E+ +
Sbjct: 450 VEVLKTPNRGFGVRSCRTFTPGQIIMEYTGEIISEGECQRRMRE-DYKDKQCYYLMELER 508
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQ 395
ID T KG+ +RF+NHSC PNC + +V G R+GVFA + GE LTYDY F
Sbjct: 509 GLIIDGT-KGSMARFINHSCAPNCEVRMLKVNGTPRMGVFAGEGGVSTGEELTYDYNFDN 567
Query: 396 FGPEVK-CYCGASSCQGYLGTKRKIGKLEL 424
FG + CYCGA++C+G+L +++ K EL
Sbjct: 568 FGTTRQACYCGAATCRGFLS--KRLNKEEL 595
>gi|238505954|ref|XP_002384179.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|220690293|gb|EED46643.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|391868614|gb|EIT77824.1| hypothetical protein Ao3042_05901 [Aspergillus oryzae 3.042]
Length = 789
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KTE G+GV + + I+EY GE+
Sbjct: 406 CGVGPECGNRNFEELKQRTKAGGKYNIGVEVIKTEDRGYGVRSNRTFEPNQVIVEYTGEI 465
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 466 ITQAECEKRMRTI-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 523
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
G+ R+ +FA R I G+ LTYDY F + + +C CG+ C+G LG
Sbjct: 524 GKPRMALFAGDRGIMTGDELTYDYNFDPYSQKNVQQCRCGSDRCRGILG 572
>gi|405966542|gb|EKC31816.1| Putative histone-lysine N-methyltransferase ASH1L [Crassostrea gigas]
Length = 2162
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 99/172 (57%), Gaps = 7/172 (4%)
Query: 251 VQYISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIG 306
+ Y C C C E C N+ F K + ++++ T+ G+G+ ++ I+ G+FI+EY+G
Sbjct: 1372 MMYTECDISTCPCQEQCLNQRFHKHEWVSGLEVIVTKDRGYGIRTSDSISNGQFILEYLG 1431
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ +A +R+ + +Y ++ Y + ID GN R++NHSC+PNC ++KW
Sbjct: 1432 EVVSEAEFRRRMTE-EYSQERHHYCLNLDSGAVIDGYRMGNIGRYVNHSCEPNCEMQKWN 1490
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA + I LTYDY F F + + C CG+ +C+G +G K
Sbjct: 1491 VNGVYRMGLFALKDISPNMELTYDYNFHSFNVDAQQLCRCGSENCRGVIGGK 1542
>gi|401888851|gb|EJT52799.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 2479]
Length = 905
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 100/178 (56%), Gaps = 13/178 (7%)
Query: 251 VQYISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ +I C + C + C N+ +++ + +V+TE G+G+ A E I G + EYIGE
Sbjct: 213 IMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGALVYEYIGE 272
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
V+ + +R+ + G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 273 VVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVV 332
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLG--TKRKIGKLE 423
R+G+F R +K E +T++Y CYCG +C G +G T+ IG ++
Sbjct: 333 GRRMRMGIFTKRDVKKDEEITFNYNI--------CYCGEPNCVGTIGGKTQTDIGGMD 382
>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 160
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 90/146 (61%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ ++ WG+ A E +++GE +IEY+GEV+ + ++R + +G+
Sbjct: 14 NQLQTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYERQGIG 73
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D +DAT KGN R +NHSCDPNC + GE ++ ++A + I+ G+ +
Sbjct: 74 SSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELGDEI 133
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG++ C+GYL
Sbjct: 134 TYDYHFPIEQDKIPCLCGSAKCRGYL 159
>gi|317151143|ref|XP_001824470.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus oryzae
RIB40]
Length = 796
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KTE G+GV + + I+EY GE+
Sbjct: 413 CGVGPECGNRNFEELKQRTKAGGKYNIGVEVIKTEDRGYGVRSNRTFEPNQVIVEYTGEI 472
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 473 ITQAECEKRMRTI-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 530
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
G+ R+ +FA R I G+ LTYDY F + + +C CG+ C+G LG
Sbjct: 531 GKPRMALFAGDRGIMTGDELTYDYNFDPYSQKNVQQCRCGSDRCRGILG 579
>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Strongylocentrotus purpuratus]
Length = 282
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 87/148 (58%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K IK K+ GWG+ A EPI E +IEY+GE + ++ + R + G+
Sbjct: 136 NQLQFRK-KDIKFCKSSIHGWGLYAMEPIAADEMVIEYVGESVRQSIADSREKAYERMGI 194
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I IDAT GN +RF+NHSC+PNC + VE E ++ +++ ++I G+
Sbjct: 195 GSSYLFRIDAVTIIDATKSGNLARFINHSCNPNCYAKIITVESEKKIVIYSKQTINVGDE 254
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA+ C+G L
Sbjct: 255 ITYDYKFPIEDEKISCLCGAAQCRGTLN 282
>gi|291225527|ref|XP_002732754.1| PREDICTED: Ash1l protein-like [Saccoglossus kowalevskii]
Length = 2643
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 92/173 (53%), Gaps = 13/173 (7%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C + C+N +K + I ++ TE GWG+ + I G+FI+EY+GEV
Sbjct: 1852 FTECSPNLCPCGDHCSNMSIQKHEWITTLERLPTENRGWGIRTKDAIKSGQFILEYVGEV 1911
Query: 309 IDDALCEQRLWDMKYRGVQN---FYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
+ E W + QN Y + ID G RF+NHSC+PNC ++KW
Sbjct: 1912 V----SENEFWQRAMKNYQNRRHHYCLNLNSGMVIDGYRMGCEGRFVNHSCEPNCEMQKW 1967
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
V G R+G+FA + I+ G LTYDY F F E + C CG+ C+G++G K
Sbjct: 1968 SVNGVYRIGLFALKDIQPGSELTYDYNFHAFNLETQQECCCGSDKCRGFIGGK 2020
>gi|242813250|ref|XP_002486129.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
gi|218714468|gb|EED13891.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
Length = 865
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 99/176 (56%), Gaps = 16/176 (9%)
Query: 257 SKACHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
S C C NR F + K+ ++++KT G+GV + + I+EY
Sbjct: 476 SSNCRVGPNCGNRSFEELKQRTKAGGKYNVGVEVIKTADRGYGVRSNRTFEPNQVIVEYT 535
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GE+I + CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC PNC +EKW
Sbjct: 536 GEIITQSECERRMRTV-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 593
Query: 366 QVEGETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
V G+ R+ +FA R I GE LTYDY F + + +C CGA +C+G LG + K
Sbjct: 594 TVGGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGAPTCRGVLGPRPK 649
>gi|170573421|ref|XP_001892464.1| SET domain containing protein [Brugia malayi]
gi|158601976|gb|EDP38706.1| SET domain containing protein [Brugia malayi]
Length = 1603
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 251 VQYISCSKACHCSETCNN-RPFRKE--KKIKIVKT-EFCGWGVEAAEPINKGEFIIEYIG 306
V + C +C + C N R FR+E ++++ +T CG GV+ I+KG+FI EYIG
Sbjct: 882 VVLMECGNSCPRNALCTNKRLFRRECVERLRTFQTMNGCGIGVKTDVNIDKGQFICEYIG 941
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ R YR +N Y + F +DA KGN +RF+NHSC PNC +++W
Sbjct: 942 EVVSMETFNIR-SRTDYRYQRNHYALNLCPGFVVDAYHKGNIARFINHSCAPNCEMQRWS 1000
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRF--VQFGPEVKCYCGASSCQGYLG 414
V G R+G+FA R I GE LTYDY + +F C CGA +C+ +L
Sbjct: 1001 VNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTICCCGAXNCRHFLN 1050
>gi|406697594|gb|EKD00852.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 8904]
Length = 843
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 100/178 (56%), Gaps = 13/178 (7%)
Query: 251 VQYISC-SKACHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ +I C + C + C N+ +++ + +V+TE G+G+ A E I G + EYIGE
Sbjct: 213 IMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGALVYEYIGE 272
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
V+ + +R+ + G+++FY ++K+ IDAT KG RF NHSC+PNC ++KW V
Sbjct: 273 VVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVV 332
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLG--TKRKIGKLE 423
R+G+F R +K E +T++Y CYCG +C G +G T+ IG ++
Sbjct: 333 GRRMRMGIFTKRDVKKDEEITFNYNI--------CYCGEPNCVGTIGGKTQTDIGGMD 382
>gi|384250559|gb|EIE24038.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 295
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 95/158 (60%), Gaps = 3/158 (1%)
Query: 260 CHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C E C+N+ F K + K+ K G+G+ A + + G+FI+EYIGEV+++ +R
Sbjct: 65 CPCGERCSNQQFSKRQYAKLEKRRAGAKGFGLFATQDLVAGQFIVEYIGEVLEEEEYLRR 124
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
+ G +++Y I IDA KG RF+NHSC+PNC +KW V GE +G++A
Sbjct: 125 KDYYQESGQRHYYFMNIGNGEVIDAARKGALGRFINHSCNPNCETQKWVVRGELAIGLYA 184
Query: 378 ARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
+ I AG LT+DY F ++G + ++C C A C+G++G
Sbjct: 185 LKDIPAGVELTFDYNFERYGDKPMRCLCEAKVCRGFIG 222
>gi|195496208|ref|XP_002095596.1| GE22484 [Drosophila yakuba]
gi|194181697|gb|EDW95308.1| GE22484 [Drosophila yakuba]
Length = 2215
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1353 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1412
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1413 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1471
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C S C+G +G K
Sbjct: 1472 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTSQCRGVIGGK 1521
>gi|198414837|ref|XP_002125245.1| PREDICTED: similar to AGAP002246-PA, partial [Ciona intestinalis]
Length = 754
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 52/147 (35%), Positives = 90/147 (61%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K +K ++ GWG+ A E I E +IEY+G+++ + ++R D RG+
Sbjct: 608 NQLMYRKKSVKFKRSHIHGWGLFAEETIGADEMVIEYVGQLVRSLIADRREVDYTRRGIG 667
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I D IDAT GNF+RF+NHSC+P+C + V+G ++ +++ +IK + +
Sbjct: 668 SSYLFRIDSDHIIDATKCGNFARFMNHSCNPSCYAKVIAVDGAKKIVIYSKDTIKPTDEI 727
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYLG 414
TYDY+F ++ C+CGA +C+G L
Sbjct: 728 TYDYKFPIEDVKIPCFCGAPNCRGTLN 754
>gi|115402571|ref|XP_001217362.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189208|gb|EAU30908.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 790
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 101/173 (58%), Gaps = 16/173 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KT G+GV + + + I+EY GE+
Sbjct: 416 CSLGPGCGNRNFNELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFDPNQIIVEYTGEI 475
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 476 ITQSECEKRMRTI-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVA 533
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R + GE LTYDY F + + +C CG+++C+G LG + K
Sbjct: 534 GKPRMALFAGDRGVMTGEELTYDYNFDPYSQKNVQQCRCGSANCRGVLGPRPK 586
>gi|157123795|ref|XP_001653916.1| set domain protein [Aedes aegypti]
gi|157123797|ref|XP_001653917.1| set domain protein [Aedes aegypti]
gi|108874200|gb|EAT38425.1| AAEL009666-PB, partial [Aedes aegypti]
gi|108874201|gb|EAT38426.1| AAEL009666-PA, partial [Aedes aegypti]
Length = 2091
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 106/194 (54%), Gaps = 8/194 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
++ CS + C C E C N ++ + ++ TE GWG+ + E + KG FI+EY+GEV
Sbjct: 1280 FVECSPENCPCGERCKNTKIQRHEYAPGLERFMTEQKGWGIRSKEGVRKGLFIMEYLGEV 1339
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + ++R+ + Y + Y + ID G+ RF+NHSC PNC ++KW V
Sbjct: 1340 VTEKEFKERMRTI-YLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKWSVN 1398
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK-RKIGKLELC 425
G R+ +FA+R I E LTYDY F F P C CGA C+G +G K +++ L +
Sbjct: 1399 GLFRMALFASRDIPPYEELTYDYNFSLFNPTEGQPCMCGAEQCRGVIGGKSQRVKPLPVA 1458
Query: 426 WGSKRKRSSTACLA 439
+K++ ++ + A
Sbjct: 1459 SEAKKQETTVSTAA 1472
>gi|219111565|ref|XP_002177534.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217412069|gb|EEC51997.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 144
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 83/139 (59%), Gaps = 3/139 (2%)
Query: 278 KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRL--WDMKYRGVQNFYMCEIR 335
K ++ + GWG+ + I KG+ ++EY+G VID E RL W+ + NFY+ +R
Sbjct: 6 KPLREQGKGWGLVPCDKIGKGDLVLEYVGNVIDAKEKEDRLSEWERDHPNDPNFYIMSLR 65
Query: 336 KDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRF-V 394
+ IDA K N SRF+NHSC PNC L + V G R G+FA R I+AGE L+YDY F
Sbjct: 66 DQWYIDARHKANLSRFINHSCAPNCFLTQINVNGYARNGIFAKRDIQAGEFLSYDYHFDT 125
Query: 395 QFGPEVKCYCGASSCQGYL 413
+ G C CGA SC+G +
Sbjct: 126 KQGDRFVCRCGAKSCRGTM 144
>gi|260833262|ref|XP_002611576.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
gi|229296947|gb|EEN67586.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
Length = 734
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 105/188 (55%), Gaps = 11/188 (5%)
Query: 241 LMENKVGP----PPVQYISCS-KACHCSETCNNRPFRK---EKKIKIVKTEFCGWGVEAA 292
LM + P P + ++ CS C ++ C N+ ++ + ++ + T+ G+GV +
Sbjct: 43 LMHDVTTPLSSAPRMSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTKDRGYGVRSK 102
Query: 293 EPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFL 352
PI +G FI+EY+GEV+ + +R ++ Y + Y + ID G RF+
Sbjct: 103 TPIPQGNFILEYVGEVVSEQEFRRRTVEI-YHDHNHHYCLNLHSGAVIDGYKYGCEGRFV 161
Query: 353 NHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQ 410
NHSC+PNC ++KW V G R+G+FA R I AGE LTYDY F F E + C CG++ C+
Sbjct: 162 NHSCEPNCEMQKWSVNGVYRIGLFALRDIPAGEELTYDYNFHAFNMEKQQICKCGSAKCR 221
Query: 411 GYLGTKRK 418
G++G K +
Sbjct: 222 GFIGGKNQ 229
>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
Length = 1212
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 90/142 (63%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A E I+ +F+IEY+GE+I + + R + G+ + Y+
Sbjct: 1071 RKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLF 1130
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D+ +DAT +G +RF+NHSCDPNC + VEG+ ++ ++A R I AGE LTY+Y+
Sbjct: 1131 RLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYK 1190
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C+CG+ C+G +
Sbjct: 1191 FPLEEKKIPCHCGSQRCRGSMN 1212
>gi|212544714|ref|XP_002152511.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
gi|210065480|gb|EEA19574.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
Length = 865
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 98/176 (55%), Gaps = 16/176 (9%)
Query: 257 SKACHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
S C C NR F + K+ ++++KT G+GV + + I+EY
Sbjct: 479 SSNCRVGPNCGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYT 538
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GE+I CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC PNC +EKW
Sbjct: 539 GEIITQNECERRMRSV-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 596
Query: 366 QVEGETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
V G+ R+ +FA R I GE LTYDY F + + +C CGA +C+G LG + K
Sbjct: 597 TVGGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGAPTCRGVLGPRPK 652
>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
Length = 1224
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 84/141 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ ++ WG+ A E I KGE +IEY+GEVI + E+R + G+ + Y+
Sbjct: 1083 RKKQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQVAEKRERAYERSGIGSSYLF 1142
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I D +DAT GN R +NHSCDPNC + + G+ ++ ++A I G+ +TYDY
Sbjct: 1143 RIDDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTYDYH 1202
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CGA+ C+G+L
Sbjct: 1203 FPIENEKIPCLCGAAKCRGFL 1223
>gi|327286108|ref|XP_003227773.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Anolis carolinensis]
Length = 2957
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 95/170 (55%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ E + G+FIIEY+GEV
Sbjct: 2107 FAECSPNTCPCGEHCCNQRIQRHEWVQCLERFRAEEKGWGIRTKESLKAGQFIIEYLGEV 2166
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSC+PNC ++KW V
Sbjct: 2167 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVN 2225
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTK 416
G R+G++A + + AG LTYDY F F E + C CG C+G +G K
Sbjct: 2226 GVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQLCKCGFEKCRGIIGGK 2275
>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella
variabilis]
Length = 380
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 97/163 (59%), Gaps = 8/163 (4%)
Query: 260 CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C+N+ F R+ K+ + + G+G+ AAE + G+F+IEY+GEV+++ +R
Sbjct: 156 CPSGHRCSNQMFTKREYSKLDVKRAGAKGFGLFAAEDMKAGQFLIEYLGEVLEEEEYHRR 215
Query: 318 ---LWDMKY--RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
W + G +++Y + IDA+ +GN RF+NHSC+PNC +KW V GE
Sbjct: 216 QGAAWKEYFIETGQRHYYFMNVGNGEVIDASRRGNLGRFINHSCEPNCETQKWVVHGELA 275
Query: 373 VGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLG 414
+G+F I AG LT+DY F ++G + +KC CG+ +C+G +G
Sbjct: 276 IGLFTLEDISAGTELTFDYNFERYGDKPMKCLCGSKNCRGVIG 318
>gi|194751483|ref|XP_001958056.1| GF10724 [Drosophila ananassae]
gi|190625338|gb|EDV40862.1| GF10724 [Drosophila ananassae]
Length = 2257
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 93/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C +E C N+ ++ + V+ TE GWGV PI KG +I+EY+GEV
Sbjct: 1382 YTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTEDKGWGVRTKLPIAKGTYILEYVGEV 1441
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1442 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1500
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I GE LTYDY F F P C C C+G +G K
Sbjct: 1501 GLSRMVLFAKRAIAQGEELTYDYNFSLFNPSEGQPCRCRMPQCRGVIGGK 1550
>gi|50546869|ref|XP_500904.1| YALI0B14883p [Yarrowia lipolytica]
gi|74689791|sp|Q6CEK8.1|SET1_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49646770|emb|CAG83155.1| YALI0B14883p [Yarrowia lipolytica CLIB122]
Length = 1170
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 95/169 (56%), Gaps = 5/169 (2%)
Query: 251 VQYISCSKACHCSET--CNNRPFRKEKK-IKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V I+ K +ET N RK KK +K ++ WG+ A EPI E IIEY+GE
Sbjct: 1002 VADINMQKQLLSTETDVLNFNQLRKRKKPVKFARSAIHNWGLYAIEPIAANEMIIEYVGE 1061
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
V+ + + R G+ + Y+ + + +DAT +G +RF+NH C P+C + +V
Sbjct: 1062 VVRQEIADLREARYMRSGIGSSYLFRVDESTVVDATKRGGIARFINHCCTPSCTAKIIKV 1121
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRF-VQFGPE-VKCYCGASSCQGYLG 414
EG+ R+ ++A+R I A E LTYDY+F + G E + C CGA C+GYL
Sbjct: 1122 EGQKRIVIYASRDIAANEELTYDYKFEKEIGEERIPCLCGAPGCKGYLN 1170
>gi|402585708|gb|EJW79647.1| hypothetical protein WUBG_09444, partial [Wuchereria bancrofti]
Length = 511
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 96/170 (56%), Gaps = 7/170 (4%)
Query: 251 VQYISCSKACHCSETCNN-RPFRKE--KKIKIVKT-EFCGWGVEAAEPINKGEFIIEYIG 306
V + C +C + C N R FR+E ++++ +T CG GV+ I+KG+FI EYIG
Sbjct: 154 VVLMECGSSCPRNAICTNKRLFRRECIERLRTFQTMNGCGIGVKTDVNIDKGQFICEYIG 213
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ R YR +N Y + F +DA KGN +RF+NHSC PNC +++W
Sbjct: 214 EVVSMETFNIR-SRTDYRYQRNHYALNLCPGFVVDAYHKGNIARFINHSCAPNCEMQRWS 272
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRF--VQFGPEVKCYCGASSCQGYLG 414
V G R+G+FA R I GE LTYDY + +F C CGA +C+ +L
Sbjct: 273 VNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTICCCGAPNCRHFLN 322
>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
Length = 171
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 89/146 (60%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ ++ WG+ A E I +GE +IEY+GEVI + ++R + +G+
Sbjct: 25 NQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREATYERQGIG 84
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I ++ +DAT KGN R +NHSCDPNC + + GE ++ ++A R I+ G+ +
Sbjct: 85 SSYLFRIDEEIVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEI 144
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG + C+G+L
Sbjct: 145 TYDYHFPFEQDKIPCLCGTAKCRGFL 170
>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
Length = 163
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 90/146 (61%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K ++ K+ WG+ A E I +GE +IEY+GEVI A+ ++R + +G+
Sbjct: 17 NQLQTRKKHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADKREKAYERQGIG 76
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I +D +DAT KGN R +NHSCDPNC + + G ++ ++A + I+ G+ +
Sbjct: 77 SSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEI 136
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F ++ C CG++ C+G+L
Sbjct: 137 TYDYHFPFEQDKIPCLCGSAKCRGFL 162
>gi|209880293|ref|XP_002141586.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209557192|gb|EEA07237.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 517
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 102/175 (58%), Gaps = 11/175 (6%)
Query: 253 YISCSK-ACHC----SETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYI 305
YI C+K C S+ C N F R+ K + + T+ G G+ AE I GEF+IEY+
Sbjct: 88 YIECNKDTCQICKQGSKDCGNCRFTLRQYKLVTLFDTKSKGIGLRTAEDIKLGEFVIEYV 147
Query: 306 GEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKW 365
GE+I+ E+R M + ++ Y+ + + IDAT+KGN +RF+NHSC+PN + +KW
Sbjct: 148 GEIINLKQLEERK-KMTSKLTKHIYVFSLGNETYIDATYKGNLARFINHSCEPNLVAQKW 206
Query: 366 QVEGETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLGTKR 417
V + +VG+F+ + IKAG+ LT+DYRF + +C CG+ C+ + +
Sbjct: 207 FVGSDIKVGLFSLKDIKAGDELTFDYRFGTSISGDQPFECMCGSKLCEKRISNTK 261
>gi|429862858|gb|ELA37465.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 805
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 95/167 (56%), Gaps = 16/167 (9%)
Query: 264 ETCNNRPFR--KEKK---------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F +E+K ++++KT G+G+ A + I+EY GE+I D
Sbjct: 425 ENCTNRAFADLQERKAGGGKYRVGVEVIKTADRGYGIRANRCFAPNQIIMEYTGEIITDE 484
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
C R+ + KY+ + +Y+ ++ IDAT G+ +RF+NHSC PNC + KW V G+ R
Sbjct: 485 ECSNRM-ETKYKDNKCYYLMSFDQNMIIDATT-GSIARFVNHSCAPNCRMIKWIVSGQPR 542
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
+ +FA + I GE LTYDY F F E KC CGA +C+G LG K
Sbjct: 543 MALFAGDKPIMTGEELTYDYNFSPFSDENVQKCLCGAPNCRGILGPK 589
>gi|195435682|ref|XP_002065808.1| GK20252 [Drosophila willistoni]
gi|194161893|gb|EDW76794.1| GK20252 [Drosophila willistoni]
Length = 2294
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS C + C+N+ + + ++ T GWGV PI KG +I+EY+GEV
Sbjct: 1428 YTECSATNCPSGDKCHNQKIHRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1487
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1488 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1546
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R I+ GE LTYDY F F P C CG C+G +G K
Sbjct: 1547 GLSRMVLFAKRQIRQGEELTYDYNFSLFNPSEGQPCRCGMPQCRGVIGGK 1596
>gi|168060592|ref|XP_001782279.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666292|gb|EDQ52951.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 690
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 90/175 (51%), Gaps = 17/175 (9%)
Query: 256 CSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ +C C E C NR +K ++K + K+ GW V +A+PI G F+ EYIGEV++D
Sbjct: 515 CNSSCLCREDCQNRVLQKGVRVKLEVFKSRHKGWAVRSAQPIPSGTFVCEYIGEVVNDRE 574
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKD----------FTIDATFKGNFSRFLNHSCDPNCILE 363
QR G Y + D F IDAT GN +RF+NHSC PN I
Sbjct: 575 ANQRGVRYDQDGCSYLYDIDAHLDMSISRAGAKPFVIDATKHGNVARFINHSCAPNLINY 634
Query: 364 KWQVEGE----TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYL 413
+ VE +G FA R I AGE L YDYR+ G C+CG S+C+G L
Sbjct: 635 EVLVESMDCQLAHIGFFANRDISAGEELAYDYRYKLLPGKGCACHCGVSTCRGRL 689
>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
Length = 1056
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 86/149 (57%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + ++R + RG
Sbjct: 909 VNQLKFRK-KLIKFARSRIHGWGLYALEAIAPDEMIVEYIGQKIRPTVADEREKRYERRG 967
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I D IDAT GN +RF+NHSC PNC + V+GE R+ +++ +I G+
Sbjct: 968 MGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGD 1027
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA C+G L
Sbjct: 1028 EITYDYKFPIEEDKIDCLCGAPGCRGSLN 1056
>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
sulphuraria]
Length = 969
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 89/140 (63%), Gaps = 6/140 (4%)
Query: 278 KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCE--QRLWDMKYRGVQNFYMCEIR 335
K+ ++ G G+ E + EF+IEY GE+I + + ++ +D + G YM +
Sbjct: 833 KVKQSGIQGLGLYTLENLPDEEFVIEYAGELIRPVIADIREKFYDRRKIGC---YMFRLN 889
Query: 336 KDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV-GVFAARSIKAGEPLTYDYRFV 394
DF +DAT KGN++RF+NHSC+PNC + V+G+ +V G+FA R+I AGE LTYDY+F
Sbjct: 890 DDFIVDATMKGNYARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQFE 949
Query: 395 QFGPEVKCYCGASSCQGYLG 414
+FG + C CGA +C+G +
Sbjct: 950 EFGETIPCNCGAPNCRGKMN 969
>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1258
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 99/176 (56%), Gaps = 8/176 (4%)
Query: 247 GPPPVQYISCSKA---CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFI 301
GP P + + + CH E C NR R + ++ + G+ + A E I G+ +
Sbjct: 603 GPAPTEESTRNNQTFNCHVGEHCGNRALHQRVYPRTELFHSFDKGFALRAKESIQAGQLV 662
Query: 302 IEYIGEVIDDALCEQRLWDM--KYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPN 359
+EY+GEVI++ ++RL + K+ +N Y+ E+ IDA F+G+ SRF+NHSCDPN
Sbjct: 663 MEYVGEVINEQEKDRRLEEHARKHPQDRNMYIMELGNQIYIDARFRGSVSRFINHSCDPN 722
Query: 360 CILEKWQVEGETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYLG 414
C L KW+V R+ + A R IK E L+YDYRF + KC+C ++ C+G +
Sbjct: 723 CHLVKWRVCDLDRIAISALRDIKPEEELSYDYRFYTSEALQWKCFCKSAKCRGTMA 778
>gi|291001085|ref|XP_002683109.1| predicted protein [Naegleria gruberi]
gi|284096738|gb|EFC50365.1| predicted protein [Naegleria gruberi]
Length = 147
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 89/147 (60%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K +K K+ WG+ A E I +GE +IEY+GEVI +AL + R + G+
Sbjct: 1 NQLDTRKKNLKFAKSPIHDWGLFALEDIEQGEMVIEYVGEVIREALSDIREKRYEQIGIG 60
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ + D IDAT +GN +RF+NHSCDPNC +V+ + +V ++A R I GE +
Sbjct: 61 SSYLFRLDNDLIIDATKRGNLARFINHSCDPNCCARIIEVDKQKKVCIYALRKILVGEEI 120
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYLG 414
TYDY+F ++ C CG+ C+ +L
Sbjct: 121 TYDYKFPIEESKIPCKCGSQKCKKWLN 147
>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
Length = 1312
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 88/141 (62%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K +K ++ +G+ + E I+ + +IEY+GEVI ++ + R + RG+ + Y
Sbjct: 1171 RKKLLKFQRSGIHAFGLFSQENISANDLVIEYVGEVIRQSISDIREHHYERRGIGSSYFF 1230
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
I +D +DAT+KGN +RF+NH C+PNC + V+G R+ +++ R IK GE +TYDY+
Sbjct: 1231 RIDEDHVVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEITYDYK 1290
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CGA +C+ +L
Sbjct: 1291 FPYEENKIPCLCGAVNCKKFL 1311
>gi|326476825|gb|EGE00835.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 628
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 103/192 (53%), Gaps = 23/192 (11%)
Query: 266 CNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
C NR F + +K ++++KTE G+GV + + I+EY GE++
Sbjct: 323 CRNRNFSELRKRIKTGGKYNIGVEVIKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEA 382
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
++R+ + Y+ + FY+ + +D IDAT +G+ +RF+NHSC+PNC +EKW V G+ R+
Sbjct: 383 QRRMKTI-YKKNECFYLMDFDQDMIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIA 440
Query: 375 VFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG-------TKRKIGKLEL 424
+FA I GE LTYDY F + + +C CG SC+G LG TK K
Sbjct: 441 LFAGDNGIMTGEELTYDYNFDPYSNKNVQECRCGTPSCRGVLGPRPKGKDTKEKSAAKPA 500
Query: 425 CWGSKRKRSSTA 436
G+KRK A
Sbjct: 501 STGTKRKSQGVA 512
>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
Length = 1088
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 91/167 (54%), Gaps = 6/167 (3%)
Query: 254 ISCSKACHCSET--CNNRPFRKEKK-IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
IS K SET N K KK + ++ WG+ A EPI E IIEY+GE I
Sbjct: 922 ISAQKQMLGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIR 981
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ E R G+ + Y+ I ++ +DAT KG +RF+NH C+P+C + +VEG+
Sbjct: 982 QQVAEHRERSYLKTGIGSSYLFRIDENTVVDATKKGGIARFINHCCNPSCTAKIIKVEGK 1041
Query: 371 TRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
R+ ++A R I+A E LTYDY+F ++C CGA C+GYL
Sbjct: 1042 KRIVIYALRDIEANEELTYDYKFEKETNDAERIRCLCGAPGCKGYLN 1088
>gi|116193789|ref|XP_001222707.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
gi|88182525|gb|EAQ89993.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
Length = 907
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 98/167 (58%), Gaps = 16/167 (9%)
Query: 266 CNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
C NR F+ + KK +++VKT G+GV + + I+EY GE+I +A C
Sbjct: 482 CQNRAFQDLQERTKKGGRYRVGVEVVKTGDRGYGVRSNRCFEANQIIMEYTGEIITEAEC 541
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
E+R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC PNC + KW V G+ R+
Sbjct: 542 ERRMNE-EYKDNECYYLMSFDQNMIIDATT-GSIARFVNHSCSPNCRMIKWIVAGQPRMA 599
Query: 375 VFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA R I GE LTYDY F F + KC CG+ +C+G LG K K
Sbjct: 600 LFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCGSPNCRGVLGPKPK 646
>gi|194874189|ref|XP_001973356.1| GG13392 [Drosophila erecta]
gi|190655139|gb|EDV52382.1| GG13392 [Drosophila erecta]
Length = 2215
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1353 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1412
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1413 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1471
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1472 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGK 1521
>gi|440633321|gb|ELR03240.1| hypothetical protein GMDG_01223 [Geomyces destructans 20631-21]
Length = 685
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 102/170 (60%), Gaps = 16/170 (9%)
Query: 263 SETCNNRPF-----RKEKK------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+E C NR F R+ K ++++KTE CG+GV A +G+ I+EY GE+I +
Sbjct: 503 AELCGNRAFADLHERRAKGGKYRVGVEVIKTEDCGYGVRANRCFQEGQIIVEYTGEIITE 562
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
C++R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC + KW V G+
Sbjct: 563 PECQRRMRE-DYKNNECYYLMLFDQNMIIDAT-RGSIARFVNHSCEPNCEMVKWIVGGKP 620
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+ +FA ++ I GE LTYDY+F +C CGA SC+G LG + K
Sbjct: 621 HMALFAGKNPIMTGEELTYDYKFDPISTRNVQECRCGAESCRGVLGPRPK 670
>gi|1335892|gb|AAB01100.1| ASH1 [Drosophila melanogaster]
Length = 2210
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1348 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1407
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1408 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1466
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1467 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGK 1516
>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 153
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 91/146 (62%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K+++ ++ WG+ A E I +GE +IEY+GE++ + ++R + +G+
Sbjct: 7 NQLQTRKKQLRFARSPIHDWGLYAMERIARGEMVIEYVGEIVRAPIADRREKLYERQGIG 66
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I ++ +DAT KGN R +NHSCDPNC + V G ++ ++A + I+ G+ L
Sbjct: 67 SSYLFRIDEELVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIELGDEL 126
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F + ++ C CGA+ C+G+L
Sbjct: 127 TYDYHFPREEAKIPCLCGAAKCRGFL 152
>gi|386771427|ref|NP_524160.2| absent, small, or homeotic discs 1, isoform B [Drosophila
melanogaster]
gi|386771429|ref|NP_001246834.1| absent, small, or homeotic discs 1, isoform C [Drosophila
melanogaster]
gi|118572535|sp|Q9VW15.3|ASH1_DROME RecName: Full=Histone-lysine N-methyltransferase ash1; AltName:
Full=Absent small and homeotic disks protein 1; AltName:
Full=Lysine N-methyltransferase 2H
gi|383292012|gb|AAF49140.3| absent, small, or homeotic discs 1, isoform B [Drosophila
melanogaster]
gi|383292013|gb|AFH04505.1| absent, small, or homeotic discs 1, isoform C [Drosophila
melanogaster]
Length = 2226
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1364 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1423
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1424 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1482
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1483 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGK 1532
>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 756
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 99/170 (58%), Gaps = 16/170 (9%)
Query: 263 SETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+E C NR F ++ ++++KT G+GV + + I+EY GE+I
Sbjct: 342 AELCRNRSFEDLRQRIKAGGKYNIGVEVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQ 401
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 402 KECERRMRTV-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKP 459
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
R+ +FA ++ I GE LTYDY F + + +C CGA +C+G LG K K
Sbjct: 460 RMALFAGKNGITTGEELTYDYNFDPYSQKNVQECRCGAETCRGVLGPKSK 509
>gi|195354202|ref|XP_002043588.1| GM17444 [Drosophila sechellia]
gi|194127756|gb|EDW49799.1| GM17444 [Drosophila sechellia]
Length = 2218
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIKIVK---TEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C E C N+ ++ V+ T GWGV PI KG +I+EY+GEV
Sbjct: 1355 YTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1414
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1415 VTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1473
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1474 GLSRMVLFAKRAIEEGEELTYDYNFSLFNPSEGQPCRCNTPQCRGVIGGK 1523
>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 816
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 99/170 (58%), Gaps = 16/170 (9%)
Query: 263 SETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+E C NR F ++ ++++KT G+GV + + I+EY GE++
Sbjct: 403 AELCRNRSFEDLRQRIKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEIVTQ 462
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 463 KECERRMRTV-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKP 520
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
R+ +FA ++ I GE LTYDY F + + +C CGA +C+G LG K K
Sbjct: 521 RMALFAGKNGITTGEELTYDYNFDPYSQKNVQECRCGAETCRGVLGPKSK 570
>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
Length = 1787
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1641 NQLKFRK-KKIRFCKSNIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEEEGI 1699
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1700 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEE 1759
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1760 ITYDYKFPIEDEKIPCLCGAENCRGTL 1786
>gi|115438388|ref|XP_001218053.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114188868|gb|EAU30568.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 942
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 92/157 (58%), Gaps = 6/157 (3%)
Query: 285 CGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATF 344
C +G+ A + +FI EY+GEVI++ +R+ G+++FY + K +DAT
Sbjct: 236 CVFGMCAETDLRPHQFIFEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATK 295
Query: 345 KGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CY 403
KGN RF NHSC+PNC ++KW V + R+G+FA R I+AGE L ++Y ++G + + CY
Sbjct: 296 KGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERRIQAGEELVFNYNVDRYGADPQPCY 355
Query: 404 CGASSCQGYLGTKRKIGKLELCWGSKRKRSSTACLAI 440
CG +C G++G G+ + +K ++ L I
Sbjct: 356 CGEPNCTGFIG-----GRTQTERATKLSNATIEALGI 387
>gi|380493696|emb|CCF33689.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 833
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 87/143 (60%), Gaps = 5/143 (3%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KT G+G+ A + I+EY GE+I D C +R+ + KY+ + +Y+ +
Sbjct: 480 VEVIKTPDRGYGIRANRCFESNQIIMEYTGEIITDEECSERMEN-KYKNNKCYYLMSFDQ 538
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA-RSIKAGEPLTYDYRFVQ 395
+ IDAT KG+ +RF+NHSC PNC + KW V G+ R+ +FA + I GE LTYDY F
Sbjct: 539 NMIIDAT-KGSIARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSP 597
Query: 396 FGPE--VKCYCGASSCQGYLGTK 416
F E C CGA +C+G LG K
Sbjct: 598 FSDENVQTCLCGAPNCRGILGPK 620
>gi|195173304|ref|XP_002027432.1| GL20883 [Drosophila persimilis]
gi|194113284|gb|EDW35327.1| GL20883 [Drosophila persimilis]
Length = 2266
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C +E C N+ ++ + ++ T GWGV PI KG +I+EY+GEV
Sbjct: 1404 YTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYVGEV 1463
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1464 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1522
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1523 GLSRMVLFAKRPIEQGEELTYDYNFSLFNPSEGQPCRCNMPQCRGVIGGK 1572
>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 1846
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1700 NQLKFRK-KKIRFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEEEGI 1758
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1759 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEE 1818
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1819 ITYDYKFPIEDEKIPCLCGAENCRGTL 1845
>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
Length = 1388
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 88/142 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ +++ WG+ A EPI +F+IEY+GE+I ++ E R + G+ + Y+
Sbjct: 1247 RKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIGSSYLF 1306
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE ++Y+Y+
Sbjct: 1307 RLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEISYNYK 1366
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CGA C+G L
Sbjct: 1367 FPLEDDKIPCNCGAPKCRGSLN 1388
>gi|198463835|ref|XP_001352956.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
gi|198151432|gb|EAL30457.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
Length = 2266
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C +E C N+ ++ + ++ T GWGV PI KG +I+EY+GEV
Sbjct: 1404 YTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYVGEV 1463
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1464 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1522
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1523 GLSRMVLFAKRPIEQGEELTYDYNFSLFNPSEGQPCRCNMPQCRGVIGGK 1572
>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
Length = 1200
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 87/149 (58%), Gaps = 3/149 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K +K ++ WG+ A E I+ G+ IIEY+GE+I + + R G+
Sbjct: 1051 NQLKKRKKPVKFARSAIHNWGLYAMENISAGDMIIEYVGEIIRQQVADMREKKYLKSGIG 1110
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I IDAT G +RF+NHSC PNC + +VEG R+ ++A R I+ E L
Sbjct: 1111 SSYLFRIDDTTVIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEEL 1170
Query: 388 TYDYRF---VQFGPEVKCYCGASSCQGYL 413
TYDY+F ++ + C CG+S C+G+L
Sbjct: 1171 TYDYKFERELESEERIPCLCGSSGCKGFL 1199
>gi|66805051|ref|XP_636258.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74852293|sp|Q54HS3.1|SET1_DICDI RecName: Full=Histone-lysine N-methyltransferase set1; AltName:
Full=Histone H3 lysine 4 methyltransferase; AltName:
Full=SET domain-containing protein 1
gi|60464689|gb|EAL62816.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 1486
Score = 117 bits (292), Expect = 2e-23, Method: Composition-based stats.
Identities = 52/141 (36%), Positives = 84/141 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
+ K+IK +++ WG+ A E I+ + +IEYIGEVI + ++R +G+ + Y+
Sbjct: 1345 RRKRIKFERSDIHDWGLFAMETISAKDMVIEYIGEVIRQKVADEREKRYVKKGIGSSYLF 1404
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D IDATFKGN +RF+NH CDPNCI + + + ++ ++A R I GE +TYDY+
Sbjct: 1405 RVDDDTIIDATFKGNLARFINHCCDPNCIAKVLTIGNQKKIIIYAKRDINIGEEITYDYK 1464
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C C + C+ L
Sbjct: 1465 FPIEDVKIPCLCKSPKCRQTL 1485
>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
Length = 1938
Score = 117 bits (292), Expect = 2e-23, Method: Composition-based stats.
Identities = 57/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1792 NQLKFRK-KKIRFCKSHIHDWGLFAMEPIVADEMVIEYVGQNIRQVIADMREKRYEDEGI 1850
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1851 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEE 1910
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1911 ITYDYKFPIEDVKIPCLCGAENCRGTL 1937
>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
Length = 213
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 85/142 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K K+ WG+ A EPI E +IEY+GEVI A+ + R + RG+ + YM
Sbjct: 72 RKKQLKFAKSSIHDWGLFALEPIAADEMVIEYVGEVIRQAIADYRERCYEERGIGSSYMF 131
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + IDAT GNF+RF+NH CDPNC + VE ++ +++ R I+ E +TYDY+
Sbjct: 132 RLDETTIIDATTMGNFARFINHCCDPNCYAKVIAVENMKKIVIYSKRDIQVDEEITYDYK 191
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CGA C+G L
Sbjct: 192 FPIEDEKIPCLCGAPQCRGTLN 213
>gi|428183837|gb|EKX52694.1| hypothetical protein GUITHDRAFT_65244 [Guillardia theta CCMP2712]
Length = 211
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 96/167 (57%), Gaps = 4/167 (2%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
Y+ C+ C C ++C N+ F+K + ++ + K + GW + E + +G F+IEY+GE++
Sbjct: 45 YVECTPGFCPCGDSCQNQRFQKCQYVRTEVKKVDKRGWALFTMEDVQQGTFVIEYMGEIL 104
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDF-TIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + E+R F + F IDA+ K + RF+NHSCDPNC +++
Sbjct: 105 NRRMYERRKKAYAKEKHTYFMVLNTSPIFEVIDASRKSSMGRFINHSCDPNCHTHRYRSL 164
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGT 415
GE VG+FA R I+ GE +T DY+ KC+CGA +C+G+LG+
Sbjct: 165 GEVVVGIFAKRDIEKGEEITIDYQMFDGAATKKCHCGAKNCKGFLGS 211
>gi|396469794|ref|XP_003838493.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
gi|312215061|emb|CBX95014.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
Length = 880
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 93/168 (55%), Gaps = 15/168 (8%)
Query: 264 ETCNNRPFRKEKK----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
E C NRPF + K+ +++V T G+GV A + I+EY GE+I
Sbjct: 408 EQCGNRPFAELKRRAKGNRYDYGVEVVDTHDRGFGVRAMRTFEPHQIIVEYAGEIITQWE 467
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
CE+R+ + Y+ + +Y+ IDAT +G +RF+NHSC+PNC + KW V GE R+
Sbjct: 468 CERRMKQV-YKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRM 525
Query: 374 GVFA-ARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA +R + G+ LTYDY F F + C CG SC+G LG K K
Sbjct: 526 ALFAGSRGVMTGDELTYDYNFDPFSQKNIQVCRCGTESCRGVLGPKPK 573
>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
Length = 292
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 86/148 (58%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K +K K+ WG+ A EPI +GE +IEY+G V+ + + R + G+
Sbjct: 146 NQLKFRK-KNLKFAKSGIHDWGLFALEPIAEGEMVIEYVGAVVRQSTADLREKKYEAMGI 204
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC+ + VE ++ +++ R I E
Sbjct: 205 GSSYLFRIDHDLIIDATKCGNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEE 264
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CG S+C+G L
Sbjct: 265 ITYDYKFPLEDEKIPCLCGTSACRGTLN 292
>gi|260830013|ref|XP_002609956.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
gi|229295318|gb|EEN65966.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
Length = 902
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 100/174 (57%), Gaps = 7/174 (4%)
Query: 251 VQYISCS-KACHCSETCNNRPFRK---EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIG 306
+ ++ CS C ++ C N+ ++ + ++ + T+ G+GV + PI +G FI+EY+G
Sbjct: 85 MSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTKDRGYGVRSKTPIPQGNFILEYVG 144
Query: 307 EVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
EV+ + +R ++ Y + Y + ID G RF+NHSC+PNC ++KW
Sbjct: 145 EVVSEQEFRRRTVEI-YHDHNHHYCLNLHSGAVIDGYKYGCEGRFVNHSCEPNCEMQKWS 203
Query: 367 VEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK--CYCGASSCQGYLGTKRK 418
V G R+G+FA R I AGE LTYDY F F E + C CG++ C+G++G K +
Sbjct: 204 VNGVYRIGLFALRDIPAGEELTYDYNFHAFNMEKQQICKCGSAKCRGFIGGKNQ 257
>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1314
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 84/146 (57%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1169 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1228
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1229 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1288
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F + + C CG ++C+G+L
Sbjct: 1289 KFEREIGSTDRIPCLCGTAACKGFLN 1314
>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1830
Score = 116 bits (290), Expect = 2e-23, Method: Composition-based stats.
Identities = 57/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1684 NQLKFRK-KKIRFCKSHIHDWGLFALEPIAADEMVIEYVGQNIRQVIADMREKRYEEEGI 1742
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1743 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEE 1802
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1803 ITYDYKFPIEDVKIPCLCGAENCRGTL 1829
>gi|389585110|dbj|GAB67841.1| hypothetical protein PCYB_124070 [Plasmodium cynomolgi strain B]
Length = 2974
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/166 (37%), Positives = 96/166 (57%), Gaps = 6/166 (3%)
Query: 266 CNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW---- 319
C NRPF+ K+++I +TE G+GV I GE I EY+GEV+ E+R+
Sbjct: 2537 CFNRPFKHSAIKELEIKQTERTGFGVFCKRDIKNGELICEYVGEVLGKKEFEERMEAYQE 2596
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ K + N+Y +I +D ID+ KG+ SRF+NHSC PN + +KW V G R+G+FA +
Sbjct: 2597 ESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVRGFYRIGIFAQQ 2656
Query: 380 SIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTKRKIGKLELC 425
I AGE +TY+Y + +C C +++C Y K++ + E+
Sbjct: 2657 DIPAGEEITYNYSYNFVFNNFECLCNSANCMNYNLKKKEETENEMS 2702
>gi|134079644|emb|CAK97070.1| unnamed protein product [Aspergillus niger]
Length = 885
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 105/192 (54%), Gaps = 27/192 (14%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C E C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 421 CGVGEECGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYTGEI 480
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I CE+R+ + Y+ +N IDAT +G+ +RF+NHSC+PNC +EKW V
Sbjct: 481 ITQTECEKRMRTI-YKHNENMI---------IDAT-RGSIARFVNHSCEPNCRMEKWTVA 529
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG--TKRKIGKLE 423
G+ R+ +FA R I GE LTYDY F + + +C CG+S+C+G LG K K+ + +
Sbjct: 530 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSNCRGILGPRPKEKVQRAK 589
Query: 424 LCWGSKRKRSST 435
K K+S+T
Sbjct: 590 EQKVEKSKKSAT 601
>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
2508]
Length = 1282
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1137 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1196
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1197 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1256
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG ++C+G+L
Sbjct: 1257 KFEREIGSTDRIPCLCGTAACKGFL 1281
>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
Length = 1313
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1168 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1227
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1228 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1287
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG ++C+G+L
Sbjct: 1288 KFEREIGSTDRIPCLCGTAACKGFL 1312
>gi|315047188|ref|XP_003172969.1| SUV39H [Arthroderma gypseum CBS 118893]
gi|311343355|gb|EFR02558.1| SUV39H [Arthroderma gypseum CBS 118893]
Length = 703
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 97/170 (57%), Gaps = 12/170 (7%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KTE G+GV + + I+EY GE++ ++R+ + Y+ + FY+ + +
Sbjct: 428 VEVIKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRRMKTI-YKKNECFYLMDFDQ 486
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQ 395
D IDAT +G+ +RF+NHSC+PNC +EKW V G+ R+ +FA + I GE LTYDY F
Sbjct: 487 DMIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDP 545
Query: 396 FGPE--VKCYCGASSCQGYLG-------TKRKIGKLELCWGSKRKRSSTA 436
+ + +C CG SC+G LG +K K G+KRK A
Sbjct: 546 YSNKNVQECRCGTPSCRGVLGPRPKGKDSKEKSATKSAGAGTKRKSQGVA 595
>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
Length = 1037
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 91/167 (54%), Gaps = 6/167 (3%)
Query: 254 ISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVID 310
IS K SET N +++K + ++ WG+ A EPI E IIEY+GE I
Sbjct: 871 ISAQKQAIGSETDILSLNTLAKRKKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIR 930
Query: 311 DALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
+ E R G+ + Y+ I ++ IDAT KG +RF+NH C P+C + +V+G+
Sbjct: 931 QQVAEHRERSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDGK 990
Query: 371 TRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
R+ ++A R I+A E LTYDY+F ++C CGA C+GYL
Sbjct: 991 KRIVIYALRDIEANEELTYDYKFERETNDSERIRCLCGAPGCKGYLN 1037
>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
90-125]
gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
Length = 1038
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 893 KRKKPVTFARSTIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGSSYL 952
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C+P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 953 FRIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1012
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1013 KFERETNDEERIRCLCGAPGCKGYLN 1038
>gi|449533627|ref|XP_004173774.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like, partial
[Cucumis sativus]
Length = 137
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/128 (47%), Positives = 82/128 (64%), Gaps = 2/128 (1%)
Query: 297 KGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC 356
G+FIIEY GEVI ++R + +G+++ Y+ + +IDAT KG+ +RF+NHSC
Sbjct: 1 NGQFIIEYCGEVISWKEAKRRSHTYENQGLKDAYIISLNASESIDATRKGSLARFINHSC 60
Query: 357 DPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYLGT 415
PNC KW V GE RVG+FA + I G L YDY F + G +V+C CGASSC G+LG
Sbjct: 61 FPNCETRKWNVLGEIRVGIFAKQDISIGTELAYDYNFEWYGGAKVRCLCGASSCSGFLGA 120
Query: 416 KRKIGKLE 423
K + G LE
Sbjct: 121 KSR-GFLE 127
>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
tetrasperma FGSC 2509]
Length = 1313
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1168 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1227
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1228 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1287
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG ++C+G+L
Sbjct: 1288 KFEREIGSTDRIPCLCGTAACKGFL 1312
>gi|327306067|ref|XP_003237725.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
gi|326460723|gb|EGD86176.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
Length = 711
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 97/167 (58%), Gaps = 16/167 (9%)
Query: 266 CNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
C NR F + +K ++++KTE G+GV + + I+EY GE++
Sbjct: 407 CRNRNFSELRKRIKTGGKYNIGVEVIKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEA 466
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
++R+ + Y+ + FY+ + +D IDAT +G+ +RF+NHSC+PNC +EKW V G+ R+
Sbjct: 467 QRRMKTI-YKKNECFYLMDFDQDMIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIA 524
Query: 375 VFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA + I GE LTYDY F + + +C CG SC+G LG + K
Sbjct: 525 LFAGDNGIMTGEELTYDYNFDPYSNKNVQECRCGTPSCRGVLGPRPK 571
>gi|425778490|gb|EKV16615.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum PHI26]
gi|425784214|gb|EKV22005.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum Pd1]
Length = 770
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 99/173 (57%), Gaps = 16/173 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F K ++++KT G+GV + + + I+EY GE+
Sbjct: 415 CRLGPECGNRNFEGLKHRTKAGGKYNIGVEVIKTADRGYGVRSNRSFDPNQIIVEYTGEI 474
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NH+C+PNC +EKW V
Sbjct: 475 LTQLECEKRMRTV-YKNNECYYLMYFDQNMIIDAT-RGSIARFVNHACEPNCRMEKWTVA 532
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R I GE L+YDY F + + +C CG+++C+G+LG + K
Sbjct: 533 GKPRMALFAGDRGISTGEELSYDYNFDPYSNKNVQQCRCGSANCRGFLGPRLK 585
>gi|302664384|ref|XP_003023822.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
gi|291187840|gb|EFE43204.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
Length = 709
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 97/167 (58%), Gaps = 16/167 (9%)
Query: 266 CNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALC 314
C NR F + +K ++++KTE G+GV + + I+EY GE++
Sbjct: 404 CRNRNFSELRKRIKTGGKYNIGVEVIKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEA 463
Query: 315 EQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVG 374
++R+ + Y+ + FY+ + +D IDAT +G+ +RF+NHSC+PNC +EKW V G+ R+
Sbjct: 464 QRRMKTI-YKKNECFYLMDFDQDMIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIA 521
Query: 375 VFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+FA + I GE LTYDY F + + +C CG SC+G LG + K
Sbjct: 522 LFAGDNGIMTGEELTYDYNFDPYSNKNVQECRCGTPSCRGVLGPRPK 568
>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1150
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 84/146 (57%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1005 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYL 1064
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1065 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDY 1124
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F + + C CG ++C+G+L
Sbjct: 1125 KFEREIGSTDRIPCLCGTAACKGFLN 1150
>gi|357611223|gb|EHJ67375.1| hypothetical protein KGM_13830 [Danaus plexippus]
Length = 1798
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 90/170 (52%), Gaps = 7/170 (4%)
Query: 253 YISCS-KACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y CS + C C + C N+ ++ + ++ TE GWGV + I G+FI+EY+GEV
Sbjct: 932 YSECSPQLCPCGDKCKNQRIQRHEWVPGLEKFMTENKGWGVRTKQMIRSGDFILEYVGEV 991
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ D ++R+ +Y + Y + ID G RF+NHSC PNC ++KW
Sbjct: 992 VSDKEFKERMA-TRYARDTHHYCLHLDGGLVIDGHRVGGDGRFVNHSCRPNCEMQKWTAN 1050
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPEV--KCYCGASSCQGYLGTK 416
G R+ +FA R I+ E LTYDY F F P V C C + C+G +G K
Sbjct: 1051 GTFRMALFALRDIEPDEELTYDYNFSLFNPAVGQPCKCDSEDCRGVIGGK 1100
>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
Length = 602
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 86/149 (57%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + ++R + RG
Sbjct: 455 VNQLKFRK-KLIKFARSRIHGWGLYALEVIAPDEMIVEYIGQKIRPTVADEREKRYERRG 513
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I D IDAT GN +RF+NHSC PNC + V+GE R+ +++ +I G+
Sbjct: 514 MGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGD 573
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA C+G L
Sbjct: 574 EITYDYKFPIEEDKIDCLCGAPGCRGSLN 602
>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
Length = 278
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 86/149 (57%), Gaps = 1/149 (0%)
Query: 266 CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRG 325
N FRK K IK ++ GWG+ A E I E I+EYIG+ I + ++R + RG
Sbjct: 131 VNQLKFRK-KLIKFARSRIHGWGLYALEAIAPDEMIVEYIGQKIRPTVADEREKRYERRG 189
Query: 326 VQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGE 385
+ + Y+ I D IDAT GN +RF+NHSC PNC + V+GE R+ +++ +I G+
Sbjct: 190 MGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGD 249
Query: 386 PLTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA C+G L
Sbjct: 250 EITYDYKFPIEEDKIDCLCGAPGCRGSLN 278
>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
Full=SET domain-containing protein 1B-A
gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
Length = 1844
Score = 115 bits (289), Expect = 3e-23, Method: Composition-based stats.
Identities = 56/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KKI+ ++ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1698 NQLKFRK-KKIRFCRSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGI 1756
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1757 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEE 1816
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1817 ITYDYKFPIEDEKIPCLCGAENCRGTL 1843
>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
Length = 1167
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 94/173 (54%), Gaps = 9/173 (5%)
Query: 251 VQYISCSKACHCSETCNNRPFR------KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEY 304
V IS K ++ +N R ++K +K ++ WG+ A E I+ + IIEY
Sbjct: 995 VNEISTQKQALAADGSDNIAIRFNQLKKRKKLVKFDRSAIHNWGLYAQEDISANDMIIEY 1054
Query: 305 IGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEK 364
+GE + + + R +GV + Y+ I +D +DAT KG +RF+NHSC PNC +
Sbjct: 1055 VGEKVRQKVADIREIKYDKQGVGSSYLFRIDEDSVVDATKKGGIARFINHSCSPNCTAKI 1114
Query: 365 WQVEGETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
+V+G R+ ++A R IK E LTYDY+F + + C CG+ +C+G+L
Sbjct: 1115 IRVDGTKRIVIYALRDIKTNEELTYDYKFEREIGSDDRIPCLCGSVNCKGFLN 1167
>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
24927]
Length = 1338
Score = 115 bits (289), Expect = 3e-23, Method: Composition-based stats.
Identities = 54/149 (36%), Positives = 87/149 (58%), Gaps = 3/149 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K ++ ++ WG+ A E I+ E IIEY+GE++ + + R + G+
Sbjct: 1189 NQLKKRKKPVRFARSAIHNWGLYAMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIG 1248
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E L
Sbjct: 1249 SSYLFRIDETTVIDATKKGGIARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEEL 1308
Query: 388 TYDYRF---VQFGPEVKCYCGASSCQGYL 413
TYDY+F + + C CG+S C+G+L
Sbjct: 1309 TYDYKFEREIDSEERIPCLCGSSGCKGFL 1337
>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
queenslandica]
Length = 1053
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 85/142 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++ K+ WG+ A E I E ++EYIG+V+ + ++R + +G+ + Y+
Sbjct: 912 RKKQLTFAKSTIHNWGLFALETIPADEMVVEYIGQVVRHGIADERERRYEAQGIGSSYLF 971
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D IDAT GNF+RF+NH CDPNC + V + ++ +++ R I+AGE +TYDY+
Sbjct: 972 RVDYDHVIDATKSGNFARFINHCCDPNCYAKIITVGNQKKIVIYSKRDIRAGEEITYDYK 1031
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CGA C+G L
Sbjct: 1032 FPIEDEKIPCLCGAPQCRGTLN 1053
>gi|327349992|gb|EGE78849.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 775
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 99/173 (57%), Gaps = 16/173 (9%)
Query: 263 SETCNNRPF---RKEKK--------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+E C NR F R+ K ++++KT G+GV + + I+EY GE++
Sbjct: 406 AELCGNRNFEGLRQRSKLGGKYNIGVEVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQ 465
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 466 EECERRMRTV-YKNNECYYLMYFDRNMIIDAT-RGSIARFVNHSCEPNCKIEKWTVAGKP 523
Query: 372 RVGVFAAR-SIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGK 421
R+ +FA I GE LTYDY F + + +C CGA +C+G LG K K K
Sbjct: 524 RMALFAGEHGIMTGEELTYDYNFDPYSQKNVQECRCGAPTCRGVLGPKPKDSK 576
>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
tropicalis]
gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
Length = 1956
Score = 115 bits (289), Expect = 4e-23, Method: Composition-based stats.
Identities = 56/147 (38%), Positives = 86/147 (58%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK KK++ K+ WG+ A EPI E +IEY+G+ I + + R + G+
Sbjct: 1810 NQLKFRK-KKLRFCKSHIHDWGLFAMEPIIADEMVIEYVGQNIRQVIADMREKRYEDEGI 1868
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ YM + D IDAT GNF+RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1869 GSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEE 1928
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 1929 ITYDYKFPIEDVKIPCLCGAENCRGTL 1955
>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
histone-lysine n-methyltransferase, h3 lysine-4 specific,
putative [Candida dubliniensis CD36]
gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
dubliniensis CD36]
Length = 1032
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 86/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 887 KRKKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGSSYL 946
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 947 FRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1006
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1007 KFERETNDEERIRCLCGAPGCKGYLN 1032
>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
Length = 1042
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 897 KRKKPVTFARSTIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGSSYL 956
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C+P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 957 FRIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1016
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1017 KFERETNDEERIRCLCGAPGCKGYLN 1042
>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1040
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 895 KRKKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGSSYL 954
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 955 FRIDDNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1014
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1015 KFERETNDEERIRCLCGAPGCKGYLN 1040
>gi|195377533|ref|XP_002047543.1| GJ11874 [Drosophila virilis]
gi|194154701|gb|EDW69885.1| GJ11874 [Drosophila virilis]
Length = 2343
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y C+ + C + C N+ ++ + ++ T GWGV PI KG +I+EY+GEV
Sbjct: 1468 YTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEV 1527
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1528 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1586
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1587 GLSRMVLFAKRAIEQGEELTYDYNFSLFNPSEGQPCRCNMPQCRGVIGGK 1636
>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
Length = 1040
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 895 KRKKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGSSYL 954
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 955 FRIDDNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1014
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1015 KFERETNDEERIRCLCGAPGCKGYLN 1040
>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
bruxellensis AWRI1499]
Length = 1104
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 88/150 (58%), Gaps = 3/150 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K ++ ++ WG+ A EPI E I EY+GE I + E R G+
Sbjct: 955 NQLLKRQKPVQFARSAIHNWGLYAXEPIAAKEMIXEYVGERIRQKVAEVREKRYLKSGIG 1014
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ + + IDA+ KG +RF+NH CDP+C + +V+G+ R+ ++A R I A E L
Sbjct: 1015 SSYLFRVDDNTVIDASKKGGIARFINHCCDPSCTAKIIKVDGKKRIVIYALRDIAANEEL 1074
Query: 388 TYDYRF-VQFGPE--VKCYCGASSCQGYLG 414
TYDY+F + PE + C CGA +C+GYL
Sbjct: 1075 TYDYKFEKETNPEERIPCLCGAPNCKGYLN 1104
>gi|359476736|ref|XP_002269759.2| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Vitis
vinifera]
Length = 1517
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 92/177 (51%), Gaps = 19/177 (10%)
Query: 256 CSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ C C+ TC NR + ++K + +TE GW V A E I +G FI EYIGEV+ +
Sbjct: 1340 CNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGEVLSEQE 1399
Query: 314 CEQR------------LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
++R +D+ + E + + IDAT GN SRF+NHSC PN I
Sbjct: 1400 ADKRGNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINHSCSPNLI 1459
Query: 362 LEKWQVEGE----TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYL 413
+ VE +G+FA R I GE LTYDYR+ G C+CGAS C+G L
Sbjct: 1460 NHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPLPGEGYPCHCGASKCRGRL 1516
>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
98AG31]
Length = 191
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 87/146 (59%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ ++K++K ++ WG+ A E I GE +IEY+GEVI A+ ++R + G+
Sbjct: 45 NQLRTRKKQLKFARSPIHDWGLYAMEMIPVGEMVIEYVGEVIRQAVADRREKAYERMGIG 104
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ + D +DAT KGN R +NH C PNC + + GE ++ ++A +I+ G+ +
Sbjct: 105 SSYLFRVDDDLVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKATIELGDEV 164
Query: 388 TYDYRFVQFGPEVKCYCGASSCQGYL 413
TYDY F + ++ C CG+S C+G L
Sbjct: 165 TYDYHFPKEDVKIPCLCGSSKCKGTL 190
>gi|367016577|ref|XP_003682787.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
gi|359750450|emb|CCE93576.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
Length = 687
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 96/158 (60%), Gaps = 3/158 (1%)
Query: 262 CSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW 319
C + C N+ F+K + I + +TE G+GV A I +FI EY+GEVI++ +RL
Sbjct: 96 CGDDCQNQRFQKRQYADIAVFQTEMKGYGVRAETDIEAHQFIYEYMGEVIEEEEFRERLV 155
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ + +++FY ++ IDAT KG+ +RF NHSC+PN + KW V G+ ++G+FA R
Sbjct: 156 EYDQKKLKHFYFMMLQNGEFIDATMKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFANR 215
Query: 380 SIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTK 416
I GE +T+DY ++G KCYC +C G+LG K
Sbjct: 216 KILKGEEITFDYNVDRYGATAQKCYCDEPNCIGFLGGK 253
>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
Length = 469
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 324 KRKKPVSYARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHREKSYLRTGIGSSYL 383
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C+P+C + +V+G+ R+ ++A R I+A E LTYDY
Sbjct: 384 FRIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDY 443
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F ++C CGA C+GYL
Sbjct: 444 KFERETNDAERIRCLCGAPGCKGYLN 469
>gi|302921594|ref|XP_003053314.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
gi|256734254|gb|EEU47601.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
Length = 776
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 89/143 (62%), Gaps = 5/143 (3%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KT G+GV + + I+EY GE+I +A CE+R+ ++ Y+ + +Y+ +
Sbjct: 460 VEVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEAECERRMTEV-YKDNECYYLMSFDQ 518
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA-RSIKAGEPLTYDYRFVQ 395
+ IDAT G+ +RF+NHSC+PNC + KW V G+ R+ +FA + I GE LTYDY F
Sbjct: 519 NMIIDATT-GSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPITTGEELTYDYNFDP 577
Query: 396 FGPE--VKCYCGASSCQGYLGTK 416
F + KC CG +C+G LG K
Sbjct: 578 FSAKNVQKCLCGEPNCRGVLGPK 600
>gi|255562775|ref|XP_002522393.1| set domain protein, putative [Ricinus communis]
gi|223538471|gb|EEF40077.1| set domain protein, putative [Ricinus communis]
Length = 1516
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 95/176 (53%), Gaps = 18/176 (10%)
Query: 256 CSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C++ C CS+TC NR + ++K + KT+ GW V A EPI G F+ EYIGEV+D+
Sbjct: 1340 CNQMCSCSKTCPNRVLQNGIRVKLEVYKTKNKGWAVRAGEPILSGTFVCEYIGEVLDEVE 1399
Query: 314 CEQR-----------LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCIL 362
QR ++D+ + E + + IDAT GN SRF+NHSC PN +
Sbjct: 1400 ANQRRGRYSEESCSYMYDIDAHTNDMSRLMEGQVKYVIDATKHGNVSRFINHSCLPNLVN 1459
Query: 363 EKWQVEG----ETRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYL 413
+ + +G++A+R I GE LTY+YR+ + G C+CG S C+G L
Sbjct: 1460 HQVIINSMDAQRAHIGLYASRDIAFGEELTYNYRYNLVPGEGYPCHCGTSKCRGRL 1515
>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
Length = 2482
Score = 115 bits (287), Expect = 6e-23, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++ K+ WG+ A EPI E +IEY+G+ I + ++R + +G+
Sbjct: 2336 NQLKFRK-KQLSFRKSRIHDWGLFALEPIAAEEMVIEYVGQCIRQTIADERERRYEEQGI 2394
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + D IDAT GN +RF+NH C+PNC + VEG ++ +++ R I E
Sbjct: 2395 GSSYLFRVDHDMIIDATKNGNLARFINHCCNPNCYAKIITVEGYKKIVIYSRRDIAVNEE 2454
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA +C+G L
Sbjct: 2455 ITYDYKFPIEDEKIPCLCGAENCRGTL 2481
>gi|388852139|emb|CCF54145.1| uncharacterized protein [Ustilago hordei]
Length = 1338
Score = 115 bits (287), Expect = 6e-23, Method: Composition-based stats.
Identities = 75/211 (35%), Positives = 111/211 (52%), Gaps = 28/211 (13%)
Query: 251 VQYISCSKACHCSETCNNRPFRKEKKIKIVKTEFC--GWGVEAAEPINKGEFIIEYIGEV 308
+ +I K C C N + +K + G+G++ EPI K +FI EY GEV
Sbjct: 692 LMFICDPKTCPSGSNCTNISLGRRPTVKTAVHYYGRRGFGLKTLEPIKKDDFIDEYRGEV 751
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCE--IRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQ 366
ID +R+ D +Y+ NFY+ + +D KGN +RF NHSC+PNC +EK+
Sbjct: 752 IDLHEASKRVTD-EYKATGNFYLLDYDTAAGELLDGGRKGNITRFANHSCEPNCRIEKFI 810
Query: 367 VEG-------ETRVGVFAARSIKAGEPLTYDYRFVQFGP---------EV---KCYCGAS 407
+ G E ++G+FA R I+AGE LTY+Y + F P EV +C CGA+
Sbjct: 811 ICGTDEALSAEFQIGLFALRDIEAGEELTYNYGWSAFQPRDITGAPTEEVPPEQCLCGAA 870
Query: 408 SCQGYLGTKR----KIGKLELCWGSKRKRSS 434
+C G LG K+ K GK E+ S++K ++
Sbjct: 871 NCAGILGGKKPPATKAGKDEVAANSRKKTAN 901
>gi|397568484|gb|EJK46160.1| hypothetical protein THAOC_35187 [Thalassiosira oceanica]
Length = 473
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 91/164 (55%), Gaps = 6/164 (3%)
Query: 256 CSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ C C NR R+ K ++ + GWG+ + + + G+ +IEY GEVID++
Sbjct: 268 CAANCDLGPNCGNRAMGRRRFAKCRVQREHGKGWGLISVDGVKSGDLVIEYAGEVIDEST 327
Query: 314 CEQRL--WDMKYRGVQNFYMCEI-RKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGE 370
E RL W + NFY+ + + + IDA N +RF+NHSCDPNC L V G
Sbjct: 328 KESRLAAWTRDHPTDPNFYVMALGQAGWYIDARHVANQARFVNHSCDPNCRLVPLNVAGH 387
Query: 371 TRVGVFAARSIKAGEPLTYDYRF-VQFGPEVKCYCGASSCQGYL 413
RV + A R ++ GE L+YDY+F + G C CG+S+C+G +
Sbjct: 388 MRVAIVAVRDVRPGEFLSYDYQFDTRQGDRFTCRCGSSNCRGTM 431
>gi|195020546|ref|XP_001985216.1| GH14625 [Drosophila grimshawi]
gi|193898698|gb|EDV97564.1| GH14625 [Drosophila grimshawi]
Length = 2406
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 253 YISCSKA-CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y C+ + C + C N+ ++ + ++ T GWGV PI KG +I+EY+GEV
Sbjct: 1540 YTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTTDKGWGVRTKLPIAKGTYILEYVGEV 1599
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + +QR+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V
Sbjct: 1600 VTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVN 1658
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G +R+ +FA R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1659 GLSRMVLFAKRAIEDGEELTYDYNFSLFNPSEGQPCRCNMPQCRGVIGGK 1708
>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
distachyon]
Length = 1194
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 90/142 (63%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A E I+ +F+IEY+GE+I + + R + G+ + Y+
Sbjct: 1053 RKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRPVSDIREAQYEKSGIGSSYLF 1112
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D+ +DAT +G +RF+NHSC+PNC + V+G+ ++ +++ R I AGE LTY+Y+
Sbjct: 1113 RLDDDYVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYK 1172
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C+CG+ C+G +
Sbjct: 1173 FPLEEKKIPCHCGSLRCRGSMN 1194
>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
Length = 1221
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1076 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYL 1135
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1136 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDY 1195
Query: 392 RFV-QFGP--EVKCYCGASSCQGYLG 414
+F + G + C CG ++C+G+L
Sbjct: 1196 KFEREIGSLDRIPCLCGTAACKGFLN 1221
>gi|426331996|ref|XP_004026979.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Gorilla gorilla
gorilla]
Length = 2776
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 86/151 (56%), Gaps = 5/151 (3%)
Query: 253 YISCS-KACHCSETCNNRPFRKEKKIKIV---KTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+ CS C C E C N+ ++ + ++ + + E GWG+ EP+ G+FIIEY+GEV
Sbjct: 2114 FAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEV 2173
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + R+ + +Y + Y + ID+ GN +RF+NHSCDPNC ++KW V
Sbjct: 2174 VSEQEFRNRMIE-QYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVN 2232
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE 399
G R+G++A + + AG LTYDY F F E
Sbjct: 2233 GVYRIGLYALKDMPAGTELTYDYNFHSFNVE 2263
>gi|297735229|emb|CBI17591.3| unnamed protein product [Vitis vinifera]
Length = 1315
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 92/177 (51%), Gaps = 19/177 (10%)
Query: 256 CSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ C C+ TC NR + ++K + +TE GW V A E I +G FI EYIGEV+ +
Sbjct: 1138 CNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGEVLSEQE 1197
Query: 314 CEQR------------LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCI 361
++R +D+ + E + + IDAT GN SRF+NHSC PN I
Sbjct: 1198 ADKRGNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINHSCSPNLI 1257
Query: 362 LEKWQVEGE----TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQGYL 413
+ VE +G+FA R I GE LTYDYR+ G C+CGAS C+G L
Sbjct: 1258 NHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPLPGEGYPCHCGASKCRGRL 1314
>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
atroviride IMI 206040]
Length = 1241
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1096 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYL 1155
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1156 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDY 1215
Query: 392 RFV-QFGP--EVKCYCGASSCQGYLG 414
+F + G + C CG ++C+G+L
Sbjct: 1216 KFEREIGSLDRIPCLCGTAACKGFLN 1241
>gi|452837203|gb|EME39145.1| hypothetical protein DOTSEDRAFT_75034 [Dothistroma septosporum NZE10]
Length = 1275
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 87/145 (60%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ GWG+ A E I E IIEY+GE + + + R + +GV + Y+
Sbjct: 1130 KRKKLVKFDRSAIHGWGLYAEENIGINELIIEYVGEKVRQKVADMREIKYEKQGVGSSYL 1189
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
+ D +DAT KG +RF+NHSCDPNC + +VEG R+ ++A + I + LTYDY
Sbjct: 1190 FRMMDDEIVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIYKNDELTYDY 1249
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG+++C+G+L
Sbjct: 1250 KFEREIGSTDRIPCLCGSANCKGFL 1274
>gi|336264616|ref|XP_003347084.1| hypothetical protein SMAC_05383 [Sordaria macrospora k-hell]
gi|380093778|emb|CCC08742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1224
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 87/140 (62%), Gaps = 5/140 (3%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
+++ KTE G+GV + + I+EY GE+I D CE+R+ + +Y+ + +Y+ +
Sbjct: 769 VEVFKTEDRGYGVRSNRCFEPHQIIMEYTGEIITDEECERRM-NEEYKNNECYYLMSFDQ 827
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA-RSIKAGEPLTYDYRFVQ 395
+ IDAT G+ +RF+NHSC PNC + KW V G+ R+ +FA R I+ GE LTYDY F
Sbjct: 828 NMIIDATT-GSIARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIQTGEELTYDYNFDP 886
Query: 396 FGPE--VKCYCGASSCQGYL 413
F + KC CGA +C+G L
Sbjct: 887 FSAKNVQKCLCGAPNCRGVL 906
>gi|255945051|ref|XP_002563293.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588028|emb|CAP86099.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 788
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 98/173 (56%), Gaps = 16/173 (9%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F K+ ++++KT G+GV + + I+EY GE+
Sbjct: 415 CRLGPECGNRNFEGLKQRTKAGGKYNIGVEVIKTADRGYGVRSNRSFEPNQIIVEYTGEI 474
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ CE+R+ + Y+ +Y+ ++ IDAT +G+ +RF+NH+C+PNC +EKW V
Sbjct: 475 LTQLECEKRMRTV-YKNNDCYYLMYFDQNMIIDAT-RGSIARFVNHACEPNCRMEKWTVA 532
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R + GE L+YDY F + + +C CG+++C+G+LG + K
Sbjct: 533 GKPRMALFAGDRGVSTGEELSYDYNFDPYSNKNVQECRCGSANCRGFLGPRLK 585
>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
Length = 1236
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E INK + IIEY+GE + + E R G+ + Y+
Sbjct: 1091 KRKKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYL 1150
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1151 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDY 1210
Query: 392 RFV-QFGP--EVKCYCGASSCQGYLG 414
+F + G + C CG ++C+G+L
Sbjct: 1211 KFEREIGSLDRIPCLCGTAACKGFLN 1236
>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
Length = 995
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 94/169 (55%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I KA +E+ N+ +++K++ ++ WG+ A EPI E IIEY+GE+
Sbjct: 827 QDIEAQKAAIGTESELLSLNQLTKRKKQVTFARSPIHNWGLYALEPIAAKEMIIEYVGEM 886
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + E R G+ + Y+ I + IDAT KG +RF+NH CDP+C + +V
Sbjct: 887 IRQPVAEMRENRYLKSGIGSSYLFRIDEYTVIDATKKGGIARFINHCCDPSCTAKIIKVG 946
Query: 369 GETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
G R+ ++A R I A E LTYDY+F V + C CGA++C+G+L
Sbjct: 947 GRKRIVIYALRDIAANEELTYDYKFEREVDDEERLPCLCGAATCKGFLN 995
>gi|156099290|ref|XP_001615647.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804521|gb|EDL45920.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 3021
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 89/153 (58%), Gaps = 6/153 (3%)
Query: 266 CNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLW---- 319
C NRPF+ K ++I +TE G+GV I GE I EY+GEV+ E+R+
Sbjct: 2584 CFNRPFKHSAIKDLEIKQTERTGYGVFCKRDIKNGELICEYVGEVLGKKEFEERMEAYQE 2643
Query: 320 DMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ K + N+Y +I +D ID+ KG+ SRF+NHSC PN + +KW V G R+G+FA +
Sbjct: 2644 ESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVRGFYRIGIFAQQ 2703
Query: 380 SIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGY 412
I AGE +TY+Y + +C C +++C +
Sbjct: 2704 DIPAGEEITYNYSYNFVFNNFECLCNSANCMNH 2736
>gi|347966112|ref|XP_321588.5| AGAP001535-PA [Anopheles gambiae str. PEST]
gi|333470205|gb|EAA00844.6| AGAP001535-PA [Anopheles gambiae str. PEST]
Length = 3613
Score = 114 bits (285), Expect = 1e-22, Method: Composition-based stats.
Identities = 62/172 (36%), Positives = 91/172 (52%), Gaps = 7/172 (4%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y C + C C + C N ++ + ++ TE GWG+ + E I+KG FI+EY+GEV
Sbjct: 2733 YTECVPEQCPCGDRCRNTCIQRHEYAPGLERFMTEEKGWGIRSRERISKGTFIMEYLGEV 2792
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + ++R+ M Y + Y + ID G+ RF+NHSC PNC ++KW V
Sbjct: 2793 VTEREFKERMRTM-YLNDTHHYCLNLDGGLVIDGHRMGSDCRFVNHSCAPNCEMQKWSVN 2851
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G R+ +FA R I E L YDY F F P C CG+ C+G +G K +
Sbjct: 2852 GLFRMALFAMRDIPPNEELCYDYNFSLFNPSEGQPCRCGSEQCRGVIGGKSQ 2903
>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 1014
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 84/142 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K ++ WG+ A E I GE +IEY+GEVI A+ ++R + G+ + Y+
Sbjct: 873 RKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLYERMGIGSSYLF 932
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D +DAT KGN R +NH C PNC + + GE ++ ++A +I+ G+ +TYDY
Sbjct: 933 RVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYH 992
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F + ++ C CG+ C+G L
Sbjct: 993 FPKEEVKIPCLCGSVKCKGTLN 1014
>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
Length = 1300
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 87/141 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ +++ WG+ A EPI +F+IEYIGE+I + + R + G+ + Y+
Sbjct: 1159 RKKHLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLF 1218
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 1219 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1278
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CG+ C+G L
Sbjct: 1279 FPLEEKKIPCNCGSRKCRGSL 1299
>gi|301114847|ref|XP_002999193.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111287|gb|EEY69339.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 295
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 89/156 (57%), Gaps = 7/156 (4%)
Query: 286 GWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFK 345
G G++ E I G F+ EY+GE++ + E + + Y ++ YM + IDAT
Sbjct: 13 GLGMKLLEDIKAGSFVGEYMGEIVTEQ--EYYMRRVLYHNEKHRYMMVLSGGEVIDATRM 70
Query: 346 GNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCY 403
G ++RF+NHSC+PNC +EKW V GE R +FA R I AGE LT+DY+F F +C
Sbjct: 71 GGWARFINHSCNPNCGVEKWDVNGEERCAIFALRDIVAGEELTFDYKFESFSKAEITECL 130
Query: 404 CGASSCQGYLGTKRKIGKLELCWGSKRKRSSTACLA 439
CGA +C+ +G K+ KL +++K + T L
Sbjct: 131 CGAPNCRKVIGMNNKVTKLN---KAQKKTAETTALV 163
>gi|367021898|ref|XP_003660234.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
gi|347007501|gb|AEO54989.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
Length = 942
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 98/169 (57%), Gaps = 16/169 (9%)
Query: 264 ETCNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F+ + KK +++ KT+ G+GV + + I+EY GE+I A
Sbjct: 488 EYCQNRAFQDLQERTKKGGRYRIGVEVFKTKDRGYGVRSNRCFEPNQIIMEYTGEIITVA 547
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC PNC + KW V G+ R
Sbjct: 548 ECERRM-NEEYKDNECYYLMSFDQNMIIDATT-GSIARFVNHSCSPNCRMIKWIVAGQPR 605
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
+ +FA R I GE LTYDY F F + KC CG+ +C+G LG K K
Sbjct: 606 MALFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCGSPNCRGVLGPKPK 654
>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 208
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 87/148 (58%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + ++R G+
Sbjct: 62 NQLKFRK-KQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQMVRPVMADRREQHYTQIGI 120
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + + IDAT GN +RF+NHSC+PNC + VEG+ ++ +++ + I E
Sbjct: 121 GSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEE 180
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA C+G+L
Sbjct: 181 ITYDYKFPLEEEKISCLCGAPQCRGFLN 208
>gi|400595512|gb|EJP63307.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 824
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 98/167 (58%), Gaps = 16/167 (9%)
Query: 264 ETCNNRPF-----RKEKK------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
ETC NR F R+ K ++++KT G+GV + + I+EY GE+I +
Sbjct: 447 ETCTNRAFATLTARRAKGGKYRVGVEVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEE 506
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC+PNC + KW V G+ R
Sbjct: 507 ECERRM-NNEYKNNECYYLMSFDQNMIIDATT-GSIARFVNHSCNPNCRMIKWIVSGQPR 564
Query: 373 VGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
+ +FA + I G+ LTYDY F F + KC CGA +C+G LG +
Sbjct: 565 MALFAGDNPIMTGDELTYDYNFDPFSAKNVQKCLCGAHNCRGVLGPR 611
>gi|222617469|gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japonica Group]
Length = 1165
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 86/134 (64%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A E I+ +F+IEY+GE+I + + R + G+ + Y+
Sbjct: 1030 RKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLF 1089
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D+ +DAT +G +RF+NHSCDPNC + VEG+ ++ ++A R I AGE LTY+Y+
Sbjct: 1090 RLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYK 1149
Query: 393 FVQFGPEVKCYCGA 406
F ++ C+CG+
Sbjct: 1150 FPLEEKKIPCHCGS 1163
>gi|218187240|gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indica Group]
Length = 1167
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 86/134 (64%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A E I+ +F+IEY+GE+I + + R + G+ + Y+
Sbjct: 1032 RKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLF 1091
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D+ +DAT +G +RF+NHSCDPNC + VEG+ ++ ++A R I AGE LTY+Y+
Sbjct: 1092 RLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYK 1151
Query: 393 FVQFGPEVKCYCGA 406
F ++ C+CG+
Sbjct: 1152 FPLEEKKIPCHCGS 1165
>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
Length = 1213
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 87/141 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K ++ +++ WG+ A EPI +F+IEYIGE+I + + R + G+ + Y+
Sbjct: 1072 RKKHLRFQRSKIHDWGLLALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLF 1131
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 1132 RLDDGYVVDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1191
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F ++ C CG+ C+G L
Sbjct: 1192 FPLEEKKIPCNCGSRKCRGSL 1212
>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
Y-27907]
Length = 1060
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 86/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + + R G+ + Y+
Sbjct: 915 KRKKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAQHREESYLKTGIGSSYL 974
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I+A E LTYDY
Sbjct: 975 FRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDY 1034
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1035 KFERETNDEERIRCLCGAPGCKGYLN 1060
>gi|170029985|ref|XP_001842871.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
gi|167865331|gb|EDS28714.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
Length = 2119
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 91/170 (53%), Gaps = 7/170 (4%)
Query: 253 YISC-SKACHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Y+ C + C C + C N ++ + ++ TE GWG+ + E + KG FI+EY+GEV
Sbjct: 1773 YVECVPENCPCGDRCQNTKIQRHEYAPGLERFMTELKGWGIRSKEGVKKGSFIMEYLGEV 1832
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + ++R+ + Y + Y + ID G+ RF+NHSC PNC ++KW V
Sbjct: 1833 VTEKEFKERMRTI-YLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKWSVN 1891
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
G R+ ++A+R I E L YDY F F P C CGA C+G +G K
Sbjct: 1892 GLFRMALYASRDIPPHEELCYDYNFSLFNPSEGQPCKCGAEQCRGVIGGK 1941
>gi|346320473|gb|EGX90073.1| histone-lysine N-methyltransferase (Ash1), putative [Cordyceps
militaris CM01]
Length = 962
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 98/166 (59%), Gaps = 16/166 (9%)
Query: 265 TCNNRPF-----RKEKK------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
TC NR F R+ K ++++KT G+GV + + I+EY GE+I +
Sbjct: 578 TCTNRAFATLTARRAKGGKYRVGVEVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEDE 637
Query: 314 CEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRV 373
C++R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC+PNC + KW V G+ R+
Sbjct: 638 CDRRMKN-EYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRM 695
Query: 374 GVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
+FA S I G+ LTYDY F F + +C CGA +C+G+LG +
Sbjct: 696 ALFAGDSPIMTGDELTYDYNFDPFSAKNVQRCLCGADNCRGFLGPR 741
>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 2607
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 51/133 (38%), Positives = 81/133 (60%)
Query: 274 EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCE 333
+K +K +++ WG+ A E I +F+IEY+GE+I + R + G+ + Y+
Sbjct: 2472 KKHLKFQRSKIHDWGLLALESIEAEDFVIEYVGEIIRRQVSNFRERQYEIMGIGSSYLFR 2531
Query: 334 IRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRF 393
+ + +DAT KG +RF+NHSC+PNC + VEG +V +++ R+I AGE LTYDY+F
Sbjct: 2532 VDDELVVDATQKGGLARFINHSCNPNCYTKIITVEGRKKVVIYSKRAIGAGEELTYDYKF 2591
Query: 394 VQFGPEVKCYCGA 406
++ CYCGA
Sbjct: 2592 SLEDKKIPCYCGA 2604
>gi|432875260|ref|XP_004072753.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Oryzias
latipes]
Length = 1547
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 86/148 (58%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N +RK K+I+ ++ WG+ A EPI E +IEY+GE I + ++R + G+
Sbjct: 1401 NQLKYRK-KRIRFSRSHIHEWGLFAMEPIAADEMVIEYVGETIRQVIADRREQRYEEEGI 1459
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
Y+ I +D IDAT GN +RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 1460 GGSYLFRIDQDTIIDATKCGNLARFINHSCNPNCYAKVISVESQKKIVIYSRQPISINEE 1519
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TY+Y+F ++ C CGA +C+G L
Sbjct: 1520 ITYNYKFPIEDTKIPCLCGAENCRGSLN 1547
>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
Length = 2204
Score = 114 bits (284), Expect = 1e-22, Method: Composition-based stats.
Identities = 62/173 (35%), Positives = 98/173 (56%), Gaps = 12/173 (6%)
Query: 247 GPPPVQYISCSKACHCSETCNNRPFRKE--KKIKIVKTEFCGWGVEAAEPINKGEFIIEY 304
PP Y SK S++ + ++E + + +++ G G+ AA + K +IEY
Sbjct: 2037 APPTCPY---SKHFVHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEY 2093
Query: 305 IGEVIDDALCEQR--LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCIL 362
IGE+I LC+ R L++ K RG+ YM + D +DAT G +R++NHSC+PNC+
Sbjct: 2094 IGEIIRSQLCDYREKLYEAKNRGI---YMFRLDDDRVVDATISGGLARYINHSCNPNCVT 2150
Query: 363 EKWQVEGETRVGVFAARSIKAGEPLTYDYRF--VQFGPEVKCYCGASSCQGYL 413
EK +V+ E R+ +FA R I GE L YDY+F ++ C CGA +C+ ++
Sbjct: 2151 EKVEVDRELRIIIFAKRRIARGEELAYDYQFDIEDDQHKIPCNCGAPNCRKWM 2203
>gi|225555248|gb|EEH03540.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 803
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 97/170 (57%), Gaps = 16/170 (9%)
Query: 263 SETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
+E C NR F ++ ++++KT G+GV + + I+EY GE+I
Sbjct: 405 AELCGNRSFEGLRQRIKMGGRYNIGVEVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQ 464
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGET 371
CE+R+ + Y+ + +Y+ ++ IDAT +G+ +RF+NHSC+PNC +EKW V G+
Sbjct: 465 EECERRMRTV-YKDNECYYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKP 522
Query: 372 RVGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
R+ +FA + I GE LTYDY F + + +C CG +C+G LG K K
Sbjct: 523 RMALFAGENGIMTGEELTYDYNFDPYSQKNVQQCRCGVPTCRGVLGPKPK 572
>gi|358385026|gb|EHK22623.1| hypothetical protein TRIVIDRAFT_212935 [Trichoderma virens Gv29-8]
Length = 738
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 89/145 (61%), Gaps = 5/145 (3%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KT G+GV + + I+EY GE+I + CE+R+ ++ Y+ + +Y+ +
Sbjct: 451 VEVIKTSDRGYGVRSNRCFEPHQIIMEYAGEIITEEECERRMNEI-YKNNECYYLMSFDQ 509
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAA-RSIKAGEPLTYDYRFVQ 395
+ IDAT G+ +RF+NHSC+PNC + KW V G+ R+ +FA R I GE LTYDY F
Sbjct: 510 NMIIDATT-GSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFDP 568
Query: 396 FGPE--VKCYCGASSCQGYLGTKRK 418
F + KC CG +C+G LG K K
Sbjct: 569 FSAKNVQKCLCGQPNCRGVLGPKPK 593
>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
Length = 1258
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 88/142 (61%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A EPI +F+IEY+GE+I + + R + G+ + Y+
Sbjct: 1117 RKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERLYEKMGIGSSYLF 1176
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 1177 RLDDGYVVDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1236
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CG+ C+G L
Sbjct: 1237 FPLEEKKIPCNCGSRKCRGSLN 1258
>gi|195128035|ref|XP_002008472.1| GI13513 [Drosophila mojavensis]
gi|193920081|gb|EDW18948.1| GI13513 [Drosophila mojavensis]
Length = 2416
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 88/162 (54%), Gaps = 6/162 (3%)
Query: 260 CHCSETCNNRPFRKEK---KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQ 316
C + C N+ ++ + ++ T GWGV PI KG +I+EY+GEV+ + +Q
Sbjct: 1530 CPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYVGEVVTEREFKQ 1589
Query: 317 RLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
R+ + Y + Y + ID G+ RF+NHSC+PNC ++KW V G +R+ +F
Sbjct: 1590 RMATI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSRMVLF 1648
Query: 377 AARSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
A R+I+ GE LTYDY F F P C C C+G +G K
Sbjct: 1649 AKRAIEEGEELTYDYNFSLFNPSEGQPCRCNMPQCRGVIGGK 1690
>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Metaseiulus occidentalis]
Length = 976
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 85/148 (57%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+ + ++ + R G+
Sbjct: 830 NQLKFRK-KQLKFAKSTIHDWGLFALEPIAADEMVIEYVGQSVRCSVADSREVQYTKVGI 888
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + + IDAT GN +RF+NHSC+PNC VEG+ ++ +++ R I E
Sbjct: 889 GSSYLFRVDHETIIDATKCGNLARFINHSCNPNCYARVITVEGQKKIVIYSKRDISVNEE 948
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F + ++ C CG C+GYL
Sbjct: 949 ITYDYKFPREEVKITCLCGTPQCRGYLN 976
>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
Length = 986
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/169 (35%), Positives = 94/169 (55%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I KA SE+ N+ +++K + ++ WG+ A EPI E IIEY+GE+
Sbjct: 818 QDIEAQKAMIGSESELLTLNQLTKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGEI 877
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
+ + E R G+ + Y+ + + IDAT KG +RF+NH CDP+C + +V
Sbjct: 878 LRQPVAEMRERTYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHCCDPSCTAKIIRVG 937
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE---VKCYCGASSCQGYLG 414
G R+ ++A R I A E LTYDY+F + + + C+CGA +C+G+L
Sbjct: 938 GRKRIVIYALRDIAANEELTYDYKFERETDDEERLPCFCGAPTCKGFLN 986
>gi|428176462|gb|EKX45346.1| hypothetical protein GUITHDRAFT_61291, partial [Guillardia theta
CCMP2712]
Length = 215
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 95/170 (55%), Gaps = 6/170 (3%)
Query: 253 YISCSKACHCSETCNNRPFRKEKKIK---IVKTEFCGWGVEAAEPINKGEFIIEYIGEVI 309
YI C C CS C N+ +K + K + GW + E + +G+ ++EYIGEVI
Sbjct: 48 YIECDYRCPCSSHCLNKRLQKRQWAKCSIFLAKNGRGWALRNDEDLRQGQLVMEYIGEVI 107
Query: 310 DDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEG 369
+R+ + Y G ++ YM ++ ++ ID+T K N +RF+NH C+PNC ++KW V
Sbjct: 108 SGEEVSRRMEE--YAGKRHTYMLKLNQEEFIDSTRKANLARFINHCCEPNCEMQKWYVGN 165
Query: 370 ETRVGVFAARSIKAGEPLTYDYRFVQFGPE-VKCYCGASSCQGYLGTKRK 418
+ V +FA I +G LT+DY +G E V C CGA C+G LG ++
Sbjct: 166 KQCVALFAKYFIPSGSELTFDYDMEFYGSENVVCLCGAPKCRGTLGKPKE 215
>gi|84997445|ref|XP_953444.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304440|emb|CAI76819.1| hypothetical protein, conserved [Theileria annulata]
Length = 1083
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 92/162 (56%), Gaps = 9/162 (5%)
Query: 264 ETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDM 321
E C NR F K+K+ + G G A E IN+GE + EY+GEVI A ++ L
Sbjct: 755 ENCGNRRFLNFTGPKLKLNYVDGKGVGTVATEDINEGELVCEYVGEVISQADFQRCLASA 814
Query: 322 KYRGVQN-----FYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVF 376
+ + + +Y+ +I++D ID+T GN +RF+NHSCDPNC V G R+GVF
Sbjct: 815 SFAEIDDGNQSHWYVMKIQRDTYIDSTHLGNVARFINHSCDPNCASVPINVRGTYRMGVF 874
Query: 377 AARSIKAGEPLTYDYRFVQ--FGPEVKCYCGASSCQGYLGTK 416
A R IK GE +TY+Y F G +C C A +C+G +G++
Sbjct: 875 AQRKIKQGEEVTYNYGFTSKGVGGGFRCRCRAKNCRGIIGSQ 916
>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1071
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 86/150 (57%), Gaps = 3/150 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K + ++ WG+ A EPI + E IIEY+GE I + + R G+
Sbjct: 922 NQLNKRKKPVSFARSAIHNWGLYALEPIAQKEMIIEYVGERIRQQVADFREKAYLKSGIG 981
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ I ++ IDAT KG +RF+NH C P+C + +VEG+ R+ ++A R I A E L
Sbjct: 982 SSYLFRIDENTVIDATKKGGIARFINHCCQPSCTAKIIKVEGQKRIVIYALRDIGANEEL 1041
Query: 388 TYDYRF---VQFGPEVKCYCGASSCQGYLG 414
TYDY+F V+C CGA C+GYL
Sbjct: 1042 TYDYKFERETNDNERVRCLCGAPGCKGYLN 1071
>gi|389646281|ref|XP_003720772.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
gi|351638164|gb|EHA46029.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
Length = 1015
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 106/185 (57%), Gaps = 16/185 (8%)
Query: 264 ETCNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F+ + KK +++V T G+GV A+ G+ I+EY GE+I +
Sbjct: 499 EHCQNRAFQDLQDRNKKGGSYRVGVEVVHTGPRGFGVRASRCFEPGQIIMEYAGEIITEE 558
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ ++ Y+ + +Y+ ++ +DAT G+ +RF+NHSC PNC + KW V G+ R
Sbjct: 559 ECERRMNEV-YKDNEAYYLMSFDQNMILDATT-GSIARFVNHSCSPNCRMIKWIVCGKPR 616
Query: 373 VGVFAARS-IKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRKIGKLELCWGSK 429
+ +FA + I GE LTYDY F F + KC CG+ +C+G LG + + K G+
Sbjct: 617 MALFAGDNPIMTGEELTYDYNFDPFSAKNVQKCLCGSENCRGVLGPRTRNDKAAKANGAG 676
Query: 430 RKRSS 434
+ ++S
Sbjct: 677 KGKNS 681
>gi|322694149|gb|EFY85986.1| histone-lysine N-methyltransferase (Ash1), putative [Metarhizium
acridum CQMa 102]
Length = 760
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 96/167 (57%), Gaps = 16/167 (9%)
Query: 264 ETCNNRPF-----RKEKK------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F R+ K ++++KT G+GV + + I+EY GE+I +
Sbjct: 406 ELCTNRSFSDLAARRSKGGKYRVGVEVIKTPDRGYGVRSNRCFKANQIIMEYTGEIITEE 465
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC+PNC + KW V G+ R
Sbjct: 466 ECERRM-NEEYKNNECYYLMSFDQNMIIDATT-GSIARFVNHSCNPNCRMIKWIVSGQPR 523
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTK 416
+ +FA R I G+ LTYDY F F + KC CG +C+G LG K
Sbjct: 524 MALFAGDRPIMTGDELTYDYNFDPFSAKNVQKCLCGEHNCRGVLGPK 570
>gi|67537188|ref|XP_662368.1| hypothetical protein AN4764.2 [Aspergillus nidulans FGSC A4]
gi|40741616|gb|EAA60806.1| hypothetical protein AN4764.2 [Aspergillus nidulans FGSC A4]
Length = 870
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 95/173 (54%), Gaps = 25/173 (14%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 502 CGVGPECGNRNFEELKQRAKAGGKYNVGVEVIKTPDRGYGVRSNRTFEPNQIIVEYTGEI 561
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ +N IDAT +G+ +RF+NH C+PNC +EKW V
Sbjct: 562 ITQAECEKRMRTI-YKKNENMI---------IDAT-RGSIARFVNHGCEPNCRMEKWTVA 610
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R I GE LTYDY F + + +C CG+S C+G LG +++
Sbjct: 611 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSKCRGILGPRKR 663
>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
Length = 1278
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E I K + IIEY+GE + ++ + R G+ + Y+
Sbjct: 1133 KRKKPVKFERSAIHNWGLYAMEHIPKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYL 1192
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I +D IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1193 FRIDEDAVIDATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDY 1252
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F E V C CG ++C+G+L
Sbjct: 1253 KFELEEKEEDRVPCLCGTTNCKGFLN 1278
>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 1278
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E I K + IIEY+GE + ++ + R G+ + Y+
Sbjct: 1133 KRKKPVKFERSAIHNWGLYAMEHIPKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYL 1192
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I +D IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1193 FRIDEDAVIDATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDY 1252
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F E V C CG ++C+G+L
Sbjct: 1253 KFELEEKEEDRVPCLCGTTNCKGFLN 1278
>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 1557
Score = 113 bits (282), Expect = 2e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 87/147 (59%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + ++R G+
Sbjct: 1411 NQLKFRK-KQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQMVRPIMADRREQFYTQIGI 1469
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + + IDAT GN +RF+NHSC+PNC + VEG+ ++ +++ + I E
Sbjct: 1470 GSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEE 1529
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G+L
Sbjct: 1530 ITYDYKFPLEDEKIVCLCGAPQCRGFL 1556
>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
Length = 1326
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 82/142 (57%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K +K +++ WGV A E I +FI+EY+GEV+ + + R +G+ + Y
Sbjct: 1185 RKKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVGEVLRPKVADVREVRYLRQGLGSSYFF 1244
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ F IDAT +G RF+NHSC+PNC + VEG+ RV ++A I G LTYDY+
Sbjct: 1245 RVGDGFVIDATQRGGLGRFINHSCEPNCYPKIITVEGQKRVFIYARTHIAPGTELTYDYK 1304
Query: 393 FVQFGPEVKCYCGASSCQGYLG 414
F ++ C CGA C+G+L
Sbjct: 1305 FPHEDQKIPCLCGAERCRGFLN 1326
>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1295
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 83/145 (57%), Gaps = 3/145 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ A E I K E IIEY+GE + + E R G+ + Y+
Sbjct: 1150 KRKKPVKFARSAIHNWGLYAMENIPKDEMIIEYVGEEVRQQIAELRESRYLKSGIGSSYL 1209
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I + IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1210 FRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDY 1269
Query: 392 RF---VQFGPEVKCYCGASSCQGYL 413
+F + + C CG ++C+G+L
Sbjct: 1270 KFERELGSADRIPCLCGTAACKGFL 1294
>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Apis florea]
Length = 1208
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 98/203 (48%), Gaps = 35/203 (17%)
Query: 251 VQYISCSKA-CHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
+ + CS C CNN+ F R+ ++ T GWG+ + E I G+F+IEY+GE
Sbjct: 810 ILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTVGRGWGLRSLEHIKAGQFVIEYVGE 869
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
VID+A ++RL K +NFY I + TIDA KGN SRF+NHSC PNC +KW V
Sbjct: 870 VIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCSPNCETQKWTV 929
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYLGTK----------- 416
G+TR+ GE C CGAS+C G++G K
Sbjct: 930 NGDTRIAC-------DGETRK------------PCLCGASNCSGFIGLKVQKPQVTTPSI 970
Query: 417 --RKIGKLELCWGSKRKRSSTAC 437
+KI K + KR R C
Sbjct: 971 QQKKIEKFDKIKRQKRSRKHVLC 993
>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
Length = 1489
Score = 113 bits (282), Expect = 2e-22, Method: Composition-based stats.
Identities = 54/144 (37%), Positives = 83/144 (57%), Gaps = 3/144 (2%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K K+ WG+ A E I G+ +IEY+GE++ + + R + +G + Y+
Sbjct: 1346 RKKQLKFAKSPIHDWGLYAMEYIPAGDMVIEYVGEMVRQQVADNREKQYERQGNFSTYLF 1405
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D +DAT KGN +R +NH C PNC + V GE R+ +FA IKAGE LTYDY+
Sbjct: 1406 RVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYK 1465
Query: 393 FVQFGPE---VKCYCGASSCQGYL 413
F + + C CG+ C+ YL
Sbjct: 1466 FQSSADDEDAIPCLCGSDGCRRYL 1489
>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 92/170 (54%), Gaps = 6/170 (3%)
Query: 251 VQYISCSKACHCSETCN---NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V IS K SET N +++K + ++ WG+ A E I E IIEY+GE
Sbjct: 886 VADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESIAAKEMIIEYVGE 945
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
I + E R G+ + Y+ I ++ IDAT KG +RF+NH C+P+C + +V
Sbjct: 946 SIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHCCNPSCTAKIIKV 1005
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
EG+ R+ ++A R I+A E LTYDY+F ++C CGA C+GYL
Sbjct: 1006 EGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGYLN 1055
>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 595
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 86/148 (58%), Gaps = 1/148 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FR+ K+I+ ++ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 449 NQLKFRR-KRIRFSRSYIHEWGLFAMEPIAADEMVIEYVGQIIRQVIADMREQRYEEEGI 507
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ + +D IDAT GN +RF+NHSC+PNC + VE + ++ +++ + I E
Sbjct: 508 GSSYLFRVDQDTIIDATKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSRQPININEE 567
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYLG 414
+TYDY+F ++ C CGA C+G L
Sbjct: 568 ITYDYKFPIEETKIPCLCGADGCRGSLN 595
>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
[Rhipicephalus pulchellus]
Length = 755
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/126 (45%), Positives = 82/126 (65%), Gaps = 2/126 (1%)
Query: 260 CHCSETCNNRPFRKEK--KIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQR 317
C C N+ F+K + +V+T GWG+ ++ + G+F++EY+GE+ID+ CE+R
Sbjct: 630 CPAGPDCENQRFQKRTYARTSLVRTPGRGWGLLTSQALAAGDFVMEYVGELIDEEECERR 689
Query: 318 LWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFA 377
L ++ +QNFY + KD IDA KGN SRF+NHSC+PNC +KW V G+TRVG+FA
Sbjct: 690 LEELHMENIQNFYFLTLEKDRIIDAGPKGNMSRFINHSCEPNCETQKWTVNGDTRVGIFA 749
Query: 378 ARSIKA 383
R I A
Sbjct: 750 IRDIPA 755
>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 92/170 (54%), Gaps = 6/170 (3%)
Query: 251 VQYISCSKACHCSETCN---NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGE 307
V IS K SET N +++K + ++ WG+ A E I E IIEY+GE
Sbjct: 886 VADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESIAAKEMIIEYVGE 945
Query: 308 VIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQV 367
I + E R G+ + Y+ I ++ IDAT KG +RF+NH C+P+C + +V
Sbjct: 946 SIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHCCNPSCTAKIIKV 1005
Query: 368 EGETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
EG+ R+ ++A R I+A E LTYDY+F ++C CGA C+GYL
Sbjct: 1006 EGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGYLN 1055
>gi|171676169|ref|XP_001903038.1| hypothetical protein [Podospora anserina S mat+]
gi|170936150|emb|CAP60810.1| unnamed protein product [Podospora anserina S mat+]
Length = 894
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 97/165 (58%), Gaps = 16/165 (9%)
Query: 264 ETCNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F+ + KK ++++KT G+GV + + I+EY GE+I +
Sbjct: 471 EFCQNRAFQDLQERTKKGGRFRVGVEVLKTSDRGYGVRSTRCFEPNQIIMEYTGEIITEE 530
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + KY+ + +Y+ ++ IDAT G+ +RF+NHSC PNC + KW V G+ R
Sbjct: 531 ECERRMNE-KYKDNECYYLMSFDQNMIIDATT-GSMARFVNHSCSPNCRMIKWIVSGQPR 588
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
+ +FA + I G+ LTYDY F F + KC CG+++C+G LG
Sbjct: 589 MALFAGDKPIMTGDELTYDYNFDPFSAKNVQKCLCGSANCRGVLG 633
>gi|365983760|ref|XP_003668713.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
gi|343767480|emb|CCD23470.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
Length = 1105
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 87/150 (58%), Gaps = 3/150 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K + ++ WG+ A EPI E IIEY+GE I A+ E R G+
Sbjct: 956 NQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQAVAEMRERRYLKNGIG 1015
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ + ++ IDAT KG +RF+NH CDP+C + +V G R+ ++A R I A E L
Sbjct: 1016 SSYLFRVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGRRRIVIYALRDIAANEEL 1075
Query: 388 TYDYRF---VQFGPEVKCYCGASSCQGYLG 414
TYDY+F + + C+CG+ +C+G+L
Sbjct: 1076 TYDYKFEREIDAEERLPCHCGSVNCKGFLN 1105
>gi|312384250|gb|EFR29018.1| hypothetical protein AND_02361 [Anopheles darlingi]
Length = 2074
Score = 113 bits (282), Expect = 2e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 87/147 (59%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ ++ + R + G+
Sbjct: 1928 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMVRPSVADLRETKYEAIGI 1986
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ + I E
Sbjct: 1987 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKIITIESEKKIVIYSKQPIGVNEE 2046
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F +++C CGA C+G L
Sbjct: 2047 ITYDYKFPLEEEKIRCLCGAPGCRGTL 2073
>gi|366990781|ref|XP_003675158.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
gi|342301022|emb|CCC68787.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
Length = 1082
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 94/169 (55%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I +A +E+ N+ +++K + ++ WG+ A EPI E IIEY+GE
Sbjct: 914 QNIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAREMIIEYVGES 973
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + E R G+ + Y+ + ++ IDAT KG +RF+NH CDP+C + +V
Sbjct: 974 IRQPVAEMRERRYLKSGIGSSYLFRVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVG 1033
Query: 369 GETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
G+ R+ ++A R I A E LTYDY+F + + CYCG+ +C+G+L
Sbjct: 1034 GKRRIVIYALRDIAANEELTYDYKFERELDAEERLPCYCGSVNCKGFLN 1082
>gi|350408006|ref|XP_003488269.1| PREDICTED: hypothetical protein LOC100743429 [Bombus impatiens]
Length = 1503
Score = 112 bits (281), Expect = 3e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1357 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRESQYEATGI 1415
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1416 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1475
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1476 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1502
>gi|340721798|ref|XP_003399301.1| PREDICTED: hypothetical protein LOC100643294 [Bombus terrestris]
Length = 1502
Score = 112 bits (281), Expect = 3e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1356 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRESQYEATGI 1414
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1415 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1474
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1475 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1501
>gi|259482394|tpe|CBF76835.1| TPA: histone-lysine N-methyltransferase (Ash1), putative
(AFU_orthologue; AFUA_3G06480) [Aspergillus nidulans
FGSC A4]
Length = 812
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 95/173 (54%), Gaps = 25/173 (14%)
Query: 260 CHCSETCNNRPFRKEKK-----------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
C C NR F + K+ ++++KT G+GV + + I+EY GE+
Sbjct: 444 CGVGPECGNRNFEELKQRAKAGGKYNVGVEVIKTPDRGYGVRSNRTFEPNQIIVEYTGEI 503
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I A CE+R+ + Y+ +N IDAT +G+ +RF+NH C+PNC +EKW V
Sbjct: 504 ITQAECEKRMRTI-YKKNENMI---------IDAT-RGSIARFVNHGCEPNCRMEKWTVA 552
Query: 369 GETRVGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLGTKRK 418
G+ R+ +FA R I GE LTYDY F + + +C CG+S C+G LG +++
Sbjct: 553 GKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRCGSSKCRGILGPRKR 605
>gi|302809376|ref|XP_002986381.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
gi|300145917|gb|EFJ12590.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
Length = 206
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 95/182 (52%), Gaps = 24/182 (13%)
Query: 256 CSKACHCSETCNNRPFRKEKKIK--IVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ +C CSE C NR ++ K+K + KT GW V AA+ I++G F+ EY+GEV++D
Sbjct: 24 CNTSCMCSEECPNRVLQRGVKVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGEVLNDQE 83
Query: 314 CEQRLWDMKYRGVQNFYMCEIR-----------------KDFTIDATFKGNFSRFLNHSC 356
+R G Y ++ K F IDAT GN +RF+NHSC
Sbjct: 84 ANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVARFINHSC 143
Query: 357 DPNCILEKWQVEGE----TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQG 411
PN + + VE +G+FA+R I GE L+YDYR+ G C+CG+S C+G
Sbjct: 144 SPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLLPGRGCPCHCGSSGCRG 203
Query: 412 YL 413
L
Sbjct: 204 RL 205
>gi|367041896|ref|XP_003651328.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
gi|346998590|gb|AEO64992.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
Length = 950
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 96/165 (58%), Gaps = 16/165 (9%)
Query: 264 ETCNNRPFR----KEKK-------IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDA 312
E C NR F+ + KK +++ KT G+GV + + I+EY GE+I +
Sbjct: 484 EYCTNRAFQDLQERTKKGGRYRVGVEVFKTPDRGYGVRSNRCFEPNQIIMEYTGEIITEE 543
Query: 313 LCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETR 372
CE+R+ + +Y+ + +Y+ ++ IDAT G+ +RF+NHSC PNC + KW V G+ R
Sbjct: 544 ECERRM-NEEYKDNECYYLMSFDQNMIIDATT-GSIARFVNHSCSPNCRMIKWIVSGQPR 601
Query: 373 VGVFAA-RSIKAGEPLTYDYRFVQFGPE--VKCYCGASSCQGYLG 414
+ +FA R I GE LTYDY F F + KC CG+++C+G LG
Sbjct: 602 MALFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCGSANCRGVLG 646
>gi|401884695|gb|EJT48845.1| hypothetical protein A1Q1_02180 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1192
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/182 (37%), Positives = 97/182 (53%), Gaps = 15/182 (8%)
Query: 251 VQYISCSKACHCSETCNNRPF--RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
+Q I C K+C C N+ R K I + + G+GV AAE I GEF+++Y GEV
Sbjct: 515 LQGILCGKSCPNGPDCGNQALCRRPAKAIIVALSSLHGYGVFAAEDIAAGEFVVDYRGEV 574
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I R+ + + + +D ID++ KGN +RF+NHSCDPN +L K+
Sbjct: 575 ISVDTFIDRIG--SHSEDDSVFAIAYDRDEIIDSSSKGNSARFINHSCDPNLVLRKFDTL 632
Query: 369 G----ETRVGVFAARSIKAGEPLTYDYRFVQF-----GPE--VKCYCGASSCQGYLGTKR 417
G E G+++ R IKAGE LTYDY + GP+ V C CGA +C G L +K+
Sbjct: 633 GDGHEEHEFGLWSRRPIKAGEELTYDYNAETYPVFTDGPDTRVPCNCGAKNCTGGLNSKQ 692
Query: 418 KI 419
+
Sbjct: 693 GV 694
>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
Length = 1056
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 911 KRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGSSYL 970
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C P+C + +VEG R+ ++A R I+A E LTYDY
Sbjct: 971 FRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGIKRIVIYALRDIEANEELTYDY 1030
Query: 392 RFVQFGPE---VKCYCGASSCQGYLG 414
+F + + ++C CGA C+GYL
Sbjct: 1031 KFERETNDEERIRCLCGAPGCKGYLN 1056
>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
Length = 1765
Score = 112 bits (281), Expect = 3e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1619 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGI 1677
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ + I E
Sbjct: 1678 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEE 1737
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1738 ITYDYKFPLEDEKIPCLCGAQGCRGTL 1764
>gi|380013032|ref|XP_003690574.1| PREDICTED: uncharacterized protein LOC100867676 [Apis florea]
Length = 1500
Score = 112 bits (281), Expect = 3e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + + R + G+
Sbjct: 1354 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEATGI 1412
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1413 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1472
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1473 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1499
>gi|406868443|gb|EKD21480.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 796
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 92/145 (63%), Gaps = 5/145 (3%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KT G+GV + + I+EY GE+I + C++R+ + Y+ + +Y+ +
Sbjct: 475 VEVIKTADRGYGVRSNRCFEANQIIVEYTGEIITEEECDRRM-NEDYKNNECYYLMSFDQ 533
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQ 395
+ IDAT KG+ +RF+NHSC+PNC + KW V G+ R+ +FA + I G+ LTYDY F
Sbjct: 534 NMIIDAT-KGSIARFVNHSCNPNCKMVKWIVGGKPRMALFAGDNPIMTGDELTYDYNFDP 592
Query: 396 FGPE--VKCYCGASSCQGYLGTKRK 418
F + +C CG+S+C+G+LG K K
Sbjct: 593 FSAKNVQECRCGSSNCRGFLGPKPK 617
>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
Length = 1151
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ WG+ E I+K + IIEY+GE + + E R G+ + Y+
Sbjct: 1006 KRKKPVKFARSAIHNWGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKSGIGSSYL 1065
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NHSC PNC + +VEG R+ ++A R I E LTYDY
Sbjct: 1066 FRIDENTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITTNEELTYDY 1125
Query: 392 RFV-QFGP--EVKCYCGASSCQGYLG 414
+F + G + C CG ++C+G+L
Sbjct: 1126 KFEREIGSLDRIPCLCGTAACKGFLN 1151
>gi|296804338|ref|XP_002843021.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238845623|gb|EEQ35285.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 719
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 90/155 (58%), Gaps = 15/155 (9%)
Query: 277 IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRK 336
++++KTE G+GV + + I+EY GE++ ++R+ + Y+ + FY+ + +
Sbjct: 428 VEVIKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRRMKTI-YKKNECFYLMDFDQ 486
Query: 337 DFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARS-IKAGEPLTYDYRFVQ 395
+ IDAT +G+ +RF+NHSC+PNC +EKW V G+ R+ +FA + I GE LTYDY F
Sbjct: 487 NMIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGENGIMTGEELTYDYNFEY 545
Query: 396 FGPEV------------KCYCGASSCQGYLGTKRK 418
+ P +C CG SC+G LG + K
Sbjct: 546 YQPYTGRYNPYSNKNVQECRCGTPSCRGVLGPRPK 580
>gi|391333600|ref|XP_003741200.1| PREDICTED: uncharacterized protein LOC100900769 [Metaseiulus
occidentalis]
Length = 748
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 86/139 (61%), Gaps = 1/139 (0%)
Query: 275 KKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEI 334
+ + + +++ GV A I+ GE +IEY GE+I L ++R K RG+ + YM +I
Sbjct: 610 QTVGVFRSKIHRRGVFAKRAIDAGEMVIEYSGELIRAVLTDKREQLYKARGI-DCYMFKI 668
Query: 335 RKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFV 394
+D +DAT GN +RF+NHSCDPNC + ++ G+ + +++ + IK GE LTYDY+F
Sbjct: 669 DEDEVVDATMHGNAARFINHSCDPNCYSKCIEIFGKKHIVIYSQKRIKVGEELTYDYKFP 728
Query: 395 QFGPEVKCYCGASSCQGYL 413
+ +V C CGA C+ YL
Sbjct: 729 KEDVKVPCTCGARKCRRYL 747
>gi|328778989|ref|XP_395451.4| PREDICTED: hypothetical protein LOC411985 isoform 1 [Apis mellifera]
Length = 1503
Score = 112 bits (280), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + + R + G+
Sbjct: 1357 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEATGI 1415
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1416 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1475
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1476 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1502
>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
Length = 1755
Score = 112 bits (280), Expect = 4e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1609 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGI 1667
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ + I E
Sbjct: 1668 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEE 1727
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1728 ITYDYKFPLEDEKIPCLCGAQGCRGTL 1754
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 112 bits (280), Expect = 4e-22, Method: Composition-based stats.
Identities = 61/154 (39%), Positives = 84/154 (54%), Gaps = 13/154 (8%)
Query: 271 FRKEKK-----IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKY-- 323
FR+ K+ + + ++ G G+ I +G+ +IEY GEVI L +QR + KY
Sbjct: 4235 FRQLKETSKASVGVYRSHIHGRGLFCKRDIEEGDMVIEYAGEVIRAVLADQR--EKKYEA 4292
Query: 324 ----RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
RGV YM I + +DAT KGN +RF+NHSCDPNC + G + +FA R
Sbjct: 4293 MSGRRGVGGCYMFRIDDNLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHILIFALR 4352
Query: 380 SIKAGEPLTYDYRFVQFGPEVKCYCGASSCQGYL 413
I GE LTYDY+F ++ C CGA C+ YL
Sbjct: 4353 RITIGEELTYDYKFPFEEVKIPCTCGAKKCRKYL 4386
>gi|168031561|ref|XP_001768289.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162680467|gb|EDQ66903.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 1655
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 89/148 (60%), Gaps = 13/148 (8%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K +++ WG+ AAEPI+ EF+IEY+GEVI + + + R + G+ + YM
Sbjct: 1488 RKKQLKFQRSKIHDWGLVAAEPIDAEEFVIEYVGEVIRNRVTDIREKRYEAIGIGSSYMF 1547
Query: 333 EIRKDFTI-------------DATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAAR 379
+ + T+ DAT +G +RF+NHSCDPNC + VEG+ +V +++ +
Sbjct: 1548 RVDDEHTLNTFIVLDHVPSQLDATRRGGLARFINHSCDPNCYTKIITVEGQKKVVIYSKQ 1607
Query: 380 SIKAGEPLTYDYRFVQFGPEVKCYCGAS 407
I GE LTYDY+F ++ C+CGA+
Sbjct: 1608 RIVPGEELTYDYKFSLEEVKIPCFCGAA 1635
>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
Length = 1111
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I KA +E+ N+ +++K + ++ WG+ A EPIN E +IEY+GE
Sbjct: 943 QNIEAQKAATGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPINAKEMVIEYVGER 1002
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + E R G+ + Y+ I + IDAT KG +RF+NH C+P+C + +V
Sbjct: 1003 IRQPVAEMRERRYIKNGIGSSYLFRIDEHTVIDATKKGGIARFINHCCEPSCTAKIIKVG 1062
Query: 369 GETRVGVFAARSIKAGEPLTYDYRF---VQFGPEVKCYCGASSCQGYLG 414
G+ R+ ++A R I A E LTYDY+F + C CGA SC+G+L
Sbjct: 1063 GKRRIVIYALRDIAANEELTYDYKFERETDAEERLPCLCGAPSCKGFLN 1111
>gi|301624151|ref|XP_002941371.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Xenopus
(Silurana) tropicalis]
Length = 3106
Score = 112 bits (280), Expect = 4e-22, Method: Composition-based stats.
Identities = 60/176 (34%), Positives = 99/176 (56%), Gaps = 12/176 (6%)
Query: 254 ISCSKACHCSETCNNRPFRKEKK--IKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDD 311
I CS C + C+NR F+K++ ++ + TE GWG+ AA+ + F++EY GEV+D
Sbjct: 2041 IECSSRCPNGDYCSNRSFQKKQHAGVEAILTEKKGWGLRAAKDLKSNTFVLEYCGEVLDH 2100
Query: 312 ALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSC-------DPNCILEK 364
+ R+ + ++Y ++ D + ++ + N S N C +PN + +K
Sbjct: 2101 KEFKSRVKEYARNKNIHYYFMALKND-EVSSSGQRNLSP-TNLPCPRSSSWVNPNTVSQK 2158
Query: 365 WQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEV-KCYCGASSCQGYLGTKRKI 419
W V G+ RVG F R + AG LT+DY+F ++G E KCYCGA +C+G LG + ++
Sbjct: 2159 WTVNGQVRVGFFTTRVVPAGSELTFDYQFQRYGKEAQKCYCGAPNCRGILGGENRV 2214
>gi|321460437|gb|EFX71479.1| Mes-4-like protein [Daphnia pulex]
Length = 706
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 85/140 (60%), Gaps = 10/140 (7%)
Query: 286 GWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFK 345
GWG+++ I KG+F++EY+GE+ID+ +RL + + +Y + + IDA
Sbjct: 416 GWGLKSHADIKKGDFVVEYVGEIIDNNEFRRRLKAKQDAQDEAYYFLTLDNNRMIDAGPS 475
Query: 346 GNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVK---- 401
GNF+RF+NHSC PNC +KW V G+TR+G+FA I A LT++Y+ ++ +V
Sbjct: 476 GNFARFMNHSCQPNCETQKWTVLGDTRIGLFAVVDIPAHAELTFNYQ-LECAQDVANESN 534
Query: 402 -----CYCGASSCQGYLGTK 416
C+CGA +C G++G K
Sbjct: 535 QRQQPCHCGAPNCAGFIGAK 554
>gi|452980621|gb|EME80382.1| hypothetical protein MYCFIDRAFT_204567 [Pseudocercospora fijiensis
CIRAD86]
Length = 1200
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 88/146 (60%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K +K ++ GWG+ A E I + IIEY+GE + + + R +GV + Y+
Sbjct: 1055 KRKKLVKFDRSAIHGWGLYAEENIAVNDLIIEYVGEKVRQKVADMREIKYDKQGVGSSYL 1114
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
+ D +DAT KG +RF+NHSCDPNC + +VEG R+ ++A + I + LTYDY
Sbjct: 1115 FRMIDDEIVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDY 1174
Query: 392 RFV-QFGP--EVKCYCGASSCQGYLG 414
+F ++G + C CG+++C+G+L
Sbjct: 1175 KFEREYGSTDRIPCLCGSANCKGFLN 1200
>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 1502
Score = 112 bits (280), Expect = 4e-22, Method: Composition-based stats.
Identities = 50/141 (35%), Positives = 84/141 (59%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K ++ WG+ A E I GE +IEY+GEVI A+ ++R + G+ + Y+
Sbjct: 1361 RKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLYERMGIGSSYLF 1420
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D +DAT KGN R +NH C PNC + + GE ++ ++A +I+ G+ +TYDY
Sbjct: 1421 RVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYH 1480
Query: 393 FVQFGPEVKCYCGASSCQGYL 413
F + ++ C CG+ C+G L
Sbjct: 1481 FPKEEVKIPCLCGSVKCKGTL 1501
>gi|332016392|gb|EGI57305.1| Histone-lysine N-methyltransferase SETD1B [Acromyrmex echinatior]
Length = 1513
Score = 112 bits (279), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + + R + G+
Sbjct: 1367 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEATGI 1425
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1426 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1485
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1486 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1512
>gi|297791815|ref|XP_002863792.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
gi|297309627|gb|EFH40051.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
Length = 1414
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 85/134 (63%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K+++ +++ WG+ A EPI +F+IEY+GE+I ++ E R + G+ + Y+
Sbjct: 1251 RKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIGSSYLF 1310
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE ++Y+Y+
Sbjct: 1311 RLDDGYVVDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEISYNYK 1370
Query: 393 FVQFGPEVKCYCGA 406
F ++ C CGA
Sbjct: 1371 FPLEDDKIPCNCGA 1384
>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
Length = 1468
Score = 112 bits (279), Expect = 4e-22, Method: Composition-based stats.
Identities = 52/144 (36%), Positives = 85/144 (59%), Gaps = 3/144 (2%)
Query: 273 KEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYMC 332
++K++K K+ WG+ A E I G+ +IEY+GEV+ + ++R + +G + Y+
Sbjct: 1325 RKKQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLF 1384
Query: 333 EIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYR 392
+ D +DAT KGN +R +NH C PNC + + GE R+ +FA +I+AGE LTYDY+
Sbjct: 1385 RVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYK 1444
Query: 393 FVQFGPE---VKCYCGASSCQGYL 413
F + + C CG+ C+ +L
Sbjct: 1445 FQSSADDEDAIPCLCGSPGCRRFL 1468
>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
10762]
Length = 1279
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 89/149 (59%), Gaps = 3/149 (2%)
Query: 268 NRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQ 327
N+ +++K +K ++ GWG+ A E I + IIEY+GE + + + R + +GV
Sbjct: 1130 NQLKKRKKLVKFDRSAIHGWGLYAEENITVNDLIIEYVGEKVRQKVADLRELRYEKQGVG 1189
Query: 328 NFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPL 387
+ Y+ + D +DAT KG +RF+NHSC PNC + +VEG R+ ++A + I E L
Sbjct: 1190 SSYLFRMMDDEIVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNEEL 1249
Query: 388 TYDYRFV-QFGP--EVKCYCGASSCQGYL 413
TYDY+F ++G + C CG ++C+G+L
Sbjct: 1250 TYDYKFEREYGSTDRIPCLCGTANCKGFL 1278
>gi|341876597|gb|EGT32532.1| hypothetical protein CAEBREN_21874 [Caenorhabditis brenneri]
Length = 1048
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 100/171 (58%), Gaps = 17/171 (9%)
Query: 256 CSKACHCSETCNNRPFRKEKKIKIVKTEFCGW----GVEAAEPINKGEFIIEYIGEVIDD 311
C +C + C N+ F K KK V+ G G+ A + I G+FIIEYIGEV++
Sbjct: 666 CPSSCRAKK-CKNQRFAK-KKYAAVEPFHTGTAKGCGLRAVKDIKAGKFIIEYIGEVLER 723
Query: 312 ALCEQRLWDMKY---RGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
E+R +KY + ++ Y+C+ +TIDAT GN SRF+NHSCDPN + EKW V
Sbjct: 724 EDYEKR--KVKYAADKKHKHHYLCDTGV-YTIDATEFGNPSRFVNHSCDPNAVCEKWSVP 780
Query: 369 GE----TRVGVFAARSIKAGEPLTYDYRFVQFGPEVK-CYCGASSCQGYLG 414
+R+G FA R IKAGE + +DY+FV +G + + C+CG C ++G
Sbjct: 781 KTPGDISRIGFFAKRFIKAGEEICFDYQFVNYGRDAQPCFCGTPQCNKWIG 831
>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
Length = 1670
Score = 112 bits (279), Expect = 4e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 87/147 (59%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ ++ + R + G+
Sbjct: 1524 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMVRPSVADLRETKYEAIGI 1582
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ ++I E
Sbjct: 1583 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQAIGINEE 1642
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1643 ITYDYKFPLEDEKIPCLCGAQGCRGTL 1669
>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1156
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 84/146 (57%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 1011 KRKKPVTFARSSIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGSSYL 1070
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C P+C + +V+G+ R+ ++A R I+A E LTYDY
Sbjct: 1071 FRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDY 1130
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F ++C CGA C+G+L
Sbjct: 1131 KFERETNDDERIRCLCGAPGCKGFLN 1156
>gi|383849246|ref|XP_003700256.1| PREDICTED: uncharacterized protein LOC100875701 [Megachile rotundata]
Length = 1503
Score = 112 bits (279), Expect = 5e-22, Method: Composition-based stats.
Identities = 53/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G+++ + + R + G+
Sbjct: 1357 NQLKFRK-KQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEATGI 1415
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I D IDAT GN +RF+NHSC+PNC + +E + ++ +++ + I E
Sbjct: 1416 GSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEE 1475
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1476 ITYDYKFPLEDDKIPCLCGAPQCRGTL 1502
>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
Length = 1548
Score = 112 bits (279), Expect = 5e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1402 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGI 1460
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ + I E
Sbjct: 1461 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEE 1520
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1521 ITYDYKFPLEDEKIPCLCGAQGCRGTL 1547
>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
Length = 1055
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 82/146 (56%), Gaps = 3/146 (2%)
Query: 272 RKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGVQNFYM 331
+++K + ++ WG+ A EPI E IIEY+GE I + E R G+ + Y+
Sbjct: 910 KRKKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHREKSYLKTGIGSSYL 969
Query: 332 CEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDY 391
I ++ IDAT KG +RF+NH C P+C + +V+ + R+ ++A R I A E LTYDY
Sbjct: 970 FRIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDNQKRIVIYALRDIDANEELTYDY 1029
Query: 392 RF---VQFGPEVKCYCGASSCQGYLG 414
+F ++C CGA C+GYL
Sbjct: 1030 KFERETNDAERIRCLCGAPGCKGYLN 1055
>gi|66357648|ref|XP_626002.1| protein with 4 PHD domains plus a SET domain and associated
cysteine cluster at the C-terminus [Cryptosporidium
parvum Iowa II]
gi|46227222|gb|EAK88172.1| protein with 4 PHD domains plus a SET domain and associated
cysteine cluster at the C-terminus [Cryptosporidium
parvum Iowa II]
Length = 1004
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 91/186 (48%), Gaps = 35/186 (18%)
Query: 267 NNRPFRKEKKI-----KIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDM 321
NN+ +KKI K++ G+G+ I K FIIEY+GE++ +R+
Sbjct: 530 NNKFLDNQKKIILKNLKVIDAGEKGFGITTNMTIPKDTFIIEYVGEILTRENYLKRVEKY 589
Query: 322 KYRGVQ------------------------------NFYMCEIRKDFTIDATFKGNFSRF 351
K R ++ ++Y EI D+ ID+T KGN SR
Sbjct: 590 KERELESRKKSIIMDYYKEDHEFNEDFVLPKDTRERHWYCMEIGNDYIIDSTNKGNLSRL 649
Query: 352 LNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEPLTYDYRFVQFGPEVKCYCGASSCQG 411
+NHSCDPNCI +KW V E RVG+F+ R I E LTYDY F F KC C + SC+G
Sbjct: 650 INHSCDPNCIAQKWLVGNECRVGIFSKREILPNEELTYDYSFTAFDIGFKCKCNSPSCKG 709
Query: 412 YLGTKR 417
+G +
Sbjct: 710 RIGIEN 715
>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
Length = 1019
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 95/169 (56%), Gaps = 6/169 (3%)
Query: 252 QYISCSKACHCSET---CNNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEV 308
Q I +A +E+ N+ +++K + ++ WG+ A EPI E IIEY+GE
Sbjct: 851 QDIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGER 910
Query: 309 IDDALCEQRLWDMKYRGVQNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVE 368
I + E R +G+ + Y+ + ++ IDAT KG +RF+NH CDP+C + +V
Sbjct: 911 IRQPVAEMREIRYIKKGIGSSYLFRVDENTVIDATKKGGIARFINHCCDPSCTAKIIKVG 970
Query: 369 GETRVGVFAARSIKAGEPLTYDYRFVQFGPE---VKCYCGASSCQGYLG 414
G+ R+ ++A R I A E LTYDY+F + + + C CGA +C+G+L
Sbjct: 971 GKKRIVIYALRDIAANEELTYDYKFERETDDEERLPCLCGAPTCKGFLN 1019
>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
Length = 1687
Score = 112 bits (279), Expect = 5e-22, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 1/147 (0%)
Query: 267 NNRPFRKEKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDALCEQRLWDMKYRGV 326
N FRK K++K K+ WG+ A EPI E +IEY+G++I + + R + G+
Sbjct: 1541 NQLKFRK-KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGI 1599
Query: 327 QNFYMCEIRKDFTIDATFKGNFSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKAGEP 386
+ Y+ I + IDAT GN +RF+NHSC+PNC + +E E ++ +++ + I E
Sbjct: 1600 GSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEE 1659
Query: 387 LTYDYRFVQFGPEVKCYCGASSCQGYL 413
+TYDY+F ++ C CGA C+G L
Sbjct: 1660 ITYDYKFPLEEEKIPCLCGAQGCRGTL 1686
>gi|302814007|ref|XP_002988688.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
gi|300143509|gb|EFJ10199.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
Length = 206
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/182 (36%), Positives = 95/182 (52%), Gaps = 24/182 (13%)
Query: 256 CSKACHCSETCNNRPFRK--EKKIKIVKTEFCGWGVEAAEPINKGEFIIEYIGEVIDDAL 313
C+ +C CSE C NR ++ K+++ KT GW V AA+ I++G F+ EY+GEV++D
Sbjct: 24 CNTSCMCSEECPNRVLQRGVNVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGEVLNDQE 83
Query: 314 CEQRLWDMKYRGVQNFYMCEIR-----------------KDFTIDATFKGNFSRFLNHSC 356
+R G Y ++ K F IDAT GN +RF+NHSC
Sbjct: 84 ANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVARFINHSC 143
Query: 357 DPNCILEKWQVEGE----TRVGVFAARSIKAGEPLTYDYRFVQF-GPEVKCYCGASSCQG 411
PN + + VE +G+FA+R I GE L+YDYR+ G C+CG+S C+G
Sbjct: 144 SPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLLPGRGCPCHCGSSGCRG 203
Query: 412 YL 413
L
Sbjct: 204 RL 205
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.137 0.456
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,339,781,991
Number of Sequences: 23463169
Number of extensions: 314762562
Number of successful extensions: 593032
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3330
Number of HSP's successfully gapped in prelim test: 1872
Number of HSP's that attempted gapping in prelim test: 578231
Number of HSP's gapped (non-prelim): 8224
length of query: 443
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 297
effective length of database: 8,933,572,693
effective search space: 2653271089821
effective search space used: 2653271089821
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 78 (34.7 bits)