BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 042571
(936 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
Length = 1258
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/863 (51%), Positives = 555/863 (64%), Gaps = 64/863 (7%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS G+I+ FWGSY +VC++LFD+CM+VMWNAVF D +A+YS++WR+RKLWS I PA
Sbjct: 448 KSVGTIDNFWGSYAVVCRILFDYCMEVMWNAVFYDAIADYSNSWRRRKLWSARSNIRLPA 507
Query: 146 SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQK 205
S KD +E+ S +V ++ +N Q ++LS + V E SK
Sbjct: 508 S-IKDYGGEIEKLSSE-------------LELVCLKKDNHAQSHNLSPFLHVRERASKLN 553
Query: 206 NLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDV 265
LS H ++ IL+ V+NEL++STK ++EYVE L++ EV K+V S+ + E+
Sbjct: 554 ALS---HKAYRGIRRILEYVKNELHMSTKPFFSEYVEFLIDKEVGKIVRVSEDDKLNEET 610
Query: 266 VDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLS----KDLLSN 321
V+ S T ++ + DS + + E +S+D++SL QAGKPL +DL SN
Sbjct: 611 VESFSRRCQTTDYSSSEFQDELTTDSVKLNVE--TSDDTQSLVQAGKPLGSLAPEDLFSN 668
Query: 322 ILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGE 381
+A AF +S VD V+ + DEP PPGF D+ R LVPS KF+ + +E K+ E
Sbjct: 669 FVASAFAKSQVD-VDFVMVDQNIDEPPPPGFGDNARTLVPSPIHKFRPTQPEESIPKIRE 727
Query: 382 YVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEH 441
YVA+A+CRQKLH V+ EWKS F+D L QFL +++ C+ K G SNA+ +H
Sbjct: 728 YVAMAICRQKLHDDVLSEWKSFFIDGILNQFLRSIHTLRQHCQP--GSKMGGTSNANKDH 785
Query: 442 HGDTSTVVDKLKEGSKRFHSSEA---STMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAF 498
+G T + KLK G++ F+SS++ S++ +KYTY+RKKKL+RKK GS S V+
Sbjct: 786 NGTALTSLYKLK-GTREFNSSDSAGVSSVCDKYTYYRKKKLVRKKLGSSSQSITPVDTGL 844
Query: 499 QTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIG 558
Q VEK +KQ V D+ V +K G+ +L D
Sbjct: 845 QHHPVEKLQKQNVVKDIEVEPVVATLKKKKQKKGQTELSD-------------------- 884
Query: 559 KNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVP----KPSGE 614
A S VKS LPS S AK+ QKV+K AV R + V KP+ +
Sbjct: 885 --------DRRAIKSIVKSSLPSDQSMAKNGTHQKVIKYKHAVPRPSINVTIDTIKPNRK 936
Query: 615 MLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLEL-HA 673
S +S D V KV H+ GIE+ D SK K ATK SK KRK + DG + H
Sbjct: 937 NSSDVSKDHAKVKKVSDSNNHDGGIEEVPTHDYSK-KNLATKISKLKRKHSADGRSVSHP 995
Query: 674 TKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASP 733
K LKV +KQAASRQV K K+ KSR SN CPRSDGCARSSI+GWEWHKWS +ASP
Sbjct: 996 MKFLKVTTSGSKQAASRQVTAGKAKSRKSRASNSCPRSDGCARSSITGWEWHKWSHSASP 1055
Query: 734 AERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVK 793
A+RARVRG +H Y E SQ +NGK LSARTNRVK+RNLLAAAEGA+LLKA+Q+K
Sbjct: 1056 ADRARVRGIHCLHANYSVSEAYTSQLSNGKVLSARTNRVKMRNLLAAAEGADLLKATQLK 1115
Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
ARKKRLRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR +ISDIRE YEKMGIGSSYL
Sbjct: 1116 ARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERLYEKMGIGSSYL 1175
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
FRLDDGYVVDATKRGG+ARFINHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY
Sbjct: 1176 FRLDDGYVVDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 1235
Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
KFPLEEKKIPC CGS+KC GSLN
Sbjct: 1236 KFPLEEKKIPCNCGSRKCRGSLN 1258
>gi|224061805|ref|XP_002300607.1| SET domain protein [Populus trichocarpa]
gi|222842333|gb|EEE79880.1| SET domain protein [Populus trichocarpa]
Length = 1390
Score = 730 bits (1885), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 443/917 (48%), Positives = 553/917 (60%), Gaps = 105/917 (11%)
Query: 81 LLLAQKSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPK 140
L + KS G+ + FWGSY ++C+ L D+CM+VMWNAVF D +AEY+ +WRK KLW HP
Sbjct: 419 LPRSTKSVGNADDFWGSYAVICRCLSDYCMEVMWNAVFYDTIAEYTISWRKSKLWFHHPY 478
Query: 141 ITGPASDYKDDRKRMEQAPSR-----HDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSV 195
+ ++E+ PS +S S DCPPGF +++ ++++ V S
Sbjct: 479 LC----------MKIEELPSETYFSGQESPASSVDCPPGFELLKTKSDHTVPSSITSSCA 528
Query: 196 PVGENLSKQKNLS---CNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKV 252
VG+ +Q +LS C D DD+KCIL+ V EL+ STK + EYVEILV+++V+K+
Sbjct: 529 HVGQEPCEQNSLSFKDCPD----DDMKCILESVAYELHKSTKVSLLEYVEILVKEKVKKL 584
Query: 253 VSASKGINMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGK 312
V+ S+ + E++ D S + G ++ IDSN+ AEI S +S Q K
Sbjct: 585 VNFSEDKRLNEEIFDFSIPSSQASEYGSIEMKDEKMIDSNQIPAEIMFSSKPQSSLQVQK 644
Query: 313 PL----SKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQ 368
S++ +SN LA AFKR V N +D+ D P PPGF+D+ L PS KF+
Sbjct: 645 SFFPFQSENEISNFLAIAFKRLRPSVV-NAIDDENIDGPPPPGFKDTA--LFPSAINKFR 701
Query: 369 FSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGN 428
S S + T K+G YV IAMC QKLH V+ WKS+FVD+ L + L C+ ++ E N
Sbjct: 702 PSKSLKLTPKVGAYVTIAMCMQKLHDDVLNVWKSIFVDEILHRSPRLCCSSEKHTEPGIN 761
Query: 429 EKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMV---EKYTYHRKKKLLRKKFG 485
E EGA K EGS +FHS ++S + KYTYHRK+KL+ KK G
Sbjct: 762 E--EGAF---------------KFTEGSNKFHSPDSSVLSLVSGKYTYHRKRKLVGKKLG 804
Query: 486 SPSNCSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKK-- 543
S S+ + +V++ + VEKSRKQ V DV EN VQP KK G+ +DA K
Sbjct: 805 SSSHSTTTVDSGLLKQPVEKSRKQDVLSDVSENVVVQPVKTPKKK-GQASSVDAKPLKAT 863
Query: 544 IGANKFTSVPSKMIGKNKVTAESSASAGSSK--VKSKLPSGYSSAKSTISQKVMKVTSAV 601
I + + P K AESS + G SK VKS L S K+ +KVMK+ AV
Sbjct: 864 IAESSVNARPLKA-----TIAESSVNVGPSKAAVKSTLKRDQSLPKNISRRKVMKIARAV 918
Query: 602 QRDKVPVP--KPSGEMLSTLSADGNDVG-----------KVVRGKAHNVGIEKDSILDSS 648
DK K S +++ + +G D G K + + K ++ S
Sbjct: 919 NDDKDAKDSIKTSRDVVGLIDCNGRDAGIKKSGTTECSKKTLNSTKVSNSKRKSTVDGGS 978
Query: 649 KSKP--------------------------------NATKESKQKRKRTMDGLEL-HATK 675
S P ATK SK KRK T++G + H K
Sbjct: 979 VSHPMKILKVENDVNKQAATGQVMARKTKSDHVFLCTATKVSKLKRKSTVNGGSVSHPMK 1038
Query: 676 ALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAE 735
LKV G KQ A+ Q +KTK+SKSR CPRSDGCARSSI+GWEWH WS+ ASPAE
Sbjct: 1039 ILKVENGANKQTATGQFTARKTKSSKSRMLIPCPRSDGCARSSINGWEWHAWSVKASPAE 1098
Query: 736 RARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKAR 795
RARVRG + +H KY G E ASQ +NGK LSARTNRVKLRNLLAAAEG +LLKA+Q+KAR
Sbjct: 1099 RARVRGVRCIHAKYSGSEAYASQLSNGKVLSARTNRVKLRNLLAAAEGVDLLKATQLKAR 1158
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KKRL FQRSKIHDWGLVALE IEAEDFVIEYVGELIR +ISDIRE YEKMGIGSSYLFR
Sbjct: 1159 KKRLCFQRSKIHDWGLVALESIEAEDFVIEYVGELIRPQISDIRERLYEKMGIGSSYLFR 1218
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
LDDGYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIYAKR+IAAGEEITYNYKF
Sbjct: 1219 LDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRYIAAGEEITYNYKF 1278
Query: 916 PLEEKKIPCYCGSKKCH 932
PLE+KKIPC CGS+K +
Sbjct: 1279 PLEDKKIPCNCGSRKLY 1295
>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
Length = 1213
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/862 (46%), Positives = 524/862 (60%), Gaps = 71/862 (8%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS GSIE FW SY +V K+L D+ MQVMWNAVF D +AEY S+WRK+KLWS H K A
Sbjct: 412 KSVGSIENFWWSYAVVRKVLLDYSMQVMWNAVFFDTLAEYLSSWRKKKLWS-HRKPQPSA 470
Query: 146 SDYKDDRKRMEQAP--SRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
++ +D +++E D+S S+ D FG+ + TE + S S G NL +
Sbjct: 471 NECEDHTEKIESEALVINPDTSESNVDGYNQFGV--LATEKNCPRLFSSSSSLKGGNLLE 528
Query: 204 QKNLSCNDHLLLD---DVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGIN 260
+ +SC L D D+ CIL+ VENEL+ S+K + +Y++ +E EV K++
Sbjct: 529 GQKVSC----LYDNSRDLTCILESVENELHFSSKVSLADYIQSFIEKEVNKLI------- 577
Query: 261 MKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIF---SSEDSKSLFQAGKPLSKD 317
P + + D ++ + EI S + +K+ G+ S +
Sbjct: 578 -------PFPEENKFNEVAVGDTRFSEKLADKTSVKEILNDKSVDPAKAGNSFGESASGN 630
Query: 318 LLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTT 377
+S++ + AFK G+VD+VV+E E D+ PPG E S + + N KF+ S S E
Sbjct: 631 HMSDVFSKAFKE-LCGYVDDVVEE-EIDD-LPPGLEKS-QTVALHYNSKFRPSRSAECNL 686
Query: 378 KMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNA 437
K+ EYVA A+CRQKLH V+ +W+SLF+D +Q +K+ ++DG++K + NA
Sbjct: 687 KITEYVATALCRQKLHDEVLEKWRSLFLDSVPKQVFISSSTIKKHFKSDGHKKRKTV-NA 745
Query: 438 HNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENA 497
EH ++ + ++KEG+K SSE ++ KYTY RKK ++ S S N
Sbjct: 746 SKEHLNSATSGLGRVKEGAKS--SSEVPPVIGKYTYCRKKLSRKELISSKSVAENDSRPG 803
Query: 498 FQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMI 557
Q V K RK +GDV E A+V+ ++V GK K+I K + +
Sbjct: 804 KQP--VAKLRKH-FSGDVGEAAEVKIASVIH---GKTKMIKGKKDTTSKGKSSVSVNSSS 857
Query: 558 GKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLS 617
+++ S K+ QKV+K + VQ D K + + LS
Sbjct: 858 HNDQL----------------------SLKNKAGQKVLKFSGEVQNDVKDFVKSNVKKLS 895
Query: 618 TLSADGNDVGKVVRGKAHNVGIEKDSILDS-SKSKPNAT-KESKQKRKRTMDGL-ELHAT 674
+ + + K+V+ G K+ + S+ NAT K SK KRK MDG H T
Sbjct: 896 ASTDNSVVMKKIVKSD----GTVKEKVTSHCSREIQNATMKVSKSKRKHQMDGTASSHPT 951
Query: 675 KALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPA 734
K LK++ G A AS+QV + K++KS+ NLCPRSDGCAR+SI GWEWHKWS +ASPA
Sbjct: 952 KVLKISNGGAYLGASKQVTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWSRSASPA 1011
Query: 735 ERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKA 794
+ARVRG V K + E N SQ +NGKGLSARTNRVKLRNLLAAAEGA+LLK Q+KA
Sbjct: 1012 YKARVRGLPCVQNKCIDSENNLSQLSNGKGLSARTNRVKLRNLLAAAEGADLLKVPQLKA 1071
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
RKK LRFQRSKIHDWGL+ALEPIEAEDFVIEY+GELIR +ISDIRE +YEKMGIGSSYLF
Sbjct: 1072 RKKHLRFQRSKIHDWGLLALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLF 1131
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
RLDDGYVVDATKRGGIARF+NHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK
Sbjct: 1132 RLDDGYVVDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1191
Query: 915 FPLEEKKIPCYCGSKKCHGSLN 936
FPLEEKKIPC CGS+KC GSLN
Sbjct: 1192 FPLEEKKIPCNCGSRKCRGSLN 1213
>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
Length = 1300
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/858 (46%), Positives = 523/858 (60%), Gaps = 60/858 (6%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS GSIE FW S+ +V K+L D+ MQVMWNAVF D +AEY S+WRK+ LWS HPK A
Sbjct: 496 KSVGSIENFWWSFAVVRKVLLDYSMQVMWNAVFFDTLAEYLSSWRKKMLWS-HPKPQPSA 554
Query: 146 SDYKDDRKRMEQAP--SRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
+ +D +++E DSS S+ D FG+ + TE + S S G NL +
Sbjct: 555 NGCEDHTEKIESEALVFNPDSSESNVDGDNQFGV--LTTEKNCPLLFSSPSSLKGGNLLE 612
Query: 204 QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKE 263
+ +SC ++ D+ CIL+ VENEL+ S+K + +Y+ VE EV K++
Sbjct: 613 GQKVSC-PYVNSRDLTCILESVENELHFSSKVSLADYIRSFVEKEVNKLIPF-------- 663
Query: 264 DVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPL----SKDLL 319
P + + G +G I +++TS + ++ S +AG S +
Sbjct: 664 ----PEENKFNEVAVGGTHFSG---ILADKTSMKEILNDKSVDPVKAGNSFGESASGNHK 716
Query: 320 SNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKM 379
S+I + AFK G+VD+VV+E E D+ PPG E S + +VP N KF+ S S E K+
Sbjct: 717 SDIFSKAFKE-LCGYVDDVVEE-EIDD-LPPGLEKS-QTVVPHYNSKFRPSRSAESNPKI 772
Query: 380 GEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHN 439
EYVA A+CRQKLH V+ +W+ LF++ +Q +K+ ++DG++K + A +A
Sbjct: 773 TEYVATALCRQKLHDEVLEKWRLLFLNSVPKQVFISSSTVKKHFKSDGHKKRKMA-DASK 831
Query: 440 EHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQ 499
EH ++ + ++KEG+K SSE ++ KYTY RKK ++ S S N Q
Sbjct: 832 EHLNSATSGLGRVKEGAKS--SSEVPPVIGKYTYCRKKLSQKELIFSKSVAENDSRTGKQ 889
Query: 500 TEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGK 559
V K RK V+GDV E A+V+ ++S K GK K+I K + +
Sbjct: 890 L--VTKLRKH-VSGDVGEAAEVK---IASAKHGKTKMIKGKKDTSSKGKSSVSVNSSSHN 943
Query: 560 NKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTL 619
++++ ++ A K + K + V K++ + S M
Sbjct: 944 DQLSLKNKAGRKVLKFSDDV-------KDFVKSNVKKLSVSTNN--------SVGMKKVA 988
Query: 620 SADGNDVGKVVRGKA-HNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALK 678
+DG D V+ K + E ++ ++K K +K KRK MDG H TK LK
Sbjct: 989 KSDGCDRDDTVKEKTTSHCSREIQNVFTATK------KVTKSKRKHQMDGTSSHPTKVLK 1042
Query: 679 VAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERAR 738
++ G A AS+QV + K++KS+ NLCPRSDGCAR+SI GWEWHKWS +ASPA +AR
Sbjct: 1043 ISNGGAYLGASKQVPVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWSRSASPAYKAR 1102
Query: 739 VRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKR 798
VRG V K + E N Q +NGKGLSARTNRVKLRNLLAAAEGA+LLK Q+KARKK
Sbjct: 1103 VRGLPCVRNKCIDSENNLFQLSNGKGLSARTNRVKLRNLLAAAEGADLLKVPQLKARKKH 1162
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
LRFQRSKIHDWGLVALEPIEAEDFVIEY+GELIR +ISDIRE +YEKMGIGSSYLFRLDD
Sbjct: 1163 LRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLFRLDD 1222
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
GYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE
Sbjct: 1223 GYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 1282
Query: 919 EKKIPCYCGSKKCHGSLN 936
EKKIPC CGS+KC GSLN
Sbjct: 1283 EKKIPCNCGSRKCRGSLN 1300
>gi|449458127|ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
Length = 1289
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 391/885 (44%), Positives = 531/885 (60%), Gaps = 81/885 (9%)
Query: 89 GSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDY 148
GSI+ F + ++C+MLFD+ +QV+WNAV D VAEYSSAWR+++ WS P + +S Y
Sbjct: 449 GSIDNFREVHAVICQMLFDYSLQVVWNAVSYDTVAEYSSAWRRKRFWSYRPHYSLASSGY 508
Query: 149 KDDRKRMEQAPS-----RHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
+D K++E+ P+ R +SS+ +TEN + +SLSVPVG S+
Sbjct: 509 RDRVKKIEKTPAEASLPRKESSLHGVSSLSVSKFKGAQTENCARSAVISLSVPVGHKSSR 568
Query: 204 QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKE 263
+ SC + +D+K +++ +E EL+ S K + EY++ ++E+EV +AS + + +
Sbjct: 569 PTSHSCCERPK-EDLKWMVEYLEKELHSSAKVSMAEYIQDILEEEVISSCNASTDVKLDK 627
Query: 264 DVVDPSSHDLHTCQC----GFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLL 319
+D S QC +++ G ++ DSN+T + S E +L LS D
Sbjct: 628 VALDVS------IQCSSTDNYSNSFGELQCDSNDTHGDRNSGELKLALLPEVN-LSNDTA 680
Query: 320 SNILACAFKRSF--------SGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSW 371
N +A + F F ++ + +E PG E+ +PS KF+ S
Sbjct: 681 LNSVANSLYEVFKEICTNEGCAFNEDCAFNEDCNELLAPGLEEHPTFQIPSPACKFRPSS 740
Query: 372 SDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKA 431
S++ +K+ Y+ +A+CRQKLH V+ EW S + DD L+QF++ W K+ C + N
Sbjct: 741 SNKCYSKIEGYIMLAICRQKLHDAVLKEWTSSYKDDLLRQFVSSWIASKKHC--NSNRIV 798
Query: 432 EGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCS 491
EGA + G+ S V DKL+EGS+RF E+S + YTY+RKK ++K GS S+C+
Sbjct: 799 EGACDG-----GEASKVPDKLREGSERFL--ESSLVTGNYTYYRKKSS-KRKLGS-SDCA 849
Query: 492 NSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTS 551
+ + EKSRK+ ++ V E + ++++ K I KNK
Sbjct: 850 TEGSPVVRNQPSEKSRKENISVGVCETTDSEIASLTLKSIAKNKR--------------- 894
Query: 552 VPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRD---KVPV 608
K ++ +++ ++V LPS +SS K+ K +K + V+ D K V
Sbjct: 895 -------KKDLSIKATCKRTCAEVT--LPSSHSSGKTICGTKKLKFSPPVKDDNAKKDSV 945
Query: 609 PKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPN--------------- 653
G M+ + N V +V+ VG ++ S
Sbjct: 946 KHGKGRMIGSPLMIKN-VDQVMNKCDRGVGAQEKLCSTSLLLSSLIFSLSLSSFFPGLYL 1004
Query: 654 --ATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRS 711
A SK KRK+ +D L K L VA +KQAAS++V +K K+ KSR N+ S
Sbjct: 1005 CAAVNVSKIKRKQKVDEASLLGNKVLTVADDFSKQAASKRVVAQKKKSDKSRKLNISIIS 1064
Query: 712 DGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNR 771
DGCARSSI+GWEW +W+L ASPAERAR RG QY ++ +GP+V+ S NGKGLSARTNR
Sbjct: 1065 DGCARSSINGWEWRRWTLKASPAERARNRGFQYFYSDPIGPDVSTSHLLNGKGLSARTNR 1124
Query: 772 VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI 831
VKLRNLLAAA+GA+LLKASQ+KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI
Sbjct: 1125 VKLRNLLAAADGADLLKASQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI 1184
Query: 832 RLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG 891
R +ISDIRE +YEKMGIGSSYLFRLDDGYVVDATKRGG+ARFINHSC PNCYTKVI+VEG
Sbjct: 1185 RPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGVARFINHSCEPNCYTKVITVEG 1244
Query: 892 QKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
QKKIFIYAKRHI+AGEEITYNYKFPLEEKKIPC C S++C GSLN
Sbjct: 1245 QKKIFIYAKRHISAGEEITYNYKFPLEEKKIPCNCRSRRCRGSLN 1289
>gi|357449257|ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
gi|355483953|gb|AES65156.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
Length = 1232
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 396/864 (45%), Positives = 519/864 (60%), Gaps = 75/864 (8%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS GSIE FW SY +V K+LFDH +QVMWNAVF D V E +WRK+K WS HPK
Sbjct: 403 KSVGSIENFWWSYAVVRKVLFDHSLQVMWNAVFFDTVTEVLFSWRKKKYWS-HPKPQSSV 461
Query: 146 SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLS---VPVGENLS 202
++ KD ++++ + S + + TE D P L LS + +G N++
Sbjct: 462 NESKDSVEKLKSEALALGTGSSVCNVEADIQSGAMATERDCHP-ELLLSPNNIKIG-NIA 519
Query: 203 KQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVV-SASKGINM 261
+ + +SC+ + +D+ IL+ VENEL+ S KA+ +YV +VE EV +++ S K I
Sbjct: 520 EGQRVSCS-YGNSEDLTRILESVENELHCSAKASLADYVRSVVEKEVNQLIPSPEKDIFS 578
Query: 262 KEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGK----PLSKD 317
+ DV D C + + G +TS + S+ S + G P S++
Sbjct: 579 EVDVSD----------CRISKMVAG------KTSVKETLSDKSIDPVKNGDSICVPSSEN 622
Query: 318 LLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTT 377
+S++ + AF+ G +++VVD+ E + PPGFE + + P CN KF+ S + E
Sbjct: 623 HMSDVFSKAFQE-LCGHLNDVVDDEEIGD-LPPGFEKN--SIFPHCNSKFRPSRTVECNP 678
Query: 378 KMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNA 437
K+ EYV A+CRQKLH V+ +WK +D ++ ++ C +K+ ++ G+ K + S A
Sbjct: 679 KITEYVTAALCRQKLHDEVLKDWKLSILDSTFKKVMSS-CTIKKNLQSRGHGKGKSFS-A 736
Query: 438 HNEHHGDTSTVVDKLKEGSKR--------FHSSEASTMVEKYTYHRKKKLLRKKFGSPSN 489
+ EH D + + K+KEG+K SS +EKYTYHRK L RK+ S
Sbjct: 737 NKEHLNDATLGLGKVKEGTKLGLGKVKEGAKSSGVPLAIEKYTYHRKN-LSRKELCSSKP 795
Query: 490 CSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKF 549
+ +N+ + ++ V+GDV E+A+V+ +A+ K GK K+I
Sbjct: 796 VVD--DNSGPGKKPLAKLRKDVSGDVKESAEVKVTAI---KRGKAKMIKGK--------- 841
Query: 550 TSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVP 609
SS + V + PS S K+ QKV K VQ V
Sbjct: 842 -------------KDTSSKKSSPVNVDNSSPSVQLSLKNKTCQKVSKFAHTVQNGVTDVL 888
Query: 610 KPSGEMLSTLSADGNDVG-KVVRGKAHNVGIEKDSILDSSKSKPNATKE-SKQKRKRTMD 667
K + + L L + N VG KVV+ +V I++ + SK K NAT SK KRK D
Sbjct: 889 KSNKKRL--LVSSDNSVGMKVVKRNNTDVTIQRKTTGHISKEKLNATNTVSKSKRKHQPD 946
Query: 668 GL-ELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHK 726
G+ H K LK++ A AS+QV + ++KS++ +LCPRS GCAR+SI GWEWHK
Sbjct: 947 GVTSSHPAKVLKISNSGASLEASKQVTEARRNSAKSKSLDLCPRSIGCARTSIDGWEWHK 1006
Query: 727 WSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAEL 786
WS +ASP RARVRG + K++ E N SQ +N KGLSARTNRVKLRNLLAAAEGA+L
Sbjct: 1007 WSQSASPTSRARVRGLPRLQNKFINSEKNPSQLSNSKGLSARTNRVKLRNLLAAAEGADL 1066
Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
LK Q+KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY+GELIR +ISDIRE +YEKM
Sbjct: 1067 LKVPQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIREVQYEKM 1126
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
GIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCY KVIS EGQKKIFIYAKRHI AG
Sbjct: 1127 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYPKVISFEGQKKIFIYAKRHINAG 1186
Query: 907 EEITYNYKFPLEEKKIPCYCGSKK 930
EEITYNYKFPLEEKKIPC CGSKK
Sbjct: 1187 EEITYNYKFPLEEKKIPCNCGSKK 1210
>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
Length = 1212
Score = 407 bits (1047), Expect = e-110, Method: Compositional matrix adjust.
Identities = 319/892 (35%), Positives = 437/892 (48%), Gaps = 139/892 (15%)
Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
V + +++ M+ +W+ V D V +Y AW KRK S P ASD + + E +P
Sbjct: 404 VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 463
Query: 160 SR---HDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHLLLD 216
D+ SD D PPGFG + E+ H + V E + S L
Sbjct: 464 KAICDSDAPESDMDFPPGFGPNQESAEHS----HSACVEYVTEKTDGRSGSSIT--LFSG 517
Query: 217 DVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVDPSSHDLHTC 276
+ + + NELY++ K ++ E ++ +E+ + ++ ++ + H
Sbjct: 518 PLGRVQVMLANELYVAAKEALFQHFEEVISEEITNCLCIGFEDDINQERIRTPVHAPEPS 577
Query: 277 QCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVD 336
G+ + + AE+ E S A +S D+ + + C + SG
Sbjct: 578 S------PPGISVHETPSPAEMPRDEISDMAEMARDEIS-DMAVDTIPCPADMAASG--T 628
Query: 337 NVVDELETDE----------PS-------------------------PPGFEDSVRKLVP 361
+ V E+ TD+ PS PPG E + LV
Sbjct: 629 STVPEVTTDKLIIPYVEHQSPSASHASIFEKLDAHEEAELDDSFDEVPPGTEAGLASLVI 688
Query: 362 SCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKE 421
K+Q S S + + Y + A RQ LH V+ EW SLF AL W K
Sbjct: 689 MEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHESVMKEWASLF-SGALSNCFDSWYARKN 747
Query: 422 CCEADGNEKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLR 481
T+ D L+ ++YTY+RK+KL +
Sbjct: 748 IV---------------------AKTMDDTLRP--------------KEYTYYRKRKLRK 772
Query: 482 KKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGDV-FENAKVQPSAVSSKKIGKNKLIDAS 540
NC S E + + + V V +N A S+++ +++
Sbjct: 773 -------NCEASSSEKPMDEQLSRPLRDLVECKVNMKNIHRSSKAGISQRVS---VVEKP 822
Query: 541 SKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVM---KV 597
SKK + PS N + + S KV + S + ++K +S KV +
Sbjct: 823 SKK------RAKPSHNDNIN-LNIQQDLKLLSDKVPKRNRSSHPTSKPLVSSKVPTEDRT 875
Query: 598 TSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDS------ILDSSKSK 651
TSA+ P K + L+T + N K V + G E + I+ +K K
Sbjct: 876 TSAM-----PAKKRKQKNLAT---ESNLKTKAVILSPESHGCEAPTEKRTTAIMPVNKRK 927
Query: 652 PNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNL---- 707
N + ESK K K L + ++ + S A K+ + + S SNL
Sbjct: 928 KNLSGESKLKAK------PLTSPESYVCEAPIDNRTTSTMPAKKRKQKNLSNESNLKKKP 981
Query: 708 ---CPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKG 764
CP S GCAR+S+SGWEW W+ NA+P+ERA+VRG + V + PE N + + KG
Sbjct: 982 LVLCPESYGCARASVSGWEWRDWARNATPSERAQVRGYR-VRSILSAPENNVLKSSQVKG 1040
Query: 765 LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
SARTNRVKLRNLLAAAEG +LLK Q K+RKKRLRFQRSKIH+WGLVALE I+AEDFVI
Sbjct: 1041 SSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKKRLRFQRSKIHEWGLVALESIDAEDFVI 1100
Query: 825 EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
EYVGELIR ++SDIRE +YEK GIGSSYLFRLDD YVVDATKRGG+ARFINHSC+PNCYT
Sbjct: 1101 EYVGELIRRQVSDIREDQYEKSGIGSSYLFRLDDDYVVDATKRGGLARFINHSCDPNCYT 1160
Query: 885 KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
KVI+VEGQKKI IYAKR I AGEE+TYNYKFPLEEKKIPC+CGS++C GS+N
Sbjct: 1161 KVITVEGQKKIVIYAKRRIYAGEELTYNYKFPLEEKKIPCHCGSQRCRGSMN 1212
>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
distachyon]
Length = 1194
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 311/902 (34%), Positives = 433/902 (48%), Gaps = 178/902 (19%)
Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDYKDDRKRMEQAPS 160
V + ++ M+ +W+ V D V Y W KR P P + + + M++
Sbjct: 405 VWQTIYYESMKNIWDEVMYDPVMNYCGGWLKRNHQLSLPCTIVPGAP---ENRNMQETDG 461
Query: 161 RHDSSVSDD-------DCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHL 213
+ D D PPGFG E+ + P L + V + +S + S +
Sbjct: 462 LSLKVICDPEAIECDMDFPPGFGPSIECPESSLPPLSLDV-VSCADQISGKPKSSST--I 518
Query: 214 LLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVD------ 267
+ + + + NELY++ K T + ++ +E+ + + ++ +
Sbjct: 519 FIGPLAGVQIMLANELYVAAKETLFHHFREVIAEEITNCLCIGLEESTDQERIRTPVHAP 578
Query: 268 --PSSHDLHTCQCGFA-DVN-----GGMRIDSNETSAEIFSSEDSKSLFQAGK------P 313
PSS +H D++ M ID+ ++SAEI E +F A + P
Sbjct: 579 ELPSSASVHETHISPELDMDKLLNVAKMDIDTMDSSAEIALDE----VFVAAELATDTIP 634
Query: 314 LSKDLLSNILACAFK-------------RSFSGFVDNVVDEL---ETDE------PSPPG 351
D+ + +C K S S ++ ++L E+DE PPG
Sbjct: 635 SPADMATEETSCVAKVATDKMLTSHAEHHSLSASYASIFEKLDVCESDELDESFDEVPPG 694
Query: 352 FEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQ 411
E + +V K+Q S S + Y+ +A+CRQ LH + EW S F+ D + +
Sbjct: 695 METGLASVVLMDKSKYQPSKSVNSVPVISRYITLAVCRQTLHQDAMKEWAS-FLSDTISE 753
Query: 412 FLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKY 471
W M+ VV K +GS R +++
Sbjct: 754 CFDSWYTME--------------------------NVVPKNIDGSLR---------PKEF 778
Query: 472 TYHRKKKLLR--------KKFGSPSN------CSNSVENAFQTEHVEKSRKQGVAGDVFE 517
TY+RK+KL + K P + S VE ++V+ SRK
Sbjct: 779 TYYRKRKLRKSSEVVSSKKPMEVPMDEQLSKPLSELVERKAHLKNVQGSRK--------- 829
Query: 518 NAKVQ-PSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVK 576
AK++ PS + +K + D ++++ + PS + K+K + E A ++
Sbjct: 830 GAKLKKPSKIQAKTL------DKEARRLNSKLDLKRPSHDVPKSKSSKEVHALNIEQDLE 883
Query: 577 SKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHN 636
+L + KS + + + V+R VPK + L + V K
Sbjct: 884 -RLSNDVPKRKSLKEVRTLNIEQDVKRLSNDVPK---RQRTPLLIKKHLVDK-------K 932
Query: 637 VGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKK 696
V +E DS+ ATK K+++ + + K L
Sbjct: 933 VSMENDSV---------ATKLVKKRKHKNISSDSSQKVKPLV------------------ 965
Query: 697 TKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNA 756
LCP SDGCAR+S SGWEW W+ NA+P+ER RVRG Y L N
Sbjct: 966 ----------LCPESDGCARASTSGWEWRDWARNATPSERTRVRG--YRVRSILSTPAN- 1012
Query: 757 SQWANG--KGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
S W N KG SARTNRVKLRNLLAA EGA+LLK +Q KARKKRLRFQRSKIH+WGLVAL
Sbjct: 1013 SLWKNSQVKGTSARTNRVKLRNLLAAYEGADLLKIAQSKARKKRLRFQRSKIHEWGLVAL 1072
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I+AEDFVIEYVGELIR +SDIRE +YEK GIGSSYLFRLDD YVVDATKRGG+ARFI
Sbjct: 1073 ESIDAEDFVIEYVGELIRRPVSDIREAQYEKSGIGSSYLFRLDDDYVVDATKRGGLARFI 1132
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGS 934
NHSC PNCYTKVI+V+GQKKIFIY+KR I AGEE+TYNYKFPLEEKKIPC+CGS +C GS
Sbjct: 1133 NHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYKFPLEEKKIPCHCGSLRCRGS 1192
Query: 935 LN 936
+N
Sbjct: 1193 MN 1194
>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
Length = 1388
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/228 (77%), Positives = 198/228 (86%), Gaps = 4/228 (1%)
Query: 709 PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
P+SDGCAR+SI+GW WH WSL AS ERARVRG+ VH ++ G + + +Q LSAR
Sbjct: 1165 PKSDGCARTSINGWHWHAWSLKASAEERARVRGSSCVHMQHFGSKSSLTQNV----LSAR 1220
Query: 769 TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
TNR KLRNLLAAA+GA++LK SQ+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVIEYVG
Sbjct: 1221 TNRAKLRNLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVG 1280
Query: 829 ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
ELIR IS+IRE +YEKMGIGSSYLFRLDDGYV+DATKRGGIARFINHSC PNCYTK+IS
Sbjct: 1281 ELIRSSISEIRERQYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIIS 1340
Query: 889 VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VEG+KKIFIYAKRHI AGEEI+YNYKFPLE+ KIPC CG+ KC GSLN
Sbjct: 1341 VEGKKKIFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGAPKCRGSLN 1388
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 166/342 (48%), Gaps = 38/342 (11%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
KS GSIE F S + VC+ L HCMQ+MWNAVF D VA +SS WRK K+W I+
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494
Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
G + Y D + E R DSS S F +
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534
Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
N ++ + LS + + + I + VENEL+LS K T+Y IL++D S+++
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594
Query: 260 NMKED---VVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP--- 313
M E+ + S + + + NG + + + FS DS+ L Q G+
Sbjct: 595 KMHEENPPSLQGSFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQ 652
Query: 314 -LSKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWS 372
S+D+++NI + A + S V++ +D L EP PPG E ++ +P K+Q S
Sbjct: 653 ITSEDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRS 709
Query: 373 DEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
E ++ YV++A+CRQKLH V+ +WKSLF+ L +FLA
Sbjct: 710 KESIPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 751
>gi|222617469|gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japonica Group]
Length = 1165
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 259/613 (42%), Positives = 334/613 (54%), Gaps = 91/613 (14%)
Query: 335 VDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHA 394
+D+ DE+ PPG E + LV K+Q S S + + Y + A RQ LH
Sbjct: 627 LDDSFDEV------PPGTEAGLASLVIMEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHE 680
Query: 395 IVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKE 454
V+ EW SLF AL W K T+ D L+
Sbjct: 681 SVMKEWASLF-SGALSNCFDSWYARKNIV---------------------AKTMDDTLRP 718
Query: 455 GSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGD 514
++YTY+RK+KL + NC S E + + + V
Sbjct: 719 --------------KEYTYYRKRKLRK-------NCEASSSEKPMDEQLSRPLRDLVECK 757
Query: 515 V-FENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSS 573
V +N A S+++ +++ SKK + PS N + + S
Sbjct: 758 VNMKNIHRSSKAGISQRVS---VVEKPSKK------RAKPSHNDNIN-LNIQQDLKLLSD 807
Query: 574 KVKSKLPSGYSSAKSTISQKVM---KVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVV 630
KV + S + ++K +S KV + TSA+ P K + L+T + N K V
Sbjct: 808 KVPKRNRSSHPTSKPLVSSKVPTEDRTTSAM-----PAKKRKQKNLAT---ESNLKTKAV 859
Query: 631 RGKAHNVGIEKDS------ILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTA 684
+ G E + I+ +K K N + ESK K K L + ++
Sbjct: 860 ILSPESHGCEAPTEKRTTAIMPVNKRKKNLSGESKLKAK------PLTSPESYVCEAPID 913
Query: 685 KQAASRQVAMKKTKASKSRTSNL-------CPRSDGCARSSISGWEWHKWSLNASPAERA 737
+ S A K+ + + S SNL CP S GCAR+S+SGWEW W+ NA+P+ERA
Sbjct: 914 NRTTSTMPAKKRKQKNLSNESNLKKKPLVLCPESYGCARASVSGWEWRDWARNATPSERA 973
Query: 738 RVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
+VRG + V + PE N + + KG SARTNRVKLRNLLAAAEG +LLK Q K+RKK
Sbjct: 974 QVRGYR-VRSILSAPENNVLKSSQVKGSSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKK 1032
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RLRFQRSKIH+WGLVALE I+AEDFVIEYVGELIR ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 1033 RLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLFRLD 1092
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
D YVVDATKRGG+ARFINHSC+PNCYTKVI+VEGQKKI IYAKR I AGEE+TYNYKFPL
Sbjct: 1093 DDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYKFPL 1152
Query: 918 EEKKIPCYCGSKK 930
EEKKIPC+CGS++
Sbjct: 1153 EEKKIPCHCGSQR 1165
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
V + +++ M+ +W+ V D V +Y AW KRK S P ASD + + E +P
Sbjct: 404 VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 463
Query: 160 S---RHDSSVSDDDCPPGFG 176
D+ SD D PPGFG
Sbjct: 464 KAICDSDAPESDMDFPPGFG 483
>gi|218187240|gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indica Group]
Length = 1167
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 259/613 (42%), Positives = 334/613 (54%), Gaps = 91/613 (14%)
Query: 335 VDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHA 394
+D+ DE+ PPG E + LV K+Q S S + + Y + A RQ LH
Sbjct: 629 LDDSFDEV------PPGTEAGLASLVIMEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHE 682
Query: 395 IVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKE 454
V+ EW SLF AL W K T+ D L+
Sbjct: 683 SVMKEWASLF-SGALSNCFDSWYARKNIV---------------------AKTMDDTLRP 720
Query: 455 GSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGD 514
++YTY+RK+KL + NC S E + + + V
Sbjct: 721 --------------KEYTYYRKRKLRK-------NCEASSSEKPMDEQLSRPLRDLVECK 759
Query: 515 V-FENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSS 573
V +N A S+++ +++ SKK + PS N + + S
Sbjct: 760 VNMKNIHRSSKAGISQRVS---VVEKPSKK------RAKPSHNDNIN-LNIQQDLKLLSD 809
Query: 574 KVKSKLPSGYSSAKSTISQKVM---KVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVV 630
KV + S + ++K +S KV + TSA+ P K + L+T + N K V
Sbjct: 810 KVPKRNRSSHPTSKPLVSSKVPTEDRTTSAM-----PAKKRKQKNLAT---ESNLKTKAV 861
Query: 631 RGKAHNVGIEKDS------ILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTA 684
+ G E + I+ +K K N + ESK K K L + ++
Sbjct: 862 ILSPESHGCEAPTEKRTTAIMPVNKRKKNLSGESKLKAK------PLTSPESYVCEAPID 915
Query: 685 KQAASRQVAMKKTKASKSRTSNL-------CPRSDGCARSSISGWEWHKWSLNASPAERA 737
+ S A K+ + + S SNL CP S GCAR+S+SGWEW W+ NA+P+ERA
Sbjct: 916 NRTTSTMPAKKRKQKNLSNESNLKKKPLVLCPESYGCARASVSGWEWRDWARNATPSERA 975
Query: 738 RVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
+VRG + V + PE N + + KG SARTNRVKLRNLLAAAEG +LLK Q K+RKK
Sbjct: 976 QVRGYR-VRSILSAPENNVLKSSQVKGSSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKK 1034
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RLRFQRSKIH+WGLVALE I+AEDFVIEYVGELIR ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 1035 RLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLFRLD 1094
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
D YVVDATKRGG+ARFINHSC+PNCYTKVI+VEGQKKI IYAKR I AGEE+TYNYKFPL
Sbjct: 1095 DDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYKFPL 1154
Query: 918 EEKKIPCYCGSKK 930
EEKKIPC+CGS++
Sbjct: 1155 EEKKIPCHCGSQR 1167
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
V + +++ M+ +W+ V D V +Y AW KRK S P ASD + + E +P
Sbjct: 406 VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 465
Query: 160 S---RHDSSVSDDDCPPGFG 176
D+ SD D PPGFG
Sbjct: 466 KAICDSDAPESDMDFPPGFG 485
>gi|296085637|emb|CBI29431.3| unnamed protein product [Vitis vinifera]
Length = 1127
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 270/701 (38%), Positives = 379/701 (54%), Gaps = 62/701 (8%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS G IE F + +VC+ +FD CMQVMWNAVF VAEY S WRKRK WSGHP+I PA
Sbjct: 356 KSVGGIENFQHTCMVVCRTIFDSCMQVMWNAVFYAPVAEYCSTWRKRKRWSGHPRIMHPA 415
Query: 146 SD----YKDDRKRMEQA-----PSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVP 196
+ ++D+ ++ E+ H+ SV + DCPPGFG+V + +Q + LS
Sbjct: 416 VEQAMLFRDNVEKSEKLIDEPLQEEHEYSVCEVDCPPGFGLVMTDQDIHIQS-SVGLSSS 474
Query: 197 VGENLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSAS 256
E + ++ ++ DD++CI++ V+NEL LS K E VE +E+EV ++ +
Sbjct: 475 TVEGIPFKEKRPSDNVQPYDDMQCIVETVQNELQLSAKMMLVECVEAFIEEEVMNLIDSF 534
Query: 257 KGINMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSK 316
K +KE D S H + +D+ G+RI+S T AE+ S DS + Q+
Sbjct: 535 KDKKLKEGTSDFSIQCPHANEDASSDMVSGLRIES--TVAEMILSVDSCTPQQSPTDFH- 591
Query: 317 DLLSNILACAFKRSFSGFVDNVVD------ELETDEPSPPGFEDSVRKLVPSCNGKFQFS 370
L N + + F ++ + + + DEP PPGFE + R VPS +F+ S
Sbjct: 592 --LPNNASVSVSEHFMSKLNKLCTTDDVVDDQDIDEPPPPGFEYNSRTFVPSQICRFRPS 649
Query: 371 WSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEK 430
SDE T +GEYVA+A+CRQ+LH V+ EWK L V+ L QF A W K+ C++ G E
Sbjct: 650 SSDECTPIIGEYVALALCRQRLHEDVLQEWKDLLVEGTLDQFFASWWTSKQRCDSTGCE- 708
Query: 431 AEGASNAHNEHHGDTSTVVDKLKEGSKRFHS---SEASTMVEKYTYHRKKKLLRKKFGSP 487
EG SN++ E D+S D+ +E +K HS E S ++ KYTY+RKKKL+RKK GS
Sbjct: 709 -EGVSNSNKEKPCDSSAASDQRRERTKDRHSLGSPELSLVIGKYTYYRKKKLVRKKIGSL 767
Query: 488 SNCSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGAN 547
S+ + SV++ Q + +EKSRKQ V GDV E +V+ + +KIG N
Sbjct: 768 SHAAASVDSGSQDQLMEKSRKQDVPGDVSEITEVEMGILKRRKIGLN------------- 814
Query: 548 KFTSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVP 607
T + ++ + V+S LP SS + +++ K V+ +V
Sbjct: 815 ---------------TCHAEDNSLQAIVQSTLPGDSSSVRIKPNRRSTKCAHVVRNGEVI 859
Query: 608 VPKPSG--EMLSTLSADGNDVGKVVR--GKAHNVGIEKDSILDSSKSKPNATKESKQKRK 663
+ E S + D + V KVV G H+VG K+ D SK +TK SK+KRK
Sbjct: 860 EDDLACGREEASPFAEDCDFVDKVVNSNGNGHDVGNLKELAGDCSKKT-KSTKVSKKKRK 918
Query: 664 RTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWE 723
D + K LK A G AKQ RQVA+ K+K SK +T N C RS GCARSSI+GW+
Sbjct: 919 DLKDVPSSRSAKVLKPANGAAKQDTGRQVAVHKSKFSKFKTLNPCLRSVGCARSSINGWD 978
Query: 724 WHKWSLNASPAERARVRG---AQYVHTKYLGPEVNASQWAN 761
W WSLNASP ERA VRG AQ+ +Y EV +SQ +N
Sbjct: 979 WRNWSLNASPTERAHVRGIHKAQFACDQYFRSEVVSSQLSN 1019
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/75 (86%), Positives = 66/75 (88%)
Query: 862 VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 921
VDATKRGGIARFINHSC PNCYTKVISVEG+KKIFIYAKR I AGEEITYNYKFPLEEKK
Sbjct: 1020 VDATKRGGIARFINHSCEPNCYTKVISVEGEKKIFIYAKRQITAGEEITYNYKFPLEEKK 1079
Query: 922 IPCYCGSKKCHGSLN 936
IPC CGSK LN
Sbjct: 1080 IPCNCGSKSFRMPLN 1094
>gi|413916020|gb|AFW55952.1| putative SET-domain containing protein family [Zea mays]
Length = 710
Score = 370 bits (949), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 191/313 (61%), Positives = 227/313 (72%), Gaps = 19/313 (6%)
Query: 627 GKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQ 686
K V A+++ I++D L SS+ P + S +K + A + V A
Sbjct: 408 AKTVANDAYDLNIQQDLTLLSSEV-PKRARSSHPTKKHMV------ANRTPTVNDNVANN 460
Query: 687 AA-SRQVAMKKTKASKSRTSN------LCPRSDGCARSSISGWEWHKWSLNASPAERARV 739
+ ++ V KK + S TS CP SDGCAR+SI+GWEW W+ NA+P+ERARV
Sbjct: 461 SMLTKHVKKKKGRNISSETSQKVKPVISCPESDGCARASINGWEWRNWARNATPSERARV 520
Query: 740 RGAQYVHTKYLGPEVNASQWAN--GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
RG H + + N + W N K SARTNRVKLRNLLAAAEG+ELLK +Q+KARKK
Sbjct: 521 RG---YHVRTILSASNDNVWKNSQAKVSSARTNRVKLRNLLAAAEGSELLKITQMKARKK 577
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RLRFQRSKIHDWGLVALE IEAEDFVIEYVG+LI ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 578 RLRFQRSKIHDWGLVALESIEAEDFVIEYVGQLIHRRVSDIRESQYEKSGIGSSYLFRLD 637
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
D +VVDATKRGG+ARFINHSC PNCYTKVI+V+GQKKIFIYAKR I AGEEITYNYKFPL
Sbjct: 638 DDFVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPL 697
Query: 918 EEKKIPCYCGSKK 930
EEKKIPC+CGS++
Sbjct: 698 EEKKIPCHCGSRR 710
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 72/316 (22%), Positives = 115/316 (36%), Gaps = 49/316 (15%)
Query: 156 EQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHLLL 215
E +P D++ D D PPGFG + L V L + S L
Sbjct: 28 ELSPKDLDATECDMDFPPGFG--PCWKSPESSLSPSLLEVNGSAKLDGKSESSTT--LFS 83
Query: 216 DDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVDPSSH---- 271
+ + + NELY+S+K + Y E ++ D++ + N+ ++ + H
Sbjct: 84 GPLAVVQRMLANELYISSKQSLFHYFEEVIADDITNCLCFGLESNIDQEQIGTPIHAPES 143
Query: 272 ------DLHTCQCGFADVNGG------------------MRID--------SNETSAEIF 299
+H ++ GG M +D + + E
Sbjct: 144 PISAETSMHETANSI-EMTGGDVLNIVEIATSTRTSPLEMTVDEKLNTVEATRTSRTEPT 202
Query: 300 SSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVDNVVDELETDE---PSPPGFEDSV 356
S E S A ++ + L+ ++ R F N+ E DE PPG E +
Sbjct: 203 SVETSTVAEMATDKMTTYHVEEHLSMSYARVFEKM--NICKTAELDEKFDEVPPGMETGL 260
Query: 357 RKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALW 416
L +Q S S + Y+ +A+CRQKLH VV EW LF D + + L W
Sbjct: 261 VPLPLMDKNIYQPSKSVNSIPLISRYITLALCRQKLHENVVREWTYLF-SDTISKCLGSW 319
Query: 417 CNMKECC--EADGNEK 430
+ ADG+ K
Sbjct: 320 YTRRNAVPKSADGSSK 335
>gi|297791815|ref|XP_002863792.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
gi|297309627|gb|EFH40051.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
Length = 1414
Score = 369 bits (946), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 180/260 (69%), Positives = 207/260 (79%), Gaps = 4/260 (1%)
Query: 670 ELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSL 729
E K L V G K A + + +K SK + CP+ DGCAR+SI+GW W WSL
Sbjct: 1130 ERSPIKDLSVDDGRPKSIALKPLEKLSSKPSKKKVLLSCPKCDGCARTSINGWHWRAWSL 1189
Query: 730 NASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKA 789
AS ER RVRG+ VH ++ G + N++Q LSARTNR K+RNLLAAA+GA++LK
Sbjct: 1190 KASAEERTRVRGSSSVHIQHFGSKSNSTQ----NVLSARTNRAKMRNLLAAADGADVLKI 1245
Query: 790 SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
SQ+KARKKRLRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR IS+IRE +YEKMGIG
Sbjct: 1246 SQLKARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIG 1305
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
SSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTK+ISVEG+KKIFIYAKRHI AGEEI
Sbjct: 1306 SSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEI 1365
Query: 910 TYNYKFPLEEKKIPCYCGSK 929
+YNYKFPLE+ KIPC CG++
Sbjct: 1366 SYNYKFPLEDDKIPCNCGAQ 1385
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 167/343 (48%), Gaps = 41/343 (11%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
KS GSIE F S + VC+ L +HCMQ+MWNAVF D VA +S +WRK+KLW
Sbjct: 438 KSVGSIENFQTSCSAVCRTLHNHCMQIMWNAVFYDTVATHSLSWRKKKLWF--------- 488
Query: 146 SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQK 205
D + H + D P F R ++ S + + N + +
Sbjct: 489 --RSSDNSTLNYCKGSHRNY---SDKPEAFKSFTCRVDSFSSKTAHSNELDLPTNGASVR 543
Query: 206 NLSCNDHLLLDD------VKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
+S L D + I + VE+EL+LS + T+Y +L++D + ++
Sbjct: 544 GVSSRKVTLPDTDETESLIASISEHVESELFLSLETHLTDYTSVLIKDGANTTTTTARDG 603
Query: 260 NMKEDVVDPSSHDLHTCQCGF-ADVNGGMRIDSNETSAEI-FSSE--DSKSLFQAGKP-- 313
M E +C+ + + +G + D N SA++ FS++ DS+ L Q G+
Sbjct: 604 KMHEG----------SCREQYNLEGSGEKQNDLNVVSAKLRFSNDFSDSQRLLQEGESSE 653
Query: 314 --LSKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSW 371
S+D+++NI + + S D +D L+ EP PPG E S++ +PS KF+
Sbjct: 654 QITSEDIIANIFSTTLETSDIPVSDE-LDALDIHEPPPPGCESSIK--MPSLRCKFRPVR 710
Query: 372 SDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
S E ++ YVA A+CRQKLH V+ EW SLF+ L +FLA
Sbjct: 711 SKESIPEIKAYVATALCRQKLHTDVMREWISLFMKCHLNEFLA 753
>gi|9759476|dbj|BAB10481.1| unnamed protein product [Arabidopsis thaliana]
Length = 1421
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 200/344 (58%), Positives = 238/344 (69%), Gaps = 10/344 (2%)
Query: 589 TISQKVMKVTSAVQR----DKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSI 644
+ISQK +S ++R D+ PS LS S D D V++ EK
Sbjct: 1058 SISQKGPSQSSILKRKHQLDEKISNVPSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPC 1115
Query: 645 LDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRT 704
S K + K ++++ E K L V G K A + + +K SK +
Sbjct: 1116 HTSDKLQKGPNKLIRRRKPLAKHTTERSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKL 1175
Query: 705 SNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKG 764
P+SDGCAR+SI+GW WH WSL AS ERARVRG+ VH ++ G + + +Q
Sbjct: 1176 FLSIPKSDGCARTSINGWHWHAWSLKASAEERARVRGSSCVHMQHFGSKSSLTQ----NV 1231
Query: 765 LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
LSARTNR KLRNLLAAA+GA++LK SQ+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVI
Sbjct: 1232 LSARTNRAKLRNLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVI 1291
Query: 825 EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
EYVGELIR IS+IRE +YEKMGIGSSYLFRLDDGYV+DATKRGGIARFINHSC PNCYT
Sbjct: 1292 EYVGELIRSSISEIRERQYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYT 1351
Query: 885 KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGS 928
K+ISVEG+KKIFIYAKRHI AGEEI+YNYKFPLE+ KIPC CG+
Sbjct: 1352 KIISVEGKKKIFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGA 1395
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 164/339 (48%), Gaps = 39/339 (11%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
KS GSIE F S + VC+ L HCMQ+MWNAVF D VA +SS WRK K+W I+
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494
Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
G + Y D + E R DSS S F +
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534
Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
N ++ + LS + + + I + VENEL+LS K T+Y IL++D S+++
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594
Query: 260 NMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP----LS 315
M E S + + + NG + + + FS DS+ L Q G+ S
Sbjct: 595 KMHEG----SFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQITS 648
Query: 316 KDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEF 375
+D+++NI + A + S V++ +D L EP PPG E ++ +P K+Q S E
Sbjct: 649 EDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRSKES 705
Query: 376 TTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
++ YV++A+CRQKLH V+ +WKSLF+ L +FLA
Sbjct: 706 IPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 744
>gi|30694058|ref|NP_199055.2| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
gi|332007422|gb|AED94805.1| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
Length = 1423
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 193/318 (60%), Positives = 226/318 (71%), Gaps = 6/318 (1%)
Query: 611 PSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLE 670
PS LS S D D V++ EK S K + K ++++ E
Sbjct: 1086 PSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPCHTSDKLQKGPNKLIRRRKPLAKHTTE 1143
Query: 671 LHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLN 730
K L V G K A + + +K SK + P+SDGCAR+SI+GW WH WSL
Sbjct: 1144 RSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKLFLSIPKSDGCARTSINGWHWHAWSLK 1203
Query: 731 ASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKAS 790
AS ERARVRG+ VH ++ G + + +Q LSARTNR KLRNLLAAA+GA++LK S
Sbjct: 1204 ASAEERARVRGSSCVHMQHFGSKSSLTQ----NVLSARTNRAKLRNLLAAADGADVLKMS 1259
Query: 791 QVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS 850
Q+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR IS+IRE +YEKMGIGS
Sbjct: 1260 QLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIGS 1319
Query: 851 SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEIT 910
SYLFRLDDGYV+DATKRGGIARFINHSC PNCYTK+ISVEG+KKIFIYAKRHI AGEEI+
Sbjct: 1320 SYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEIS 1379
Query: 911 YNYKFPLEEKKIPCYCGS 928
YNYKFPLE+ KIPC CG+
Sbjct: 1380 YNYKFPLEDDKIPCNCGA 1397
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 164/339 (48%), Gaps = 39/339 (11%)
Query: 86 KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
KS GSIE F S + VC+ L HCMQ+MWNAVF D VA +SS WRK K+W I+
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494
Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
G + Y D + E R DSS S F +
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534
Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
N ++ + LS + + + I + VENEL+LS K T+Y IL++D S+++
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594
Query: 260 NMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP----LS 315
M E S + + + NG + + + FS DS+ L Q G+ S
Sbjct: 595 KMHEG----SFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQITS 648
Query: 316 KDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEF 375
+D+++NI + A + S V++ +D L EP PPG E ++ +P K+Q S E
Sbjct: 649 EDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRSKES 705
Query: 376 TTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
++ YV++A+CRQKLH V+ +WKSLF+ L +FLA
Sbjct: 706 IPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 744
>gi|242084688|ref|XP_002442769.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
gi|241943462|gb|EES16607.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
Length = 1033
Score = 365 bits (937), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 190/311 (61%), Positives = 226/311 (72%), Gaps = 15/311 (4%)
Query: 627 GKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQ 686
K V AH++ I++D L SS+ P + S +K + A + V A
Sbjct: 731 AKPVANDAHDLNIQQDLTLLSSEV-PKRARSSHPTKKHMV------ANRTPTVNDNVANN 783
Query: 687 AA-SRQVAMKKTKASKSRTSN------LCPRSDGCARSSISGWEWHKWSLNASPAERARV 739
+ ++ V KK + S TS CP SDGCAR+SI+GWEW W+ NA+P+ERARV
Sbjct: 784 SMLTKHVKKKKGRDISSETSQKVKPMISCPESDGCARASINGWEWRNWARNATPSERARV 843
Query: 740 RGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRL 799
RG + V T N + + K SARTNRVKLRNLLAAAEGAELLK +Q+KARKKRL
Sbjct: 844 RGYR-VRTILSASNNNVWKNSQAKVSSARTNRVKLRNLLAAAEGAELLKITQMKARKKRL 902
Query: 800 RFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDG 859
RFQRSKIH+WGLVALE IEAEDFVIEYVG+LI ++SDIRE +YEK GIGSSYLFRLDD
Sbjct: 903 RFQRSKIHEWGLVALELIEAEDFVIEYVGQLIHRRVSDIRESQYEKSGIGSSYLFRLDDD 962
Query: 860 YVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
+VVDATKRGG+ARFINHSC PNCYTKVI+V+GQKKIFIYAKR I AGEEITYNYKFPLEE
Sbjct: 963 FVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPLEE 1022
Query: 920 KKIPCYCGSKK 930
+KIPC+CGS++
Sbjct: 1023 EKIPCHCGSRR 1033
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 65/148 (43%), Gaps = 9/148 (6%)
Query: 289 IDSNETS-AEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVDNVVDELETDE- 346
+++ TS E S+E S + A + + L+ ++ R F ++ E DE
Sbjct: 514 VEATRTSLTEPTSAETSTAAEMATDKMPTSHVEEHLSMSYARIFEKM--DICKTAELDEK 571
Query: 347 --PSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLF 404
PPG E + + +Q S S + Y+ +A+CRQKLH VV EW SLF
Sbjct: 572 FDEVPPGMETGLVPVPLMDKNIYQPSKSMNSIPLISRYITLALCRQKLHENVVREWTSLF 631
Query: 405 VDDALQQFLALWCNMKECC--EADGNEK 430
D + + L W + ADG+ K
Sbjct: 632 -SDTISKCLGSWYTRRNAVPKSADGSSK 658
>gi|295913201|gb|ADG57859.1| transcription factor [Lycoris longituba]
Length = 164
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 139/161 (86%), Positives = 152/161 (94%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
NLLAAAEGA++LK +Q+ ARKKRLRFQRS IHDWGLVALEPIE+EDFVIEYVGEL+R +I
Sbjct: 4 NLLAAAEGADILKFNQLTARKKRLRFQRSNIHDWGLVALEPIESEDFVIEYVGELVRRQI 63
Query: 836 SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
SDIREC+YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTKVI+VEGQKKI
Sbjct: 64 SDIRECQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVEGQKKI 123
Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
FIYAKRHI AGEE+TYNYKFPLEE+KI C CGSK+C GS+N
Sbjct: 124 FIYAKRHIHAGEELTYNYKFPLEEQKILCNCGSKRCRGSMN 164
>gi|449525052|ref|XP_004169534.1| PREDICTED: uncharacterized LOC101220062, partial [Cucumis sativus]
Length = 509
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 202/540 (37%), Positives = 286/540 (52%), Gaps = 70/540 (12%)
Query: 280 FADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFS------- 332
+++ G ++ DSN+T + S E +L LS D N +A + F
Sbjct: 15 YSNSFGELQCDSNDTHGDRNSGELKLALLPEVN-LSNDTALNSVANSLYEVFKEICTNEG 73
Query: 333 -GFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQK 391
F ++ + +E PG E+ +PS KF+ S S++ +K+ Y+ +A+CRQK
Sbjct: 74 CAFNEDCAFNEDCNELLAPGLEEHPTFQIPSPACKFRPSSSNKCYSKIEGYIMLAICRQK 133
Query: 392 LHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDK 451
LH V+ EW S + DD L+QF++ W K+ C + N EGA + G+ S V DK
Sbjct: 134 LHDAVLKEWTSSYKDDLLRQFVSSWIASKKHC--NSNRIVEGACDG-----GEASKVPDK 186
Query: 452 LKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGV 511
L+EGS+RF E+S + YTY+RKK ++K GS S+C+ + + EKSRK+ +
Sbjct: 187 LREGSERFL--ESSLVTGNYTYYRKKSS-KRKLGS-SDCATEGSPVVRNQPSEKSRKENI 242
Query: 512 AGDVFENAKVQPSAVSSKKIGKNKL-----IDASSKKIGANKFTSVPSKMIGKNKVTAES 566
+ V E + ++++ K I KNK I A+ K+ A +VT
Sbjct: 243 SVGVCETTDSEIASLTLKSIAKNKRKKDLSIKATCKRTCA--------------EVT--- 285
Query: 567 SASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRD---KVPVPKPSGEMLSTLSADG 623
LPS +SS K+ K +K + V+ D K V G M+ +
Sbjct: 286 ------------LPSSHSSGKTICGTKKLKFSPPVKDDNAKKDSVKHGKGRMIGSPLMIK 333
Query: 624 NDVGKVVRGKAHNVGI-EKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKG 682
N V +V+ VG EK S+ SK KRK+ +D L K L VA
Sbjct: 334 N-VDQVMNKCDRGVGAQEKLSV-----------NVSKIKRKQKVDEASLLGNKVLTVADD 381
Query: 683 TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
+KQAAS++V +K K+ KSR N+ SDGCARSSI+GWEW +W+L ASPAERAR RG
Sbjct: 382 FSKQAASKRVVAQKKKSDKSRKLNISIISDGCARSSINGWEWRRWTLKASPAERARNRGF 441
Query: 743 QYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
QY ++ +GP+V+ S NGKGLSARTNRVKLRNLLAAA+GA+LLKASQ+KARK F
Sbjct: 442 QYFYSDPIGPDVSTSHLLNGKGLSARTNRVKLRNLLAAADGADLLKASQLKARKNVYAFN 501
>gi|168031561|ref|XP_001768289.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162680467|gb|EDQ66903.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 1655
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 131/251 (52%), Positives = 173/251 (68%), Gaps = 31/251 (12%)
Query: 709 PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK----- 763
P S GCAR SI+GW+W W+ + A+R R + K + ++ + A K
Sbjct: 1388 PTSVGCARCSITGWDWRTWARDR--AKRRLQRRVKAAIRKEVKQDLKKLKRATKKNVTNV 1445
Query: 764 ----------GL-SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
GL +AR NR +R L AEG++LL+ + +KARKK+L+FQRSKIHDWGLV
Sbjct: 1446 NNSTTTTVIAGLQAARKNRADMRKLAVGAEGSDLLRFNMLKARKKQLKFQRSKIHDWGLV 1505
Query: 813 ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV----------- 861
A EPI+AE+FVIEYVGE+IR +++DIRE RYE +GIGSSY+FR+DD +
Sbjct: 1506 AAEPIDAEEFVIEYVGEVIRNRVTDIREKRYEAIGIGSSYMFRVDDEHTLNTFIVLDHVP 1565
Query: 862 --VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
+DAT+RGG+ARFINHSC+PNCYTK+I+VEGQKK+ IY+K+ I GEE+TY+YKF LEE
Sbjct: 1566 SQLDATRRGGLARFINHSCDPNCYTKIITVEGQKKVVIYSKQRIVPGEELTYDYKFSLEE 1625
Query: 920 KKIPCYCGSKK 930
KIPC+CG+ K
Sbjct: 1626 VKIPCFCGAAK 1636
>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 2607
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 132/277 (47%), Positives = 172/277 (62%), Gaps = 61/277 (22%)
Query: 710 RSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLG---------PEVNASQWA 760
+S GCAR SISGW+W +W+ + + R R + + T + P +++ +
Sbjct: 2335 KSFGCARCSISGWKWRRWAHHGA-KHRLRQKVQDSISTSSISRGMIDSVVRPNPSSATYL 2393
Query: 761 NGKGLSARTNRVKLRNLLAAAEGAELLKASQV---------------------------- 792
SARTNR LR L++ EG++ LK SQ+
Sbjct: 2394 Q----SARTNRAALRKLISVVEGSDKLKLSQLTVIPAIPLPSYQTKLSIKTSLRIPGINR 2449
Query: 793 -------------------KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
+ KK L+FQRSKIHDWGL+ALE IEAEDFVIEYVGE+IR
Sbjct: 2450 SRRSVINAGPSQSSIAVLCSSMKKHLKFQRSKIHDWGLLALESIEAEDFVIEYVGEIIRR 2509
Query: 834 KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
++S+ RE +YE MGIGSSYLFR+DD VVDAT++GG+ARFINHSCNPNCYTK+I+VEG+K
Sbjct: 2510 QVSNFRERQYEIMGIGSSYLFRVDDELVVDATQKGGLARFINHSCNPNCYTKIITVEGRK 2569
Query: 894 KIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
K+ IY+KR I AGEE+TY+YKF LE+KKIPCYCG+ +
Sbjct: 2570 KVVIYSKRAIGAGEELTYDYKFSLEDKKIPCYCGAPR 2606
>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
Length = 1326
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 115/228 (50%), Positives = 160/228 (70%), Gaps = 19/228 (8%)
Query: 709 PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
P+ GCARSS+ G EW W + Y + + + ++ G SAR
Sbjct: 1118 PQLPGCARSSLPGSEWKLWKTS-------------YESYCCIDRKQSRAKERAGTLPSAR 1164
Query: 769 TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
+NR +R L G+ ++ +++ARKK L+FQRSKIH WG+VA+E IE EDF++EYVG
Sbjct: 1165 SNRAGVRKL----GGSRAME--KMRARKKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVG 1218
Query: 829 ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
E++R K++D+RE RY + G+GSSY FR+ DG+V+DAT+RGG+ RFINHSC PNCY K+I+
Sbjct: 1219 EVLRPKVADVREVRYLRQGLGSSYFFRVGDGFVIDATQRGGLGRFINHSCEPNCYPKIIT 1278
Query: 889 VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VEGQK++FIYA+ HIA G E+TY+YKFP E++KIPC CG+++C G LN
Sbjct: 1279 VEGQKRVFIYARTHIAPGTELTYDYKFPHEDQKIPCLCGAERCRGFLN 1326
>gi|347967460|ref|XP_307938.5| AGAP002246-PA [Anopheles gambiae str. PEST]
gi|333466285|gb|EAA03662.5| AGAP002246-PA [Anopheles gambiae str. PEST]
Length = 2003
Score = 236 bits (603), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 140/176 (79%), Gaps = 2/176 (1%)
Query: 763 KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
+G+S AR+N+ +L N A+ +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1828 QGISREARSNQRRLLNAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1887
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ VIEYVG+++R ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNP
Sbjct: 1888 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1947
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
NCY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG+ C G+LN
Sbjct: 1948 NCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAPGCRGTLN 2003
>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
Length = 2482
Score = 234 bits (598), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/230 (51%), Positives = 155/230 (67%), Gaps = 12/230 (5%)
Query: 713 GCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
GCARS IS E K+ RAR+ L N ++ A AR
Sbjct: 2259 GCARSEGFYKISFKEKTKY------LHRARLASVVEKPEDDLMESQNKAKQATQSSREAR 2312
Query: 769 TNRVKLRNLLAAAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
+N+ +L A+ G ++LL+ +Q+K RKK+L F++S+IHDWGL ALEPI AE+ VIEY
Sbjct: 2313 SNQRRLLASFGASCGDISDLLRFNQLKFRKKQLSFRKSRIHDWGLFALEPIAAEEMVIEY 2372
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+ IR I+D RE RYE+ GIGSSYLFR+D ++DATK G +ARFINH CNPNCY K+
Sbjct: 2373 VGQCIRQTIADERERRYEEQGIGSSYLFRVDHDMIIDATKNGNLARFINHCCNPNCYAKI 2432
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I+VEG KKI IY++R IA EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 2433 ITVEGYKKIVIYSRRDIAVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 2482
>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
Length = 1670
Score = 233 bits (595), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 103/176 (58%), Positives = 140/176 (79%), Gaps = 2/176 (1%)
Query: 763 KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
+GLS AR+N+ +L A+ +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1495 QGLSREARSNQRRLLTAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1554
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ VIEYVG+++R ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNP
Sbjct: 1555 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1614
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
NCY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1615 NCYAKVITIESEKKIVIYSKQAIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1670
>gi|312384250|gb|EFR29018.1| hypothetical protein AND_02361 [Anopheles darlingi]
Length = 2074
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 102/176 (57%), Positives = 139/176 (78%), Gaps = 2/176 (1%)
Query: 763 KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
+G+S AR+N+ +L N A+ +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1899 QGISREARSNQRRLLNAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1958
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ VIEYVG+++R ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNP
Sbjct: 1959 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 2018
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
NCY K+I++E +KKI IY+K+ I EEITY+YKFPLEE+KI C CG+ C G+LN
Sbjct: 2019 NCYAKIITIESEKKIVIYSKQPIGVNEEITYDYKFPLEEEKIRCLCGAPGCRGTLN 2074
>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 574
Score = 232 bits (591), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 102/177 (57%), Positives = 134/177 (75%)
Query: 760 ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
N AR+N+ +L A ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A
Sbjct: 398 VNAASREARSNQRRLLTAFGTATDSDLLKFNQLKFRKKQLKFSKSDIHDWGLFAMEPIAA 457
Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
++ VIEYVG+++R ++D RE YEK GIGSSYLFR+D ++DATK G +ARFINHSCN
Sbjct: 458 DEMVIEYVGQMVRPFLADFREKEYEKRGIGSSYLFRIDLETIIDATKCGNLARFINHSCN 517
Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY K+I++EGQKKI IY+K+ I EEITY+YKFP+EE+KIPC CG+ +C G LN
Sbjct: 518 PNCYAKIITIEGQKKIVIYSKKDIKVDEEITYDYKFPIEEEKIPCLCGAAQCKGYLN 574
>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
Length = 1687
Score = 231 bits (590), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1513 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1572
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1573 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1632
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1633 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1687
>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
Length = 1765
Score = 231 bits (589), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 102/176 (57%), Positives = 139/176 (78%), Gaps = 2/176 (1%)
Query: 763 KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
+G+S AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1590 QGISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1649
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNP
Sbjct: 1650 EMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1709
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
NCY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1710 NCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1765
>gi|195062427|ref|XP_001996188.1| GH22347 [Drosophila grimshawi]
gi|193899683|gb|EDV98549.1| GH22347 [Drosophila grimshawi]
Length = 1714
Score = 231 bits (589), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1540 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1599
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1600 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1659
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1660 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1714
>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
Length = 1755
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 101/175 (57%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1581 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1640
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1641 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1700
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1701 CYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1755
>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
Length = 1548
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 101/175 (57%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1374 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1433
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1434 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1493
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1494 CYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1548
>gi|195496958|ref|XP_002095897.1| GE25383 [Drosophila yakuba]
gi|194181998|gb|EDW95609.1| GE25383 [Drosophila yakuba]
Length = 1628
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1454 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1513
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1514 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1573
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1574 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1628
>gi|194898301|ref|XP_001978769.1| GG11901 [Drosophila erecta]
gi|190650472|gb|EDV47727.1| GG11901 [Drosophila erecta]
Length = 1626
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1452 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1511
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1512 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1571
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1572 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1626
>gi|195356446|ref|XP_002044683.1| GM18767 [Drosophila sechellia]
gi|194133849|gb|EDW55365.1| GM18767 [Drosophila sechellia]
Length = 1637
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 102/175 (58%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1463 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1522
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1523 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1582
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1583 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1637
>gi|332016392|gb|EGI57305.1| Histone-lysine N-methyltransferase SETD1B [Acromyrmex echinatior]
Length = 1513
Score = 230 bits (586), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)
Query: 751 GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
GP+ N S+ GK LS AR+N+ +L ++LLK +Q+K RKK+L+F +S I
Sbjct: 1325 GPK-NNSKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1383
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
HDWGL A+EPI A++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++DATK
Sbjct: 1384 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEATGIGSSYLFRIDLDTIIDATK 1443
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC C
Sbjct: 1444 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1503
Query: 927 GSKKCHGSLN 936
G+ +C G+LN
Sbjct: 1504 GAPQCRGTLN 1513
>gi|307182871|gb|EFN69932.1| Histone-lysine N-methyltransferase SETD1B [Camponotus floridanus]
Length = 1422
Score = 230 bits (586), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)
Query: 751 GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
GP+ N S+ GK LS AR+N+ +L ++LLK +Q+K RKK+L+F +S I
Sbjct: 1234 GPK-NNSKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1292
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
HDWGL A+EPI A++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++DATK
Sbjct: 1293 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEATGIGSSYLFRIDLDTIIDATK 1352
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC C
Sbjct: 1353 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1412
Query: 927 GSKKCHGSLN 936
G+ +C G+LN
Sbjct: 1413 GAPQCRGTLN 1422
>gi|340721798|ref|XP_003399301.1| PREDICTED: hypothetical protein LOC100643294 [Bombus terrestris]
Length = 1502
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 100/170 (58%), Positives = 132/170 (77%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1333 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1392
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG++IR ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1393 VGQMIRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1452
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1453 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1502
>gi|350408006|ref|XP_003488269.1| PREDICTED: hypothetical protein LOC100743429 [Bombus impatiens]
Length = 1503
Score = 229 bits (584), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 100/170 (58%), Positives = 132/170 (77%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG++IR ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMIRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503
>gi|307195485|gb|EFN77371.1| Histone-lysine N-methyltransferase SETD1B [Harpegnathos saltator]
Length = 1490
Score = 229 bits (583), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)
Query: 751 GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
GP+ N S+ GK LS AR+N+ +L ++LLK +Q+K RKK+L+F +S I
Sbjct: 1302 GPKSN-SKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1360
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
HDWGL A+EPI A++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++DATK
Sbjct: 1361 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATK 1420
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC C
Sbjct: 1421 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1480
Query: 927 GSKKCHGSLN 936
G+ +C G+LN
Sbjct: 1481 GAPQCRGTLN 1490
>gi|383849246|ref|XP_003700256.1| PREDICTED: uncharacterized protein LOC100875701 [Megachile rotundata]
Length = 1503
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 132/170 (77%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+++R ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503
>gi|328778989|ref|XP_395451.4| PREDICTED: hypothetical protein LOC411985 isoform 1 [Apis mellifera]
Length = 1503
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 132/170 (77%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+++R ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503
>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
Length = 565
Score = 229 bits (583), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 105/178 (58%), Positives = 136/178 (76%), Gaps = 7/178 (3%)
Query: 766 SARTNRVKLRNLLA-------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
++RT RV R L++ + ++LLK +Q+K RKKRL F +S IHDWGL A E I+
Sbjct: 388 NSRTARVNNRRLVSGIMVQNKSMAESDLLKFNQLKNRKKRLIFDKSPIHDWGLYAGESID 447
Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
A D VIEY+GE+IR ++++IRE YE++GIGSSYLFR+DD V+DATK+GG+ARFINH C
Sbjct: 448 AHDIVIEYIGEVIRQQVAEIREKHYERIGIGSSYLFRVDDDMVIDATKKGGMARFINHCC 507
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNC K+I+V+ QKK+ IYA R I GEEITY+YKFP+E +KIPC+CGSK C GSLN
Sbjct: 508 TPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEAEKIPCFCGSKFCKGSLN 565
>gi|380013032|ref|XP_003690574.1| PREDICTED: uncharacterized protein LOC100867676 [Apis florea]
Length = 1500
Score = 229 bits (583), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 132/170 (77%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1331 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1390
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+++R ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1391 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1450
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1451 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1500
>gi|62862148|ref|NP_001015221.1| Set1, isoform A [Drosophila melanogaster]
gi|62862150|ref|NP_001015222.1| Set1, isoform B [Drosophila melanogaster]
gi|161076059|ref|NP_001104406.1| Set1, isoform C [Drosophila melanogaster]
gi|281366745|ref|NP_001163846.1| Set1, isoform D [Drosophila melanogaster]
gi|281366747|ref|NP_001163847.1| Set1, isoform E [Drosophila melanogaster]
gi|281366749|ref|NP_001163848.1| Set1, isoform F [Drosophila melanogaster]
gi|281366751|ref|NP_001163849.1| Set1, isoform G [Drosophila melanogaster]
gi|281366753|ref|NP_001163850.1| Set1, isoform H [Drosophila melanogaster]
gi|281366755|ref|NP_001163851.1| Set1, isoform I [Drosophila melanogaster]
gi|51951109|gb|EAL24598.1| Set1, isoform A [Drosophila melanogaster]
gi|51951110|gb|EAL24599.1| Set1, isoform B [Drosophila melanogaster]
gi|158529717|gb|EDP28071.1| Set1, isoform C [Drosophila melanogaster]
gi|281309231|gb|EFA98694.1| Set1, isoform D [Drosophila melanogaster]
gi|281309232|gb|EFA98695.1| Set1, isoform E [Drosophila melanogaster]
gi|281309233|gb|EFA98696.1| Set1, isoform F [Drosophila melanogaster]
gi|281309234|gb|EFA98697.1| Set1, isoform G [Drosophila melanogaster]
gi|281309235|gb|EFA98698.1| Set1, isoform H [Drosophila melanogaster]
gi|281309236|gb|EFA98699.1| Set1, isoform I [Drosophila melanogaster]
Length = 1641
Score = 229 bits (583), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 101/175 (57%), Positives = 136/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1467 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1526
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 1527 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1586
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1587 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
>gi|196014878|ref|XP_002117297.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
gi|190580050|gb|EDV20136.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
Length = 217
Score = 228 bits (581), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 104/173 (60%), Positives = 137/173 (79%), Gaps = 3/173 (1%)
Query: 767 ARTNRVKLRNLLA---AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+R NR R +++ A++ +LLK +Q+K RKK+LRF RS+IH+WGL A EPI A++ V
Sbjct: 45 SRENRASNRRIISFFQASDAGDLLKLNQLKTRKKQLRFGRSQIHEWGLFAREPIAADEMV 104
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
IEYVG+ IR ++D RE RYEK+GIGSSYLFR+DD Y++DATK G +ARFINHSC+PNCY
Sbjct: 105 IEYVGQTIRQTVADEREHRYEKIGIGSSYLFRIDDNYIIDATKCGNLARFINHSCSPNCY 164
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
K+IS+E QKKI IY+K I EEITY+YKFP+E+ KIPC+CG+ +C G+LN
Sbjct: 165 AKIISLESQKKIVIYSKYDIQVNEEITYDYKFPIEDVKIPCHCGALQCRGALN 217
>gi|432875260|ref|XP_004072753.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Oryzias
latipes]
Length = 1547
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 103/167 (61%), Positives = 131/167 (78%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ ++L++ +Q+K RKKR+RF RS IH+WGL A+EPI A++ VIEYVGE
Sbjct: 1381 RSEQRRLLSSFSCDSDLVRFNQLKYRKKRIRFSRSHIHEWGLFAMEPIAADEMVIEYVGE 1440
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D RE RYE+ GIG SYLFR+D ++DATK G +ARFINHSCNPNCY KVISV
Sbjct: 1441 TIRQVIADRREQRYEEEGIGGSYLFRIDQDTIIDATKCGNLARFINHSCNPNCYAKVISV 1500
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I+ EEITYNYKFP+E+ KIPC CG++ C GSLN
Sbjct: 1501 ESQKKIVIYSRQPISINEEITYNYKFPIEDTKIPCLCGAENCRGSLN 1547
>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
tropicalis]
gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
Length = 1956
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 108/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+LRF +S IH
Sbjct: 1770 GKSIPAQPQASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLRFCKSHIH 1827
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 1828 DWGLFAMEPIIADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1887
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KVI+VE QKKI IY+K++I EEITY+YKFP+E+ KIPC CG
Sbjct: 1888 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCG 1947
Query: 928 SKKCHGSLN 936
++ C G+LN
Sbjct: 1948 AENCRGTLN 1956
>gi|189238620|ref|XP_969339.2| PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum]
gi|270009170|gb|EFA05618.1| hypothetical protein TcasGA2_TC015826 [Tribolium castaneum]
Length = 1268
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 105/177 (59%), Positives = 136/177 (76%), Gaps = 4/177 (2%)
Query: 763 KGLSARTNRVKLRNLLAAAEG---AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
+GLS R R R LLAA G ++LLK +Q+K RKK L+F +S IHDWGL A+EPI A
Sbjct: 1093 QGLS-REARSNQRRLLAAFGGDTDSDLLKFNQLKFRKKHLKFAKSAIHDWGLFAMEPIAA 1151
Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++DATK G +ARFINHSCN
Sbjct: 1152 DEMVIEYVGQMVRHSVADLRERKYEATGIGSSYLFRIDLENIIDATKCGNLARFINHSCN 1211
Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY KVI++E QKKI IY+K+ I EEITY+YKFP+E++KIPC CG+ C G+LN
Sbjct: 1212 PNCYAKVITIESQKKIVIYSKQSIGVNEEITYDYKFPIEDEKIPCLCGAATCRGTLN 1268
>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Metaseiulus occidentalis]
Length = 976
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 107/186 (57%), Positives = 138/186 (74%), Gaps = 5/186 (2%)
Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWG 810
P V ASQ + AR+N+ +L + E ++LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 795 PVVAASQMSR----EARSNQRRLLTIFQETEVVSDLLKFNQLKFRKKQLKFAKSTIHDWG 850
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L ALEPI A++ VIEYVG+ +R ++D RE +Y K+GIGSSYLFR+D ++DATK G +
Sbjct: 851 LFALEPIAADEMVIEYVGQSVRCSVADSREVQYTKVGIGSSYLFRVDHETIIDATKCGNL 910
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY +VI+VEGQKKI IY+KR I+ EEITY+YKFP EE KI C CG+ +
Sbjct: 911 ARFINHSCNPNCYARVITVEGQKKIVIYSKRDISVNEEITYDYKFPREEVKITCLCGTPQ 970
Query: 931 CHGSLN 936
C G LN
Sbjct: 971 CRGYLN 976
>gi|328711160|ref|XP_001945277.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
[Acyrthosiphon pisum]
Length = 1322
Score = 226 bits (576), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 159/254 (62%), Gaps = 16/254 (6%)
Query: 687 AASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVH 746
A S Q+ KK K C R++G + HK+ + A+ R+
Sbjct: 1081 AVSNQIPRKKKKDEWKVHDTGCGRTEGYYKVDSKQKANHKYHFGHTIAQLNRIT------ 1134
Query: 747 TKYLGPEVNASQWANGKGLS----ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
E+ + + GK L+ AR+N+ +L A ++LLK + +K RKK+L+F
Sbjct: 1135 ------ELKRVKESTGKMLALSREARSNQRRLLTAFGATSDSDLLKFNVLKFRKKQLKFG 1188
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
+S IHDWGL A+E I A++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++
Sbjct: 1189 KSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGSSYLFRIDLDTII 1248
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINHSCNPNCY K+I ++GQKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 1249 DATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLEDNKI 1308
Query: 923 PCYCGSKKCHGSLN 936
PC CG+ C G+LN
Sbjct: 1309 PCLCGTHCCRGTLN 1322
>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
Length = 1938
Score = 225 bits (574), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 106/189 (56%), Positives = 140/189 (74%), Gaps = 5/189 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK++RF +S IH
Sbjct: 1752 GKSIPAQPQASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKIRFCKSHIH 1809
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 1810 DWGLFAMEPIVADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1869
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KV++VE QKKI IY+K++I EEITY+YKFP+E+ KIPC CG
Sbjct: 1870 GNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCG 1929
Query: 928 SKKCHGSLN 936
++ C G+LN
Sbjct: 1930 AENCRGTLN 1938
>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
Length = 375
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 103/180 (57%), Positives = 134/180 (74%), Gaps = 8/180 (4%)
Query: 765 LSARTNRVKLRNLLAAAE--------GAELLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
+S+R NR+ R L E ++L+ + +KARKK+L+F +S IH+WGL A+E
Sbjct: 196 ISSRMNRINNRRLAVGIEMQKKISSSETDILRFNALKARKKQLKFSKSPIHNWGLYAMEH 255
Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
I+ D VIEYVGE++R ++DIRE +YE+ GIGSSYLFR+DD VVDATK+G IARFINH
Sbjct: 256 IDMGDMVIEYVGEIVRQTVADIRERQYERQGIGSSYLFRIDDDTVVDATKKGNIARFINH 315
Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
SC+P+C K+I VEG+KKI IYA R I GEEITY+YKFP+E+ KIPC CG+K C G+LN
Sbjct: 316 SCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYKFPIEDVKIPCLCGAKACRGTLN 375
>gi|326670188|ref|XP_699940.5| PREDICTED: histone-lysine N-methyltransferase SETD1B [Danio rerio]
Length = 753
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 115/229 (50%), Positives = 149/229 (65%), Gaps = 21/229 (9%)
Query: 713 GCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
GCARS IS E ++ N +Q H+ G E+ A Q
Sbjct: 541 GCARSEGYYFISKREKLQYLCNERTVSEEFTADSQVSHSSRSGSELRAEQ---------- 590
Query: 769 TNRVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R LL++ ++LLK +Q+K RKKRLRF +S+IHDWGL A EPI A++ +IEYV
Sbjct: 591 ------RRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIEYV 644
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KVI
Sbjct: 645 GQSIRQVIADMRERRYETEGIGSSYLFRVDHDTIIDATKCGNLARFINHSCNPNCYAKVI 704
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+++ I EEITY+YKFP+E++KIPC C ++ C G+LN
Sbjct: 705 TVEAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 753
>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
Full=SET domain-containing protein 1B-A
gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
Length = 1844
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 131/167 (78%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ ++LLK +Q+K RKK++RF RS IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1678 RSEQRRLLSSFSCDSDLLKFNQLKFRKKKIRFCRSHIHDWGLFAMEPIAADEMVIEYVGQ 1737
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+V
Sbjct: 1738 NIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1797
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1798 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1844
>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 595
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 108/204 (52%), Positives = 146/204 (71%), Gaps = 11/204 (5%)
Query: 743 QYVHTKYLGPEVNASQWANGKGLSARTN---------RVKLRNLLAAAE-GAELLKASQV 792
+Y+H L E+ ++ A G + A+ R + R LL++ ++L+K +Q+
Sbjct: 393 RYLHNTRLAAELPSTS-AQGTCIPAQQPTSLRAGSDFRCEQRRLLSSFSCDSDLVKFNQL 451
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+KR+RF RS IH+WGL A+EPI A++ VIEYVG++IR I+D+RE RYE+ GIGSSY
Sbjct: 452 KFRRKRIRFSRSYIHEWGLFAMEPIAADEMVIEYVGQIIRQVIADMREQRYEEEGIGSSY 511
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
LFR+D ++DATK G +ARFINHSCNPNCY K+I+VE QKKI IY+++ I EEITY+
Sbjct: 512 LFRVDQDTIIDATKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSRQPININEEITYD 571
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
YKFP+EE KIPC CG+ C GSLN
Sbjct: 572 YKFPIEETKIPCLCGADGCRGSLN 595
>gi|47217812|emb|CAG07226.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1884
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 102/171 (59%), Positives = 129/171 (75%), Gaps = 5/171 (2%)
Query: 771 RVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
R + R LL A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 1714 RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 1773
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+ IR ++D RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY K
Sbjct: 1774 YVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 1833
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI++E QKKI IY+K+ IA EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 1834 VITIESQKKIVIYSKQAIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 1884
>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
Length = 1787
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 133/167 (79%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ A ++LLK +Q+K RKK++RF +S IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1621 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSNIHDWGLFAMEPIAADEMVIEYVGQ 1680
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D+RE RYE+ GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+V
Sbjct: 1681 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1740
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1741 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1787
>gi|348510108|ref|XP_003442588.1| PREDICTED: hypothetical protein LOC100694318 [Oreochromis niloticus]
Length = 2205
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 105/177 (59%), Positives = 132/177 (74%), Gaps = 7/177 (3%)
Query: 765 LSARTNRVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
LS R R + R LL A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A
Sbjct: 2031 LSER--RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAA 2088
Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
++ VIEYVG+ IR ++D RE RY + GIGSSYLFR+D ++DATK G +ARFINH C
Sbjct: 2089 DEMVIEYVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCT 2148
Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY KVI++E QKKI IY+K+ IA EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 2149 PNCYAKVITIESQKKIVIYSKQPIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 2205
>gi|328701191|ref|XP_003241521.1| PREDICTED: hypothetical protein LOC100573227 [Acyrthosiphon pisum]
Length = 1315
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 113/254 (44%), Positives = 158/254 (62%), Gaps = 16/254 (6%)
Query: 687 AASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVH 746
A Q+ KK K C R++G + HK+ + A+ R+
Sbjct: 1074 ALPNQLPRKKKKDEWKVHDTGCGRTEGYYKVDSKQKANHKYHFGHTIAQLNRIT------ 1127
Query: 747 TKYLGPEVNASQWANGKGLS----ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
E+ + + GK L+ AR+N+ +L A ++LLK + +K RKK+L+F
Sbjct: 1128 ------ELKRVKESTGKMLALSREARSNQRRLLTAYGATSDSDLLKFNVLKFRKKQLKFG 1181
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
+S IHDWGL A+E I A++ VIEYVG+++R ++D+RE +YE GIGSSYLFR+D ++
Sbjct: 1182 KSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGSSYLFRIDLDTII 1241
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINHSCNPNCY K+I ++GQKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 1242 DATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLEDNKI 1301
Query: 923 PCYCGSKKCHGSLN 936
PC CG+ C G+LN
Sbjct: 1302 PCLCGTHCCRGTLN 1315
>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1830
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 102/167 (61%), Positives = 132/167 (79%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ A ++LLK +Q+K RKK++RF +S IHDWGL ALEPI A++ VIEYVG+
Sbjct: 1664 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFALEPIAADEMVIEYVGQ 1723
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D+RE RYE+ GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+V
Sbjct: 1724 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1783
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I EEITY+YKFP+E+ KIPC CG++ C G+LN
Sbjct: 1784 ESQKKIVIYSRQPINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 1830
>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 1846
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 133/167 (79%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ A ++LLK +Q+K RKK++RF +S IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1680 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFAMEPIAADEMVIEYVGQ 1739
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D+RE RYE+ GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+V
Sbjct: 1740 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1799
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1800 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1846
>gi|410903041|ref|XP_003965002.1| PREDICTED: uncharacterized protein LOC101071878 [Takifugu rubripes]
Length = 1776
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 102/171 (59%), Positives = 129/171 (75%), Gaps = 5/171 (2%)
Query: 771 RVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
R + R LL A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 1606 RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 1665
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+ IR ++D RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY K
Sbjct: 1666 YVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 1725
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI++E QKKI IY+K+ IA EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 1726 VITIESQKKIVIYSKQAIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 1776
>gi|194764460|ref|XP_001964347.1| GF23123 [Drosophila ananassae]
gi|190614619|gb|EDV30143.1| GF23123 [Drosophila ananassae]
Length = 236
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 135/175 (77%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 62 GISREARSNQRRLLTAFGSIGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 121
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 122 MVIEYVGQMIRPVVADLRESKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 181
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI++E +KKI IY+K+ I EEITY+YKFPLE++KIPC C ++ C G+LN
Sbjct: 182 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCAAQGCRGTLN 236
>gi|302764394|ref|XP_002965618.1| hypothetical protein SELMODRAFT_84518 [Selaginella moellendorffii]
gi|300166432|gb|EFJ33038.1| hypothetical protein SELMODRAFT_84518 [Selaginella moellendorffii]
Length = 139
Score = 222 bits (566), Expect = 6e-55, Method: Composition-based stats.
Identities = 90/139 (64%), Positives = 121/139 (87%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
++ARKK L+FQRSKIH WG+VA+E IE EDF++EYVGE++R K++D+RE RY + G+GSS
Sbjct: 1 MRARKKLLKFQRSKIHAWGVVAMEFIEPEDFIVEYVGEVLRPKVADVREVRYLRQGLGSS 60
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y FR+ DG+V+DAT+RGG+ RFINHSC PNCY K+I+VEGQK++FIYA+ HIA G E+TY
Sbjct: 61 YFFRVGDGFVIDATQRGGLGRFINHSCEPNCYAKIITVEGQKRVFIYARTHIAPGTELTY 120
Query: 912 NYKFPLEEKKIPCYCGSKK 930
+YKFP E++KIPC CG+++
Sbjct: 121 DYKFPHEDQKIPCLCGAER 139
>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
Length = 1674
Score = 222 bits (566), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 104/178 (58%), Positives = 133/178 (74%), Gaps = 6/178 (3%)
Query: 765 LSARTNRVKLRNLLAAAE------GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
+S+R RV R L E +++LK +Q+K+RKK+L+F +S IHDWGL A E I+
Sbjct: 1497 ISSRATRVNNRRLAIGMEMQKKTIDSDILKFNQLKSRKKQLKFAKSPIHDWGLFAEEHID 1556
Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
D VIEYVGE+IR ++++ RE +YE+ GIGSSYLFR+DD V+DATKRG IARFINH C
Sbjct: 1557 VNDMVIEYVGEMIRQQVAEEREKQYERCGIGSSYLFRVDDDTVIDATKRGSIARFINHCC 1616
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+PNC K+I+V+ QKKI IYA R I GEEITY+YKFP+E +KIPC CGSK C G+LN
Sbjct: 1617 SPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKFPIEAEKIPCLCGSKFCKGTLN 1674
>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 1557
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/171 (59%), Positives = 134/171 (78%), Gaps = 1/171 (0%)
Query: 767 ARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
AR+N+ +L AE ++LLK +Q+K RKK+L+F +S+IHDWGL ALEPI A++ VIE
Sbjct: 1387 ARSNQRRLLTSFGEAEVISDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIE 1446
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+++R ++D RE Y ++GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY K
Sbjct: 1447 YVGQMVRPIMADRREQFYTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAK 1506
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI+VEGQKKI IY+K+ I EEITY+YKFPLE++KI C CG+ +C G LN
Sbjct: 1507 VITVEGQKKIVIYSKQPINVNEEITYDYKFPLEDEKIVCLCGAPQCRGFLN 1557
>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
Length = 1340
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 98/171 (57%), Positives = 131/171 (76%), Gaps = 5/171 (2%)
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
SAR++R R +G++ + + +K+R+K+++FQRS IHDWGL A+E I A+D VIE
Sbjct: 1175 SARSSRFDSR-----FQGSDSVTLASLKSRRKKIKFQRSDIHDWGLFAMETINAKDMVIE 1229
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
Y+GE+IR K++D RE RY K GIGSSYLFR+DD ++DAT +G +ARFINH C+PNC K
Sbjct: 1230 YIGEVIRQKVADEREKRYIKKGIGSSYLFRVDDDTIIDATLKGNLARFINHCCDPNCIAK 1289
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
V+++ QKKI IYAKR I GEEITY+YKFP+E++KIPC C S KC G+LN
Sbjct: 1290 VLTINNQKKIIIYAKRDINIGEEITYDYKFPIEDEKIPCLCKSPKCRGTLN 1340
>gi|301622880|ref|XP_002940755.1| PREDICTED: hypothetical protein LOC100487270 [Xenopus (Silurana)
tropicalis]
Length = 1817
Score = 221 bits (563), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 105/189 (55%), Positives = 136/189 (71%), Gaps = 7/189 (3%)
Query: 753 EVNASQWANGKGLSARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIH 807
E A + LS R R + R LL+A ++LLK +Q+K RKK+LRF RS IH
Sbjct: 1631 EQEADTQGTNRVLSER--RSEQRRLLSAMGSTALLDSDLLKLNQLKFRKKKLRFGRSHIH 1688
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK
Sbjct: 1689 EWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYSQQGIGSSYLFRMDQDTIIDATKC 1748
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G +ARFINH C+PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG
Sbjct: 1749 GNLARFINHCCSPNCYAKVITIESQKKIVIYSKQPIGINEEITYDYKFPLEDNKIPCLCG 1808
Query: 928 SKKCHGSLN 936
++ C G+LN
Sbjct: 1809 TENCRGTLN 1817
>gi|326666050|ref|XP_001920852.3| PREDICTED: hypothetical protein LOC556535 [Danio rerio]
Length = 2253
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 100/171 (58%), Positives = 129/171 (75%), Gaps = 5/171 (2%)
Query: 771 RVKLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
R + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 2083 RSEQRRLLSAIGTPAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 2142
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+ IR ++D RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY K
Sbjct: 2143 YVGQSIRQMVADNREKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 2202
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI++E QKKI IY+K+ I EEITY+YKFP+EE KIPC CG++ C G+LN
Sbjct: 2203 VITIESQKKIVIYSKQPIGVNEEITYDYKFPIEENKIPCLCGTESCRGTLN 2253
>gi|344294411|ref|XP_003418911.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Loxodonta africana]
Length = 1703
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1525 TNRVLSERRSEQRRLLSAIGTSAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1584
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1585 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1644
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1645 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1703
>gi|29437232|gb|AAH49883.1| Setd1a protein [Mus musculus]
Length = 849
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)
Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
V+A Q G + TNRV + R LL+A ++LLK +Q+K RKK+LRF
Sbjct: 658 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 715
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++
Sbjct: 716 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 775
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 776 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 835
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C GSLN
Sbjct: 836 PCLCGTESCRGSLN 849
>gi|357631650|gb|EHJ79119.1| hypothetical protein KGM_15585 [Danaus plexippus]
Length = 1491
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 97/170 (57%), Positives = 129/170 (75%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
AR+N+ +L ++LLK +Q+K RKK+L+F +S IHDWGL A E I A++ VIEY
Sbjct: 1322 ARSNQRRLLTAFGTDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAQEAIAADEMVIEY 1381
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+++R ++D+RE YE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY K+
Sbjct: 1382 VGQMVRPIVADVREAHYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKI 1441
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I++E QKKI IY+K+ I EEITY+YKFPLE++KIPC CG+ +C G LN
Sbjct: 1442 ITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDEKIPCLCGAPQCRGYLN 1491
>gi|119572565|gb|EAW52180.1| hCG1998636, isoform CRA_b [Homo sapiens]
Length = 1189
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1011 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1070
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1071 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1130
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1131 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1189
>gi|329664196|ref|NP_001192362.1| histone-lysine N-methyltransferase SETD1A [Bos taurus]
Length = 1708
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708
>gi|296473196|tpg|DAA15311.1| TPA: SET domain containing 1A [Bos taurus]
Length = 1708
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708
>gi|6683126|dbj|BAA20797.2| KIAA0339 protein [Homo sapiens]
Length = 1709
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1531 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1590
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1591 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1650
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1651 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1709
>gi|55741677|ref|NP_055527.1| histone-lysine N-methyltransferase SETD1A [Homo sapiens]
gi|68052990|sp|O15047.3|SET1A_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1A; AltName:
Full=Lysine N-methyltransferase 2F; AltName: Full=SET
domain-containing protein 1A; Short=hSET1A; AltName:
Full=Set1/Ash2 histone methyltransferase complex subunit
SET1
gi|168272962|dbj|BAG10320.1| SET domain-containing protein 1A [synthetic construct]
Length = 1707
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
>gi|296220008|ref|XP_002807469.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Callithrix jacchus]
Length = 1708
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708
>gi|348584344|ref|XP_003477932.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cavia
porcellus]
Length = 1702
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1524 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1583
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1584 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1643
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1644 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1702
>gi|119508422|ref|NP_821172.2| SET domain containing 1A [Mus musculus]
Length = 1716
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)
Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQ 802
V+A Q G + TNRV + R LL+A ++LLK +Q+K RKK+LRF
Sbjct: 1525 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 1582
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++
Sbjct: 1583 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 1642
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 1643 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 1702
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C GSLN
Sbjct: 1703 PCLCGTESCRGSLN 1716
>gi|431906860|gb|ELK10981.1| Histone-lysine N-methyltransferase SETD1A [Pteropus alecto]
Length = 1665
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1487 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1546
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1547 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1606
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1607 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1665
>gi|114662089|ref|XP_001155055.1| PREDICTED: histone-lysine N-methyltransferase SETD1A isoform 2 [Pan
troglodytes]
gi|410289610|gb|JAA23405.1| SET domain containing 1A [Pan troglodytes]
Length = 1707
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
>gi|410221312|gb|JAA07875.1| SET domain containing 1A [Pan troglodytes]
Length = 1707
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
>gi|403276868|ref|XP_003930105.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Saimiri
boliviensis boliviensis]
Length = 1713
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1535 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1594
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1595 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1654
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1655 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1713
>gi|354497843|ref|XP_003511027.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cricetulus
griseus]
gi|344247133|gb|EGW03237.1| Histone-lysine N-methyltransferase SETD1A [Cricetulus griseus]
Length = 1723
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1545 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1604
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1605 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1664
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1665 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1723
>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
Length = 1685
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 101/172 (58%), Positives = 127/172 (73%), Gaps = 3/172 (1%)
Query: 768 RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
R R R LLA E +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1514 REARSAQRRLLAQFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1573
Query: 825 EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
EYVG+++R ++++RE +YE GIG SYLFR+DD +V+DAT G RFINHSC PNCY
Sbjct: 1574 EYVGQVVRKSVAELRERQYEAKGIGGSYLFRIDDDFVIDATMCGNNGRFINHSCQPNCYA 1633
Query: 885 KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
K+I+VEG+KKI IY+KR I EEITY+YKFP EE+KIPC CG+ C G+LN
Sbjct: 1634 KIITVEGKKKIVIYSKRDINVMEEITYDYKFPYEEEKIPCQCGASTCRGTLN 1685
>gi|194219010|ref|XP_001915080.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Equus caballus]
Length = 1707
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
>gi|410984758|ref|XP_003998693.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Felis catus]
Length = 1708
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708
>gi|402908170|ref|XP_003916826.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Papio anubis]
Length = 1707
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
>gi|148685638|gb|EDL17585.1| mCG141846, isoform CRA_b [Mus musculus]
Length = 1006
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)
Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQ 802
V+A Q G + TNRV + R LL+A ++LLK +Q+K RKK+LRF
Sbjct: 815 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 872
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++
Sbjct: 873 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 932
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 933 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 992
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C GSLN
Sbjct: 993 PCLCGTESCRGSLN 1006
>gi|332262972|ref|XP_003280531.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Nomascus
leucogenys]
Length = 1675
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1497 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1556
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1557 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1616
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1617 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1675
>gi|297283866|ref|XP_002808342.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Macaca mulatta]
Length = 1704
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1526 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1585
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1586 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1645
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1646 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1704
>gi|351711375|gb|EHB14294.1| Histone-lysine N-methyltransferase SETD1A [Heterocephalus glaber]
Length = 1502
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1324 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1383
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1384 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1443
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1444 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDSKIPCLCGAESCRGSLN 1502
>gi|345801544|ref|XP_848999.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Canis lupus familiaris]
Length = 1713
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1535 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1594
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1595 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1654
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1655 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1713
>gi|350581585|ref|XP_003481071.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sus scrofa]
Length = 1550
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1372 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1431
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1432 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1491
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1492 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1550
>gi|395846353|ref|XP_003795872.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Otolemur
garnettii]
Length = 1706
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1528 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1587
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1588 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1647
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1648 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1706
>gi|126334524|ref|XP_001364653.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Monodelphis
domestica]
Length = 1706
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1528 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1587
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1588 AADEMVIEYVGQNIRQVVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1647
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1648 CTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1706
>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 208
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 103/171 (60%), Positives = 134/171 (78%), Gaps = 1/171 (0%)
Query: 767 ARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
AR+N+ +L AE ++LLK +Q+K RKK+L+F +S+IHDWGL ALEPI A++ VIE
Sbjct: 38 ARSNQRRLLTSFGEAEVVSDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIE 97
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+++R ++D RE Y ++GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY K
Sbjct: 98 YVGQMVRPVMADRREQHYTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAK 157
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI+VEGQKKI IY+K+ I EEITY+YKFPLEE+KI C CG+ +C G LN
Sbjct: 158 VITVEGQKKIVIYSKQPINVNEEITYDYKFPLEEEKISCLCGAPQCRGFLN 208
>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
JAM81]
Length = 1367
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 95/152 (62%), Positives = 119/152 (78%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
+ L +Q+KARKKR++F +S IHDWGL A+EPI+A D VIEY+GE+IR K++D RE YE
Sbjct: 1216 DALSFNQLKARKKRIKFDKSIIHDWGLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYE 1275
Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
GIGSSYLFR+D+ ++DATK G +ARFINH C PNC KVISV+G K+I IYA R I
Sbjct: 1276 ASGIGSSYLFRVDEDTIIDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIK 1335
Query: 905 AGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
GEE+TY+YKFP+EE KIPC CG+ C G+LN
Sbjct: 1336 EGEELTYDYKFPIEEDKIPCLCGAVNCRGTLN 1367
>gi|397472033|ref|XP_003807565.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Pan paniscus]
Length = 1479
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1301 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1360
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1361 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1420
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1421 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1479
>gi|444725782|gb|ELW66336.1| Histone-lysine N-methyltransferase SETD1A [Tupaia chinensis]
Length = 1344
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1166 TNRVLSERRSEQRRLLSAIGTSAILDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1225
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1226 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1285
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1286 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1344
>gi|119572564|gb|EAW52179.1| hCG1998636, isoform CRA_a [Homo sapiens]
gi|119572567|gb|EAW52182.1| hCG1998636, isoform CRA_a [Homo sapiens]
Length = 1010
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 832 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 891
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 892 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 951
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 952 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1010
>gi|426255271|ref|XP_004021280.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Ovis aries]
Length = 1394
Score = 219 bits (559), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1216 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1275
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1276 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1335
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1336 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1394
>gi|395514884|ref|XP_003761641.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sarcophilus
harrisii]
Length = 1691
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1513 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1572
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1573 AADEMVIEYVGQNIRQVVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1632
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1633 CTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1691
>gi|321472797|gb|EFX83766.1| hypothetical protein DAPPUDRAFT_301653 [Daphnia pulex]
Length = 303
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 100/187 (53%), Positives = 134/187 (71%)
Query: 750 LGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDW 809
LGP AR+N+ +L ++LLK +Q+K R +L+F +S IHDW
Sbjct: 117 LGPTDTGINKGQALSREARSNQRRLLTAFGLENDSDLLKFNQLKFRGAKLKFGKSSIHDW 176
Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
GL A+E I A++ VIEYVG+++R ++D+RE +YE +GIGSSYLFR+D Y++DATK G
Sbjct: 177 GLFAVESIAADEMVIEYVGQVVRPVLADLRETQYEAVGIGSSYLFRVDLEYIIDATKCGN 236
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
+ARFINHSCNPNCY +VI++E QKKI IY+K+ I GEEITY+YKFP+EE KI C CGS
Sbjct: 237 LARFINHSCNPNCYARVITIESQKKIVIYSKQPIGVGEEITYDYKFPIEEDKIICLCGSS 296
Query: 930 KCHGSLN 936
+C G+LN
Sbjct: 297 QCRGTLN 303
>gi|27552784|gb|AAH42890.1| Setd1a protein, partial [Mus musculus]
Length = 458
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)
Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
V+A Q G + TNRV + R LL+A ++LLK +Q+K RKK+LRF
Sbjct: 267 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 324
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++
Sbjct: 325 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 384
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 385 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 444
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C GSLN
Sbjct: 445 PCLCGTESCRGSLN 458
>gi|302760998|ref|XP_002963921.1| hypothetical protein SELMODRAFT_81880 [Selaginella moellendorffii]
gi|302769161|ref|XP_002968000.1| hypothetical protein SELMODRAFT_88044 [Selaginella moellendorffii]
gi|300164738|gb|EFJ31347.1| hypothetical protein SELMODRAFT_88044 [Selaginella moellendorffii]
gi|300167650|gb|EFJ34254.1| hypothetical protein SELMODRAFT_81880 [Selaginella moellendorffii]
Length = 143
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 97/143 (67%), Positives = 123/143 (86%), Gaps = 4/143 (2%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
++AR K L+FQRSKIH WG++AL+ IE EDFVIEY+GEL+R K++D+RE RYEKMGI SS
Sbjct: 1 MQARTKSLKFQRSKIHSWGVIALQSIEPEDFVIEYIGELVRSKVADLRERRYEKMGIDSS 60
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCN----PNCYTKVISVEGQKKIFIYAKRHIAAGE 907
YLFR+D VVDATK GG+ARFINHSC+ PNCYTK+++VEGQK+IFIY+K+HI GE
Sbjct: 61 YLFRVDAENVVDATKHGGLARFINHSCDARILPNCYTKILTVEGQKRIFIYSKKHIKVGE 120
Query: 908 EITYNYKFPLEEKKIPCYCGSKK 930
E+TY+YKFP EE+KIPC CG+++
Sbjct: 121 ELTYDYKFPFEEQKIPCLCGAER 143
>gi|113206662|gb|ABI34482.1| SET domain containing 1Bb [Danio rerio]
Length = 175
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 101/167 (60%), Positives = 131/167 (78%), Gaps = 1/167 (0%)
Query: 771 RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R + R LL++ ++LLK +Q+K RKKRLRF +S+IHDWGL A EPI A++ +IEYVG+
Sbjct: 9 RAEQRRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIEYVGQ 68
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR I+D+RE RYE GIGSSYLFR+D ++DATK G +ARFINHSCNPNCY KVI+V
Sbjct: 69 SIRQVIADMRERRYETEGIGSSYLFRVDHDTIIDATKCGNLARFINHSCNPNCYAKVITV 128
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E QKKI IY+++ I EEITY+YKFP+E++KIPC C ++ C G+LN
Sbjct: 129 EAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 175
>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
queenslandica]
Length = 1053
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 154/231 (66%), Gaps = 17/231 (7%)
Query: 711 SDGCARSSISGWEWHKWSLNASPAERAR----VRGAQYVHTKYLGPEVNASQWANGKGLS 766
S GCARS G+ P++++R +R ++ K + N++ G
Sbjct: 835 SSGCARSE--GY------YKIDPSDKSRYLHHLRRSKASEDK----DKNSAPPPTSSGRG 882
Query: 767 ARTNRVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
R N ++ +L + + EL + +Q+KARKK+L F +S IH+WGL ALE I A++ V+E
Sbjct: 883 NRLNHRRVASLFTSGDVSRELQQYNQLKARKKQLTFAKSTIHNWGLFALETIPADEMVVE 942
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
Y+G+++R I+D RE RYE GIGSSYLFR+D +V+DATK G ARFINH C+PNCY K
Sbjct: 943 YIGQVVRHGIADERERRYEAQGIGSSYLFRVDYDHVIDATKSGNFARFINHCCDPNCYAK 1002
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+I+V QKKI IY+KR I AGEEITY+YKFP+E++KIPC CG+ +C G+LN
Sbjct: 1003 IITVGNQKKIVIYSKRDIRAGEEITYDYKFPIEDEKIPCLCGAPQCRGTLN 1053
>gi|20071601|gb|AAH27450.1| SETD1A protein [Homo sapiens]
Length = 469
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 291 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 350
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 351 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 410
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 411 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 469
>gi|16307411|gb|AAH10250.1| Setd1a protein, partial [Mus musculus]
Length = 316
Score = 216 bits (551), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)
Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
V+A Q G + TNRV + R LL+A ++LLK +Q+K RKK+LRF
Sbjct: 125 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 182
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++
Sbjct: 183 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 242
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KI
Sbjct: 243 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 302
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C GSLN
Sbjct: 303 PCLCGTESCRGSLN 316
>gi|301778787|ref|XP_002924825.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Ailuropoda melanoleuca]
Length = 1594
Score = 216 bits (550), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 104/180 (57%), Positives = 133/180 (73%), Gaps = 12/180 (6%)
Query: 769 TNRV------KLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1415 TNRVLSERRSEQRRLLSAIGTSAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1474
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 1475 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1534
Query: 878 C-NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1535 CMQPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1594
>gi|66805051|ref|XP_636258.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74852293|sp|Q54HS3.1|SET1_DICDI RecName: Full=Histone-lysine N-methyltransferase set1; AltName:
Full=Histone H3 lysine 4 methyltransferase; AltName:
Full=SET domain-containing protein 1
gi|60464689|gb|EAL62816.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 1486
Score = 216 bits (550), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 97/171 (56%), Positives = 128/171 (74%), Gaps = 5/171 (2%)
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
SAR++R R G++ + + +K+R+KR++F+RS IHDWGL A+E I A+D VIE
Sbjct: 1321 SARSSRFDNRGF-----GSDPITLASLKSRRKRIKFERSDIHDWGLFAMETISAKDMVIE 1375
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
Y+GE+IR K++D RE RY K GIGSSYLFR+DD ++DAT +G +ARFINH C+PNC K
Sbjct: 1376 YIGEVIRQKVADEREKRYVKKGIGSSYLFRVDDDTIIDATFKGNLARFINHCCDPNCIAK 1435
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
V+++ QKKI IYAKR I GEEITY+YKFP+E+ KIPC C S KC +LN
Sbjct: 1436 VLTIGNQKKIIIYAKRDINIGEEITYDYKFPIEDVKIPCLCKSPKCRQTLN 1486
>gi|71051743|gb|AAH98812.1| RGD1311624 protein, partial [Rattus norvegicus]
Length = 216
Score = 216 bits (550), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 108/190 (56%), Positives = 139/190 (73%), Gaps = 8/190 (4%)
Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKI 806
PE +Q N + LS R R + R LL+A ++LLK +Q+K RKK+LRF RS+I
Sbjct: 30 PEGVDTQGTN-RVLSER--RSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRI 86
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
H+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK
Sbjct: 87 HEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATK 146
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC C
Sbjct: 147 CGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLC 206
Query: 927 GSKKCHGSLN 936
G++ C GSLN
Sbjct: 207 GTESCRGSLN 216
>gi|195553639|ref|XP_002076709.1| GD11928 [Drosophila simulans]
gi|194202088|gb|EDX15664.1| GD11928 [Drosophila simulans]
Length = 374
Score = 216 bits (549), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 102/194 (52%), Positives = 136/194 (70%), Gaps = 19/194 (9%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ +L + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 181 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 240
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN-- 879
VIEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCN
Sbjct: 241 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNVS 300
Query: 880 -----------------PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
PNCY KVI++E +KKI IY+K+ I EEITY+YKFPLEE+KI
Sbjct: 301 ITGYFDIMLPTYNFVFQPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKI 360
Query: 923 PCYCGSKKCHGSLN 936
PC CG++ C G+LN
Sbjct: 361 PCLCGAQGCRGTLN 374
>gi|380801021|gb|AFE72386.1| histone-lysine N-methyltransferase SETD1A, partial [Macaca mulatta]
Length = 310
Score = 216 bits (549), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 132 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 191
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 192 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 251
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 252 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 310
>gi|355756723|gb|EHH60331.1| hypothetical protein EGM_11666, partial [Macaca fascicularis]
Length = 256
Score = 216 bits (549), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)
Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 78 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 137
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 138 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 197
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 198 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 256
>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1389
Score = 215 bits (548), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 98/169 (57%), Positives = 125/169 (73%), Gaps = 2/169 (1%)
Query: 770 NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
N+V+L L+ E A EL+K +Q++ RKK LRF RS IHDWGL A+E I + VIEYV
Sbjct: 1221 NQVQLAMALSKGENAATELVKFNQLQTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYV 1280
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE+IR +++D RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I
Sbjct: 1281 GEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 1340
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
++ +KKI IYAK+ I G EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 1341 TINSEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGYLN 1389
>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1359
Score = 215 bits (547), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 94/145 (64%), Positives = 116/145 (80%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
+ +RKK+++F+RS IHDWGL ALEPI A D VIEY+GE+IR K++D RE RY K GIGSS
Sbjct: 1215 MTSRKKKIKFERSLIHDWGLFALEPIYARDMVIEYIGEVIRQKVADEREKRYTKQGIGSS 1274
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
YLFR+DD ++DAT +G ARFINH C+PNC KVI++ GQKKI IYAKR I GEE+TY
Sbjct: 1275 YLFRIDDDTIIDATFKGNQARFINHCCDPNCMAKVITMGGQKKIIIYAKRDINVGEELTY 1334
Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
+YKFP+E+ KIPC C S KC G+LN
Sbjct: 1335 DYKFPIEDVKIPCLCKSAKCRGTLN 1359
>gi|119572569|gb|EAW52184.1| hCG1998636, isoform CRA_e [Homo sapiens]
Length = 1175
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/193 (52%), Positives = 133/193 (68%), Gaps = 2/193 (1%)
Query: 746 HTKYL--GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQR 803
HT L GP + + A +G + + K + L A L+ + RKK+LRF R
Sbjct: 983 HTNILSDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGVDTQFRKKKLRFGR 1042
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S+IH+WGL A+EPI A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++D
Sbjct: 1043 SRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIID 1102
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
ATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIP
Sbjct: 1103 ATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIP 1162
Query: 924 CYCGSKKCHGSLN 936
C CG++ C GSLN
Sbjct: 1163 CLCGTESCRGSLN 1175
>gi|410922130|ref|XP_003974536.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Takifugu
rubripes]
Length = 1302
Score = 214 bits (546), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 104/203 (51%), Positives = 143/203 (70%), Gaps = 13/203 (6%)
Query: 744 YVHTKYLGPEVNASQWANGKGLSARTN---------RVKLRNLLAAAE-GAELLKASQVK 793
Y++ L E+ ++Q G G++ + + R++ R LL++ ++LLK +Q+K
Sbjct: 1103 YMNNTELSAELPSTQ---GTGVAGQQSTSLRWGSDFRLEQRRLLSSFNCDSDLLKFNQLK 1159
Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
RKK RF S IH+WGL A+EPI AE+ V+EYVGE+IR I+D+RE RYE+ GI SSY+
Sbjct: 1160 FRKKMTRFSPSHIHEWGLFAMEPIAAEEMVMEYVGEIIRQVIADMREQRYEESGIRSSYM 1219
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
FR+D ++DATK G +ARFINHSCNPNCY K+I+VE QKKI IY+++ I+ EEITY+Y
Sbjct: 1220 FRIDQETIIDATKCGNVARFINHSCNPNCYAKIITVESQKKIVIYSRQPISINEEITYDY 1279
Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
KFP E+ KIPC C + C GSLN
Sbjct: 1280 KFPSEDTKIPCLCRATGCRGSLN 1302
>gi|281201147|gb|EFA75361.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1458
Score = 214 bits (546), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 99/171 (57%), Positives = 126/171 (73%), Gaps = 10/171 (5%)
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
SAR++R + R G +++ + +RKK+++F RS IHDWGL ALEPI A+D VIE
Sbjct: 1298 SARSSRYESRM------GGDIV----LTSRKKKIKFSRSAIHDWGLFALEPITAKDMVIE 1347
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
Y+GE+IR K++D RE RY K GIGSSYLFR+DD ++DAT +G +ARFINH C+PNC K
Sbjct: 1348 YIGEVIRQKVADEREKRYTKKGIGSSYLFRIDDDTIIDATFKGNLARFINHCCDPNCIAK 1407
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+I QKKI IYAKR I GEEITY+YKFP+E+ KIPC C S KC G+LN
Sbjct: 1408 IIQTGNQKKIVIYAKRDIMIGEEITYDYKFPIEDVKIPCLCKSAKCRGTLN 1458
>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 1502
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 94/153 (61%), Positives = 119/153 (77%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
A++LK +Q++ RKK+L+F RS IHDWGL A+E I A + VIEYVGE+IR ++D RE Y
Sbjct: 1350 ADVLKFNQLRTRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLY 1409
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E+MGIGSSYLFR+DD VVDATK+G + R INH C+PNC K+I++ G+KKI IYAK I
Sbjct: 1410 ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTI 1469
Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
G+E+TY+Y FP EE KIPC CGS KC G+LN
Sbjct: 1470 ELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1502
>gi|348554403|ref|XP_003463015.1| PREDICTED: hypothetical protein LOC100714908 [Cavia porcellus]
Length = 1931
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 133/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1763 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1822
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 1823 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1882
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI+ EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1883 TVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1931
>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
Length = 296
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 96/155 (61%), Positives = 127/155 (81%), Gaps = 2/155 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
A+L+KA+Q K R+K LRF RS IH+WGL A EPI+ ++ VIEYVGE++R +++ RE RY
Sbjct: 142 ADLVKANQFKRRRKLLRFARSMIHEWGLFAQEPIDKDELVIEYVGEIVRQTVAEDRERRY 201
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
++GIGSSYLFR+D+ YV+DAT+ G IARFINHSC+ NCY +V+SV+G+K+I IY+KR I
Sbjct: 202 ARIGIGSSYLFRIDEDYVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPI 261
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
AA EEITY+YKFP EE KIPC+CG++ C G+LN
Sbjct: 262 AANEEITYDYKFPREEGPNKIPCFCGARTCRGTLN 296
>gi|355718717|gb|AES06362.1| SET domain containing 1A [Mustela putorius furo]
Length = 296
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 103/178 (57%), Positives = 132/178 (74%), Gaps = 11/178 (6%)
Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
TNRV + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 119 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 178
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVG+ IR ++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH
Sbjct: 179 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 238
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSL 935
C PNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSL
Sbjct: 239 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSL 296
>gi|334327124|ref|XP_003340832.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like, partial [Monodelphis domestica]
Length = 1723
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 140/186 (75%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1540 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1597
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1598 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1657
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI+ EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1658 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKIPCLCGSEN 1717
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1718 CRGTLN 1723
>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 1014
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 94/153 (61%), Positives = 119/153 (77%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
A++LK +Q++ RKK+L+F RS IHDWGL A+E I A + VIEYVGE+IR ++D RE Y
Sbjct: 862 ADVLKFNQLRTRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLY 921
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E+MGIGSSYLFR+DD VVDATK+G + R INH C+PNC K+I++ G+KKI IYAK I
Sbjct: 922 ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTI 981
Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
G+E+TY+Y FP EE KIPC CGS KC G+LN
Sbjct: 982 ELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1014
>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
SS1]
Length = 1095
Score = 213 bits (541), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 97/169 (57%), Positives = 125/169 (73%), Gaps = 2/169 (1%)
Query: 770 NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
N+V+ L+ E A + +K +Q++ RKK LRF RS IHDWGL A+E I D VIEYV
Sbjct: 927 NQVQRAMALSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKINRGDLVIEYV 986
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE+IR +++D RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I
Sbjct: 987 GEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 1046
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
++ G+KKI IYAK+ I G EITY+Y FP+E+ KIPC CGS KC G+LN
Sbjct: 1047 TISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1095
>gi|71897211|ref|NP_001025832.1| histone-lysine N-methyltransferase SETD1B [Gallus gallus]
gi|82231199|sp|Q5F3P8.1|SET1B_CHICK RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|60098811|emb|CAH65236.1| hypothetical protein RCJMB04_10j6 [Gallus gallus]
Length = 2008
Score = 213 bits (541), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IH
Sbjct: 1822 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 1879
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 1880 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1939
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CG
Sbjct: 1940 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 1999
Query: 928 SKKCHGSLN 936
S+ C G+LN
Sbjct: 2000 SENCRGTLN 2008
>gi|224071200|ref|XP_002193972.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Taeniopygia
guttata]
Length = 2004
Score = 212 bits (540), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1821 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1878
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1879 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1938
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1939 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1998
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1999 CRGTLN 2004
>gi|198414837|ref|XP_002125245.1| PREDICTED: similar to AGAP002246-PA, partial [Ciona intestinalis]
Length = 754
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 109/247 (44%), Positives = 159/247 (64%), Gaps = 9/247 (3%)
Query: 695 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 754
K+ +A K S++ P GCARS ++K + N +R + + G
Sbjct: 512 KRRQARKDDISDIKPHKTGCARSE----GFYKVNDNTKLIQRKLMYRQADNTPQPKGRSK 567
Query: 755 NASQWANGKGLS---ARTNRVKLRNLLA--AAEGAELLKASQVKARKKRLRFQRSKIHDW 809
A++ A+ G++ +R R +R + + ++ ++LLK +Q+ RKK ++F+RS IH W
Sbjct: 568 YANESASTSGVATEKSRETRHMMRRIASEFGSDASDLLKYNQLMYRKKSVKFKRSHIHGW 627
Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
GL A E I A++ VIEYVG+L+R I+D RE Y + GIGSSYLFR+D +++DATK G
Sbjct: 628 GLFAEETIGADEMVIEYVGQLVRSLIADRREVDYTRRGIGSSYLFRIDSDHIIDATKCGN 687
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
ARF+NHSCNP+CY KVI+V+G KKI IY+K I +EITY+YKFP+E+ KIPC+CG+
Sbjct: 688 FARFMNHSCNPSCYAKVIAVDGAKKIVIYSKDTIKPTDEITYDYKFPIEDVKIPCFCGAP 747
Query: 930 KCHGSLN 936
C G+LN
Sbjct: 748 NCRGTLN 754
>gi|86129848|gb|ABC86576.1| KIAA0339 protein [Danio rerio]
Length = 406
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 102/177 (57%), Positives = 131/177 (74%), Gaps = 7/177 (3%)
Query: 765 LSARTNRVKLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
LS R R + R LL+A ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A
Sbjct: 232 LSER--RSEQRRLLSAIGTPAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAA 289
Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
++ VIEYVG+ IR ++D E RY + GIGSSYLFR+D ++DATK G +ARFINH C
Sbjct: 290 DEMVIEYVGQSIRQMVADNWEKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCT 349
Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY KVI++E QKKI IY+K+ I EEITY+YKFP+EE KIPC CG++ C G+LN
Sbjct: 350 PNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIEENKIPCLCGTESCRGTLN 406
>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
Length = 292
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 115/251 (45%), Positives = 151/251 (60%), Gaps = 19/251 (7%)
Query: 692 VAMKKTKASKSRTSNLCPRSDGCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHT 747
V KK K + + GCAR+ IS E H + + R A
Sbjct: 55 VPPKKRKKYSQQDEVMVAHKTGCARTEGYYKISHQEKHGYLKDFRGVNERDARDAA---- 110
Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLL--AAAEGAELLKASQVKARKKRLRFQRSK 805
L +V S+ AR+ + +N+L + +LLK +Q+K RKK L+F +S
Sbjct: 111 --LKKKVQISR-------EARSESRRWQNVLQLMGEDIGDLLKFNQLKFRKKNLKFAKSG 161
Query: 806 IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDAT 865
IHDWGL ALEPI + VIEYVG ++R +D+RE +YE MGIGSSYLFR+D ++DAT
Sbjct: 162 IHDWGLFALEPIAEGEMVIEYVGAVVRQSTADLREKKYEAMGIGSSYLFRIDHDLIIDAT 221
Query: 866 KRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
K G +ARFINHSCNPNC K+I+VE KKI IY++R I EEITY+YKFPLE++KIPC
Sbjct: 222 KCGNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEEITYDYKFPLEDEKIPCL 281
Query: 926 CGSKKCHGSLN 936
CG+ C G+LN
Sbjct: 282 CGTSACRGTLN 292
>gi|441630858|ref|XP_003280765.2| PREDICTED: uncharacterized protein LOC100584028 [Nomascus leucogenys]
Length = 1863
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1680 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1737
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1738 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1797
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1798 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1857
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1858 CRGTLN 1863
>gi|119618696|gb|EAW98290.1| hCG1812756 [Homo sapiens]
Length = 1048
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 865 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 922
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 923 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 982
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 983 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1042
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1043 CRGTLN 1048
>gi|210032580|ref|NP_055863.1| histone-lysine N-methyltransferase SETD1B [Homo sapiens]
gi|166977692|sp|Q9UPS6.2|SET1B_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=Lysine N-methyltransferase 2G; AltName: Full=SET
domain-containing protein 1B; Short=hSET1B
Length = 1923
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1740 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1797
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1798 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1857
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1858 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1917
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1918 CRGTLN 1923
>gi|392352531|ref|XP_003751234.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Rattus
norvegicus]
Length = 1900
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/190 (57%), Positives = 141/190 (74%), Gaps = 5/190 (2%)
Query: 750 LGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKI 806
+G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S I
Sbjct: 1713 MGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHI 1770
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
HDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 1771 HDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATK 1830
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC C
Sbjct: 1831 CGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLC 1890
Query: 927 GSKKCHGSLN 936
GS+ C G+LN
Sbjct: 1891 GSENCRGTLN 1900
>gi|334192482|gb|AEG67286.1| histone-lysine N-methyltransferase [Homo sapiens]
Length = 1966
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1783 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1840
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1841 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1900
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1901 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1960
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1961 CRGTLN 1966
>gi|402887949|ref|XP_003907341.1| PREDICTED: uncharacterized protein LOC101023789 [Papio anubis]
Length = 1927
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1744 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1801
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1802 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1861
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1862 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1921
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1922 CRGTLN 1927
>gi|410047437|ref|XP_003314036.2| PREDICTED: uncharacterized protein LOC473295, partial [Pan
troglodytes]
Length = 1955
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1772 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1829
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1830 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1889
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1890 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1949
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1950 CRGTLN 1955
>gi|403281795|ref|XP_003932362.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Saimiri
boliviensis boliviensis]
Length = 1823
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1640 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1697
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1698 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1757
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1758 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1817
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1818 CRGTLN 1823
>gi|344297409|ref|XP_003420391.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Loxodonta africana]
Length = 1750
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1582 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1641
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 1642 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1701
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1702 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1750
>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
Length = 1312
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/172 (56%), Positives = 121/172 (70%), Gaps = 3/172 (1%)
Query: 768 RTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
R NR++LR A E A + +K+RKK L+FQRS IH +GL + E I A D VI
Sbjct: 1141 RANRLQLRQFAAVIEQATNSDTFSLDMLKSRKKLLKFQRSGIHAFGLFSQENISANDLVI 1200
Query: 825 EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
EYVGE+IR ISDIRE YE+ GIGSSY FR+D+ +VVDAT +G +ARF+NH C PNCY
Sbjct: 1201 EYVGEVIRQSISDIREHHYERRGIGSSYFFRIDEDHVVDATYKGNLARFMNHCCEPNCYA 1260
Query: 885 KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
K+I V+G ++I IY+KR I GEEITY+YKFP EE KIPC CG+ C LN
Sbjct: 1261 KIIMVDGHQRIVIYSKRDIKKGEEITYDYKFPYEENKIPCLCGAVNCKKFLN 1312
>gi|405967140|gb|EKC32340.1| Histone-lysine N-methyltransferase SETD1B-A [Crassostrea gigas]
Length = 1401
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 91/142 (64%), Positives = 113/142 (79%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
RKK+L+F RS IHDWGL ALEPI A++ VIEYVGE +R ++D+RE +YE G GSSYLF
Sbjct: 1260 RKKQLKFARSGIHDWGLFALEPIAADEMVIEYVGETLRQSLADLREKKYEAEGCGSSYLF 1319
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+D ++DATK G +ARFINH CNPNCY K+I+VE QKKI IY+KR I EEITY+YK
Sbjct: 1320 RVDTETIIDATKCGNLARFINHCCNPNCYAKIITVESQKKIVIYSKRDIDVNEEITYDYK 1379
Query: 915 FPLEEKKIPCYCGSKKCHGSLN 936
FP+E++KIPC CG+ C G+LN
Sbjct: 1380 FPIEDEKIPCLCGAPNCRGTLN 1401
>gi|301754587|ref|XP_002913168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Ailuropoda melanoleuca]
Length = 1805
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 110/193 (56%), Positives = 141/193 (73%), Gaps = 5/193 (2%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 1615 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1672
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 1673 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1732
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
ATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIP
Sbjct: 1733 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1792
Query: 924 CYCGSKKCHGSLN 936
C CGS+ C G+LN
Sbjct: 1793 CLCGSENCRGTLN 1805
>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
SS1]
Length = 1014
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 96/169 (56%), Positives = 125/169 (73%), Gaps = 2/169 (1%)
Query: 770 NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
N+V+ L+ E A + +K +Q++ RKK LRF RS IHDWGL A+E I + VIEYV
Sbjct: 846 NQVQRAMALSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYV 905
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE+IR +++D RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I
Sbjct: 906 GEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 965
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
++ G+KKI IYAK+ I G EITY+Y FP+E+ KIPC CGS KC G+LN
Sbjct: 966 TISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1014
>gi|5689489|dbj|BAA83028.1| KIAA1076 protein [Homo sapiens]
Length = 804
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 621 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 678
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 679 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 738
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 739 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 798
Query: 931 CHGSLN 936
C G+LN
Sbjct: 799 CRGTLN 804
>gi|149063329|gb|EDM13652.1| rCG21620 [Rattus norvegicus]
Length = 1091
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 908 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 965
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 966 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1025
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1026 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1085
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1086 CRGTLN 1091
>gi|396578140|ref|NP_001035488.2| histone-lysine N-methyltransferase SETD1B [Mus musculus]
gi|166977693|sp|Q8CFT2.2|SET1B_MOUSE RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
Length = 1985
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1802 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1859
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1860 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1919
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1920 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1979
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1980 CRGTLN 1985
>gi|426374487|ref|XP_004054104.1| PREDICTED: uncharacterized protein LOC101124677 [Gorilla gorilla
gorilla]
Length = 1922
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1754 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1813
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 1814 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1873
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1874 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1922
>gi|410976579|ref|XP_003994695.1| PREDICTED: uncharacterized protein LOC101096419 [Felis catus]
Length = 1919
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 110/193 (56%), Positives = 141/193 (73%), Gaps = 5/193 (2%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 1729 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1786
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 1787 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1846
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
ATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIP
Sbjct: 1847 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1906
Query: 924 CYCGSKKCHGSLN 936
C CGS+ C G+LN
Sbjct: 1907 CLCGSENCRGTLN 1919
>gi|50546869|ref|XP_500904.1| YALI0B14883p [Yarrowia lipolytica]
gi|74689791|sp|Q6CEK8.1|SET1_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49646770|emb|CAG83155.1| YALI0B14883p [Yarrowia lipolytica CLIB122]
Length = 1170
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 99/181 (54%), Positives = 130/181 (71%), Gaps = 10/181 (5%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R L+A + ++L +Q++ RKK ++F RS IH+WGL A+EPI
Sbjct: 990 NSRVNRANNRRLVADINMQKQLLSTETDVLNFNQLRKRKKPVKFARSAIHNWGLYAIEPI 1049
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A + +IEYVGE++R +I+D+RE RY + GIGSSYLFR+D+ VVDATKRGGIARFINH
Sbjct: 1050 AANEMIIEYVGEVVRQEIADLREARYMRSGIGSSYLFRVDESTVVDATKRGGIARFINHC 1109
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSL 935
C P+C K+I VEGQK+I IYA R IAA EE+TY+YKF E E++IPC CG+ C G L
Sbjct: 1110 CTPSCTAKIIKVEGQKRIVIYASRDIAANEELTYDYKFEKEIGEERIPCLCGAPGCKGYL 1169
Query: 936 N 936
N
Sbjct: 1170 N 1170
>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Strongylocentrotus purpuratus]
Length = 282
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 96/175 (54%), Positives = 128/175 (73%), Gaps = 5/175 (2%)
Query: 767 ARTNRVKLRNLLAAAEGAE-----LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
+R R R +A + E L K +Q++ RKK ++F +S IH WGL A+EPI A++
Sbjct: 108 SRDARSDQRRFIAVLQQQEEQVSNLFKFNQLQFRKKDIKFCKSSIHGWGLYAMEPIAADE 167
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
VIEYVGE +R I+D RE YE+MGIGSSYLFR+D ++DATK G +ARFINHSCNPN
Sbjct: 168 MVIEYVGESVRQSIADSREKAYERMGIGSSYLFRIDAVTIIDATKSGNLARFINHSCNPN 227
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY K+I+VE +KKI IY+K+ I G+EITY+YKFP+E++KI C CG+ +C G+LN
Sbjct: 228 CYAKIITVESEKKIVIYSKQTINVGDEITYDYKFPIEDEKISCLCGAAQCRGTLN 282
>gi|355786615|gb|EHH66798.1| hypothetical protein EGM_03852, partial [Macaca fascicularis]
Length = 673
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 490 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 547
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 548 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 607
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 608 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 667
Query: 931 CHGSLN 936
C G+LN
Sbjct: 668 CRGTLN 673
>gi|392332670|ref|XP_003752655.1| PREDICTED: uncharacterized protein LOC100359816 [Rattus norvegicus]
Length = 2265
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 2097 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 2156
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 2157 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 2216
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 2217 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 2265
>gi|351698529|gb|EHB01448.1| Histone-lysine N-methyltransferase SETD1B [Heterocephalus glaber]
Length = 1486
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IH
Sbjct: 1300 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 1357
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 1358 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1417
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CG
Sbjct: 1418 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 1477
Query: 928 SKKCHGSLN 936
S+ C G+LN
Sbjct: 1478 SENCRGTLN 1486
>gi|297263735|ref|XP_002808043.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Macaca mulatta]
Length = 2216
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 2048 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 2107
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 2108 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 2167
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 2168 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 2216
>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 1278
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 154/242 (63%), Gaps = 15/242 (6%)
Query: 708 CPRSDGCARSS----ISGWEWHKWSLNASPAERARV-RGAQYVHTKYLGPEVNASQWANG 762
P + GCAR+ I E K+ + ++AR R AQ G + A + A+
Sbjct: 1039 VPNATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAP--GKSLTAVKVADD 1096
Query: 763 KGLS---ARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
+S +R NR R +A + ++ L+ +Q+K RKK ++F+RS IH+WGL A+E I
Sbjct: 1097 NSISRGSSRANRANHRRAMAEMNDQKSDALRFNQLKKRKKPVKFERSAIHNWGLYAMEHI 1156
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R ++ +RE RY++ GIGSSYLFR+D+ V+DATK+GGIARFINHS
Sbjct: 1157 PKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYLFRIDEDAVIDATKKGGIARFINHS 1216
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF LEEK ++PC CG+ C G
Sbjct: 1217 CMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEEDRVPCLCGTTNCKGF 1276
Query: 935 LN 936
LN
Sbjct: 1277 LN 1278
>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
Length = 1278
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 154/242 (63%), Gaps = 15/242 (6%)
Query: 708 CPRSDGCARSS----ISGWEWHKWSLNASPAERARV-RGAQYVHTKYLGPEVNASQWANG 762
P + GCAR+ I E K+ + ++AR R AQ G + A + A+
Sbjct: 1039 VPNATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAP--GKSLTAVKVADD 1096
Query: 763 KGLS---ARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
+S +R NR R +A + ++ L+ +Q+K RKK ++F+RS IH+WGL A+E I
Sbjct: 1097 NSISRGSSRANRANHRRAMAEMNDQKSDALRFNQLKKRKKPVKFERSAIHNWGLYAMEHI 1156
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R ++ +RE RY++ GIGSSYLFR+D+ V+DATK+GGIARFINHS
Sbjct: 1157 PKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYLFRIDEDAVIDATKKGGIARFINHS 1216
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF LEEK ++PC CG+ C G
Sbjct: 1217 CMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEEDRVPCLCGTTNCKGF 1276
Query: 935 LN 936
LN
Sbjct: 1277 LN 1278
>gi|345321023|ref|XP_001506028.2| PREDICTED: hypothetical protein LOC100074411 [Ornithorhynchus
anatinus]
Length = 1258
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1090 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1149
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 1150 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1209
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1210 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1258
>gi|355564772|gb|EHH21272.1| hypothetical protein EGK_04290, partial [Macaca mulatta]
Length = 663
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 480 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 537
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 538 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 597
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 598 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 657
Query: 931 CHGSLN 936
C G+LN
Sbjct: 658 CRGTLN 663
>gi|211830050|gb|AAH38367.2| Setd1b protein [Mus musculus]
Length = 1103
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 920 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 977
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 978 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1037
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1038 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1097
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1098 CRGTLN 1103
>gi|28972602|dbj|BAC65717.1| mKIAA1076 protein [Mus musculus]
Length = 855
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 672 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 729
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 730 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 789
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 790 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 849
Query: 931 CHGSLN 936
C G+LN
Sbjct: 850 CRGTLN 855
>gi|27371314|gb|AAH41681.1| Setd1b protein, partial [Mus musculus]
Length = 917
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 734 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 791
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 792 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 851
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 852 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 911
Query: 931 CHGSLN 936
C G+LN
Sbjct: 912 CRGTLN 917
>gi|26251880|gb|AAH40775.1| Setd1b protein, partial [Mus musculus]
Length = 911
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 728 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 785
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 786 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 845
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 846 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 905
Query: 931 CHGSLN 936
C G+LN
Sbjct: 906 CRGTLN 911
>gi|431912177|gb|ELK14315.1| Histone-lysine N-methyltransferase SETD1B [Pteropus alecto]
Length = 245
Score = 209 bits (532), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 110/203 (54%), Positives = 146/203 (71%), Gaps = 3/203 (1%)
Query: 737 ARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVK 793
AR G + K +N+S+ + + + R + R LL++ G+ +LLK +Q+K
Sbjct: 43 ARSEGFYTIDKKDKLRYLNSSRASTDEPPADTQRRSEQRRLLSSFTGSCDSDLLKFNQLK 102
Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
RKK+L+F +S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+
Sbjct: 103 FRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYM 162
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
FR+D ++DATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI+ EEITY+Y
Sbjct: 163 FRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDY 222
Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
KFP+E+ KIPC CGS+ C G+LN
Sbjct: 223 KFPIEDVKIPCLCGSENCRGTLN 245
>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
Length = 1200
Score = 209 bits (532), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 104/199 (52%), Positives = 134/199 (67%), Gaps = 14/199 (7%)
Query: 752 PEVNASQWAN---GKGLSARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLR 800
PEV A+ N ++R NRV R L+ + A++++ +Q+K RKK ++
Sbjct: 1002 PEVTAAAVTNPTKNPSSTSRMNRVNNRRLVVGLNNQKQMLSSDADVMRFNQLKKRKKPVK 1061
Query: 801 FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
F RS IH+WGL A+E I A D +IEYVGE+IR +++D+RE +Y K GIGSSYLFR+DD
Sbjct: 1062 FARSAIHNWGLYAMENISAGDMIIEYVGEIIRQQVADMREKKYLKSGIGSSYLFRIDDTT 1121
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE-- 918
V+DATK GGIARFINHSC PNC K+I VEG K+I IYA R I EE+TY+YKF E
Sbjct: 1122 VIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFERELE 1181
Query: 919 -EKKIPCYCGSKKCHGSLN 936
E++IPC CGS C G LN
Sbjct: 1182 SEERIPCLCGSSGCKGFLN 1200
>gi|395513793|ref|XP_003761107.1| PREDICTED: uncharacterized protein LOC100928096 [Sarcophilus
harrisii]
Length = 1224
Score = 209 bits (531), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)
Query: 771 RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1056 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1115
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI
Sbjct: 1116 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1175
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1176 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1224
>gi|345791349|ref|XP_543382.3| PREDICTED: histone-lysine N-methyltransferase SETD1B [Canis lupus
familiaris]
Length = 1920
Score = 209 bits (531), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 107/186 (57%), Positives = 138/186 (74%), Gaps = 5/186 (2%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
+ A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1737 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1794
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK G
Sbjct: 1795 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1854
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
ARFINHSCNPNCY KVI+VE QKKI IY+ +HI EEITY+YKFP+E+ KIPC CGS+
Sbjct: 1855 ARFINHSCNPNCYAKVITVESQKKIVIYSNQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1914
Query: 931 CHGSLN 936
C G+LN
Sbjct: 1915 CRGTLN 1920
>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
Length = 213
Score = 209 bits (531), Expect = 7e-51, Method: Composition-based stats.
Identities = 99/173 (57%), Positives = 129/173 (74%), Gaps = 3/173 (1%)
Query: 767 ARTNRVKLRNLLAAA---EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+R NR R L A + ++LK ++++ RKK+L+F +S IHDWGL ALEPI A++ V
Sbjct: 41 SRENRAMQRRLQLAVCNEDFGDILKFNKLRVRKKQLKFAKSSIHDWGLFALEPIAADEMV 100
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
IEYVGE+IR I+D RE YE+ GIGSSY+FRLD+ ++DAT G ARFINH C+PNCY
Sbjct: 101 IEYVGEVIRQAIADYRERCYEERGIGSSYMFRLDETTIIDATTMGNFARFINHCCDPNCY 160
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
KVI+VE KKI IY+KR I EEITY+YKFP+E++KIPC CG+ +C G+LN
Sbjct: 161 AKVIAVENMKKIVIYSKRDIQVDEEITYDYKFPIEDEKIPCLCGAPQCRGTLN 213
>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
lacrymans S7.3]
Length = 260
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 96/155 (61%), Positives = 119/155 (76%), Gaps = 1/155 (0%)
Query: 783 GAEL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIREC 841
AEL LK +Q++ RKK LRF RS IHDWGL A+E I + VIEYVGE+IR +++D RE
Sbjct: 106 AAELNLKFNQLQTRKKHLRFARSPIHDWGLYAMERISRGEMVIEYVGEVIRAQVADKREK 165
Query: 842 RYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKR 901
YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I++ G+KKI IYAK+
Sbjct: 166 VYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQ 225
Query: 902 HIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I GEEITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 226 DIELGEEITYDYHFPIEQDKIPCLCGSAKCRGYLN 260
>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
magnipapillata]
Length = 1403
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 98/175 (56%), Positives = 127/175 (72%), Gaps = 1/175 (0%)
Query: 763 KGLSARTNRVKLRNLLAAAEGAELLK-ASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
K R + +L + +A E + L+ +++ RKK LRF +S IH+WGL A EPI A++
Sbjct: 1229 KSRQKRAVQRRLASSIAQEEFSSLINYNNKLMRRKKALRFSKSSIHNWGLFACEPINADE 1288
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
V EYVG+++R +++IRE RYEK GIGSSYLFRLD V+DATK G ARFINH C+PN
Sbjct: 1289 MVCEYVGQMVRSIVAEIRERRYEKQGIGSSYLFRLDSDSVIDATKDGCNARFINHCCDPN 1348
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY KVI VEG KKI IY++R I GEEITY+YKFP+E++KIPC CG+ C G+LN
Sbjct: 1349 CYAKVILVEGAKKIVIYSRRAIKLGEEITYDYKFPIEDEKIPCLCGAALCRGTLN 1403
>gi|291001085|ref|XP_002683109.1| predicted protein [Naegleria gruberi]
gi|284096738|gb|EFC50365.1| predicted protein [Naegleria gruberi]
Length = 147
Score = 208 bits (529), Expect = 1e-50, Method: Composition-based stats.
Identities = 96/147 (65%), Positives = 117/147 (79%)
Query: 790 SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
+Q+ RKK L+F +S IHDWGL ALE IE + VIEYVGE+IR +SDIRE RYE++GIG
Sbjct: 1 NQLDTRKKNLKFAKSPIHDWGLFALEDIEQGEMVIEYVGEVIREALSDIREKRYEQIGIG 60
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
SSYLFRLD+ ++DATKRG +ARFINHSC+PNC ++I V+ QKK+ IYA R I GEEI
Sbjct: 61 SSYLFRLDNDLIIDATKRGNLARFINHSCDPNCCARIIEVDKQKKVCIYALRKILVGEEI 120
Query: 910 TYNYKFPLEEKKIPCYCGSKKCHGSLN 936
TY+YKFP+EE KIPC CGS+KC LN
Sbjct: 121 TYDYKFPIEESKIPCKCGSQKCKKWLN 147
>gi|432094921|gb|ELK26329.1| Histone-lysine N-methyltransferase SETD1B [Myotis davidii]
Length = 1462
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/193 (56%), Positives = 140/193 (72%), Gaps = 5/193 (2%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 1272 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1329
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 1330 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1389
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
ATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIP
Sbjct: 1390 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1449
Query: 924 CYCGSKKCHGSLN 936
C C S+ C G+LN
Sbjct: 1450 CLCNSENCRGTLN 1462
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 207 bits (528), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 142/393 (36%), Positives = 201/393 (51%), Gaps = 62/393 (15%)
Query: 563 TAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTLSAD 622
T S ++ + +V++K+ + + KSTIS+ ++ +PV + +T S D
Sbjct: 628 TEVSESALKAERVETKIEATTVTIKSTISESIVPA--------IPVKQ------NTWSED 673
Query: 623 GNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKG 682
G V G N E + +LD + + D +L + + VA
Sbjct: 674 GF----VFHGTLPNTYPEDEVLLDLDGFQ-----------QIVKDDEDLQYLREV-VAAE 717
Query: 683 TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
+K+ + K SK R + P + + E+HK + S AR G
Sbjct: 718 ESKELGDTNYWAYRRKESKIRNGDAEPSNLVLP---FNKNEYHKPNSTGS----ARTEGY 770
Query: 743 QYVHTKYLGPEVNASQW-----------ANGKG--LSARTNRVKLRNLLAAAEGA----- 784
Y+ P SQ+ AN +++R NRV R L A E +
Sbjct: 771 ------YMVPAAEKSQYLPLRNRVSLEKANTSSSRITSRMNRVNNRRLAAGVEKSNLNSE 824
Query: 785 -ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
+LL+ + +KARKK+LRF S+IH GL A+E I+ D VIEYVGE++R +++D RE +Y
Sbjct: 825 TDLLRFNALKARKKQLRFGPSRIHTLGLFAMENIDKNDMVIEYVGEIVRQRVADTRERKY 884
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
+ GIG SYLFR+D +VDATK+G IARFINHSC PNC K+I VEG +KI IYA R I
Sbjct: 885 VREGIGDSYLFRIDKDAIVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDI 944
Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
GEE+TY+YKFP E KIPC CG+ C G LN
Sbjct: 945 EEGEELTYDYKFPEEVDKIPCLCGAPTCRGYLN 977
>gi|402221447|gb|EJU01516.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 164
Score = 207 bits (528), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 96/158 (60%), Positives = 119/158 (75%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
AA +EL +Q++ARKK L+F RS IHDWGL ALE I A D VIEYVGE++R +++D
Sbjct: 7 GAAPSSELFAFNQLRARKKELKFARSPIHDWGLYALEYIPAGDMVIEYVGEVVRQQVADK 66
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE YE+ GIGSSYLFR+DD VVDAT +G I R INHSC+PNC ++I++ KKI IY
Sbjct: 67 REKVYERQGIGSSYLFRIDDDLVVDATMKGNIGRLINHSCSPNCTARIITINSSKKIVIY 126
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
AK I GEEITY+Y FP+E++KIPC CGS+KC G LN
Sbjct: 127 AKTPIEPGEEITYDYHFPIEQEKIPCLCGSEKCRGFLN 164
>gi|444724926|gb|ELW65512.1| Histone-lysine N-methyltransferase SETD1B [Tupaia chinensis]
Length = 1554
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/153 (64%), Positives = 125/153 (81%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RY
Sbjct: 1402 SDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRY 1461
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI
Sbjct: 1462 EDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHI 1521
Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1522 SVNEEITYDYKFPIEDIKIPCLCGSENCRGTLN 1554
>gi|449477606|ref|XP_002188016.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
[Taeniopygia guttata]
Length = 228
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IH
Sbjct: 42 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 99
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 100 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 159
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CG
Sbjct: 160 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 219
Query: 928 SKKCHGSLN 936
S+ C G+LN
Sbjct: 220 SENCRGTLN 228
>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
Length = 1614
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/171 (57%), Positives = 125/171 (73%), Gaps = 4/171 (2%)
Query: 770 NRVKLRNLLAAAEGA--EL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
N+V+ L+ E A EL K +Q++ RKK LRF RS IHDWGL A+E I + VIEY
Sbjct: 1444 NQVQRAVALSKGETAASELTFKFNQLQTRKKHLRFSRSPIHDWGLYAMEKISKGEMVIEY 1503
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VGE+IR +++D RE YEK GIGSSYLFR+D+ +VVDATK+G + R INHSC+PNC K+
Sbjct: 1504 VGEVIRAQVADKREKTYEKQGIGSSYLFRIDEEFVVDATKKGNLGRLINHSCDPNCTAKI 1563
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE-KKIPCYCGSKKCHGSLN 936
I++ G KKI IYAK+ I GEEITY+Y FP+E+ KIPC CGS +C G LN
Sbjct: 1564 ITISGVKKIVIYAKQDIELGEEITYDYHFPIEQDNKIPCLCGSARCRGYLN 1614
>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
972h-]
gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1; AltName: Full=Set1 complex
component set1; Short=Set1C component set1; AltName:
Full=Spset1
gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
Length = 920
Score = 206 bits (523), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 97/178 (54%), Positives = 125/178 (70%), Gaps = 6/178 (3%)
Query: 765 LSARTNRVKLRNLLAAAEG------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
+++R NRV R L A E A+LL+ + +KARKK+L F S+IH GL A+E I+
Sbjct: 743 ITSRMNRVNNRRLAAGVEKSQLPAEADLLRFNALKARKKQLHFGPSRIHTLGLFAMENID 802
Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
D VIEY+GE+IR +++D RE Y + GIG SYLFR+D+ +VDATK+G IARFINHSC
Sbjct: 803 KNDMVIEYIGEIIRQRVADNREKNYVREGIGDSYLFRIDEDVIVDATKKGNIARFINHSC 862
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNC ++I VEG++KI IYA R I GEE+TY+YKFP E KIPC CG+ C G LN
Sbjct: 863 APNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYKFPEEADKIPCLCGAPTCRGYLN 920
>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
98AG31]
Length = 191
Score = 206 bits (523), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 92/153 (60%), Positives = 117/153 (76%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
A++LK +Q++ RKK+L+F RS IHDWGL A+E I + VIEYVGE+IR ++D RE Y
Sbjct: 39 ADVLKFNQLRTRKKQLKFARSPIHDWGLYAMEMIPVGEMVIEYVGEVIRQAVADRREKAY 98
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E+MGIGSSYLFR+DD VVDATK+G + R INH C PNC K+I++ G+KKI IYAK I
Sbjct: 99 ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKATI 158
Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
G+E+TY+Y FP E+ KIPC CGS KC G+LN
Sbjct: 159 ELGDEVTYDYHFPKEDVKIPCLCGSSKCKGTLN 191
>gi|355718741|gb|AES06369.1| SET domain containing 1B [Mustela putorius furo]
Length = 359
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 109/192 (56%), Positives = 140/192 (72%), Gaps = 5/192 (2%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 170 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 227
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 228 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 287
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
ATK G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIP
Sbjct: 288 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 347
Query: 924 CYCGSKKCHGSL 935
C CGS+ C G+L
Sbjct: 348 CLCGSENCRGTL 359
>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 1450
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 99/188 (52%), Positives = 127/188 (67%), Gaps = 12/188 (6%)
Query: 760 ANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
A G+G S R NRV R +A + A+ L+ +Q+K RKK ++F RS IH+WGL
Sbjct: 1264 ATGRGTS-RANRVNQRRYVADLNDQKRTLGQDADALRFNQLKKRKKTVKFARSAIHNWGL 1322
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
A+E I +D +IEYVG+ +R ++ IRE Y + GIGSSYLFR+DDG V+DATK+GGIA
Sbjct: 1323 YAMESIPKDDMIIEYVGQEVRQSVATIREKAYIRAGIGSSYLFRIDDGTVIDATKKGGIA 1382
Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGS 928
RFINHSC PNC K+I VEG K+I IYA R I EE+TY+YKF E E ++PC CG+
Sbjct: 1383 RFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEDNPEDRVPCLCGT 1442
Query: 929 KKCHGSLN 936
C G LN
Sbjct: 1443 TACKGFLN 1450
>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 1344
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 102/189 (53%), Positives = 129/189 (68%), Gaps = 16/189 (8%)
Query: 763 KGLSARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
K S+R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WG
Sbjct: 1157 KSTSSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWG 1215
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGI
Sbjct: 1216 LYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGI 1275
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
ARFINHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CG
Sbjct: 1276 ARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCG 1335
Query: 928 SKKCHGSLN 936
S C G LN
Sbjct: 1336 STGCKGFLN 1344
>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
Length = 171
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 93/156 (59%), Positives = 116/156 (74%)
Query: 781 AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE 840
A G K +Q++ RKK LRF RS IHDWGL A+E I + VIEYVGE+IR +++D RE
Sbjct: 16 AAGELSFKFNQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKRE 75
Query: 841 CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I++ G+KKI IYAK
Sbjct: 76 ATYERQGIGSSYLFRIDEEIVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAK 135
Query: 901 RHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
R I G+EITY+Y FP E+ KIPC CG+ KC G LN
Sbjct: 136 RDIELGDEITYDYHFPFEQDKIPCLCGTAKCRGFLN 171
>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
tetrasperma FGSC 2509]
Length = 1313
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NRV R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1132 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1191
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1192 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1251
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1252 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1311
Query: 935 LN 936
LN
Sbjct: 1312 LN 1313
>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
2508]
Length = 1282
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NRV R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1101 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1160
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1161 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1220
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1221 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1280
Query: 935 LN 936
LN
Sbjct: 1281 LN 1282
>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
Length = 1313
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NRV R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1132 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1191
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1192 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1251
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1252 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1311
Query: 935 LN 936
LN
Sbjct: 1312 LN 1313
>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1314
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NRV R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1133 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1192
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1193 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1252
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1253 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1312
Query: 935 LN 936
LN
Sbjct: 1313 LN 1314
>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
Length = 1224
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 94/161 (58%), Positives = 121/161 (75%), Gaps = 1/161 (0%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
+ LAAA +K +Q+++RKK+L+F RS IHDWGL ALE I + VIEYVGE+IR ++
Sbjct: 1065 DTLAAAPEV-TIKFNQLQSRKKQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQV 1123
Query: 836 SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
++ RE YE+ GIGSSYLFR+DD VVDATK G + R INHSC+PNC K+I++ GQKKI
Sbjct: 1124 AEKRERAYERSGIGSSYLFRIDDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKI 1183
Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
IYAK I G+E+TY+Y FP+E +KIPC CG+ KC G LN
Sbjct: 1184 VIYAKVDIHPGDEVTYDYHFPIENEKIPCLCGAAKCRGFLN 1224
>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 191
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 102/185 (55%), Positives = 131/185 (70%), Gaps = 6/185 (3%)
Query: 755 NASQWANGKGLSARTNRVKLRNLLAAAE--GAEL-LKASQVKARKKRLRFQRSKIHDWGL 811
NA + A G N+V+ L+ E AEL +K +Q++ RKK LRF RS IHDWGL
Sbjct: 10 NARRQAQG---LEEINQVQRAMALSKGETAAAELTIKFNQLQTRKKHLRFARSPIHDWGL 66
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
A+E I + VIEYVGE+IR +I+D RE YE+ GIGSSYLFR+D+ VVDATK+G +
Sbjct: 67 YAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLG 126
Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKC 931
R INHSC+PNC K+I++ G+KKI IYAK+ I G+EITY+Y FP+E+ KIPC CGS +C
Sbjct: 127 RLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLCGSARC 186
Query: 932 HGSLN 936
G LN
Sbjct: 187 RGYLN 191
>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
24927]
Length = 1338
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 98/187 (52%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 753 EVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
+ + S AN + L+A N K LAA + +++ +Q+K RKK +RF RS IH+WGL
Sbjct: 1154 DTSRSNRANNRRLAAEVNLQK--QTLAAVDVHDVINFNQLKKRKKPVRFARSAIHNWGLY 1211
Query: 813 ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
A+E I + +IEYVGE++R +++D+RE Y + GIGSSYLFR+D+ V+DATK+GGIAR
Sbjct: 1212 AMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIGSSYLFRIDETTVIDATKKGGIAR 1271
Query: 873 FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
FINHSC PNC K+I VEG K+I IYA R I EE+TY+YKF E E++IPC CGS
Sbjct: 1272 FINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFEREIDSEERIPCLCGSS 1331
Query: 930 KCHGSLN 936
C G LN
Sbjct: 1332 GCKGFLN 1338
>gi|408391029|gb|EKJ70413.1| hypothetical protein FPSE_09407 [Fusarium pseudograminearum CS3096]
Length = 1263
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1201
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1202 CMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1261
Query: 935 LN 936
LN
Sbjct: 1262 LN 1263
>gi|410516926|sp|Q4I5R3.2|SET1_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
Length = 1263
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1201
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1202 CMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1261
Query: 935 LN 936
LN
Sbjct: 1262 LN 1263
>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1150
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NRV R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 969 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1028
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1029 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1088
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1089 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1148
Query: 935 LN 936
LN
Sbjct: 1149 LN 1150
>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
Length = 1056
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)
Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
+ P S GCAR+ ++K S +R +R T E+N + L
Sbjct: 835 VVPHSSGCARTE----GYYKLSYKE---KRGVLRRPDVFLT-----EINERDDEKARYLV 882
Query: 766 -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
S R R R LL + +++ K +Q+K RKK ++F RS+IH WGL ALE I ++
Sbjct: 883 QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEM 942
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR ++D RE RYE+ G+GSSYLFR+D V+DAT+ G +ARFINHSC PNC
Sbjct: 943 IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 1002
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y K++ V+G+K+I IY+K I G+EITY+YKFP+EE KI C CG+ C GSLN
Sbjct: 1003 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 1056
>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 164
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 94/155 (60%), Positives = 119/155 (76%), Gaps = 1/155 (0%)
Query: 783 GAEL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIREC 841
AEL +K +Q++ RKK LRF RS IHDWGL A+E I + VIEYVGE+IR +++D RE
Sbjct: 10 AAELTIKFNQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREK 69
Query: 842 RYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKR 901
YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I++ G+KKI IYAK+
Sbjct: 70 AYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQ 129
Query: 902 HIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I G+EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 130 DIELGDEITYDYHFPIEQDKIPCLCGSAKCRGYLN 164
>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
Length = 1510
Score = 203 bits (517), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)
Query: 765 LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
L+++ R+ R LL + A + K +Q+K RKK ++F RS+IH WGL A+E I ++
Sbjct: 1337 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1396
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR +++ RE YE+ GIGSSYLFR+D +V+DATKRG ARFINHSC PNC
Sbjct: 1397 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1456
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y KV+++EG+K+I IY++ I GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1457 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1510
>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
AltName: Full=SET domain-containing protein 2
gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
Length = 1507
Score = 203 bits (517), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)
Query: 765 LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
L+++ R+ R LL + A + K +Q+K RKK ++F RS+IH WGL A+E I ++
Sbjct: 1334 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1393
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR +++ RE YE+ GIGSSYLFR+D +V+DATKRG ARFINHSC PNC
Sbjct: 1394 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1453
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y KV+++EG+K+I IY++ I GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1454 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1507
>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 1337
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
S+R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1153 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1211
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARF
Sbjct: 1212 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1271
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS
Sbjct: 1272 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1331
Query: 931 CHGSLN 936
C G LN
Sbjct: 1332 CKGFLN 1337
>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
Length = 1376
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
S+R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1192 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1250
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARF
Sbjct: 1251 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1310
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS
Sbjct: 1311 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1370
Query: 931 CHGSLN 936
C G LN
Sbjct: 1371 CKGFLN 1376
>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 1334
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
S+R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1150 SSRSNRVNNRRLVADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1208
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARF
Sbjct: 1209 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1268
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS
Sbjct: 1269 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1328
Query: 931 CHGSLN 936
C G LN
Sbjct: 1329 CKGFLN 1334
>gi|400596097|gb|EJP63881.1| histone H3 methyltransferase complex protein [Beauveria bassiana
ARSEF 2860]
Length = 1220
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 117/254 (46%), Positives = 151/254 (59%), Gaps = 30/254 (11%)
Query: 708 CPRSDGCAR---------SSISGWEWHKWSLNASPAER-ARVR-GAQYVHTKYLGPEVNA 756
P GCAR S S + H + ER ARV+ G V T V+A
Sbjct: 972 VPNPTGCARTEGVKKILNSEKSKYLPHHIKVQKVREEREARVKNGKDSVAT-----SVDA 1026
Query: 757 SQWANGKGLS---ARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSK 805
S+ A K ++ +R NR R +A + +++ K +Q+K RKK ++F RS
Sbjct: 1027 SKIAADKLVAKGNSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSA 1086
Query: 806 IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDAT 865
IH+WGL +E I +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+D+ V+DAT
Sbjct: 1087 IHNWGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKSGIGSSYLFRIDENTVIDAT 1146
Query: 866 KRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKI 922
K+GGIARFINHSC PNC K+I VEG K+I IYA R IA EE+TY+YKF E +I
Sbjct: 1147 KKGGIARFINHSCLPNCTAKIIKVEGSKRIVIYALREIAMNEELTYDYKFEREIGSLDRI 1206
Query: 923 PCYCGSKKCHGSLN 936
PC CG+ C G LN
Sbjct: 1207 PCLCGTAACKGFLN 1220
>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
Length = 1286
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1105 NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFDRSAIHNWGLYAMENI 1164
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1165 PKDDMIIEYVGEEVRQQIAEIREHRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1224
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1225 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1284
Query: 935 LN 936
LN
Sbjct: 1285 LN 1286
>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
Length = 1167
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 130/185 (70%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLL---------AAAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+NR R L+ AA+G++ ++ +Q+K RKK ++F RS IH+WGL A
Sbjct: 983 TSRSNRANNRRLVNEISTQKQALAADGSDNIAIRFNQLKKRKKLVKFDRSAIHNWGLYAQ 1042
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R K++DIRE +Y+K G+GSSYLFR+D+ VVDATK+GGIARFI
Sbjct: 1043 EDISANDMIIEYVGEKVRQKVADIREIKYDKQGVGSSYLFRIDEDSVVDATKKGGIARFI 1102
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC+PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1103 NHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEELTYDYKFEREIGSDDRIPCLCGSVNC 1162
Query: 932 HGSLN 936
G LN
Sbjct: 1163 KGFLN 1167
>gi|444318419|ref|XP_004179867.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
gi|387512908|emb|CCH60348.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
Length = 1216
Score = 202 bits (515), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 95/161 (59%), Positives = 117/161 (72%), Gaps = 3/161 (1%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
A +ELL +Q+ RKK + F RS IH+WGL ALEPI A++ VIEYVGE IR +S++
Sbjct: 1056 AIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPITAKEMVIEYVGERIRQPVSEM 1115
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE RY K GIGSSYLFR+D+ V+DATKRGGIARFINHSC P+C K+I V G K+I IY
Sbjct: 1116 REKRYIKGGIGSSYLFRIDENTVIDATKRGGIARFINHSCEPSCTAKIIKVGGMKRIVIY 1175
Query: 899 AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
A R I EE+TY+YKF E E+++PCYCG+ C G LN
Sbjct: 1176 ALRDIGLNEELTYDYKFEREIDAEERLPCYCGAPSCKGFLN 1216
>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
Length = 1802
Score = 202 bits (515), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)
Query: 765 LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
L+++ R+ R LL + A + K +Q+K RKK ++F RS+IH WGL A+E I ++
Sbjct: 1629 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1688
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR +++ RE YE+ GIGSSYLFR+D +V+DATKRG ARFINHSC PNC
Sbjct: 1689 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1748
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y KV+++EG+K+I IY++ I GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1749 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1802
>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
niger CBS 513.88]
Length = 1239
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 100/186 (53%), Positives = 128/186 (68%), Gaps = 16/186 (8%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
++R+ RV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1055 TSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1113
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARF
Sbjct: 1114 EENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARF 1173
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS
Sbjct: 1174 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1233
Query: 931 CHGSLN 936
C G LN
Sbjct: 1234 CKGFLN 1239
>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1259
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +G E +L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1075 TSRSTRVNNRRLIADINAQKQALPMQGGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1134
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1135 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1194
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1195 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1254
Query: 932 HGSLN 936
G LN
Sbjct: 1255 KGFLN 1259
>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
Length = 1101
Score = 202 bits (514), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 100/186 (53%), Positives = 128/186 (68%), Gaps = 16/186 (8%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
++R+ RV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 917 TSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 975
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARF
Sbjct: 976 EENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARF 1035
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS
Sbjct: 1036 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1095
Query: 931 CHGSLN 936
C G LN
Sbjct: 1096 CKGFLN 1101
>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
Length = 1079
Score = 202 bits (514), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 111/261 (42%), Positives = 148/261 (56%), Gaps = 43/261 (16%)
Query: 718 SISGWEWHKWSLNA--SPAERARVRGAQYVHTKYLGPEVNASQWANGKGL---------- 765
++S W W + + A P E VR + Y+ A++ K +
Sbjct: 820 NLSAWAWRQKEIKALNRPEESGPVREETIIPGYYVSNTTGAARTEGRKRILESEKSKYLP 879
Query: 766 ---------------SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKR 798
++R+ RV R L+A + EG ++L+ +Q+K RKK
Sbjct: 880 HRIKVQKAPKTISKSTSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKP 938
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
+RF RS IH+WGL A E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+
Sbjct: 939 VRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDE 998
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
V+DATKRGGIARFINHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E
Sbjct: 999 NTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFERE 1058
Query: 919 ---EKKIPCYCGSKKCHGSLN 936
+ +IPC CGS C G LN
Sbjct: 1059 WDSDDRIPCLCGSTGCKGFLN 1079
>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
atroviride IMI 206040]
Length = 1241
Score = 202 bits (513), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1060 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1119
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1120 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1179
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1180 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1239
Query: 935 LN 936
LN
Sbjct: 1240 LN 1241
>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
Length = 1260
Score = 202 bits (513), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1079 NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1138
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1139 PKDDMIIEYVGEEVRQQIAELREHRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1198
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1199 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1258
Query: 935 LN 936
LN
Sbjct: 1259 LN 1260
>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1202
Score = 202 bits (513), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA----------AAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A A G ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1018 TSRSTRVNNRRLIADINAQKQALPTANGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1077
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1078 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1137
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1138 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1197
Query: 932 HGSLN 936
G LN
Sbjct: 1198 KGFLN 1202
>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 1188
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV+ R L A +G + L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1004 TSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKRKKPVRFARSAIHNWGLYAE 1063
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1064 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGGIARFI 1123
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I+ EE+TY+YKF E E +IPC CGS C
Sbjct: 1124 NHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDRIPCLCGSAGC 1183
Query: 932 HGSLN 936
G LN
Sbjct: 1184 KGFLN 1188
>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
Length = 1221
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1040 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1099
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1100 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1159
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1160 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1219
Query: 935 LN 936
LN
Sbjct: 1220 LN 1221
>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
Length = 739
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)
Query: 765 LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
L+++ R+ R LL + A + K +Q+K RKK ++F RS+IH WGL A+E I ++
Sbjct: 566 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 625
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR +++ RE YE+ GIGSSYLFR+D +V+DATKRG ARFINHSC PNC
Sbjct: 626 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 685
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y KV+++EG+K+I IY++ I GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 686 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 739
>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
Length = 1151
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 149/251 (59%), Gaps = 24/251 (9%)
Query: 708 CPRSDGCAR---------SSISGWEWHKWSLNASPAERA-RVR-GAQYVHTKYLGPEVNA 756
P GCAR S S + H + ERA RV+ G V+T ++ A
Sbjct: 903 VPNPTGCARTEGVKKILNSEKSKYLPHHIKVQKVREERAARVKNGKDGVNTTMDASKIAA 962
Query: 757 SQWANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHD 808
+ KG ++R NR R +A + +++ K +Q+K RKK ++F RS IH+
Sbjct: 963 EKMI-AKG-NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHN 1020
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL +E I +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+D+ V+DATK+G
Sbjct: 1021 WGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKSGIGSSYLFRIDENTVIDATKKG 1080
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCY 925
GIARFINHSC PNC K+I VEG K+I IYA R I EE+TY+YKF E +IPC
Sbjct: 1081 GIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITTNEELTYDYKFEREIGSLDRIPCL 1140
Query: 926 CGSKKCHGSLN 936
CG+ C G LN
Sbjct: 1141 CGTAACKGFLN 1151
>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
Length = 1236
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1055 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1114
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1115 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1174
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1175 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1234
Query: 935 LN 936
LN
Sbjct: 1235 LN 1236
>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 1239
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1055 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1114
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1115 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1174
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1175 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1234
Query: 932 HGSLN 936
G LN
Sbjct: 1235 KGFLN 1239
>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1330
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/184 (54%), Positives = 126/184 (68%), Gaps = 16/184 (8%)
Query: 768 RTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALE 815
R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A E
Sbjct: 1148 RSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYAEE 1206
Query: 816 PIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFIN 875
I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARFIN
Sbjct: 1207 NITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARFIN 1266
Query: 876 HSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCH 932
HSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1267 HSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1326
Query: 933 GSLN 936
G LN
Sbjct: 1327 GFLN 1330
>gi|342887802|gb|EGU87231.1| hypothetical protein FOXB_02213 [Fusarium oxysporum Fo5176]
Length = 1258
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/187 (52%), Positives = 128/187 (68%), Gaps = 8/187 (4%)
Query: 758 QWANGKGLSAR-TNRVKLRNL----LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
Q AN R TNR +++L + +++ K +Q+K RKK ++F RS IH+WGL
Sbjct: 1072 QAANDSERQKRATNRRYVQDLNDQKKTLGQDSDIFKFNQLKKRKKPVKFARSAIHNWGLY 1131
Query: 813 ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
A+E I +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIAR
Sbjct: 1132 AMENIAKDDMIIEYVGEQVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIAR 1191
Query: 873 FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
FINHSC PNC K+I VEG K+I IYA + IA EE+TY+YKF E +IPC CG+
Sbjct: 1192 FINHSCEPNCTAKIIKVEGSKRIVIYALQDIAMSEELTYDYKFEREIGSLDRIPCLCGTA 1251
Query: 930 KCHGSLN 936
C G LN
Sbjct: 1252 ACKGFLN 1258
>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
Length = 1331
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/184 (54%), Positives = 126/184 (68%), Gaps = 16/184 (8%)
Query: 768 RTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALE 815
R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A E
Sbjct: 1149 RSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYAEE 1207
Query: 816 PIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFIN 875
I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARFIN
Sbjct: 1208 NITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARFIN 1267
Query: 876 HSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCH 932
HSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1268 HSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1327
Query: 933 GSLN 936
G LN
Sbjct: 1328 GFLN 1331
>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
Length = 1670
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 91/168 (54%), Positives = 123/168 (73%), Gaps = 2/168 (1%)
Query: 771 RVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
R+ R LL + A + K + +K+RKK ++F RS+IH +GL A+E I ++ +IEY+G
Sbjct: 1503 RLLQRRLLTSLGDANSDFFKINMLKSRKKMIKFARSRIHGYGLYAMETIAQDEMIIEYIG 1562
Query: 829 ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
+ IR ++D RE YE+ GIGSSYLFR+D+ V+DATKRG ARFINHSC PNCY KV++
Sbjct: 1563 QKIRSLVADEREKAYERRGIGSSYLFRIDEHTVIDATKRGNFARFINHSCQPNCYAKVLT 1622
Query: 889 VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+EG+K+I IY++ I GEEITY+YKFP+EE KI C CG+K C G LN
Sbjct: 1623 IEGEKRIVIYSRSTINKGEEITYDYKFPIEEDKIDCLCGAKTCRGYLN 1670
>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1267
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1083 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1142
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1143 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1202
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1203 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1262
Query: 932 HGSLN 936
G LN
Sbjct: 1263 KGFLN 1267
>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
Length = 1266
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1082 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1141
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1142 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1201
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1202 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1261
Query: 932 HGSLN 936
G LN
Sbjct: 1262 KGFLN 1266
>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1267
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1083 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1142
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1143 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1202
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1203 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1262
Query: 932 HGSLN 936
G LN
Sbjct: 1263 KGFLN 1267
>gi|358058803|dbj|GAA95766.1| hypothetical protein E5Q_02423 [Mixia osmundae IAM 14324]
Length = 2083
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/183 (54%), Positives = 124/183 (67%), Gaps = 7/183 (3%)
Query: 753 EVNASQWANGKGLSARTNRVKL-----RNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
+ AS A + A T R+ L R A G++LLK +Q+K+RKK+L+F RS IH
Sbjct: 1339 QAPASSTAISRLARANTRRLVLDMEQQRKSAATEAGSDLLKFNQLKSRKKQLKFARSPIH 1398
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+E I A D +IEYVGELIR +++D RE YEKMGIGSSYLFR+DD VVDATK+
Sbjct: 1399 DWGLYAMEHIPARDMIIEYVGELIRQQVADKREKAYEKMGIGSSYLFRVDDDLVVDATKK 1458
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCY 925
G AR INH C PNC ++I++ G KKI IYA I G+EITY+Y F E + KIPC
Sbjct: 1459 GTYARLINHCCAPNCTARIITIGGHKKIVIYALTDIEPGDEITYDYHFATESDDLKIPCL 1518
Query: 926 CGS 928
CGS
Sbjct: 1519 CGS 1521
>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 1155
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV+ R L A +G + L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 971 TSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKRKKPVRFARSAIHNWGLYAE 1030
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1031 ENIAANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGGIARFI 1090
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I+ EE+TY+YKF E E +IPC CGS C
Sbjct: 1091 NHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDRIPCLCGSAGC 1150
Query: 932 HGSLN 936
G LN
Sbjct: 1151 KGFLN 1155
>gi|366990781|ref|XP_003675158.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
gi|342301022|emb|CCC68787.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
Length = 1082
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 901 SSRDNRASNRRFQQNIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 960
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A + +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 961 AAREMIIEYVGESIRQPVAEMRERRYLKSGIGSSYLFRVDENNVIDATKKGGIARFINHC 1020
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G+++I IYA R IAA EE+TY+YKF E E+++PCYCGS C G
Sbjct: 1021 CDPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERELDAEERLPCYCGSVNCKGF 1080
Query: 935 LN 936
LN
Sbjct: 1081 LN 1082
>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1319
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 92/182 (50%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R ++A + +++L+ +Q++ RKK ++F RS IH+WGL A+E I
Sbjct: 1138 TSRANRANHRRVVADLNDQKKTLGQDSDVLRFNQLRKRKKPVKFARSAIHNWGLYAMENI 1197
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R ++ +RE RY K GIGS+YLFR+DD V+DATK+GGIARFINHS
Sbjct: 1198 PKDDMIIEYVGEEVRPSVAKVREARYLKSGIGSTYLFRIDDEAVIDATKKGGIARFINHS 1257
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R I EE+TY+YKF EE ++PC CG+ C G
Sbjct: 1258 CMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEEDQKDRVPCLCGTTACKGF 1317
Query: 935 LN 936
LN
Sbjct: 1318 LN 1319
>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1271
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266
Query: 932 HGSLN 936
G LN
Sbjct: 1267 KGFLN 1271
>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 1271
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266
Query: 932 HGSLN 936
G LN
Sbjct: 1267 KGFLN 1271
>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 160
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 88/150 (58%), Positives = 116/150 (77%)
Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
+K +Q++ RKK LRF RS IHDWGL A+E + + VIEYVGE++R +++D RE YE+
Sbjct: 11 VKFNQLQTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYERQ 70
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC ++I++ G+KKI IYAK+ I G
Sbjct: 71 GIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELG 130
Query: 907 EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 131 DEITYDYHFPIEQDKIPCLCGSAKCRGYLN 160
>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
[Coccidioides immitis RS]
Length = 1271
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266
Query: 932 HGSLN 936
G LN
Sbjct: 1267 KGFLN 1271
>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
Length = 995
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK++ F RS IH+WGL ALEPI
Sbjct: 814 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKQVTFARSPIHNWGLYALEPI 873
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE+IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 874 AAKEMIIEYVGEMIRQPVAEMRENRYLKSGIGSSYLFRIDEYTVIDATKKGGIARFINHC 933
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G+K+I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 934 CDPSCTAKIIKVGGRKRIVIYALRDIAANEELTYDYKFEREVDDEERLPCLCGAATCKGF 993
Query: 935 LN 936
LN
Sbjct: 994 LN 995
>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
Length = 1076
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 895 NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 954
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 955 PKDDMIIEYVGEEVRQQIAELRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1014
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1015 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1074
Query: 935 LN 936
LN
Sbjct: 1075 LN 1076
>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1280
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/185 (53%), Positives = 127/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +G E +L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1096 TSRSTRVNNRRLIADINAQKQALPMQGGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1155
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++++RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1156 ENISANDMIIEYVGEKVRQQVANMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1215
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1216 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1275
Query: 932 HGSLN 936
G LN
Sbjct: 1276 KGFLN 1280
>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
Length = 602
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)
Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
+ P S GCAR+ ++K S +R +R T E+N + L
Sbjct: 381 VVPHSSGCARTE----GYYKLSYKE---KRGVLRRPDVFLT-----EINERDDEKARYLV 428
Query: 766 -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
S R R R LL + +++ K +Q+K RKK ++F RS+IH WGL ALE I ++
Sbjct: 429 QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEVIAPDEM 488
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR ++D RE RYE+ G+GSSYLFR+D V+DAT+ G +ARFINHSC PNC
Sbjct: 489 IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 548
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y K++ V+G+K+I IY+K I G+EITY+YKFP+EE KI C CG+ C GSLN
Sbjct: 549 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 602
>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 153
Score = 199 bits (507), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 93/151 (61%), Positives = 115/151 (76%)
Query: 786 LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK 845
L+K +Q++ RKK+LRF RS IHDWGL A+E I + VIEYVGE++R I+D RE YE+
Sbjct: 3 LIKFNQLQTRKKQLRFARSPIHDWGLYAMERIARGEMVIEYVGEIVRAPIADRREKLYER 62
Query: 846 MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAA 905
GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+ISV G KKI IYAK+ I
Sbjct: 63 QGIGSSYLFRIDEELVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIEL 122
Query: 906 GEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
G+E+TY+Y FP EE KIPC CG+ KC G LN
Sbjct: 123 GDELTYDYHFPREEAKIPCLCGAAKCRGFLN 153
>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
Length = 1230
Score = 199 bits (506), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1046 TSRSTRVNNRRLIADINAQKQALPTQSGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1105
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1106 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1165
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1166 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1225
Query: 932 HGSLN 936
G LN
Sbjct: 1226 KGFLN 1230
>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1295
Score = 199 bits (506), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1114 NSRANRAHNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1173
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
++ +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1174 PKDEMIIEYVGEEVRQQIAELRESRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1233
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1234 CMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFERELGSADRIPCLCGTAACKGF 1293
Query: 935 LN 936
LN
Sbjct: 1294 LN 1295
>gi|302910631|ref|XP_003050330.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
gi|256731267|gb|EEU44617.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
Length = 1281
Score = 199 bits (506), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 125/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1100 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1159
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+D+ V+DATK+GGIARFINHS
Sbjct: 1160 AKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 1219
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1220 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1279
Query: 935 LN 936
LN
Sbjct: 1280 LN 1281
>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
Length = 163
Score = 199 bits (506), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 92/158 (58%), Positives = 115/158 (72%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
A A +K +Q++ RKK LRF +S IHDWGL A+E I + VIEYVGE+IR ++D
Sbjct: 6 AIANELTFIKFNQLQTRKKHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADK 65
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC K+I++ G KKI IY
Sbjct: 66 REKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIY 125
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
AK+ I G+EITY+Y FP E+ KIPC CGS KC G LN
Sbjct: 126 AKQDIELGDEITYDYHFPFEQDKIPCLCGSAKCRGFLN 163
>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
Length = 1083
Score = 199 bits (506), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/191 (51%), Positives = 126/191 (65%), Gaps = 14/191 (7%)
Query: 757 SQWANGKGLSARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHD 808
SQ A G ++R NR R +A + ++ + +Q+K RKK ++F RS IH+
Sbjct: 896 SQVARG---TSRANRANNRRFVAGLHDQIRGLGQDSDAFRFNQLKKRKKPVKFARSAIHN 952
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL +E I +D +IEYVGE +R I+++RE RY K GIGSSYLFR+DD V+DATK+G
Sbjct: 953 WGLYTMENIPKDDMIIEYVGEEVRQVIAELREARYLKSGIGSSYLFRIDDNTVIDATKKG 1012
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCY 925
GIARFINHSC PNC K+I VEG K+I IYA R IA EE+TY+YKF E +IPC
Sbjct: 1013 GIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGATDRIPCL 1072
Query: 926 CGSKKCHGSLN 936
CG+ C G LN
Sbjct: 1073 CGTAACKGFLN 1083
>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
oryzae RIB40]
gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1229
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1045 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1104
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1105 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1164
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1165 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1224
Query: 932 HGSLN 936
G LN
Sbjct: 1225 KGFLN 1229
>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
Length = 1019
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 99/193 (51%), Positives = 130/193 (67%), Gaps = 11/193 (5%)
Query: 755 NASQWANGKGLSARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKI 806
+ S+ A + S+R NR R E +ELL +Q+ RKK + F RS I
Sbjct: 827 DPSEPAPQELSSSRDNRASNRRFQQDIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAI 886
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
H+WGL ALEPI A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK
Sbjct: 887 HNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREIRYIKKGIGSSYLFRVDENTVIDATK 946
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
+GGIARFINH C+P+C K+I V G+K+I IYA R IAA EE+TY+YKF E E+++P
Sbjct: 947 KGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIAANEELTYDYKFERETDDEERLP 1006
Query: 924 CYCGSKKCHGSLN 936
C CG+ C G LN
Sbjct: 1007 CLCGAPTCKGFLN 1019
>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
[Aspergillus oryzae 3.042]
Length = 1223
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1039 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1098
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1099 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1158
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1159 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1218
Query: 932 HGSLN 936
G LN
Sbjct: 1219 KGFLN 1223
>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
NIH/UT8656]
Length = 1277
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 94/170 (55%), Positives = 124/170 (72%), Gaps = 6/170 (3%)
Query: 770 NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
R LR+L + A++L+ +Q++ RKK ++F RS IH+WGL +LE I+ + +IEYVGE
Sbjct: 1111 QREMLRDL---GDDADVLRFNQLQKRKKPVKFARSAIHNWGLYSLERIQTSEMIIEYVGE 1167
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
IR +I+D+RE +Y + GIGSSYLFR+D+G VVDATK+GGIARFINHSC+PNC K+I V
Sbjct: 1168 KIRQEIADLREIKYTESGIGSSYLFRIDEGTVVDATKKGGIARFINHSCSPNCTAKIIRV 1227
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
G K+I IYA R I EE+TY+YKF E + +IPC CGS C G LN
Sbjct: 1228 GGTKRIVIYALRDIEKDEELTYDYKFEREIDSDDRIPCLCGSAVCKGFLN 1277
>gi|365983760|ref|XP_003668713.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
gi|343767480|emb|CCD23470.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
Length = 1105
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/182 (56%), Positives = 131/182 (71%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLR----NLLA--AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R N+ A AA GAE LL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 924 SSRDNRASNRRFQQNIEAQRAAIGAESDLLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 983
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 984 AAKEMIIEYVGESIRQAVAEMRERRYLKNGIGSSYLFRVDENNVIDATKKGGIARFINHC 1043
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC+CGS C G
Sbjct: 1044 CDPSCTAKIIKVGGRRRIVIYALRDIAANEELTYDYKFEREIDAEERLPCHCGSVNCKGF 1103
Query: 935 LN 936
LN
Sbjct: 1104 LN 1105
>gi|156061849|ref|XP_001596847.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980]
gi|154700471|gb|EDO00210.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1264
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S R NRV R +A ++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1083 SGRANRVNNRRYVADLNDQKKTLGGDTDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1142
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINHS
Sbjct: 1143 AMNDMIIEYVGEKVRQQVADLRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 1202
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I+VE K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1203 CMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACKGF 1262
Query: 935 LN 936
LN
Sbjct: 1263 LN 1264
>gi|119467882|ref|XP_001257747.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405899|gb|EAW15850.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 1241
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236
Query: 932 HGSLN 936
G LN
Sbjct: 1237 KGFLN 1241
>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
Length = 278
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)
Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
+ P S GCAR+ ++K S +R +R T E+N + L
Sbjct: 57 VIPHSSGCARTE----GYYKLS---HKEKRGILRRPDVFLT-----EINEKDDEKARYLV 104
Query: 766 -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
S R R R LL + +++ K +Q+K RKK ++F RS+IH WGL ALE I ++
Sbjct: 105 QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEM 164
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
++EY+G+ IR ++D RE RYE+ G+GSSYLFR+D V+DAT+ G +ARFINHSC PNC
Sbjct: 165 IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 224
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y K++ V+G+K+I IY+K I G+EITY+YKFP+EE KI C CG+ C GSLN
Sbjct: 225 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 278
>gi|159124080|gb|EDP49198.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 1241
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236
Query: 932 HGSLN 936
G LN
Sbjct: 1237 KGFLN 1241
>gi|70991351|ref|XP_750524.1| SET domain protein [Aspergillus fumigatus Af293]
gi|74671075|sp|Q4WNH8.1|SET1_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=SET domain-containing protein 1
gi|66848157|gb|EAL88486.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 1241
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236
Query: 932 HGSLN 936
G LN
Sbjct: 1237 KGFLN 1241
>gi|254569422|ref|XP_002491821.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031618|emb|CAY69541.1| hypothetical protein PAS_chr2-2_0494 [Komagataella pastoris GS115]
gi|328351679|emb|CCA38078.1| histone-lysine N-methyltransferase SETD1 [Komagataella pastoris CBS
7435]
Length = 1020
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 126/187 (67%), Gaps = 11/187 (5%)
Query: 761 NGKGLSARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLV 812
N K S+R NR R A ++L + + RKK + F RS IH+WGL
Sbjct: 834 NPKLQSSRVNRANNRRFAADINAQKQMLNSETDILDLNHLTKRKKPVSFARSAIHNWGLY 893
Query: 813 ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
ALEPI A++ +IEYVGE++R K+S++RE +Y K GIGSSYLFR+D+ V+DATK+GGIAR
Sbjct: 894 ALEPIAAKEMIIEYVGEILRQKVSEVREKKYLKSGIGSSYLFRVDEDTVIDATKKGGIAR 953
Query: 873 FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
FINH C P+C K+I VEG+K+I IYA + IAA EE+TY+YKF E E++IPC CG
Sbjct: 954 FINHCCQPSCTAKIIKVEGKKRIVIYALKDIAANEELTYDYKFEREDNNEERIPCLCGVP 1013
Query: 930 KCHGSLN 936
C G LN
Sbjct: 1014 GCKGYLN 1020
>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 1232
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 98/183 (53%), Positives = 125/183 (68%), Gaps = 4/183 (2%)
Query: 757 SQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
S N + L A N K + + +G + L+ +Q+K RKK +RF RS IH+WGL A E
Sbjct: 1051 STRVNNRRLVADINAQKQALPMQSGDG-DALRFNQLKKRKKPVRFARSAIHNWGLYAEEN 1109
Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFINH
Sbjct: 1110 ISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFINH 1169
Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
SC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C G
Sbjct: 1170 SCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCKG 1229
Query: 934 SLN 936
LN
Sbjct: 1230 FLN 1232
>gi|451848788|gb|EMD62093.1| hypothetical protein COCSADRAFT_162605 [Cochliobolus sativus ND90Pr]
Length = 1180
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/190 (50%), Positives = 126/190 (66%), Gaps = 8/190 (4%)
Query: 754 VNASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDW 809
N + AN + A R + + L+ AEG + L+ +Q+K RKK ++F RS IH+W
Sbjct: 992 ANKASTANSRANRANNRRMVNDINISKNLSGAEG-DALRFNQLKKRKKLVKFDRSAIHNW 1050
Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
GL A EPI D +IEYVGE +R +++D+RE +Y+ G+GSSYLFR+D+ V+DATK GG
Sbjct: 1051 GLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGG 1110
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYC 926
IARFINHSC PNC K+I V+ K+I IYA R I + EE+TY+YKF E +IPC C
Sbjct: 1111 IARFINHSCTPNCTAKIIRVDNTKRIVIYALRDIQSDEELTYDYKFEREIDATDRIPCLC 1170
Query: 927 GSKKCHGSLN 936
GS C G LN
Sbjct: 1171 GSIGCKGFLN 1180
>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
Length = 1111
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 101/182 (55%), Positives = 129/182 (70%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLR----NLLA--AAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R N+ A AA G +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 930 SSRDNRASNRRFQQNIEAQKAATGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 989
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ VIEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 990 NAKEMVIEYVGERIRQPVAEMRERRYIKNGIGSSYLFRIDEHTVIDATKKGGIARFINHC 1049
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C P+C K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1050 CEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERETDAEERLPCLCGAPSCKGF 1109
Query: 935 LN 936
LN
Sbjct: 1110 LN 1111
>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
Length = 986
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 805 SSRVNRAINRRFQQDIEAQKAMIGSESELLTLNQLTKRKKPVTFARSAIHNWGLYALEPI 864
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE++R ++++RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 865 AAKEMIIEYVGEILRQPVAEMRERTYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 924
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G+K+I IYA R IAA EE+TY+YKF E E+++PC+CG+ C G
Sbjct: 925 CDPSCTAKIIRVGGRKRIVIYALRDIAANEELTYDYKFERETDDEERLPCFCGAPTCKGF 984
Query: 935 LN 936
LN
Sbjct: 985 LN 986
>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
Length = 539
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 91/174 (52%), Positives = 122/174 (70%), Gaps = 3/174 (1%)
Query: 766 SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+ R R R LL + ++ K +Q+K RKK ++F RS+IH WGL ALEPI ++ +
Sbjct: 366 ATREARSDNRRLLTSMGETSSDFFKVNQLKYRKKMIKFARSRIHGWGLYALEPITPDEMI 425
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
+EYVG+ IR ++D RE RY + G+GSSYLFR+D V+DAT G ARFINHSC PNCY
Sbjct: 426 VEYVGQKIRPTVADEREKRYIRKGMGSSYLFRIDSDNVIDATNMGNFARFINHSCQPNCY 485
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-KIPCYCGSKKCHGSLN 936
KV+ V+G+K+I IY+K I G+EITY+YKFP+EE+ KI C CG+ C G+LN
Sbjct: 486 AKVVVVDGEKRIVIYSKTPINKGDEITYDYKFPIEEEDKIDCLCGAPSCRGTLN 539
>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
bruxellensis AWRI1499]
Length = 1104
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/183 (53%), Positives = 128/183 (69%), Gaps = 11/183 (6%)
Query: 765 LSARTNRVKLRNLLAAAE------GAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
S+R NR R A G+E LL+ +Q+ R+K ++F RS IH+WGL A EP
Sbjct: 922 FSSRVNRAASRRFAADVSMQKQMLGSESDLLELNQLLKRQKPVQFARSAIHNWGLYAXEP 981
Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
I A++ + EYVGE IR K++++RE RY K GIGSSYLFR+DD V+DA+K+GGIARFINH
Sbjct: 982 IAAKEMIXEYVGERIRQKVAEVREKRYLKSGIGSSYLFRVDDNTVIDASKKGGIARFINH 1041
Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
C+P+C K+I V+G+K+I IYA R IAA EE+TY+YKF E E++IPC CG+ C G
Sbjct: 1042 CCDPSCTAKIIKVDGKKRIVIYALRDIAANEELTYDYKFEKETNPEERIPCLCGAPNCKG 1101
Query: 934 SLN 936
LN
Sbjct: 1102 YLN 1104
>gi|440632035|gb|ELR01954.1| hypothetical protein GMDG_05127 [Geomyces destructans 20631-21]
Length = 1301
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NRV R +A A+ L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1120 SSRANRVNNRRFVADLNDQKKTLGGEADALRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1179
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
D +IEYVGE +R ++D+RE Y K GIGSSYLFR+D+ VVDATKRGGIARFINHS
Sbjct: 1180 AMNDMIIEYVGEKLRQSVADLRERIYLKSGIGSSYLFRIDENTVVDATKRGGIARFINHS 1239
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I VEG ++I IYA R I EE+TY+YKF E + +IPC CG+ C G
Sbjct: 1240 CMPNCTAKIIKVEGTRRIVIYALRDIKLNEELTYDYKFEREIGSDDRIPCLCGTVACKGF 1299
Query: 935 LN 936
LN
Sbjct: 1300 LN 1301
>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
2.1.1.43)(COMPASS component SET1)(SET domain-containing
protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
[Aspergillus nidulans FGSC A4]
Length = 1220
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 97/185 (52%), Positives = 128/185 (69%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNLLA---------AAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++G +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1036 TSRSTRVNNRRLIADINAQKQALPSQGGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1095
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
I A + +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1096 VNISANEMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1155
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E + +IPC CGS C
Sbjct: 1156 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSAGC 1215
Query: 932 HGSLN 936
G LN
Sbjct: 1216 KGFLN 1220
>gi|451998610|gb|EMD91074.1| hypothetical protein COCHEDRAFT_1156416 [Cochliobolus heterostrophus
C5]
Length = 1251
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 91/170 (53%), Positives = 120/170 (70%), Gaps = 4/170 (2%)
Query: 770 NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
N + + L+ AEG + L+ +Q+K RKK ++F RS IH+WGL A EPI D +IEYVGE
Sbjct: 1083 NDINISKNLSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGE 1141
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
+R +++D+RE +Y+ G+GSSYLFR+D+ V+DATK GGIARFINHSC PNC K+I V
Sbjct: 1142 KVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRV 1201
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+ K+I IYA R I + EE+TY+YKF E +IPC CGS C G LN
Sbjct: 1202 DNTKRIVIYALRDIQSDEELTYDYKFEREIDATDRIPCLCGSIGCKGFLN 1251
>gi|189205485|ref|XP_001939077.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975170|gb|EDU41796.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1274
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 126/189 (66%), Gaps = 8/189 (4%)
Query: 755 NASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
N + AN + A R + + ++ AEG + L+ +Q+K RKK ++F RS IH+WG
Sbjct: 1087 NKTSTANSRANRANNRRMVNDINISKNMSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWG 1145
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A EPI D +IEYVGE +R +++D+RE +Y+ G+GSSYLFR+D+ V+DATK GGI
Sbjct: 1146 LYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGI 1205
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
ARFINHSC PNC K+I V+ K+I IYA R I + EE+TY+YKF E +IPC CG
Sbjct: 1206 ARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDYKFEREMDATDRIPCLCG 1265
Query: 928 SKKCHGSLN 936
S C G LN
Sbjct: 1266 SIGCKGFLN 1274
>gi|330906701|ref|XP_003295568.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
gi|311333048|gb|EFQ96340.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
Length = 1272
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 126/189 (66%), Gaps = 8/189 (4%)
Query: 755 NASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
N + AN + A R + + ++ AEG + L+ +Q+K RKK ++F RS IH+WG
Sbjct: 1085 NKTSTANSRANRANNRRMVNDINISKNMSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWG 1143
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A EPI D +IEYVGE +R +++D+RE +Y+ G+GSSYLFR+D+ V+DATK GGI
Sbjct: 1144 LYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGI 1203
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
ARFINHSC PNC K+I V+ K+I IYA R I + EE+TY+YKF E +IPC CG
Sbjct: 1204 ARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDYKFEREMDATDRIPCLCG 1263
Query: 928 SKKCHGSLN 936
S C G LN
Sbjct: 1264 SIGCKGFLN 1272
>gi|320580861|gb|EFW95083.1| histone-lysine n-methyltransferase, h3 lysine-4 specific, putative
[Ogataea parapolymorpha DL-1]
Length = 658
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 94/182 (51%), Positives = 126/182 (69%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R A ++L +Q+ RKK ++F RS IH+WGL ALEPI
Sbjct: 477 SSRVNRANNRRFAADISAQKQMLGSETDILDLNQLTKRKKPVQFARSAIHNWGLYALEPI 536
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR +++++RE +Y + GIGSSYLFR+D+ V+DA+K+GGIARFINH
Sbjct: 537 AAKEMIIEYVGERIRQQVAEVREKKYLRSGIGSSYLFRIDENTVIDASKKGGIARFINHC 596
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C P+C K+I VEG+K+I IYA R IAA EE+TY+YKF E E++IPC CG+ C G
Sbjct: 597 CVPSCTAKIIKVEGKKRIVIYALRDIAANEELTYDYKFERETNDEERIPCLCGAPGCKGY 656
Query: 935 LN 936
LN
Sbjct: 657 LN 658
>gi|302306708|ref|NP_983083.2| ABR136Wp [Ashbya gossypii ATCC 10895]
gi|442570023|sp|Q75D88.2|SET1_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|299788647|gb|AAS50907.2| ABR136Wp [Ashbya gossypii ATCC 10895]
Length = 975
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 794 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 853
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 854 SAKEMIIEYVGERIRQPVAEMREKRYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 913
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G K+I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 914 CDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERETDDEERLPCLCGAPNCKGF 973
Query: 935 LN 936
LN
Sbjct: 974 LN 975
>gi|374106286|gb|AEY95196.1| FABR136Wp [Ashbya gossypii FDAG1]
Length = 975
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 794 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 853
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 854 SAKEMIIEYVGERIRQPVAEMREKRYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 913
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V G K+I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 914 CDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERETDDEERLPCLCGAPNCKGF 973
Query: 935 LN 936
LN
Sbjct: 974 LN 975
>gi|254585415|ref|XP_002498275.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
gi|238941169|emb|CAR29342.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
Length = 1015
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 834 SSRDNRASNRRFQQDIEAQRAAIGTESELLSLNQLNKRKKPVTFARSTIHNWGLYALEPI 893
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 894 AAKEMIIEYVGERIRQPVAEMREIRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHC 953
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C P+C K+I V G K+I IYA R I A EE+TY+YKF E E+++PC CG+ C G
Sbjct: 954 CEPSCTAKIIKVGGMKRIVIYALRDIGANEELTYDYKFEREIDAEERLPCLCGAPSCKGF 1013
Query: 935 LN 936
LN
Sbjct: 1014 LN 1015
>gi|347827280|emb|CCD42977.1| similar to histone-lysine N-methyltransferase [Botryotinia
fuckeliana]
Length = 1265
Score = 196 bits (498), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 89/155 (57%), Positives = 115/155 (74%), Gaps = 3/155 (1%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
++L+ +Q+K RKK ++F RS IH+WGL A+E I D +IEYVGE +R +++D+RE RY
Sbjct: 1111 DVLRFNQLKKRKKPVKFARSAIHNWGLYAMENIAMNDMIIEYVGEKVRQQVADLRENRYL 1170
Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
K GIGSSYLFR+D+ V+DATK+GGIARFINHSC PNC K+I+VE K+I IYA R IA
Sbjct: 1171 KSGIGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKIITVEKSKRIVIYALRDIA 1230
Query: 905 AGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
EE+TY+YKF E +IPC CG+ C G LN
Sbjct: 1231 QNEELTYDYKFEREIGSTDRIPCLCGTPACKGFLN 1265
>gi|367004711|ref|XP_003687088.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
gi|357525391|emb|CCE64654.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
Length = 1030
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+PI
Sbjct: 849 SSRVNRAMNRRFQQDIEAQKAVIGSESELLTLNQLNKRKKPVTFARSAIHNWGLYALQPI 908
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR +S++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 909 AAKEMIIEYVGERIRQPVSEMREIRYIKNGIGSSYLFRIDENTVIDATKKGGIARFINHC 968
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
C+P+C K+I V G+K+I IYA R I EE+TY+YKF EE +++PC CG+ C G
Sbjct: 969 CDPSCTAKIIKVGGKKRIVIYALRDIDVNEELTYDYKFEREEDDQERLPCLCGAPNCKGF 1028
Query: 935 LN 936
LN
Sbjct: 1029 LN 1030
>gi|259146872|emb|CAY80128.1| Set1p [Saccharomyces cerevisiae EC1118]
Length = 1080
Score = 196 bits (497), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
CNPNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|323348281|gb|EGA82530.1| Set1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 980
Score = 196 bits (497), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 799 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 858
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 859 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 918
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
CNPNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 919 CNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 978
Query: 935 LN 936
LN
Sbjct: 979 LN 980
>gi|380806421|gb|AFE75086.1| histone-lysine N-methyltransferase SETD1B, partial [Macaca mulatta]
Length = 367
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/180 (57%), Positives = 133/180 (73%), Gaps = 5/180 (2%)
Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +S IH
Sbjct: 190 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 247
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
DWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++DATK
Sbjct: 248 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 307
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G ARFINHSCNPNCY KVI+VE QKKI IY+K+HI EEITY+YKFP+E+ KIPC CG
Sbjct: 308 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 367
>gi|169601854|ref|XP_001794349.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
gi|111067888|gb|EAT89008.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
Length = 1168
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 94/185 (50%), Positives = 123/185 (66%), Gaps = 4/185 (2%)
Query: 755 NASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
A+ AN N + + + AEG + L+ +Q+K RKK ++F RS IH+WGL A
Sbjct: 985 TANSRANRANNRRMVNDINISKNMTGAEG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAQ 1043
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY++ G+GSSYLFR+D+ V+DATK GGIARFI
Sbjct: 1044 ENIVANDMIIEYVGEKVRQRVADLREVRYDQQGVGSSYLFRIDEDTVIDATKMGGIARFI 1103
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC PNC K+I V+ K+I IYA R I EE+TY+YKF E +IPC CGS C
Sbjct: 1104 NHSCTPNCTAKIIRVDNTKRIVIYALRDIGQDEELTYDYKFEREMDATDRIPCLCGSVGC 1163
Query: 932 HGSLN 936
G LN
Sbjct: 1164 KGFLN 1168
>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 1148
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 91/172 (52%), Positives = 121/172 (70%), Gaps = 3/172 (1%)
Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + L++ + ++ ++ K +Q+K RKK ++F RS IH+WGL E I +D +IEYV
Sbjct: 977 RRHVTALQDEMKSSGDPDVFKFNQLKKRKKPVKFARSAIHNWGLYTEENINKDDMIIEYV 1036
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE +R +IS+IRE RY K G+GSSYLFR+D+ V+DATK+GGIARFINHSC PNC K+I
Sbjct: 1037 GEQVRQQISEIREVRYLKQGMGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKII 1096
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
V+G K+I IYA R IA EE+TY+YKF E +IPC CG+ C G LN
Sbjct: 1097 KVDGSKRIVIYALRDIARTEELTYDYKFEREIGSLDRIPCLCGTALCKGFLN 1148
>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 1270
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 95/181 (52%), Positives = 122/181 (67%), Gaps = 10/181 (5%)
Query: 766 SARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
++R NR R ++A + ++ K +Q+K RKK ++F RS IH+WGL E I
Sbjct: 1090 NSRANRANNRRHVSALQDEKKMSGDSDAFKFNQLKKRKKPVKFARSAIHNWGLYTEENIN 1149
Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
+D +IEYVGE +R IS+IRE RY K G+GSSYLFR+DD V+DATK+GGIARFINHSC
Sbjct: 1150 KDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATKKGGIARFINHSC 1209
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSL 935
PNC K+I V+G K+I IYA R IA EE+TY+YKF E +IPC CG+ C G L
Sbjct: 1210 MPNCTAKIIKVDGSKRIVIYALRDIAQHEELTYDYKFEREIGSLDRIPCLCGTAACKGFL 1269
Query: 936 N 936
N
Sbjct: 1270 N 1270
>gi|398394325|ref|XP_003850621.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339470500|gb|EGP85597.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 1163
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 95/186 (51%), Positives = 123/186 (66%), Gaps = 15/186 (8%)
Query: 766 SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
++R+NRV R L+ A ++ ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 978 TSRSNRVNNRRLVNDINLQKQTLATANSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1037
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I D +IEYVGE +R KI+D+RE RYEK G+GSSYLFR+ D +VDATK+GGIARF
Sbjct: 1038 EENIAVNDLIIEYVGEKVRQKIADLREIRYEKQGVGSSYLFRMIDDEIVDATKKGGIARF 1097
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC+PNC K+I VEG +I IYA + I +E+TY+YKF E +IPC CGS
Sbjct: 1098 INHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFEREMDSTDRIPCLCGSAN 1157
Query: 931 CHGSLN 936
C G LN
Sbjct: 1158 CKGFLN 1163
>gi|403217274|emb|CCK71769.1| hypothetical protein KNAG_0H03550 [Kazachstania naganishii CBS 8797]
Length = 1111
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 96/180 (53%), Positives = 125/180 (69%), Gaps = 11/180 (6%)
Query: 765 LSARTNRVKLRNLLA------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
LS+R NR + R AA G E LL +Q+ RKK + F RS IH+WGL ALEP
Sbjct: 931 LSSRDNRAQNRRFQQDIEAQKAAIGTESDLLSLNQLNKRKKPVTFARSAIHNWGLYALEP 990
Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
I A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATKRGGIARFINH
Sbjct: 991 IAAKEMIIEYVGERIRQPVAEMREMRYLKSGIGSSYLFRVDENNVIDATKRGGIARFINH 1050
Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
C+P+C K+I V G+++I IYA R I EE+TY+YKF E E+++PC+CG+ C G
Sbjct: 1051 CCDPSCTAKIIKVGGKRRIVIYALRDIGKNEELTYDYKFEREQDDEERLPCHCGAPTCKG 1110
>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
Length = 1135
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 91/172 (52%), Positives = 121/172 (70%), Gaps = 3/172 (1%)
Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R + L++ + ++ ++ K +Q+K RKK ++F RS IH+WGL E I +D +IEYV
Sbjct: 964 RRHVTALQDEMKSSGDPDVFKFNQLKKRKKPVKFARSAIHNWGLYTEENINKDDMIIEYV 1023
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE +R +IS+IRE RY K G+GSSYLFR+D+ V+DATK+GGIARFINHSC PNC K+I
Sbjct: 1024 GEQVRQQISEIREVRYLKQGMGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKII 1083
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
V+G K+I IYA R IA EE+TY+YKF E +IPC CG+ C G LN
Sbjct: 1084 KVDGSKRIVIYALRDIARTEELTYDYKFEREIGSLDRIPCLCGTALCKGFLN 1135
>gi|50312247|ref|XP_456155.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74636430|sp|Q6CIT4.1|SET1_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49645291|emb|CAG98863.1| KLLA0F24134p [Kluyveromyces lactis]
Length = 1000
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NRV R E ++LL +Q+ RKK + F RS IH+WGL ALEPI
Sbjct: 819 SSRLNRVFQRRFQQDIEAQRAAIGFESDLLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 878
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATKRGGIARFINH
Sbjct: 879 AAKEMIIEYVGESIRQPVAEMREKRYIKSGIGSSYLFRIDENTVIDATKRGGIARFINHC 938
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
C P+C K+I V+G+K+I IYA R I EE+TY+YKF E +++PC CG+ C G
Sbjct: 939 CEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERETDEGERLPCLCGAPSCKGF 998
Query: 935 LN 936
LN
Sbjct: 999 LN 1000
>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
Length = 1505
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 145/247 (58%), Gaps = 23/247 (9%)
Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLS 766
+ P ++GCAR+ + K S+ + R V T + + L+
Sbjct: 1265 VIPVAEGCARAR----PYKKMSMKQKRSLVRRPENESSVSTAIFSERDETA--MRHQHLA 1318
Query: 767 ARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
+ R+ R LL + A + K +Q+K RKK ++F RS+IH WGL A+E I ++ ++
Sbjct: 1319 NKDMRLLQRRLLTSLGDASNDFFKINQLKFRKKMIKFARSRIHGWGLYAMETIAQDEMIV 1378
Query: 825 EYVGEL---------------IRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
EY+G+ IR ++D RE YE+ GIGSSYLFR+D+ V+DATKRG
Sbjct: 1379 EYIGQTVIVFQNFSSILFHLQIRSLVADEREKAYERRGIGSSYLFRIDENSVIDATKRGN 1438
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
ARFINHSC PNCY KV+++EG+K+I IY++ I GEEITY+YKFP+E+ KI C CG+K
Sbjct: 1439 FARFINHSCQPNCYAKVLTIEGEKRIVIYSRSVINKGEEITYDYKFPIEDDKIDCLCGAK 1498
Query: 930 KCHGSLN 936
C G LN
Sbjct: 1499 ACRGYLN 1505
>gi|154311696|ref|XP_001555177.1| hypothetical protein BC1G_06307 [Botryotinia fuckeliana B05.10]
Length = 451
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S R RV R +A ++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 270 SGRATRVNNRRYVADLNDQKKTLGGDTDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 329
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINHS
Sbjct: 330 AMNDMIIEYVGEKVRQQVADLRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 389
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C PNC K+I+VE K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 390 CMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACKGF 449
Query: 935 LN 936
LN
Sbjct: 450 LN 451
>gi|256271664|gb|EEU06704.1| Set1p [Saccharomyces cerevisiae JAY291]
Length = 1080
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|151944065|gb|EDN62358.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 1080
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|6321911|ref|NP_011987.1| Set1p [Saccharomyces cerevisiae S288c]
gi|731707|sp|P38827.1|SET1_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1
gi|529135|gb|AAB68867.1| Set1p [Saccharomyces cerevisiae]
gi|190405898|gb|EDV09165.1| histone-lysine N-methyltransferase [Saccharomyces cerevisiae RM11-1a]
gi|285810026|tpg|DAA06813.1| TPA: Set1p [Saccharomyces cerevisiae S288c]
gi|392298926|gb|EIW10021.1| Set1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1080
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|349578671|dbj|GAA23836.1| K7_Set1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1080
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|156847122|ref|XP_001646446.1| hypothetical protein Kpol_1048p18 [Vanderwaltozyma polyspora DSM
70294]
gi|156117123|gb|EDO18588.1| hypothetical protein Kpol_1048p18 [Vanderwaltozyma polyspora DSM
70294]
Length = 1074
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 91/161 (56%), Positives = 118/161 (73%), Gaps = 3/161 (1%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
A +ELL +Q+ RKK + F RS IH+WGL ALEPI A++ +IEYVGE IR ++++
Sbjct: 914 AIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEM 973
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE RY K GIGSSYLFR+D+ V+DATKRGGIARFINH C+P+C K+I V G K+I IY
Sbjct: 974 RERRYIKNGIGSSYLFRVDENTVIDATKRGGIARFINHCCDPSCTAKIIKVGGMKRIVIY 1033
Query: 899 AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
A R IA+ EE+TY+YKF E ++++PC CG+ C G LN
Sbjct: 1034 ALRDIASNEELTYDYKFEREMDDKERLPCLCGAATCKGFLN 1074
>gi|410080444|ref|XP_003957802.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
gi|372464389|emb|CCF58667.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
Length = 1133
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 90/161 (55%), Positives = 118/161 (73%), Gaps = 3/161 (1%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
A +ELL +Q+ RKK + F RS IH+WGL ALEPI A++ +IEYVGE IR ++++
Sbjct: 973 AIGSESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEM 1032
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH C+P+C K+I V G+++I IY
Sbjct: 1033 REKRYLKNGIGSSYLFRVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIY 1092
Query: 899 AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
A R IA EE+TY+YKF E E+++PC CG+ C G LN
Sbjct: 1093 ALRDIAKNEELTYDYKFEREQDDEERLPCLCGAPNCKGFLN 1133
>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 1257
Score = 193 bits (491), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 127/193 (65%), Gaps = 11/193 (5%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKI 806
+ ++ A KG ++R NR R ++A + ++ K +Q+K RKK ++F RS I
Sbjct: 1066 LTVAEKAVAKG-NSRANRANNRRHVSALQDEKKISGDSDAFKFNQLKKRKKPVKFARSAI 1124
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
H+WGL E I +D +IEYVGE +R IS+IRE RY K G+GSSYLFR+DD V+DATK
Sbjct: 1125 HNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATK 1184
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
+GGIARFINHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E +IP
Sbjct: 1185 KGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIP 1244
Query: 924 CYCGSKKCHGSLN 936
C CG+ C G LN
Sbjct: 1245 CLCGTAACKGFLN 1257
>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 1262
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 127/193 (65%), Gaps = 11/193 (5%)
Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKI 806
+ A++ KG ++R NR R ++A + ++ K +Q+K RKK ++F RS I
Sbjct: 1071 LTAAEKVVAKG-NSRANRANNRRHVSALQDEKKISGDSDAFKFNQLKKRKKPVKFARSAI 1129
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
H+WGL E I +D +IEYVGE +R IS+IRE RY K G+GSSYLFR+DD V+DATK
Sbjct: 1130 HNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATK 1189
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
+GGIARFINHSC PNC K+I V+G K+I IYA R I EE+TY+YKF E +IP
Sbjct: 1190 KGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIP 1249
Query: 924 CYCGSKKCHGSLN 936
C CG+ C G LN
Sbjct: 1250 CLCGTAACKGFLN 1262
>gi|302501458|ref|XP_003012721.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
gi|291176281|gb|EFE32081.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
Length = 1363
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 129/207 (62%), Gaps = 37/207 (17%)
Query: 766 SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
S+R+NRV R L+A + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1158 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1216
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATK GGIARF
Sbjct: 1217 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1276
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG---------------------EEITYN 912
INHSC PNC K+I V+G K+I IYA R I G EE+TY+
Sbjct: 1277 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERGGLIHSKLLLSIYDRLLTFCVDEELTYD 1336
Query: 913 YKFPLE---EKKIPCYCGSKKCHGSLN 936
YKF E + +IPC CGS C G LN
Sbjct: 1337 YKFEREWDSDDRIPCLCGSTGCKGFLN 1363
>gi|207344594|gb|EDZ71692.1| YHR119Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1080
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 899 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDFI 958
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 959 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R IAA EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078
Query: 935 LN 936
LN
Sbjct: 1079 LN 1080
>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
Length = 1372
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 89/170 (52%), Positives = 121/170 (71%), Gaps = 4/170 (2%)
Query: 770 NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
N + + ++ A+G + L+ +Q+K RKK ++F RS IH+WGL A E I A D +IEYVGE
Sbjct: 1204 NDINISKNMSGADG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAEENIAANDMIIEYVGE 1262
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
+R +++D+RE +Y++ G+GSSYLFR+D+ V+DATK GGIARFINHSC PNC K+I V
Sbjct: 1263 KVRQRVADLREAKYDQQGVGSSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRV 1322
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+ K+I IYA R I+ EE+TY+YKF E +IPC CGS C G LN
Sbjct: 1323 DTSKRIVIYALRDISQDEELTYDYKFEREMDATDRIPCLCGSIGCKGFLN 1372
>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
Length = 2067
Score = 192 bits (488), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 86/156 (55%), Positives = 117/156 (75%), Gaps = 3/156 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
A LL +++R+K+LRF RS +H WG+ A EPI A D VIEY GE+I ++D RE +Y
Sbjct: 1912 AGLLNLDVLQSRRKKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQY 1971
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E M IGS Y+FR+D+ VVDAT +G +ARFINHSC+P+C T++I+VEG KKI IYA+R +
Sbjct: 1972 EDMQIGSDYMFRVDEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDV 2031
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
A GEE++Y+YKFP E ++PC+CGS+KC G +N
Sbjct: 2032 AMGEELSYDYKFPPEPDEAARVPCHCGSEKCRGFIN 2067
>gi|406863054|gb|EKD16102.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1280
Score = 192 bits (487), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 87/155 (56%), Positives = 113/155 (72%), Gaps = 3/155 (1%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
++L+ +Q++ R+K + F+RS IH WGL A+E I D +IEYVGE +R ++D+RE RY
Sbjct: 1126 DVLRFNQLQKRQKLVNFERSAIHKWGLFAMENINQNDMIIEYVGEKVRQTVADLREHRYL 1185
Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
K GIGSSYLFR+D+ V+DATK+GGIARFINHSC PNC K+I+VE K+I IYA R IA
Sbjct: 1186 KSGIGSSYLFRIDESTVIDATKKGGIARFINHSCMPNCTAKIITVEKGKRIVIYALRDIA 1245
Query: 905 AGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
EE+TY+YKF E +IPC CG+ C G LN
Sbjct: 1246 QNEELTYDYKFEREIGSTDRIPCLCGTAACKGFLN 1280
>gi|453082196|gb|EMF10244.1| hypothetical protein SEPMUDRAFT_151237 [Mycosphaerella populorum
SO2202]
Length = 1254
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 94/185 (50%), Positives = 126/185 (68%), Gaps = 14/185 (7%)
Query: 766 SARTNRVKLRNL----------LAAAEGAEL-LKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+NRV R L LA+ A++ ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 1070 TSRSNRVNNRRLVNDINLQKQSLASTGDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYAE 1129
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I D +IEYVGE +R K++D+RE +Y+K G+GSSYLFR+ D +VDATK+GGIARFI
Sbjct: 1130 ENIAINDLIIEYVGEKVRQKVADMREIKYDKQGVGSSYLFRMLDDEIVDATKKGGIARFI 1189
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
NHSC+PNC K+I VEG +I IYA + I+ +E+TY+YKF E +IPC CGS C
Sbjct: 1190 NHSCSPNCTAKIIKVEGTPRIVIYALKDISKNDELTYDYKFEREIGATDRIPCLCGSANC 1249
Query: 932 HGSLN 936
G LN
Sbjct: 1250 KGFLN 1254
>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
Length = 1040
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A A G+E +L + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 862 SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 921
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y K GIGSSYLFR+DD V+DATK+GGIARFINH C+P
Sbjct: 922 EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDDNTVIDATKKGGIARFINHCCSP 981
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 982 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1040
>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1071
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 93/178 (52%), Positives = 118/178 (66%), Gaps = 7/178 (3%)
Query: 766 SARTNRVKLRNLLAAA----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
S+R NR R E+L +Q+ RKK + F RS IH+WGL ALEPI ++
Sbjct: 894 SSRVNRANNRRFAYEVSTFNNQNEILTLNQLNKRKKPVSFARSAIHNWGLYALEPIAQKE 953
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
+IEYVGE IR +++D RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH C P+
Sbjct: 954 MIIEYVGERIRQQVADFREKAYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHCCQPS 1013
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
C K+I VEGQK+I IYA R I A EE+TY+YKF E +++ C CG+ C G LN
Sbjct: 1014 CTAKIIKVEGQKRIVIYALRDIGANEELTYDYKFERETNDNERVRCLCGAPGCKGYLN 1071
>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1040
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A A G+E +L + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 862 SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 921
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y K GIGSSYLFR+DD V+DATK+GGIARFINH C+P
Sbjct: 922 EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDDNTVIDATKKGGIARFINHCCSP 981
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 982 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1040
>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
10762]
Length = 1279
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/153 (56%), Positives = 110/153 (71%), Gaps = 3/153 (1%)
Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
++ +Q+K RKK ++F RS IH WGL A E I D +IEYVGE +R K++D+RE RYEK
Sbjct: 1127 VRFNQLKKRKKLVKFDRSAIHGWGLYAEENITVNDLIIEYVGEKVRQKVADLRELRYEKQ 1186
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
G+GSSYLFR+ D +VDATK+GGIARFINHSC+PNC K+I VEG +I IYA + I
Sbjct: 1187 GVGSSYLFRMMDDEIVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKN 1246
Query: 907 EEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
EE+TY+YKF E +IPC CG+ C G LN
Sbjct: 1247 EELTYDYKFEREYGSTDRIPCLCGTANCKGFLN 1279
>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 144
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 87/144 (60%), Positives = 109/144 (75%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ RKK LRF RS IHDWGL A+E I + VIEYVGE+IR ++++ RE YE+ GIGSSY
Sbjct: 1 QTRKKHLRFARSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGSSY 60
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
LFR+D+ VVDATK+G + R INHSC+PNC K+I++ G+KKI IYAK+ I G+EITY+
Sbjct: 61 LFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYD 120
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
Y FP E+ KI C CGS KC G LN
Sbjct: 121 YHFPFEQDKILCLCGSVKCRGFLN 144
>gi|401625463|gb|EJS43472.1| set1p [Saccharomyces arboricola H-6]
Length = 1089
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 908 SSRDNRASNRRFQQNIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 967
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
++ +IEYVGE IR ++++RE RY K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 968 ATKEMIIEYVGERIRQPVAEMRERRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1027
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R I A EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1028 CDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQDDEERLPCLCGAPNCKGF 1087
Query: 935 LN 936
LN
Sbjct: 1088 LN 1089
>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
90-125]
gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
Length = 1038
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 92/179 (51%), Positives = 122/179 (68%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A +E++ + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 860 SSRVNRANNRRFAADITAQIGSESEVMSLNALTKRKKPVTFARSTIHNWGLYAMEPIAAK 919
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y + GIGSSYLFR+D+ V+DATK+GGIARFINH CNP
Sbjct: 920 EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCNP 979
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 980 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1038
>gi|401842102|gb|EJT44375.1| SET1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1087
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R E +ELL +Q+ RKK + F RS IH+WGL AL+ I
Sbjct: 906 SSRDNRASNRRFQQNIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 965
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++RE RY GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 966 AAKEMIIEYVGERIRQPVAEMRERRYLMNGIGSSYLFRVDESTVIDATKKGGIARFINHC 1025
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+PNC K+I V G+++I IYA R I A EE+TY+YKF E E+++PC CG+ C G
Sbjct: 1026 CDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQDDEERLPCLCGASNCKGF 1085
Query: 935 LN 936
LN
Sbjct: 1086 LN 1087
>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
Length = 1042
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 92/179 (51%), Positives = 122/179 (68%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A +E++ + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 864 SSRVNRANNRRFAADITAQIGSESEVMSLNALTKRKKPVTFARSTIHNWGLYAMEPIAAK 923
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y + GIGSSYLFR+D+ V+DATK+GGIARFINH CNP
Sbjct: 924 EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCNP 983
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 984 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1042
>gi|452980621|gb|EME80382.1| hypothetical protein MYCFIDRAFT_204567 [Pseudocercospora fijiensis
CIRAD86]
Length = 1200
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 91/186 (48%), Positives = 123/186 (66%), Gaps = 15/186 (8%)
Query: 766 SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
++R+NRV R L+ + ++ ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 1015 TSRSNRVNNRRLVNDINLQKQAFATSNSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1074
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I D +IEYVGE +R K++D+RE +Y+K G+GSSYLFR+ D +VDATK+GGIARF
Sbjct: 1075 EENIAVNDLIIEYVGEKVRQKVADMREIKYDKQGVGSSYLFRMIDDEIVDATKKGGIARF 1134
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC+PNC K+I VEG +I IYA + I +E+TY+YKF E +IPC CGS
Sbjct: 1135 INHSCDPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFEREYGSTDRIPCLCGSAN 1194
Query: 931 CHGSLN 936
C G LN
Sbjct: 1195 CKGFLN 1200
>gi|452837203|gb|EME39145.1| hypothetical protein DOTSEDRAFT_75034 [Dothistroma septosporum NZE10]
Length = 1275
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 92/186 (49%), Positives = 123/186 (66%), Gaps = 15/186 (8%)
Query: 766 SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
++R+NRV R L+ A ++ ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 1090 TSRSNRVNNRRLVNDINLQKQSFATANSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1149
Query: 814 LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
E I + +IEYVGE +R K++D+RE +YEK G+GSSYLFR+ D +VDATK+GGIARF
Sbjct: 1150 EENIGINELIIEYVGEKVRQKVADMREIKYEKQGVGSSYLFRMMDDEIVDATKKGGIARF 1209
Query: 874 INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
INHSC+PNC K+I VEG +I IYA + I +E+TY+YKF E +IPC CGS
Sbjct: 1210 INHSCDPNCTAKIIKVEGTPRIVIYALKDIYKNDELTYDYKFEREIGSTDRIPCLCGSAN 1269
Query: 931 CHGSLN 936
C G LN
Sbjct: 1270 CKGFLN 1275
>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
histone-lysine n-methyltransferase, h3 lysine-4 specific,
putative [Candida dubliniensis CD36]
gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
dubliniensis CD36]
Length = 1032
Score = 189 bits (479), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 95/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A A G+E +L + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 854 SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 913
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH C+P
Sbjct: 914 EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 973
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 974 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1032
>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
Length = 1088
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/182 (51%), Positives = 120/182 (65%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R A ++L + + RKK + F RS IH+WGL ALEPI
Sbjct: 907 SSRVNRANNRRFAADISAQKQMLGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPI 966
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ VVDATK+GGIARFINH
Sbjct: 967 AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVVDATKKGGIARFINHC 1026
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
CNP+C K+I VEG+K+I IYA R I A EE+TY+YKF E ++I C CG+ C G
Sbjct: 1027 CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKETNDAERIRCLCGAPGCKGY 1086
Query: 935 LN 936
LN
Sbjct: 1087 LN 1088
>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
Length = 1489
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)
Query: 766 SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
SAR NR R L+ A ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1309 SARNNRADSRRLVLGIEQHKRETATDTDIFKFNQLRTRKKQLKFAKSPIHDWGLYAMEYI 1368
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A D VIEYVGE++R +++D RE +YE+ G S+YLFR+DD VVDAT +G IAR +NH
Sbjct: 1369 PAGDMVIEYVGEMVRQQVADNREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1428
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
C PNC K+++V G+K+I ++AK I AGEE+TY+YKF +E IPC CGS C
Sbjct: 1429 CTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQSSADDEDAIPCLCGSDGC 1485
>gi|452820772|gb|EME27810.1| histone-lysine N-methyltransferase isoform 2 [Galdieria
sulphuraria]
Length = 797
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/178 (51%), Positives = 122/178 (68%), Gaps = 7/178 (3%)
Query: 766 SARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
SAR +R + R L ++ L SQ+K+RKKR+RF RS IH WGL A++ IE
Sbjct: 620 SARESRQEYRKLRQGVGELGIRHSDSLIISQLKSRKKRVRFGRSIIHGWGLYAMQDIEPN 679
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+F+IEYVGE+IR KISD RE RY + GIG SY+FRLD+ ++DAT++G +ARF+NHSC
Sbjct: 680 EFIIEYVGEIIRQKISDEREKRYFRQGIGDSYMFRLDEDQIIDATRKGSVARFVNHSCES 739
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
N K+I+++ KKI Y+KR I AGEEITY+YKF E++ KI C CG+ C LN
Sbjct: 740 NAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTEDENNKILCLCGAPTCRKFLN 797
>gi|452820773|gb|EME27811.1| histone-lysine N-methyltransferase isoform 1 [Galdieria
sulphuraria]
Length = 769
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/178 (51%), Positives = 122/178 (68%), Gaps = 7/178 (3%)
Query: 766 SARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
SAR +R + R L ++ L SQ+K+RKKR+RF RS IH WGL A++ IE
Sbjct: 592 SARESRQEYRKLRQGVGELGIRHSDSLIISQLKSRKKRVRFGRSIIHGWGLYAMQDIEPN 651
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+F+IEYVGE+IR KISD RE RY + GIG SY+FRLD+ ++DAT++G +ARF+NHSC
Sbjct: 652 EFIIEYVGEIIRQKISDEREKRYFRQGIGDSYMFRLDEDQIIDATRKGSVARFVNHSCES 711
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
N K+I+++ KKI Y+KR I AGEEITY+YKF E++ KI C CG+ C LN
Sbjct: 712 NAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTEDENNKILCLCGAPTCRKFLN 769
>gi|313228693|emb|CBY17844.1| unnamed protein product [Oikopleura dioica]
Length = 473
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 140/232 (60%), Gaps = 12/232 (5%)
Query: 711 SDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV-----NASQWANGKGL 765
S GCAR+ G+ + W + R++ + L PE +A+ G
Sbjct: 248 SSGCARTE--GYFKYTWEEKL----QMRLKLHRLGQLTDLKPEATSQGNDAAGRKEGVQA 301
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
+ R R R + + +++ +Q+ RKK+++F +S IH WGL ALE I A+DFVIE
Sbjct: 302 TDRETRNYHRRVADEFSDSSIMRYNQLTMRKKKVKFMKSAIHGWGLFALEDIPADDFVIE 361
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+ IR+ I+D RE Y K GIGSSYLFRLD +V+DATK G +RFINH C PNC K
Sbjct: 362 YVGQKIRIGIADNREEEYTKCGIGSSYLFRLDTTHVIDATKHGSSSRFINHCCVPNCIAK 421
Query: 886 VIS-VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI+ +G+KKI IY+K IA E+TY+YKFP+E++KI C C + C G LN
Sbjct: 422 VINGADGEKKIIIYSKAPIAKNAEVTYDYKFPIEDEKIRCLCFHENCRGYLN 473
>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
Y-27907]
Length = 1060
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/179 (51%), Positives = 121/179 (67%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A +++L + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 882 SSRVNRANNRRFAADITAQIGNESDILSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 941
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR +++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH C+P
Sbjct: 942 EMIIEYVGERIRQQVAQHREESYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 1001
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG+K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 1002 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1060
>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
Length = 1056
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 96/179 (53%), Positives = 122/179 (68%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A A G E +L + + RKK + F RS IH+WGL ALEPI A+
Sbjct: 878 SSRVNRANNRRFAADITAQIGTESDVLSLNALTKRKKPVTFARSAIHNWGLYALEPIAAK 937
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH C+P
Sbjct: 938 EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 997
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I VEG K+I IYA R I A EE+TY+YKF E E++I C CG+ C G LN
Sbjct: 998 SCTAKIIKVEGIKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1056
>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
antarctica T-34]
Length = 1366
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 89/184 (48%), Positives = 122/184 (66%), Gaps = 12/184 (6%)
Query: 760 ANGKGL-SARTNRVKLRNLLAAAE--------GAELLKASQVKARKKRLRFQRSKIHDWG 810
AN + L SAR NR R L+ E ++ K +Q++ RKK+L+F +S IHDWG
Sbjct: 1179 ANAQVLQSARNNRADSRRLVLGIEQHKRETATDTDMFKFNQLRTRKKQLKFAKSPIHDWG 1238
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L A+E I A D VIEYVGE++R +++D RE +YE+ G S+YLFR+DD VVDAT +G I
Sbjct: 1239 LYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNI 1298
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCG 927
AR +NH C PNC K++++ G+K+I ++AK I GEE+TY+YKF +E IPC CG
Sbjct: 1299 ARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYKFQSSADDEDAIPCLCG 1358
Query: 928 SKKC 931
S C
Sbjct: 1359 SPGC 1362
>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
Length = 1453
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 87/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)
Query: 766 SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
SAR NR R L+ A ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1273 SARNNRADSRRLVLGIEQHKRETATDTDMFKFNQLRTRKKQLKFAKSPIHDWGLYAMEFI 1332
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A D VIEYVGE++R +++D RE +YE+ G S+YLFR+DD VVDAT +G IAR +NH
Sbjct: 1333 PAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1392
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
C PNC K++++ G+K+I ++AK I AGEE+TY+YKF +E IPC CGS C
Sbjct: 1393 CTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYKFQSSADDEDAIPCLCGSPGC 1449
>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R L+A ++L + + RKK + F RS IH+WGL ALE I
Sbjct: 874 SSRVNRANNRRLVADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESI 933
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 934 AAKEMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHC 993
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
CNP+C K+I VEG+K+I IYA R I A EE+TY+YKF E +++I C CG+ C G
Sbjct: 994 CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGY 1053
Query: 935 LN 936
LN
Sbjct: 1054 LN 1055
>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R L+A ++L + + RKK + F RS IH+WGL ALE I
Sbjct: 874 SSRVNRANNRRLVADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESI 933
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 934 AAKEMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHC 993
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
CNP+C K+I VEG+K+I IYA R I A EE+TY+YKF E +++I C CG+ C G
Sbjct: 994 CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGY 1053
Query: 935 LN 936
LN
Sbjct: 1054 LN 1055
>gi|313228694|emb|CBY17845.1| unnamed protein product [Oikopleura dioica]
Length = 1034
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 140/232 (60%), Gaps = 12/232 (5%)
Query: 711 SDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV-----NASQWANGKGL 765
S GCAR+ G+ + W + R++ + L PE +A+ G
Sbjct: 809 SSGCARTE--GYFKYTWEEKL----QMRLKLHRLGQLTDLKPEATSQGNDAAGRKEGVQA 862
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
+ R R R + + +++ +Q+ RKK+++F +S IH WGL ALE I A+DFVIE
Sbjct: 863 TDRETRNYHRRVADEFSDSSIMRYNQLTMRKKKVKFMKSAIHGWGLFALEDIPADDFVIE 922
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
YVG+ IR+ I+D RE Y K GIGSSYLFRLD +V+DATK G +RFINH C PNC K
Sbjct: 923 YVGQKIRIGIADNREEEYTKCGIGSSYLFRLDTTHVIDATKHGSSSRFINHCCVPNCIAK 982
Query: 886 VIS-VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
VI+ +G+KKI IY+K IA E+TY+YKFP+E++KI C C + C G LN
Sbjct: 983 VINGADGEKKIIIYSKAPIAKNAEVTYDYKFPIEDEKIRCLCFHENCRGYLN 1034
>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
Length = 1468
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 87/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)
Query: 766 SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
SAR NR R L+ A ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1288 SARNNRADSRRLVLGIEQHKRETATDTDIFKFNQLRTRKKQLKFAKSPIHDWGLYAMELI 1347
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A D VIEYVGE++R +++D RE +YE+ G S+YLFR+DD VVDAT +G IAR +NH
Sbjct: 1348 PAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1407
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
C PNC K++++ G+K+I ++AK I AGEE+TY+YKF +E IPC CGS C
Sbjct: 1408 CTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSADDEDAIPCLCGSPGC 1464
>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
Length = 1055
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 92/179 (51%), Positives = 119/179 (66%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A E+L + + RKK + F RS IH+WGL ALEPI A+
Sbjct: 877 SSRVNRANNRRFAADITAQLGNETEVLSLNTLTKRKKPVSFARSAIHNWGLYALEPIAAK 936
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH C+P
Sbjct: 937 EMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 996
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
+C K+I V+ QK+I IYA R I A EE+TY+YKF E ++I C CG+ C G LN
Sbjct: 997 SCTAKIIKVDNQKRIVIYALRDIDANEELTYDYKFERETNDAERIRCLCGAPGCKGYLN 1055
>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
Length = 1037
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/182 (51%), Positives = 123/182 (67%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLA------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R A A G+E +L + + RKK + F RS IH+WGL ALEPI
Sbjct: 856 SSRVNRANNRRFAADISAQKQAIGSETDILSLNTLAKRKKPVSFARSAIHNWGLYALEPI 915
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 916 AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHC 975
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V+G+K+I IYA R I A EE+TY+YKF E ++I C CG+ C G
Sbjct: 976 CSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDSERIRCLCGAPGCKGY 1035
Query: 935 LN 936
LN
Sbjct: 1036 LN 1037
>gi|388580816|gb|EIM21128.1| histone methyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 144
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 82/144 (56%), Positives = 108/144 (75%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ RKK LRF +S IH WGL + + I + VIEYVGE++R +++D RE YEK GIGSSY
Sbjct: 1 RTRKKELRFAKSSIHSWGLYSCQVIPKGEMVIEYVGEVVRQQVADRREKAYEKQGIGSSY 60
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
LF++DD +VDAT +G +AR INH C PNC K+I++ G+KKI IYAK I+ G+EITY+
Sbjct: 61 LFKIDDDNIVDATMKGSVARLINHCCQPNCTAKIITILGEKKIIIYAKTEISPGDEITYD 120
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
Y FP+E++KIPC CG + C GSLN
Sbjct: 121 YHFPIEDEKIPCLCGVEGCRGSLN 144
>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
Length = 469
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 124/189 (65%), Gaps = 12/189 (6%)
Query: 760 ANGKGL-SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWG 810
A G L S+R NR R A +++L + + RKK + + RS IH+WG
Sbjct: 281 AGGNSLQSSRVNRANNRRFAADISAQKQILSTESDILNLNALTKRKKPVSYARSAIHNWG 340
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
L ALEPI A++ +IEYVGE IR ++++ RE Y + GIGSSYLFR+D+ V+DATK+GGI
Sbjct: 341 LYALEPIAAKEMIIEYVGESIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGI 400
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCG 927
ARFINH CNP+C K+I V+G+K+I IYA R I A EE+TY+YKF E ++I C CG
Sbjct: 401 ARFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDAERIRCLCG 460
Query: 928 SKKCHGSLN 936
+ C G LN
Sbjct: 461 APGCKGYLN 469
>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1156
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/179 (51%), Positives = 123/179 (68%), Gaps = 8/179 (4%)
Query: 766 SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
S+R NR R A A G E +L + + RKK + F RS IH+WGL A+EPI A+
Sbjct: 978 SSRVNRANNRRFAADITAQIGTESDVLSLNALTKRKKPVTFARSSIHNWGLYAMEPIAAK 1037
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
+ +IEYVGE IR ++++ RE Y + GIGSSYLFR+D+ V+DATK+GGIARFINH C+P
Sbjct: 1038 EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 1097
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
+C K+I V+G+K+I IYA R I A EE+TY+YKF E +++I C CG+ C G LN
Sbjct: 1098 SCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGFLN 1156
>gi|224086152|ref|XP_002307834.1| SET domain protein [Populus trichocarpa]
gi|222857283|gb|EEE94830.1| SET domain protein [Populus trichocarpa]
Length = 90
Score = 184 bits (466), Expect = 2e-43, Method: Composition-based stats.
Identities = 85/90 (94%), Positives = 86/90 (95%)
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIY
Sbjct: 1 RERLYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIY 60
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGS 928
AKRHIAAGEEITYNYKFPLE+KKIPC CGS
Sbjct: 61 AKRHIAAGEEITYNYKFPLEDKKIPCNCGS 90
>gi|448106516|ref|XP_004200765.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|448109616|ref|XP_004201396.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382187|emb|CCE81024.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382952|emb|CCE80259.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
Length = 1062
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 91/182 (50%), Positives = 120/182 (65%), Gaps = 11/182 (6%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
S+R NR R A ++L + + RKK + F RS IH+WGL ALEPI
Sbjct: 881 SSRVNRANNRRFAADISAQKQMIGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPI 940
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
A++ +IEYVGE IR ++++ RE Y K GIGSSYLFR+D+ V+DATK+GGIARFINH
Sbjct: 941 AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHC 1000
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
C+P+C K+I V+G+K+I IYA R I EE+TY+YKF E E++I C CG+ C G
Sbjct: 1001 CSPSCTAKIIKVDGKKRIVIYALRDIDKNEELTYDYKFERETNDEERIRCLCGAPGCKGY 1060
Query: 935 LN 936
LN
Sbjct: 1061 LN 1062
>gi|321257952|ref|XP_003193759.1| hypothetical protein CGB_D6310C [Cryptococcus gattii WM276]
gi|317460229|gb|ADV21972.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1477
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++L +Q++ RKK+LRF RS I +GL A+E I A + V EYVG+L+R ++D+RE RY
Sbjct: 1322 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1381
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
K GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+I V GQ KI IYA+R +
Sbjct: 1382 MKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1441
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
GEEI Y+YKFPLE ++PC CG+ C G LN
Sbjct: 1442 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1477
>gi|357139465|ref|XP_003571302.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-4
specific-like [Brachypodium distachyon]
Length = 319
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 80/145 (55%), Positives = 108/145 (74%), Gaps = 1/145 (0%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
+ AR+K+LR +RSKIH+ G++A+ I + ++EY+GE + ++D+R RYEK G G
Sbjct: 176 LSARRKKLRVERSKIHELGVMAVTDIMKGELIVEYIGERMPKWVADLRGLRYEKAGKGD- 234
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y F++D G V+DAT RGGIAR+INHSC PNC T+VI GQ++IFIYA + I AG E+TY
Sbjct: 235 YFFKIDAGLVIDATLRGGIARYINHSCEPNCETRVILSNGQRRIFIYANQKIKAGTELTY 294
Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
+YKFP EE KIPC CGSK+C S+N
Sbjct: 295 DYKFPFEENKIPCSCGSKRCRKSMN 319
>gi|338819402|sp|P0CO27.1|SET1_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
Length = 1469
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++L +Q++ RKK+LRF RS I +GL A+E I A + V EYVG+L+R ++D+RE RY
Sbjct: 1314 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1373
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
K GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+I V GQ KI IYA+R +
Sbjct: 1374 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1433
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
GEEI Y+YKFPLE ++PC CG+ C G LN
Sbjct: 1434 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1469
>gi|58266804|ref|XP_570558.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|338819403|sp|P0CO26.1|SET1_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|57226791|gb|AAW43251.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1469
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++L +Q++ RKK+LRF RS I +GL A+E I A + V EYVG+L+R ++D+RE RY
Sbjct: 1314 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1373
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
K GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+I V GQ KI IYA+R +
Sbjct: 1374 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1433
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
GEEI Y+YKFPLE ++PC CG+ C G LN
Sbjct: 1434 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1469
>gi|134110798|ref|XP_775863.1| hypothetical protein CNBD2720 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258529|gb|EAL21216.1| hypothetical protein CNBD2720 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1483
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++L +Q++ RKK+LRF RS I +GL A+E I A + V EYVG+L+R ++D+RE RY
Sbjct: 1328 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1387
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
K GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+I V GQ KI IYA+R +
Sbjct: 1388 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1447
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
GEEI Y+YKFPLE ++PC CG+ C G LN
Sbjct: 1448 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1483
>gi|345490501|ref|XP_001604610.2| PREDICTED: hypothetical protein LOC100121025 [Nasonia vitripennis]
Length = 1677
Score = 179 bits (453), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 92/190 (48%), Positives = 120/190 (63%), Gaps = 31/190 (16%)
Query: 751 GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
GP+VN S+ GK LS AR+N+ +L ++LLK +Q+K
Sbjct: 1515 GPKVN-SKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKXXX---------- 1563
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
VIEYVG++IR ++D+RE +YE GIGSSYLFR+D ++DATK
Sbjct: 1564 ----------------VIEYVGQMIRPIVADLRETQYEATGIGSSYLFRIDLDTIIDATK 1607
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I EEITY+YKFPLE+ KIPC C
Sbjct: 1608 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1667
Query: 927 GSKKCHGSLN 936
G+ +C G+LN
Sbjct: 1668 GAPQCRGTLN 1677
>gi|46126135|ref|XP_387621.1| hypothetical protein FG07445.1 [Gibberella zeae PH-1]
Length = 1252
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/182 (50%), Positives = 117/182 (64%), Gaps = 22/182 (12%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINH
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINH- 1200
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
S EG K+I IYA R IA EE+TY+YKF E +IPC CG+ C G
Sbjct: 1201 ----------SFEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1250
Query: 935 LN 936
LN
Sbjct: 1251 LN 1252
>gi|224003381|ref|XP_002291362.1| set domain-containing protein [Thalassiosira pseudonana CCMP1335]
gi|220973138|gb|EED91469.1| set domain-containing protein, partial [Thalassiosira pseudonana
CCMP1335]
Length = 189
Score = 176 bits (445), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/178 (47%), Positives = 121/178 (67%), Gaps = 4/178 (2%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G+ + A R+ L++L + + ++ + + R++RLRF +S IH WG+ A E I A D
Sbjct: 7 GRAVRASQRRM-LKSLASLGDASKTVDRLVSRDREERLRFDKSLIHGWGVFAEEAINAGD 65
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
+IEY GELI ++D RE YE+ IGS Y+FR+D+ +V DATK G +ARFIN SC PN
Sbjct: 66 MIIEYRGELIGNAVADKRELEYERAKIGSDYMFRIDEFWVCDATKLGNVARFINASCCPN 125
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
CYTK+I+ K+I I+AK++I GEE+ Y+YKF +E EK+IPCYCGS +C G +N
Sbjct: 126 CYTKIITANETKRIVIFAKKNIQRGEELCYDYKFQMEHEKEKRIPCYCGSSECRGFMN 183
>gi|443918070|gb|ELU38649.1| Setd1a protein [Rhizoctonia solani AG-1 IA]
Length = 425
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 81/132 (61%), Positives = 101/132 (76%)
Query: 780 AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIR 839
AA+ ++LK +Q++ RKK+L F RS IHDWGL A EPI A D VIEYVGE+IR +++D R
Sbjct: 198 AADAGDVLKFNQLRTRKKQLTFARSPIHDWGLYAAEPIPAGDMVIEYVGEVIRQQVADKR 257
Query: 840 ECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYA 899
E YEK GIGSSYLFR+DD VVDATK+G + R INH C PNC K+I++ G+KKI IYA
Sbjct: 258 EKYYEKTGIGSSYLFRVDDDSVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYA 317
Query: 900 KRHIAAGEEITY 911
K +I G+EITY
Sbjct: 318 KTNIDVGDEITY 329
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 78/138 (56%), Positives = 95/138 (68%), Gaps = 4/138 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM----GIGSSYLFRLDD 858
RS IH GL IE D VIEY GE+IR ++D RE +YE M G+G Y+FR+DD
Sbjct: 4250 RSHIHGRGLFCKRDIEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVGGCYMFRIDD 4309
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
VVDAT +G ARFINHSC+PNCY++V+ + G K I I+A R I GEE+TY+YKFP E
Sbjct: 4310 NLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHILIFALRRITIGEELTYDYKFPFE 4369
Query: 919 EKKIPCYCGSKKCHGSLN 936
E KIPC CG+KKC LN
Sbjct: 4370 EVKIPCTCGAKKCRKYLN 4387
>gi|406694364|gb|EKC97692.1| hypothetical protein A1Q2_08004 [Trichosporon asahii var. asahii CBS
8904]
Length = 1218
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/171 (49%), Positives = 112/171 (65%), Gaps = 4/171 (2%)
Query: 769 TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
TNR AAE ++L +Q++ RKK+LRF RS I +GL A+E I+ + V EYVG
Sbjct: 1049 TNRKAGGGTATAAE-SDLFMFNQLRIRKKQLRFARSAIEGYGLYAMETIQPGEMVCEYVG 1107
Query: 829 ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
EL R ++D+RE RY K GIGSSYLFR+D V DAT +G ++R INHSCNP K+I+
Sbjct: 1108 ELCRSSVADVREQRYTKQGIGSSYLFRIDGDIVCDATFKGSVSRLINHSCNPTANAKIIN 1167
Query: 889 VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
+ G KI IYAKR + G+E+TY+Y FPLE+ ++ C CG C G LN
Sbjct: 1168 INGHNKIVIYAKRTLYPGDEVTYSYNFPLEQDESLRVRCLCGEPTCLGFLN 1218
>gi|392580378|gb|EIW73505.1| hypothetical protein TREMEDRAFT_24920 [Tremella mesenterica DSM
1558]
Length = 180
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 82/161 (50%), Positives = 112/161 (69%), Gaps = 3/161 (1%)
Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
+A ++L +Q++ RKK+LRF RS I +GL A+E I+ + V EYVGE+ R +++I
Sbjct: 20 SATAESDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIQPGEMVCEYVGEICRSAVAEI 79
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
RE RY + GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+IS+ GQ KI IY
Sbjct: 80 REQRYLRQGIGSSYLFRIDNDLVCDATFKGSVSRLINHSCDPSASAKIISINGQSKIVIY 139
Query: 899 AKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
AKR + GEEI Y+YKFPLE ++PC CG+ C G LN
Sbjct: 140 AKRTLHPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 180
>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
Length = 3708
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 79/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3576 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3634
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP EE+KI
Sbjct: 3635 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 3694
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3695 PCSCGSKRCRKYLN 3708
>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
Length = 3837
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3705 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3763
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3764 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3823
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3824 PCSCGSKRCRKYLN 3837
>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
Length = 3756
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3624 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3682
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3683 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3742
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3743 PCSCGSKRCRKYLN 3756
>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
Length = 3822
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3690 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3748
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3749 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3808
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3809 PCSCGSKRCRKYLN 3822
>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
Length = 3828
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3696 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3754
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3755 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3814
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3815 PCSCGSKRCRKYLN 3828
>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
Length = 3741
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3609 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3667
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP EE+KI
Sbjct: 3668 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 3727
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3728 PCSCGSKRCRKYLN 3741
>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 3779
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3647 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3705
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3706 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3765
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3766 PCSCGSKRCRKYLN 3779
>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
Length = 4925
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 79/144 (54%), Positives = 99/144 (68%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
++ K+ + RS IH GL IEA + VIEY GELIR ++D RE Y+ GIG Y
Sbjct: 4783 ESSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-Y 4841
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+F++D+ +VVDAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+
Sbjct: 4842 MFKIDENFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYD 4901
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
YKFP E+ KIPC CGSKKC LN
Sbjct: 4902 YKFPFEDVKIPCSCGSKKCRKYLN 4925
>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 3474
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3342 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3400
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3401 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3460
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3461 PCSCGSKRCRKYLN 3474
>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
Length = 3759
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3627 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3685
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3686 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3745
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3746 PCSCGSKRCRKYLN 3759
>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
Full=Lysine N-methyltransferase 2A
gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
Length = 3726
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3594 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3652
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3653 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3712
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3713 PCSCGSKRCRKYLN 3726
>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
spliced, exon II-containing isoform} [Drosophila,
embryos, Peptide, 3726 aa]
Length = 3726
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3594 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3652
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3653 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3712
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3713 PCSCGSKRCRKYLN 3726
>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
Length = 3358
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3226 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3284
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3285 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3344
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3345 PCSCGSKRCRKYLN 3358
>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
Length = 3358
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 3226 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3284
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 3285 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3344
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 3345 PCSCGSKRCRKYLN 3358
>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
Length = 3489
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 79/141 (56%), Positives = 97/141 (68%), Gaps = 1/141 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K+ + RS IH GL IEA + VIEY GELIR ++D RE Y+ GIG Y+F+
Sbjct: 3350 KESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFK 3408
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +VVDAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKF
Sbjct: 3409 IDEHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKF 3468
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
P E+ KIPC CGSKKC LN
Sbjct: 3469 PFEDVKIPCSCGSKKCRKYLN 3489
>gi|449016155|dbj|BAM79557.1| unknown RNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1151
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 117/188 (62%), Gaps = 6/188 (3%)
Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
P A +W+ L R RV+ R LL+A E ++ R++ LRF RS IH +GL
Sbjct: 967 PADEARRWSATGSL--RGTRVEQRQLLSAVSDREQ-TVLYLQQRERFLRFYRSGIHGYGL 1023
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
ALE I A D++IEY GEL+R ++D+RE Y + G+G S++FR+D VVDAT G +A
Sbjct: 1024 WALEDIPAGDYIIEYRGELVRSAVADLRERAYRQQGMGDSFMFRIDADTVVDATHIGSVA 1083
Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGS 928
RF+NHSC+PN +++ + G I Y+KR I GEEITY+Y F +E+ +K+PC CG+
Sbjct: 1084 RFVNHSCDPNAIARIVQLGGASHILFYSKRSICVGEEITYDYNFDIEDDASEKVPCLCGA 1143
Query: 929 KKCHGSLN 936
C LN
Sbjct: 1144 PNCRQYLN 1151
>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
Length = 3069
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 79/141 (56%), Positives = 97/141 (68%), Gaps = 1/141 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K+ + RS IH GL IEA + VIEY GELIR ++D RE Y+ GIG Y+F+
Sbjct: 2930 KESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFK 2988
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +VVDAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKF
Sbjct: 2989 IDEHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKF 3048
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
P E+ KIPC CGSKKC LN
Sbjct: 3049 PFEDVKIPCSCGSKKCRKYLN 3069
>gi|322699770|gb|EFY91529.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Metarhizium
acridum CQMa 102]
Length = 1231
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 81/157 (51%), Positives = 109/157 (69%), Gaps = 8/157 (5%)
Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R NR R +A + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1073 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1132
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
+D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+GGIARFINHS
Sbjct: 1133 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDDTVIDATKKGGIARFINHS 1192
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
C PNC K+I VEG K+I IYA R IA ++++ K
Sbjct: 1193 CMPNCTAKIIKVEGSKRIVIYALRDIAMSKQLSLESK 1229
>gi|322706019|gb|EFY97601.1| Histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF 23]
Length = 1222
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 130/220 (59%), Gaps = 23/220 (10%)
Query: 708 CPRSDGCAR---------SSISGWEWHKWSLNASPAER---ARVRGAQYVHTKYLGPEVN 755
P + GCAR S S + H + + ER ++ G VH ++
Sbjct: 1006 VPNATGCARTEGVKKILNSEKSKYLPHHIKVQKAREERQARSKRNGKDSVHAASEAAKIA 1065
Query: 756 ASQWANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIH 807
A + A G ++R NR R +A + +++ K +Q+K RKK ++F RS IH
Sbjct: 1066 AEKIAKG---NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIH 1122
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
+WGL A+E I +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD V+DATK+
Sbjct: 1123 NWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDDTVIDATKK 1182
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
GGIARFINHSC PNC K+I VEG K+I IYA R IA E
Sbjct: 1183 GGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMSE 1222
>gi|425770041|gb|EKV08516.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Penicillium
digitatum Pd1]
gi|425771732|gb|EKV10169.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Penicillium
digitatum PHI26]
Length = 1178
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/152 (54%), Positives = 106/152 (69%), Gaps = 11/152 (7%)
Query: 766 SARTNRVKLRNLLA----------AAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A A G ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1026 TSRSTRVNNRRLIADINAQKQALPTANGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1085
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1086 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1145
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
NHSC PNC K+I V+G K+I IYA R I G
Sbjct: 1146 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERG 1177
>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
Length = 2343
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RSKIH GL L EA + VIEY GE+IR ++D RE Y GIG Y+FR+DD VV
Sbjct: 2211 RSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVLTDKREKYYNSKGIGC-YMFRIDDNLVV 2269
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC+PNCY+KV+ + G K I I+A R I GEE+TY+YKFP+EE KI
Sbjct: 2270 DATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRRIICGEELTYDYKFPIEEDKI 2329
Query: 923 PCYCGSKKCHGSLN 936
PC CG+++C LN
Sbjct: 2330 PCTCGTRRCRKFLN 2343
>gi|91076142|ref|XP_970289.1| PREDICTED: similar to mixed-lineage leukemia protein, mll [Tribolium
castaneum]
Length = 1824
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RSKIH GL L EA + VIEY GE+IR ++D RE Y GIG Y+FR+DD VV
Sbjct: 1692 RSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVLTDKREKYYNSKGIGC-YMFRIDDNLVV 1750
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC+PNCY+KV+ + G K I I+A R I GEE+TY+YKFP+EE KI
Sbjct: 1751 DATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRRIICGEELTYDYKFPIEEDKI 1810
Query: 923 PCYCGSKKCHGSLN 936
PC CG+++C LN
Sbjct: 1811 PCTCGTRRCRKFLN 1824
>gi|258564606|ref|XP_002583048.1| hypothetical protein UREG_07821 [Uncinocarpus reesii 1704]
gi|237908555|gb|EEP82956.1| hypothetical protein UREG_07821 [Uncinocarpus reesii 1704]
Length = 1311
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/159 (51%), Positives = 106/159 (66%), Gaps = 11/159 (6%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A +++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1092 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1151
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 1152 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1211
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
NHSC PNC K+I V+G K+I IYA R I G Y
Sbjct: 1212 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRGNSARQKY 1250
>gi|340381930|ref|XP_003389474.1| PREDICTED: histone-lysine N-methyltransferase trithorax-like
[Amphimedon queenslandica]
Length = 192
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/133 (58%), Positives = 97/133 (72%), Gaps = 1/133 (0%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IH GL L+ I++ D VIEY G +IR ++D RE YE GIG Y+FR+D VVD
Sbjct: 61 SHIHGLGLFCLQEIDSGDMVIEYAGTVIRSTLTDYRERFYESRGIGC-YMFRIDSDEVVD 119
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
AT G +ARFINHSC PNCY+KV++V+GQKKI I+A R I GEE+TY+YKFP+EE KIP
Sbjct: 120 ATMSGNMARFINHSCEPNCYSKVVAVDGQKKIMIFALRRIVPGEELTYDYKFPIEEAKIP 179
Query: 924 CYCGSKKCHGSLN 936
C CGS +C +LN
Sbjct: 180 CKCGSARCRKTLN 192
>gi|195446231|ref|XP_002070688.1| GK10891 [Drosophila willistoni]
gi|194166773|gb|EDW81674.1| GK10891 [Drosophila willistoni]
Length = 447
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 79/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 315 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 373
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP EE+KI
Sbjct: 374 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 433
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 434 PCSCGSKRCRKYLN 447
>gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster]
Length = 751
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 619 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 677
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 678 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 737
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 738 PCSCGSKRCRKYLN 751
>gi|238488134|ref|XP_002375305.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|220700184|gb|EED56523.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 1024
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 81/153 (52%), Positives = 105/153 (68%), Gaps = 11/153 (7%)
Query: 766 SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
++R+ RV R L+A ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 867 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 926
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+ V+DATKRGGIARFI
Sbjct: 927 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 986
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
NHSC PNC K+I V+G K+I IYA R I GE
Sbjct: 987 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERGE 1019
>gi|336385606|gb|EGO26753.1| hypothetical protein SERLADRAFT_385814 [Serpula lacrymans var.
lacrymans S7.9]
Length = 115
Score = 166 bits (421), Expect = 5e-38, Method: Composition-based stats.
Identities = 73/114 (64%), Positives = 91/114 (79%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEYVGE+IR +++D RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC
Sbjct: 2 VIEYVGEVIRAQVADKREKVYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNC 61
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
K+I++ G+KKI IYAK+ I GEEITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 62 TAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQDKIPCLCGSAKCRGYLN 115
>gi|291240989|ref|XP_002740398.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like,
partial [Saccoglossus kowalevskii]
Length = 357
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 80/144 (55%), Positives = 99/144 (68%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL PI+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 215 KTSKEAVGVYRSGIHGRGLFCKRPIDAAEMVIEYAGMVIRSSLTDKREKYYDSKGIGC-Y 273
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI V+G+K I I+A R I GEE+TY+
Sbjct: 274 MFRIDDYDVVDATMHGNAARFINHSCEPNCFSRVIQVDGKKHIVIFASRKIMPGEELTYD 333
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
YKFP EE+KIPC CGSKKC LN
Sbjct: 334 YKFPFEEEKIPCTCGSKKCRKHLN 357
>gi|195570949|ref|XP_002103466.1| GD20433 [Drosophila simulans]
gi|194199393|gb|EDX12969.1| GD20433 [Drosophila simulans]
Length = 152
Score = 165 bits (417), Expect = 1e-37, Method: Composition-based stats.
Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 20 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 78
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 79 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 138
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 139 PCSCGSKRCRKYLN 152
>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
Length = 4493
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/185 (48%), Positives = 115/185 (62%), Gaps = 6/185 (3%)
Query: 757 SQWANGKGLSARTNRVKLRNLLAA-AEGAELLKASQVKARKKRLR----FQRSKIHDWGL 811
SQ+ G R N V++ + A A +L A + + K+ R RS IH GL
Sbjct: 4310 SQYREMPGSKERKNDVEMVHKSARRATSMDLPMAMRFRKLKEHAREAVGVYRSHIHGRGL 4369
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
I+ + VIEY GE+IR ++D RE YE GIG Y+FR+DD V+DAT G A
Sbjct: 4370 YCKRNIDEGEMVIEYSGEVIRGSLTDKREKYYEGKGIGC-YMFRIDDYDVIDATLHGNAA 4428
Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKC 931
RFINHSC PNCY+KVI+V+G+K I I+A + I GEE+TY+YKFP+EE KIPC CG+KKC
Sbjct: 4429 RFINHSCEPNCYSKVINVDGKKHIVIFAMKSIKRGEELTYDYKFPIEEVKIPCTCGAKKC 4488
Query: 932 HGSLN 936
LN
Sbjct: 4489 RRYLN 4493
>gi|242221772|ref|XP_002476627.1| predicted protein [Postia placenta Mad-698-R]
gi|220724099|gb|EED78169.1| predicted protein [Postia placenta Mad-698-R]
Length = 115
Score = 164 bits (416), Expect = 2e-37, Method: Composition-based stats.
Identities = 72/114 (63%), Positives = 90/114 (78%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEYVGE+IR +++D RE YE+ GIGSSYLFR+D+ VVDATK+G + R INHSC+PNC
Sbjct: 2 VIEYVGEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNC 61
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
K+I++ G+KKI IYAK+ I G EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 62 TAKIITINGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGFLN 115
>gi|195145308|ref|XP_002013638.1| GL23289 [Drosophila persimilis]
gi|194102581|gb|EDW24624.1| GL23289 [Drosophila persimilis]
Length = 293
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 161 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 219
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+YKFP E++KI
Sbjct: 220 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 279
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 280 PCSCGSKRCRKYLN 293
>gi|195501654|ref|XP_002097885.1| GE26460 [Drosophila yakuba]
gi|194183986|gb|EDW97597.1| GE26460 [Drosophila yakuba]
Length = 343
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY GELIR ++D RE Y+ GIG Y+F++DD VV
Sbjct: 211 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 269
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT RG ARFINH C PNCY+KV+ + G K I I+A R I GEE+TY+YKFP EE+KI
Sbjct: 270 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 329
Query: 923 PCYCGSKKCHGSLN 936
PC CGSK+C LN
Sbjct: 330 PCSCGSKRCRKYLN 343
>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
Length = 3530
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
A + RV NL A L + S K+ + S IH GL L IEA + VIEY
Sbjct: 3367 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3421
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE+IR ++D RE Y+ IG Y+F++DD VVDAT +G ARFINHSC PNCY++V
Sbjct: 3422 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3480
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ + G+K I I+A R I GEE+TY+YKFP E+ KIPC CGS++C LN
Sbjct: 3481 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3530
>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
Length = 3434
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
A + RV NL A L + S K+ + S IH GL L IEA + VIEY
Sbjct: 3271 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3325
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE+IR ++D RE Y+ IG Y+F++DD VVDAT +G ARFINHSC PNCY++V
Sbjct: 3326 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3384
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ + G+K I I+A R I GEE+TY+YKFP E+ KIPC CGS++C LN
Sbjct: 3385 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3434
>gi|157126650|ref|XP_001654691.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108873214|gb|EAT37439.1| AAEL010578-PA [Aedes aegypti]
Length = 172
Score = 163 bits (413), Expect = 4e-37, Method: Composition-based stats.
Identities = 84/167 (50%), Positives = 106/167 (63%), Gaps = 5/167 (2%)
Query: 774 LRNLLAAAEGAELLKASQVKA----RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
LR + G+ L A + + K+ + RS IH GL IEA + VIEY GE
Sbjct: 7 LRKIKRRGSGSNLPMAMRYRTLKETSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGE 66
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
LIR ++D RE Y+ GIG Y+F++D+ +VVDAT RG ARFINHSC PNCY+KV+ +
Sbjct: 67 LIRSTLTDKRERYYDSRGIGC-YMFKIDEHFVVDATMRGNAARFINHSCEPNCYSKVVDI 125
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
G K I I+A R I GEE+TY+YKFP E+ KIPC CGSKKC LN
Sbjct: 126 LGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYLN 172
>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
Length = 3522
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
A + RV NL A L + S K+ + S IH GL L IEA + VIEY
Sbjct: 3359 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3413
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE+IR ++D RE Y+ IG Y+F++DD VVDAT +G ARFINHSC PNCY++V
Sbjct: 3414 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3472
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ + G+K I I+A R I GEE+TY+YKFP E+ KIPC CGS++C LN
Sbjct: 3473 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3522
>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus floridanus]
Length = 3218
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 83/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
A + RV NL A L + S KK + S IH GL L IEA + VIEY
Sbjct: 3055 AESRRVASTNLPMAMRFRILKETS-----KKSVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3109
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE+IR ++D RE Y+ IG Y+F++DD VVDAT +G ARFINHSC PNCY++V
Sbjct: 3110 AGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3168
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ + G+K I I+A R I GEE+TY+YKFP E+ KIPC CGS++C LN
Sbjct: 3169 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3218
>gi|241687915|ref|XP_002401626.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215504523|gb|EEC14017.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 1036
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 77/141 (54%), Positives = 97/141 (68%), Gaps = 1/141 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K+ + RS IH GL I+ + +IEY GE+IR ++D RE YE GIG Y+FR
Sbjct: 897 KEAVGVYRSSIHGRGLYCKRNIDGGEMIIEYAGEVIRAALTDKREKYYESKGIGC-YMFR 955
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD VVDAT G ARFINHSC PNCY+KVI+V+ +K I I+A R I GEE+TY+YKF
Sbjct: 956 IDDHEVVDATMHGNAARFINHSCEPNCYSKVITVDNKKHIVIFALRSILKGEELTYDYKF 1015
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
P+EE KIPC CGS++C LN
Sbjct: 1016 PIEEVKIPCSCGSRRCRKFLN 1036
>gi|403360488|gb|EJY79922.1| Histone-lysine N-methyltransferase [Oxytricha trifallax]
Length = 2438
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 5/199 (2%)
Query: 741 GAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAEL---LKASQVKARKK 797
G + ++K E+N ++ N + + +++L + + + +L +K KA+ K
Sbjct: 2242 GKKMFYSKSHQDEINENK--NQQKSTVLAKKIRLTDTIDTNKEIDLPIAMKYRLSKAQPK 2299
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ SKIH GL LE + D VIEYVGE IR K++D RE YE+ GIG YLFRLD
Sbjct: 2300 RVEVGPSKIHRNGLFTLEDLMPGDIVIEYVGEKIRNKVADKREIYYEQKGIGDCYLFRLD 2359
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
Y++DAT G AR++NHSC+ NC K+I+V+ QK I I R I +GEE+TYNY F
Sbjct: 2360 KEYIIDATFFGNKARYLNHSCDANCSAKIINVQQQKHIIISTNRQIKSGEELTYNYNFDY 2419
Query: 918 EEKKIPCYCGSKKCHGSLN 936
E KI C+CG+ C G LN
Sbjct: 2420 ETDKIACFCGAPTCSGRLN 2438
>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
mellifera]
Length = 3195
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
A + RV NL A L + S K+ + S IH GL L IEA + VIEY
Sbjct: 3032 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3086
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE+IR ++D RE Y+ IG Y+F++DD VVDAT +G ARFINHSC PNCY++V
Sbjct: 3087 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3145
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+ + G+K I I+A R I GEE+TY+YKFP E+ KIPC CGS++C LN
Sbjct: 3146 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3195
>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 3311
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+IH GL L IEA + VIEY GE+IR +SD RE Y + GIG Y+FR+DD +VV
Sbjct: 3179 RSQIHGRGLFCLREIEAGEMVIEYAGEVIRANLSDKREKYYTEKGIGC-YMFRIDDHFVV 3237
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT +G ARFINHSC PNCY++V+ + G+K I I+A R I EE+TY+YKFP E++KI
Sbjct: 3238 DATMKGNAARFINHSCEPNCYSRVVDILGKKHIVIFALRKINIMEELTYDYKFPFEDEKI 3297
Query: 923 PCYCGSKKCHGSLN 936
C+C SKKC LN
Sbjct: 3298 SCHCLSKKCKKYLN 3311
>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
latipes]
Length = 4536
Score = 162 bits (411), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 80/149 (53%), Positives = 103/149 (69%), Gaps = 4/149 (2%)
Query: 791 QVKA-RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
Q+KA K+ + RS IH GL + IEA + VIEY G +IR ++D RE Y+ GIG
Sbjct: 4389 QLKAMSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDAKGIG 4448
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y+FR+DD VVDAT G ARFINHSC PNCY++V++V+GQK I I+A R I GEE+
Sbjct: 4449 C-YMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVLTVDGQKHIVIFASRRICCGEEL 4507
Query: 910 TYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
TY+YKFP+E+ K+PC CG+KKC LN
Sbjct: 4508 TYDYKFPIEDASNKLPCNCGTKKCRKFLN 4536
>gi|4127850|emb|CAA09454.1| MLL protein [Gallus gallus]
Length = 945
Score = 162 bits (410), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 801 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 859
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 860 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 919
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 920 YKFPIEDASNKLPCNCGAKKCRKFLN 945
>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
Length = 3272
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 93/133 (69%), Gaps = 1/133 (0%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S+IH GL L IE + VIEY GE+IR ++D RE Y+ IG Y+F++DD VVD
Sbjct: 3141 SRIHGRGLFCLRDIEPGEMVIEYAGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVD 3199
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
AT +G ARFINHSC PNCY++V+ + G+K I I+A R I GEE+TY+YKFP E+ KIP
Sbjct: 3200 ATMKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIP 3259
Query: 924 CYCGSKKCHGSLN 936
C CGS+KC LN
Sbjct: 3260 CTCGSRKCRKYLN 3272
>gi|15488422|gb|AAL01112.1|AF408061_1 trithorax-related protein 7 [Arabidopsis thaliana]
Length = 259
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/211 (48%), Positives = 126/211 (59%), Gaps = 6/211 (2%)
Query: 611 PSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLE 670
PS LS S D D V++ EK S K K K ++++ E
Sbjct: 55 PSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPCHTSDKLKKGPNKLIRRRKPLAKHTTE 112
Query: 671 LHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLN 730
K L V G K A + + +K SK + P+SDGCAR+SI+GW WH WSL
Sbjct: 113 RSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKLFLSIPKSDGCARTSINGWHWHAWSLK 172
Query: 731 ASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKAS 790
AS ERARVRG+ VH ++ G + + +Q LSARTNR KLRNLLAAA+GA++LK S
Sbjct: 173 ASAEERARVRGSSCVHMQHFGSKSSLTQ----NVLSARTNRAKLRNLLAAADGADVLKMS 228
Query: 791 QVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
Q+KARKK LRFQ+SKIHDWGLVALEPIEAED
Sbjct: 229 QLKARKKHLRFQQSKIHDWGLVALEPIEAED 259
>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
Length = 3871
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3727 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3785
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3786 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3845
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3846 YKFPIEDASNKLPCNCGAKKCRKFLN 3871
>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
domestica]
Length = 3960
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE YE GIG Y
Sbjct: 3816 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYESKGIGC-Y 3874
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3875 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3934
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3935 YKFPIEDASNKLPCNCGAKKCRKFLN 3960
>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
Length = 3786
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3642 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3700
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3701 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3760
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3761 YKFPIEDASNKLPCNCGAKKCRKFLN 3786
>gi|32394672|gb|AAN39003.1| SET1 protein [Griffithsia japonica]
Length = 201
Score = 161 bits (407), Expect = 2e-36, Method: Composition-based stats.
Identities = 90/191 (47%), Positives = 122/191 (63%), Gaps = 8/191 (4%)
Query: 754 VNASQWANGKG--LSARTNRVKLRNLLAAAEGA----ELLKASQVKARKKRLRFQRSKIH 807
VNA++ +G S+R +R + R L + EL+ S ++ R+K + +RS IH
Sbjct: 11 VNATEVVKPEGGVESSRESRRQQRWLRKGMQQVKARHELVTGSMLQERRKAVFCRRSGIH 70
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
+GL A E IEA +FVIEYVG +IR ++D+RE YE+ G+G SYLFRL+ VVDAT+R
Sbjct: 71 GFGLYAQEEIEAREFVIEYVGVVIRQSVADVREREYEEGGVGDSYLFRLNGEMVVDATRR 130
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCY 925
GGIARFINHSC+PN V G ++I Y++RHI +E+TY+YKF LE +KKI C
Sbjct: 131 GGIARFINHSCDPNLTATTQRVGGTERIVFYSRRHIGKYDELTYDYKFALEGDDKKIRCL 190
Query: 926 CGSKKCHGSLN 936
C S C LN
Sbjct: 191 CKSLNCRKFLN 201
>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
harrisii]
Length = 3995
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE YE GIG Y
Sbjct: 3851 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYESKGIGC-Y 3909
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3910 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3969
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3970 YKFPIEDASNKLPCNCGAKKCRKFLN 3995
>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Meleagris gallopavo]
Length = 3851
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3707 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3765
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3766 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3825
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3826 YKFPIEDASNKLPCNCGAKKCRKFLN 3851
>gi|241554585|ref|XP_002399516.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215501703|gb|EEC11197.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 544
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + I+A + VIEY G+++R ++D RE YE GIG Y+FR+D+ VV
Sbjct: 412 RSLIHGRGLYSKRNIDAGEMVIEYAGQVVRSVLTDKRERLYESRGIGC-YMFRMDEDQVV 470
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC+PNCY+KVI+V GQK I IYA R I GEE+TY+YKFP EE KI
Sbjct: 471 DATTHGNAARFINHSCDPNCYSKVIAVFGQKHIIIYALRKIYKGEELTYDYKFPKEEVKI 530
Query: 923 PCYCGSKKCHGSLN 936
PC CG+++C LN
Sbjct: 531 PCSCGARRCRKFLN 544
>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
guttata]
Length = 3849
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3705 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3763
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3764 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3823
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3824 YKFPIEDASNKLPCNCGAKKCRKFLN 3849
>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
Length = 1858
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 105/161 (65%), Gaps = 5/161 (3%)
Query: 780 AAEGAELLKASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
A + ++ A Q + + R +RS IH GL A +E + V+EY+GE IR +
Sbjct: 1699 ATQSIQMPMAMQYRVLRDNYRTYSVVRRSPIHGCGLYAARRLEKDSMVVEYMGERIRDIL 1758
Query: 836 SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
+D RE Y+ GIG Y+FR+DD Y++DAT + ARF+NHSC PNCYT++++ +G K+I
Sbjct: 1759 TDYRERMYDARGIGC-YMFRIDDDYIIDATMKANQARFMNHSCEPNCYTRIVNPDGVKRI 1817
Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+A R + GEE+T +YK P+E+ KIPCYCG++ C GS+N
Sbjct: 1818 IYFASRVVLEGEELTVDYKMPIEDVKIPCYCGTRSCRGSMN 1858
>gi|156353190|ref|XP_001622957.1| predicted protein [Nematostella vectensis]
gi|156209595|gb|EDO30857.1| predicted protein [Nematostella vectensis]
Length = 288
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL I A + VIEY G LIR ++D RE YE GIG Y+FR+D YVV
Sbjct: 156 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 214
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 215 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 274
Query: 923 PCYCGSKKCHGSLN 936
PC+C SK+C LN
Sbjct: 275 PCHCKSKRCRKYLN 288
>gi|156354350|ref|XP_001623359.1| predicted protein [Nematostella vectensis]
gi|156210050|gb|EDO31259.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 160 bits (406), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL I A + VIEY G LIR ++D RE YE GIG Y+FR+D YVV
Sbjct: 179 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 237
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 238 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 297
Query: 923 PCYCGSKKCHGSLN 936
PC+C SK+C LN
Sbjct: 298 PCHCKSKRCRKYLN 311
>gi|156353192|ref|XP_001622958.1| predicted protein [Nematostella vectensis]
gi|156209596|gb|EDO30858.1| predicted protein [Nematostella vectensis]
Length = 314
Score = 160 bits (406), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL I A + VIEY G LIR ++D RE YE GIG Y+FR+D YVV
Sbjct: 182 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 240
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 241 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 300
Query: 923 PCYCGSKKCHGSLN 936
PC+C SK+C LN
Sbjct: 301 PCHCKSKRCRKYLN 314
>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
Length = 3452
Score = 160 bits (406), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 1/133 (0%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IH GL L IE + VIEY GE+IR ++D RE Y+ IG Y+F++DD VVD
Sbjct: 3321 SHIHGRGLFCLRDIEPGEMVIEYAGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVD 3379
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
AT +G ARFINHSC PNCY++V+ + G+K I I+A R I GEE+TY+YKFP E+ KIP
Sbjct: 3380 ATMKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIP 3439
Query: 924 CYCGSKKCHGSLN 936
C CGS+KC LN
Sbjct: 3440 CTCGSRKCRKYLN 3452
>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
niloticus]
Length = 4517
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/171 (49%), Positives = 112/171 (65%), Gaps = 5/171 (2%)
Query: 770 NRVKLRNLLAAAEGAEL-LKASQVKA-RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
+ V+LR A+ L ++ Q+KA K+ + RS IH GL + IEA + VIEY
Sbjct: 4348 DEVQLRTARRASMELPLAVRFKQLKATSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYS 4407
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
G +IR ++D RE Y+ GIG Y+FR+DD VVDAT G ARFINHSC PNCY++VI
Sbjct: 4408 GNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVI 4466
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
+V+GQK I I+A R I GEE+TY+YKFP+E+ K+PC C +KKC LN
Sbjct: 4467 TVDGQKHIVIFASRRIYCGEELTYDYKFPIEDASNKLPCNCSAKKCRKFLN 4517
>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4219
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
KA + + RS IH GL + IE + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 4075 KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4133
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++V++V+GQK I I+A R I GEE+TY+
Sbjct: 4134 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFATRKIYKGEELTYD 4193
Query: 913 YKFPLEEK--KIPCYCGSKKCHGSLN 936
YKFP+EE K+PC CG+KKC LN
Sbjct: 4194 YKFPIEEPGNKLPCNCGAKKCRKFLN 4219
>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4218
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
KA + + RS IH GL + IE + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 4074 KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4132
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++V++V+GQK I I+A R I GEE+TY+
Sbjct: 4133 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFATRKIYKGEELTYD 4192
Query: 913 YKFPLEEK--KIPCYCGSKKCHGSLN 936
YKFP+EE K+PC CG+KKC LN
Sbjct: 4193 YKFPIEEPGNKLPCNCGAKKCRKFLN 4218
>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
Length = 3806
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3662 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3720
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3721 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3780
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3781 YKFPIEDASNKLPCNCGAKKCRKFLN 3806
>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
gorilla]
Length = 3837
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3693 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3751
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3752 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3811
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3812 YKFPIEDASNKLPCNCGAKKCRKFLN 3837
>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
Length = 3710
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3566 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3624
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3625 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3684
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3685 YKFPIEDASNKLPCNCGAKKCRKFLN 3710
>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
Length = 3826
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3682 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3740
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3741 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3800
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3801 YKFPIEDASNKLPCNCGAKKCRKFLN 3826
>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
taurus]
gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
Length = 3821
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3677 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3735
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3736 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3795
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3796 YKFPIEDASNKLPCNCGAKKCRKFLN 3821
>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
Length = 3846
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3702 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3760
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3761 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3820
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3821 YKFPIEDASNKLPCNCGAKKCRKFLN 3846
>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
Length = 3899
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3755 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3813
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3814 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3873
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3874 YKFPIEDASNKLPCNCGAKKCRKFLN 3899
>gi|76161881|gb|AAX30110.2| KIAA1076 protein [Schistosoma japonicum]
Length = 123
Score = 160 bits (404), Expect = 4e-36, Method: Composition-based stats.
Identities = 72/114 (63%), Positives = 88/114 (77%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEYVG ++R ++++RE +YE GIG SYLFR+DD +V+DAT G ARFINHSC PNC
Sbjct: 10 VIEYVGHVVRKGVAELREHQYEAKGIGGSYLFRIDDDFVIDATMCGNNARFINHSCQPNC 69
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y K+I VE +KKI IY+KR I EEITY+YKFP EE+KIPC CGS C G+LN
Sbjct: 70 YAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEEKIPCQCGSSSCRGTLN 123
>gi|325192337|emb|CCA26782.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 2128
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/140 (51%), Positives = 99/140 (70%), Gaps = 1/140 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL+ +S IH +GL E + ++EY GELI +++D+RE YE+MG+GS YLFRLD
Sbjct: 1989 RLKVCKSAIHGYGLFTKEALSEGQAIVEYQGELISQQVADVREALYEEMGVGSCYLFRLD 2048
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEG-QKKIFIYAKRHIAAGEEITYNYKFP 916
++DAT RG +ARFINHSC+P + + + VE +KKI I+AKR I AGEEITY+YKFP
Sbjct: 2049 ATTIIDATTRGNLARFINHSCDPKAFARSVIVENDKKKILIFAKRAIMAGEEITYDYKFP 2108
Query: 917 LEEKKIPCYCGSKKCHGSLN 936
+EE+ + C CG+ C G +N
Sbjct: 2109 IEEEALRCDCGAPNCVGRMN 2128
>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Loxodonta africana]
Length = 3962
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3818 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3876
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3877 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3936
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3937 YKFPIEDASNKLPCNCGAKKCRKFLN 3962
>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
Length = 3554
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3410 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3468
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3469 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3528
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3529 YKFPIEDASNKLPCNCGAKKCRKFLN 3554
>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
Length = 3459
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3315 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3373
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3374 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3433
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3434 YKFPIEDASNKLPCNCGAKKCRKFLN 3459
>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
Length = 3844
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3700 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3758
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3759 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3818
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3819 YKFPIEDASNKLPCNCGAKKCRKFLN 3844
>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
Length = 3824
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3680 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3738
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3739 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3798
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3799 YKFPIEDASNKLPCNCGAKKCRKFLN 3824
>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
garnettii]
Length = 4062
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3918 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3976
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3977 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 4036
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 4037 YKFPIEDASNKLPCNCGAKKCRKFLN 4062
>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
boliviensis boliviensis]
Length = 3985
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3841 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3899
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3900 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3959
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3960 YKFPIEDASNKLPCNCGAKKCRKFLN 3985
>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
Length = 2525
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 2381 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2439
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 2440 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2499
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 2500 YKFPIEDASNKLPCNCGAKKCRKFLN 2525
>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
rotundus]
Length = 3966
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3822 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3880
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3881 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3940
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3941 YKFPIEDASNKLPCNCGAKKCRKFLN 3966
>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
melanoleuca]
Length = 3981
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3837 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3895
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3896 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3955
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3956 YKFPIEDASNKLPCNCGAKKCRKFLN 3981
>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
jacchus]
Length = 3994
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3850 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3908
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3909 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3968
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3969 YKFPIEDASNKLPCNCGAKKCRKFLN 3994
>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
griseus]
Length = 3907
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3763 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3821
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3822 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3881
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3882 YKFPIEDASNKLPCNCGAKKCRKFLN 3907
>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
familiaris]
Length = 3829
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3685 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3743
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3744 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3803
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3804 YKFPIEDASNKLPCNCGAKKCRKFLN 3829
>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
Length = 3968
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3824 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3882
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3883 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3942
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3943 YKFPIEDASNKLPCNCGAKKCRKFLN 3968
>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
Length = 3986
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3842 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3900
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3901 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3960
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3961 YKFPIEDASNKLPCNCGAKKCRKFLN 3986
>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
Length = 3969
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
leucogenys]
Length = 3968
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3824 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3882
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3883 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3942
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3943 YKFPIEDASNKLPCNCGAKKCRKFLN 3968
>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
sapiens]
Length = 3972
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3828 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3886
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3887 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3946
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3947 YKFPIEDASNKLPCNCGAKKCRKFLN 3972
>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_e [Homo sapiens]
Length = 3972
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3828 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3886
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3887 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3946
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3947 YKFPIEDASNKLPCNCGAKKCRKFLN 3972
>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_d [Homo sapiens]
Length = 4002
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3858 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3916
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3917 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3976
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3977 YKFPIEDASNKLPCNCGAKKCRKFLN 4002
>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
sapiens]
gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
7; AltName: Full=Lysine N-methyltransferase 2A;
Short=KMT2A; AltName: Full=Trithorax-like protein;
AltName: Full=Zinc finger protein HRX; Contains: RecName:
Full=MLL cleavage product N320; AltName: Full=N-terminal
cleavage product of 320 kDa; Short=p320; Contains:
RecName: Full=MLL cleavage product C180; AltName:
Full=C-terminal cleavage product of 180 kDa; Short=p180
gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) [Homo sapiens]
Length = 3969
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_a [Homo sapiens]
Length = 3969
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
Length = 4005
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3861 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3919
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3920 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3979
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3980 YKFPIEDASNKLPCNCGAKKCRKFLN 4005
>gi|299115811|emb|CBN74374.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 3157
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 104/140 (74%), Gaps = 2/140 (1%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
L+ +RS IH WGL ALE + ++EY+GE++R ++D+RE +YE+MG+GS YLFR D
Sbjct: 3018 LKVRRSHIHGWGLFALEEFPKDAMLVEYMGEVVRQCVADLREVKYEEMGVGSCYLFRADA 3077
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEG--QKKIFIYAKRHIAAGEEITYNYKFP 916
+VDAT++G +ARFINH C+PN +++++EG +KKI I AKR I GEEITY+YKF
Sbjct: 3078 DAIVDATRKGNLARFINHCCDPNAIARIVNLEGTKKKKIIIVAKRQIKPGEEITYDYKFE 3137
Query: 917 LEEKKIPCYCGSKKCHGSLN 936
E+ ++ C+CG+ C+GS+N
Sbjct: 3138 REDGQLSCHCGADVCYGSMN 3157
>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
Length = 3866
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3722 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3780
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3781 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3840
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3841 YKFPIEDASNKLPCNCGAKKCRKFLN 3866
>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL [Pongo abelii]
Length = 4012
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3868 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3926
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3927 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3986
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3987 YKFPIEDASNKLPCNCGAKKCRKFLN 4012
>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
Length = 3869
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3725 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3783
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3784 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3843
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3844 YKFPIEDASNKLPCNCGAKKCRKFLN 3869
>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
Length = 4202
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 4058 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 4116
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 4117 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 4176
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 4177 YKFPIEDASNKLPCNCGAKKCRKFLN 4202
>gi|361127849|gb|EHK99806.1| putative Histone-lysine N-methyltransferase, H3 lysine-4 specific
[Glarea lozoyensis 74030]
Length = 1072
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 78/153 (50%), Positives = 105/153 (68%), Gaps = 8/153 (5%)
Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
++R +RV R +A E ++ L+ +Q++ RKK ++F RS IH+WGL A+E I
Sbjct: 909 NSRASRVNNRRFVAGLEDQKKALGGESDALRFNQLQKRKKPVKFARSAIHNWGLYAMENI 968
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
D +IEYVGE IR +++D+RE Y + GIGSSYLFR+D+ VVDATK+GGIARFINHS
Sbjct: 969 AINDMIIEYVGEKIRQQVADLRENNYLRSGIGSSYLFRIDENTVVDATKKGGIARFINHS 1028
Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEIT 910
C PNC K+I+VE K+I IYA R IA + T
Sbjct: 1029 CMPNCTAKIITVEKSKRIVIYALRDIAMSKFFT 1061
>gi|357150782|ref|XP_003575574.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Brachypodium
distachyon]
Length = 1055
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 101/142 (71%), Gaps = 1/142 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLF 854
++RL F +S+IH +G+ A +A D +IEY+GEL+R +SDIRE R Y + +Y+F
Sbjct: 892 RRRLAFGKSRIHGFGVFAKVAHKAGDMMIEYIGELVRPPVSDIREGRIYNSLVGAGTYMF 951
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+DD V+DAT+ G IA INHSC PNCY++VISV G + I I+AKR IA EE+TY+Y+
Sbjct: 952 RIDDERVIDATRSGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDIAPWEELTYDYR 1011
Query: 915 FPLEEKKIPCYCGSKKCHGSLN 936
F ++++PCYCG KC G +N
Sbjct: 1012 FVSNDQRLPCYCGFPKCRGVVN 1033
>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
Length = 3463
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3319 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3377
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3378 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3437
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3438 YKFPIEDASNKLPCNCGAKKCRKFLN 3463
>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_c [Homo sapiens]
Length = 3130
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 2986 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3044
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3045 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3104
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3105 YKFPIEDASNKLPCNCGAKKCRKFLN 3130
>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
norvegicus]
Length = 3894
Score = 159 bits (402), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3750 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3808
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3809 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3868
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3869 YKFPIEDASNKLPCNCGAKKCRKFLN 3894
>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) variant [Homo sapiens]
Length = 2880
Score = 159 bits (402), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 2736 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2794
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 2795 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2854
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 2855 YKFPIEDASNKLPCNCGAKKCRKFLN 2880
>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
[Cavia porcellus]
Length = 2799
Score = 159 bits (402), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 2655 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2713
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 2714 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2773
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 2774 YKFPIEDASNKLPCNCGAKKCRKFLN 2799
>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
Length = 3963
Score = 159 bits (402), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3819 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3877
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3878 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3937
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3938 YKFPIEDASNKLPCNCGAKKCRKFLN 3963
>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
Length = 3706
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3562 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3620
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3621 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3680
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3681 YKFPIEDASNKLPCNCGAKKCRKFLN 3706
>gi|326489957|dbj|BAJ94052.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1081
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 137/238 (57%), Gaps = 19/238 (7%)
Query: 711 SDGCARSSISGWEWHKWSLNASP-------AERARVRGAQYVHTKY----LGPEVNASQW 759
S GCAR+ + +H+ P +R V Y+ + Y +G + +
Sbjct: 828 SSGCARTE--PYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEPI 885
Query: 760 ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
+ L A + + N+ + AE + +KA+ +KRL F +S+IH +G+ A +A
Sbjct: 886 QSVALLDAASQEASV-NVSSMAEKYKSMKAT----FRKRLAFGKSRIHGFGVFAKVGHKA 940
Query: 820 EDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
D +IEY+GEL+R ISD+RE R Y + +Y+FR+DD V+DAT+ G IA INHSC
Sbjct: 941 GDMMIEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 1000
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY++VISV G + I I+AKR I EE+TY+Y+F E+++PCYCG KC G +N
Sbjct: 1001 EPNCYSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVN 1058
>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=Zinc finger protein HRX;
Contains: RecName: Full=MLL cleavage product N320;
AltName: Full=N-terminal cleavage product of 320 kDa;
Short=p320; Contains: RecName: Full=MLL cleavage product
C180; AltName: Full=C-terminal cleavage product of 180
kDa; Short=p180
Length = 3966
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3822 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3880
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3881 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3940
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3941 YKFPIEDASNKLPCNCGAKKCRKFLN 3966
>gi|332077887|gb|AED99887.1| Trx1 [Hordeum vulgare]
Length = 1029
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 137/238 (57%), Gaps = 19/238 (7%)
Query: 711 SDGCARSSISGWEWHKWSLNASP-------AERARVRGAQYVHTKY----LGPEVNASQW 759
S GCAR+ + +H+ P +R V Y+ + Y +G + +
Sbjct: 776 SSGCARTE--PYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEPI 833
Query: 760 ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
+ L A + + N+ + AE + +KA+ +KRL F +S+IH +G+ A +A
Sbjct: 834 QSVALLDAASQEASV-NVSSMAEKYKSMKAT----FRKRLAFGKSRIHGFGVFAKVGHKA 888
Query: 820 EDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
D +IEY+GEL+R ISD+RE R Y + +Y+FR+DD V+DAT+ G IA INHSC
Sbjct: 889 GDMMIEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 948
Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
PNCY++VISV G + I I+AKR I EE+TY+Y+F E+++PCYCG KC G +N
Sbjct: 949 EPNCYSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVN 1006
>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
norvegicus]
Length = 3725
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3581 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3639
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3640 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3699
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3700 YKFPIEDASNKLPCNCGAKKCRKFLN 3725
>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
Length = 3969
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNRARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
Length = 3987
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 3843 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3901
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3902 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3961
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3962 YKFPIEDASNKLPCNCGAKKCRKFLN 3987
>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
Length = 4498
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/136 (55%), Positives = 95/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY G +IR ++D RE Y+ GIG Y+FR+DD VV
Sbjct: 4364 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 4422
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+YKFP+E+
Sbjct: 4423 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYRGEELTYDYKFPIEDASS 4482
Query: 921 KIPCYCGSKKCHGSLN 936
K+PC C SKKC LN
Sbjct: 4483 KLPCNCNSKKCRKFLN 4498
>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 2943
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 67/141 (47%), Positives = 102/141 (72%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R+ ++S IH +GL EP+ ++EY G++I I+D RE RYE+ G+GS Y+FRL
Sbjct: 2803 ERMLVRKSSIHGYGLFLKEPVSDGQMIVEYQGQMINQTIADERERRYEEQGVGSCYMFRL 2862
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKF 915
D+ ++DAT+ G +ARFINHSC+P + ++++VE G+KKI I+AKR IA G+E+TY+YKF
Sbjct: 2863 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDYKF 2922
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
P+E++ I C C + C G +N
Sbjct: 2923 PIEDEAIRCDCNAPNCIGRMN 2943
>gi|449679772|ref|XP_002161520.2| PREDICTED: histone-lysine N-methyltransferase MLL-like [Hydra
magnipapillata]
Length = 281
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 98/134 (73%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RSKI GL + IEA + +IEY GE IR ++D RE YEK G G Y+FR+DD VV
Sbjct: 149 RSKIQGRGLFCKKTIEAGEMIIEYSGEKIRSSLTDRREKYYEKKGYGC-YMFRIDDTDVV 207
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT +G ARFINHSC PNC++++IS++G KKI IYA++ + GEE+TY+YKF +E+ K+
Sbjct: 208 DATTKGNAARFINHSCEPNCFSRIISIDGCKKIIIYAQKRVTVGEELTYDYKFAIEDDKL 267
Query: 923 PCYCGSKKCHGSLN 936
PC+CG+KKC LN
Sbjct: 268 PCFCGAKKCRKYLN 281
>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Takifugu rubripes]
Length = 4478
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 76/136 (55%), Positives = 95/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY G +IR ++D RE Y+ GIG Y+FR+DD VV
Sbjct: 4344 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 4402
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+YKFP+E+
Sbjct: 4403 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYRGEELTYDYKFPIEDASS 4462
Query: 921 KIPCYCGSKKCHGSLN 936
K+PC C SKKC LN
Sbjct: 4463 KLPCNCNSKKCRKFLN 4478
>gi|356526623|ref|XP_003531916.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
Length = 1088
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 140/256 (54%), Gaps = 26/256 (10%)
Query: 703 RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
R S LC P GCARS + + +L A+ +R V Y+ Y
Sbjct: 813 RVSGLCSDYEPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 872
Query: 752 PEVNASQWANGKGLSA---------RTNRVKLRN-LLAAAEGAELLKASQVKARKKRLRF 801
+ +G+G+ + RT+R+ N +L+ +E ++ + +KRL F
Sbjct: 873 HGSLNNLEPSGRGVCSKFFCSQQRLRTSRIDTSNSILSISEKYTYMR----ETFRKRLAF 928
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
+S+IH +G+ A P + D VIEY GEL+R I+D RE Y + +Y+FR+DD
Sbjct: 929 GKSRIHGFGIFAKHPYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDER 988
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
V+DAT+ G IA INHSC PNCY++VISV G + I I+AKR I EE+TY+Y+F ++
Sbjct: 989 VIDATRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE 1048
Query: 921 KIPCYCGSKKCHGSLN 936
++PCYCG KC G +N
Sbjct: 1049 RLPCYCGFPKCRGIVN 1064
>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
Length = 3002
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 76/157 (48%), Positives = 99/157 (63%), Gaps = 3/157 (1%)
Query: 780 AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIR 839
A + + LL S AR R+ S+IH GL AL ++ VIEY GELIR I D R
Sbjct: 2849 ATQFSRLLSGSL--ARFYRVSVHPSRIHGRGLFALREFREDEMVIEYTGELIRSIICDAR 2906
Query: 840 ECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYA 899
E +Y G+ Y+FR+D +V+DAT G ARFINH+C+PNCY KV+S++ +K I I A
Sbjct: 2907 ELKYRATGVDC-YMFRIDPDWVIDATYAGNAARFINHACDPNCYAKVVSIDDKKHIVILA 2965
Query: 900 KRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+R I GEE+TY+Y+FP E K+PC CGS C LN
Sbjct: 2966 QRKIYPGEELTYDYRFPKESDKLPCNCGSYSCRKYLN 3002
>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
Length = 2720
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 102/146 (69%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS+IH GL IEA + VIEY G +IR ++D R+ Y+ GIG Y
Sbjct: 2576 KISKEAVGVYRSEIHGRGLFCKRNIEAGEMVIEYAGTVIRAVLTDKRQKYYDGKGIGC-Y 2634
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT +G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+
Sbjct: 2635 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 2694
Query: 913 YKFPLE--EKKIPCYCGSKKCHGSLN 936
YKFP+E E K+ C CG+++C GSLN
Sbjct: 2695 YKFPIEDDESKLHCNCGTRRCRGSLN 2720
>gi|219127798|ref|XP_002184115.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404346|gb|EEC44293.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 154
Score = 156 bits (395), Expect = 4e-35, Method: Composition-based stats.
Identities = 75/145 (51%), Positives = 100/145 (68%), Gaps = 4/145 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
++RL +RS IH WGL I+ +D +IEY+GE+IR I+D RE YE G GS Y+FR
Sbjct: 10 EQRLVARRSHIHGWGLFTKSDIQKDDPIIEYMGEVIRQPIADQRERAYEISGEGSCYMFR 69
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITY 911
LD +VDATK G +ARF+NH C PN Y K+ISV+ + +KI ++A R I+AGEEITY
Sbjct: 70 LDMQRIVDATKIGCMARFMNHCCGPNAYAKIISVDAELGQDRKIVVFANRDISAGEEITY 129
Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
+YKFP+E+ + C CG+ C G LN
Sbjct: 130 DYKFPVEDGSLRCTCGAPNCIGRLN 154
>gi|391333600|ref|XP_003741200.1| PREDICTED: uncharacterized protein LOC100900769 [Metaseiulus
occidentalis]
Length = 748
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 72/134 (53%), Positives = 95/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RSKIH G+ A I+A + VIEY GELIR ++D RE Y+ GI Y+F++D+ VV
Sbjct: 616 RSKIHRRGVFAKRAIDAGEMVIEYSGELIRAVLTDKREQLYKARGI-DCYMFKIDEDEVV 674
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC+PNCY+K I + G+K I IY+++ I GEE+TY+YKFP E+ K+
Sbjct: 675 DATMHGNAARFINHSCDPNCYSKCIEIFGKKHIVIYSQKRIKVGEELTYDYKFPKEDVKV 734
Query: 923 PCYCGSKKCHGSLN 936
PC CG++KC LN
Sbjct: 735 PCTCGARKCRRYLN 748
>gi|428181744|gb|EKX50607.1| hypothetical protein GUITHDRAFT_66907 [Guillardia theta CCMP2712]
Length = 182
Score = 156 bits (395), Expect = 5e-35, Method: Composition-based stats.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 2/158 (1%)
Query: 780 AAEGAELLKASQVKARKK-RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
A E +KA ++ R + R +RS IH+ GL A I+ D VIEYVGELIR + DI
Sbjct: 26 AKEEERSIKAKYLQLRSEPRTVVRRSPIHNLGLFATRRIDRNDMVIEYVGELIRPIVGDI 85
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
R+ + G S+Y+FRLDD Y+VDA G +RFINH C+PNCY +V++V+G K I ++
Sbjct: 86 RDDLALEKG-KSTYMFRLDDNYIVDAMFAGNASRFINHCCDPNCYCQVVTVDGVKHIVLF 144
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
A R I A EEITY+YK P+EE K+ C+CGS KC G +N
Sbjct: 145 AMRDIEADEEITYDYKLPIEEVKVICHCGSAKCRGYMN 182
>gi|323450935|gb|EGB06814.1| hypothetical protein AURANDRAFT_15476, partial [Aureococcus
anophagefferens]
Length = 136
Score = 156 bits (395), Expect = 5e-35, Method: Composition-based stats.
Identities = 66/136 (48%), Positives = 95/136 (69%), Gaps = 1/136 (0%)
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV 861
+RS IH WGL + + F+IEYVG+++R + D RE Y+ G+GS YLFRLD+ +
Sbjct: 1 RRSHIHGWGLYCKTDVPKDAFIIEYVGQVVRQAVGDKREKYYDDAGVGSCYLFRLDEDAI 60
Query: 862 VDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFPLEEK 920
VDAT+RG I RFINH C PN Y K+++++ KKI I A + + AG+E+ Y+YKFP+E+
Sbjct: 61 VDATRRGNIGRFINHCCRPNAYAKIVALDSNTKKIVIIALQDLKAGDEVMYDYKFPIEDD 120
Query: 921 KIPCYCGSKKCHGSLN 936
K+ CYCG+ C G++N
Sbjct: 121 KVKCYCGAPNCRGTMN 136
>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 3855
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3711 KTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREKYYDGKGIGC-Y 3769
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI ++GQK I I+A R I GEE+TY+
Sbjct: 3770 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYD 3829
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+ C CG+KKC LN
Sbjct: 3830 YKFPIEDANNKLACNCGTKKCRKFLN 3855
>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 3840
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3696 KTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREKYYDGKGIGC-Y 3754
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI ++GQK I I+A R I GEE+TY+
Sbjct: 3755 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYD 3814
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+ C CG+KKC LN
Sbjct: 3815 YKFPIEDANNKLACNCGTKKCRKFLN 3840
>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
tropicalis]
Length = 2771
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/146 (54%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2627 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDSKGIGC-Y 2685
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2686 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYD 2745
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 2746 YKFPIEDASNKLPCNCGAKKCRRFLN 2771
>gi|326676474|ref|XP_003200588.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Danio
rerio]
Length = 820
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 676 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGNVIRAVLTDKREKYYDSKGIGC-Y 734
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC+PNCY++VI+VEGQK I I+A R I GEE+TY+
Sbjct: 735 MFRIDDFDVVDATMHGNAARFINHSCDPNCYSRVINVEGQKHIVIFALRKIYRGEELTYD 794
Query: 913 YKFPLE--EKKIPCYCGSKKCHGSLN 936
YKFP+E + K+ C CG+++C LN
Sbjct: 795 YKFPIEDADNKLHCNCGARRCRRFLN 820
>gi|6850313|gb|AAF29390.1|AC009999_10 Contains similarity to MLL protein from Fugu rubripes gb|AF036382,
and contains a PWWP PF|00855 and a SET PF|00856 domain
[Arabidopsis thaliana]
Length = 1193
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 13/234 (5%)
Query: 709 PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
P GCAR+ + + +L + ++R V Y+ Y E + + G
Sbjct: 810 PNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFSTYERIYGS 869
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+S T N+L+ AE +K + +KRL F +S IH +G+ A P A D V
Sbjct: 870 KMSQITTP---SNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 922
Query: 824 IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
IEY GEL+R I+D RE Y M +Y+FR+D+ V+DAT+ G IA INHSC PNC
Sbjct: 923 IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 982
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++VISV G + I I+AKR +A EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 983 YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1036
>gi|240254018|ref|NP_172074.6| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|240254020|ref|NP_001077464.4| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|257096236|sp|P0CB22.1|ATX2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX2; AltName:
Full=Protein SET DOMAIN GROUP 30; AltName:
Full=Trithorax-homolog protein 2; Short=TRX-homolog
protein 2
gi|332189779|gb|AEE27900.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|332189780|gb|AEE27901.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
Length = 1083
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 13/234 (5%)
Query: 709 PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
P GCAR+ + + +L + ++R V Y+ Y E + + G
Sbjct: 833 PNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFSTYERIYGS 892
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+S T N+L+ AE +K + +KRL F +S IH +G+ A P A D V
Sbjct: 893 KMSQITTP---SNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 945
Query: 824 IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
IEY GEL+R I+D RE Y M +Y+FR+D+ V+DAT+ G IA INHSC PNC
Sbjct: 946 IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 1005
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++VISV G + I I+AKR +A EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 1006 YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1059
>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Anolis carolinensis]
Length = 3817
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 75/146 (51%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ IG Y
Sbjct: 3673 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSILTDKREKYYDSKDIGC-Y 3731
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI+++GQK I I+A R I GEE+TY+
Sbjct: 3732 MFRIDDAEVVDATMHGNAARFINHSCEPNCFSRVINIDGQKHIVIFAMRKIFRGEELTYD 3791
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 3792 YKFPIEDASNKLPCNCGAKKCRRFLN 3817
>gi|115478080|ref|NP_001062635.1| Os09g0134500 [Oryza sativa Japonica Group]
gi|47848462|dbj|BAD22318.1| trithorax-like [Oryza sativa Japonica Group]
gi|113630868|dbj|BAF24549.1| Os09g0134500 [Oryza sativa Japonica Group]
gi|215704333|dbj|BAG93767.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 1022
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 100/142 (70%), Gaps = 1/142 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLF 854
++RL F +S+IH +G+ A +A D +IEY+GEL+R ISDIRE R Y + +Y+F
Sbjct: 860 RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVGAGTYMF 919
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+DD V+DAT+ G IA INHSC PNCY++VISV G + I I+AKR I EE+TY+Y+
Sbjct: 920 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDINPWEELTYDYR 979
Query: 915 FPLEEKKIPCYCGSKKCHGSLN 936
F ++++PCYCG KC G +N
Sbjct: 980 FVSSDQRLPCYCGFPKCRGVVN 1001
>gi|256077052|ref|XP_002574822.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
Length = 1720
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 111/180 (61%), Gaps = 15/180 (8%)
Query: 768 RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
R R R LLA E +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1545 REARSVQRRLLAEFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1604
Query: 825 EYVGELIRLKISDIRECRYE-----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHS-- 877
EYVG ++R ++++RE +YE K+ S +L + + + + F HS
Sbjct: 1605 EYVGHVVRKGVAELREHQYEAXXXXKLFTESIFLVLMSLRFYLYPRQNT----FYYHSEF 1660
Query: 878 -CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+PNCY K+I VE +KKI IY+KR I EEITY+YKFP EE KIPC CGS C G+LN
Sbjct: 1661 YLHPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEDKIPCQCGSSSCRGTLN 1720
>gi|154417121|ref|XP_001581581.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121915810|gb|EAY20595.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 762
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 21/187 (11%)
Query: 771 RVKLRNLLAAAEGAELLKASQVKARK-------------------KRLRFQRSKIHDWGL 811
R +L+ L+ A + ++ +Q A K KR+ F++S I +GL
Sbjct: 545 RARLQPLVVARDSGLKIRQTQFPAPKLMMQRRGAANNQMRRGINGKRVYFEKSTIQGYGL 604
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
ALEPI ++ + EY GELIR +I+D+RE +YE++G +LFR+D+ VVDAT RGG +
Sbjct: 605 FALEPISSDSLICEYNGELIRSRIADLREKQYEQLGFPHMFLFRIDNDTVVDATMRGGKS 664
Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSK 929
RF+NHSC+PNC +K+I+V + I YA R+I +EIT+NY+ E+ K+ CYCG+K
Sbjct: 665 RFLNHSCHPNCRSKIINVGKTQTISFYAIRNIKPHDEITFNYQMEFEDRSKRERCYCGAK 724
Query: 930 KCHGSLN 936
+C G LN
Sbjct: 725 QCLGYLN 731
>gi|397614973|gb|EJK63132.1| hypothetical protein THAOC_16229, partial [Thalassiosira oceanica]
Length = 962
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 109/179 (60%), Gaps = 5/179 (2%)
Query: 761 NGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
NG+ A R+ L++L + + + R++ LRF +S IH WG+ A E I +
Sbjct: 786 NGRAARASQRRM-LKSLATLGSASSSVDRLAGRDREQELRFDKSSIHGWGVFAEEHINSG 844
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
D +IEY GELIR ++D RE YE + Y+FR+D V DAT RG +AR+IN SC+P
Sbjct: 845 DLIIEYRGELIRNAVADKREKEYEAARM-DDYMFRIDANTVCDATMRGNVARYINASCDP 903
Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
NCYT++I+ K+I IYAKR I G E+ Y+YKF LE K++ C CG+K C G +N
Sbjct: 904 NCYTQIITAGDNKRIVIYAKRDIPKGGELVYDYKFSLESDPGKRLICNCGAKMCKGFMN 962
>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
Length = 2812
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2668 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKYYDSKGIGC-Y 2726
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2727 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRRIFRGEELTYD 2786
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2787 YKFPIEDAGSKLPCNCGAKRCRRFLN 2812
>gi|256077050|ref|XP_002574821.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
Length = 1822
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/180 (47%), Positives = 111/180 (61%), Gaps = 15/180 (8%)
Query: 768 RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
R R R LLA E +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1647 REARSVQRRLLAEFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1706
Query: 825 EYVGELIRLKISDIRECRYE-----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHS-- 877
EYVG ++R ++++RE +YE K+ S +L + + + + F HS
Sbjct: 1707 EYVGHVVRKGVAELREHQYEAXXXXKLFTESIFLVLMSLRFYLYPRQNT----FYYHSEF 1762
Query: 878 -CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
+PNCY K+I VE +KKI IY+KR I EEITY+YKFP EE KIPC CGS C G+LN
Sbjct: 1763 YLHPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEDKIPCQCGSSSCRGTLN 1822
>gi|414884804|tpg|DAA60818.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
gi|414884805|tpg|DAA60819.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
Length = 976
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 113/174 (64%), Gaps = 5/174 (2%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
GLS + + N+ + E ++KA+ +KRL F +SKIH +G+ A +A D +
Sbjct: 783 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 838
Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
+EY+GE++R ISD RE R Y + +Y+FR+DD V+DAT+ G IAR INHSC PNC
Sbjct: 839 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 898
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++ I++ G + I I+AKR I EE+TY+Y+F ++++PC+CG KC G +N
Sbjct: 899 YSRAITILGDEHIIIFAKRDIDPWEELTYDYRFFSSDQRLPCFCGFPKCRGVVN 952
>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
Length = 1622
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 79/146 (54%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 1478 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDSKGIGC-Y 1536
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI VEGQK I I+A R I GEE+TY+
Sbjct: 1537 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYD 1596
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 1597 YKFPIEDASNKLPCNCGAKKCRRFLN 1622
>gi|195059316|ref|XP_001995609.1| GH17848 [Drosophila grimshawi]
gi|193896395|gb|EDV95261.1| GH17848 [Drosophila grimshawi]
Length = 2535
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2382 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2441
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2442 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2500
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2501 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2535
>gi|348667410|gb|EGZ07235.1| hypothetical protein PHYSODRAFT_306469 [Phytophthora sojae]
Length = 3106
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 101/141 (71%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R+ ++S IH +GL E + ++EY G++I ++D RE RYE+ G+GS Y+FRL
Sbjct: 2966 ERMLVRKSSIHGYGLFMKEAVSEGQMIVEYQGQMINQAVADERERRYEEQGVGSCYMFRL 3025
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKF 915
D+ ++DAT+ G +ARFINHSC+P + ++++VE G+KKI I+AKR IA G+E+TY+YKF
Sbjct: 3026 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDYKF 3085
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
P+E++ I C C + C G +N
Sbjct: 3086 PIEDEAIRCDCNAPNCIGRMN 3106
>gi|24639197|ref|NP_525040.2| trithorax-related, isoform C [Drosophila melanogaster]
gi|22831528|gb|AAF45684.2| trithorax-related, isoform C [Drosophila melanogaster]
Length = 2410
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2257 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2316
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2317 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2375
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2376 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2410
>gi|28571451|ref|NP_726773.2| trithorax-related, isoform D [Drosophila melanogaster]
gi|74865454|sp|Q8IRW8.2|TRR_DROME RecName: Full=Histone-lysine N-methyltransferase trr; AltName:
Full=Lysine N-methyltransferase 2C; AltName:
Full=Trithorax-related protein
gi|28381556|gb|AAN09063.2| trithorax-related, isoform D [Drosophila melanogaster]
Length = 2431
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2278 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2337
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2338 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2396
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2397 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2431
>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
Length = 1990
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 91/140 (65%), Gaps = 1/140 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KKRL F +S IH WGL++ PI+A VI Y GE +R I+D+RE RYE+ G YL R
Sbjct: 1851 KKRLTFGKSAIHGWGLISKVPIKAGAMVIIYRGEAVRTPIADLREARYERDGT-DCYLLR 1909
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
DD VVD T G IARF NHSC+PN YTK+I G+ + + + + AG E+TYNY+F
Sbjct: 1910 ADDHTVVDCTNMGNIARFTNHSCDPNMYTKIIKSGGEHHVCFFTRVDVPAGTEMTYNYRF 1969
Query: 916 PLEEKKIPCYCGSKKCHGSL 935
+E+ K+PCYC S C G L
Sbjct: 1970 EIEDGKVPCYCASHNCRGYL 1989
>gi|3256105|emb|CAA15944.1| EG:63B12.3 [Drosophila melanogaster]
Length = 2422
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2269 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2328
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2329 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2387
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2388 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2422
>gi|198467813|ref|XP_001354516.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
gi|198146119|gb|EAL31569.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
Length = 2566
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2413 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2472
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2473 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2531
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 2532 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2566
>gi|426381958|ref|XP_004057596.1| PREDICTED: uncharacterized protein LOC101126745 [Gorilla gorilla
gorilla]
Length = 1635
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 65/102 (63%), Positives = 81/102 (79%)
Query: 835 ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY KVI++E QKK
Sbjct: 1534 VADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKK 1593
Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1594 IVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1635
>gi|405120132|gb|AFR94903.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
grubii H99]
Length = 1471
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 100/156 (64%), Gaps = 17/156 (10%)
Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
++L +Q++ RKK+LRF RS I +GL A+E I A + V EYVG+L+R ++D+RE RY
Sbjct: 1330 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1389
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
K GIGSSYLFR+D+ V DAT +G ++R INHSC+P+ K+I V GQ KI
Sbjct: 1390 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIL------- 1442
Query: 904 AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
Y+YKFPLE ++PC CG+ C G LN
Sbjct: 1443 -------YDYKFPLESDPSLRVPCLCGAATCRGWLN 1471
>gi|297698601|ref|XP_002826411.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Pongo abelii]
Length = 1576
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 65/102 (63%), Positives = 81/102 (79%)
Query: 835 ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
++D+RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY KVI++E QKK
Sbjct: 1475 VADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKK 1534
Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I IY+K+ I EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1535 IVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1576
>gi|194767856|ref|XP_001966030.1| GF19475 [Drosophila ananassae]
gi|190622915|gb|EDV38439.1| GF19475 [Drosophila ananassae]
Length = 2490
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2337 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2396
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2397 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2455
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 2456 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2490
>gi|297843378|ref|XP_002889570.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
lyrata]
gi|297335412|gb|EFH65829.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
lyrata]
Length = 1206
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/234 (37%), Positives = 128/234 (54%), Gaps = 13/234 (5%)
Query: 709 PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
P GCAR+ + + +L ++R V Y+ Y E + G
Sbjct: 823 PNPSGCARTEPYNYLGRRGRKEPEALAGPSSKRLFVENQPYIVGGYSRHEFATYERIYGS 882
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+S + N+L+ AE +K + +KRL F +S IH +G+ A P A D V
Sbjct: 883 KMS---QIITPSNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 935
Query: 824 IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
IEY GEL+R I+D RE Y M +Y+FR+D+ V+DAT+ G IA INHSC PNC
Sbjct: 936 IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 995
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++VISV G + I I+AKR +A EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 996 YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1049
>gi|449496792|ref|XP_004160228.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ATX2-like [Cucumis sativus]
Length = 1095
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 112/170 (65%), Gaps = 5/170 (2%)
Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
+T ++ RN+L+ AE + ++ + +KRL F +S IH +G+ A P A D VIEY
Sbjct: 906 KTCQLDPRNILSVAEKYKFMR----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 961
Query: 828 GELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE++R I+D RE + +G+ +Y+FR+DD V+DAT+ G IA INHSC PNCY++V
Sbjct: 962 GEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 1021
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
ISV G + I I+AKR I EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 1022 ISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVN 1071
>gi|194912727|ref|XP_001982563.1| GG12672 [Drosophila erecta]
gi|190648239|gb|EDV45532.1| GG12672 [Drosophila erecta]
Length = 2406
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2253 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2312
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2313 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2371
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2372 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2406
>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
Length = 2510
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2357 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2416
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2417 EAKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2475
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 2476 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2510
>gi|384246087|gb|EIE19578.1| Mll1 protein, partial [Coccomyxa subellipsoidea C-169]
Length = 132
Score = 154 bits (388), Expect = 3e-34, Method: Composition-based stats.
Identities = 75/136 (55%), Positives = 94/136 (69%), Gaps = 6/136 (4%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS +H WGL A +PI A +FVIEYVGELIR ++D+RE ++ S Y FR+D +VV
Sbjct: 1 RSGVHAWGLFAHKPIPANEFVIEYVGELIRRTVNDVRE----RVDADSDYRFRVDADWVV 56
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EK 920
DAT+RGG ARFINH C+PNC TK +SV GQ I I +KR IA GEE++Y+YKF E
Sbjct: 57 DATRRGGKARFINHCCDPNCVTKTVSVGGQLHIMICSKRAIAEGEELSYDYKFQPTPGEA 116
Query: 921 KIPCYCGSKKCHGSLN 936
IPC CG++ C LN
Sbjct: 117 PIPCTCGARNCRKRLN 132
>gi|302786940|ref|XP_002975241.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
gi|300157400|gb|EFJ24026.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
Length = 1184
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 68/141 (48%), Positives = 98/141 (69%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
+RL F +S IH WGL A EP A D VIEY GE+IR ++D+RE R Y + +Y+F
Sbjct: 1022 RRLTFGKSAIHGWGLFAKEPHGAGDMVIEYAGEIIRPTVADVREKRCYNSLVGAGTYMFC 1081
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ VVDAT+ G IA INHSC PNCY++V++ G+++I I+AK+ IA G+E+TY+Y+F
Sbjct: 1082 IDNERVVDATRAGSIAHLINHSCEPNCYSRVVTTNGKERIVIFAKQDIAGGDEVTYDYRF 1141
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
++PC+CG+ C G +N
Sbjct: 1142 TSIGDQLPCHCGTAGCRGIVN 1162
>gi|167533965|ref|XP_001748661.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772902|gb|EDQ86548.1| predicted protein [Monosiga brevicollis MX1]
Length = 706
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/137 (53%), Positives = 98/137 (71%), Gaps = 10/137 (7%)
Query: 810 GLVA-LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
GL+ L+ I A + VIEYVGE+IR ++D RE RY ++ IGSSYLFR+D+ V+DAT+RG
Sbjct: 570 GLIQQLKDIPANEVVIEYVGEVIRQSVADERERRYTQVKIGSSYLFRIDELNVIDATRRG 629
Query: 869 GIARFINHSCN-------PNC--YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
IARF+NHSC+ PN + KV +K I IY+KR I G+EITY+Y+FP+E+
Sbjct: 630 HIARFMNHSCDVGLPRSIPNTPFHPKVFESHDEKHIVIYSKRDIREGQEITYDYQFPMED 689
Query: 920 KKIPCYCGSKKCHGSLN 936
+KIPCYCG++ C G LN
Sbjct: 690 EKIPCYCGAENCKGFLN 706
>gi|168012124|ref|XP_001758752.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
patens subsp. patens]
gi|162689889|gb|EDQ76258.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
patens subsp. patens]
Length = 2373
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 116/192 (60%), Gaps = 9/192 (4%)
Query: 752 PEVNASQWANGKGLSARTNRV-KLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
PE A+ W G T R+ K +L + E L+ Q K KK L +S IH G
Sbjct: 2184 PEQVAA-WLRISGRKLSTRRLHKSASLAMKFDHKEYLRFKQKKGWKK-LAVYKSIIHALG 2241
Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSYLFRLDDGYVVDATK 866
L + I + V+EYVGE++ +++D RE Y G+ YLFR+D ++DAT+
Sbjct: 2242 LYTTDFIAEREVVVEYVGEIVGHRVADKREVEYHSRKRLQYQGACYLFRIDTEQIIDATR 2301
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPC 924
GGIARF+NHSC+PNC KVI VE KK+ +AKR+I AGEE+TY+YKF +E KIPC
Sbjct: 2302 NGGIARFVNHSCSPNCVAKVICVENLKKVIFFAKRNIDAGEEVTYDYKFNYDEVGDKIPC 2361
Query: 925 YCGSKKCHGSLN 936
+CG+ +C G+LN
Sbjct: 2362 FCGTPECRGTLN 2373
>gi|124111218|gb|ABM91999.1| MLL [Pan troglodytes]
Length = 338
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 194 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 252
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 253 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 312
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 313 YKFPIEDASNKLPCNCGAKKCRKFLN 338
>gi|449441169|ref|XP_004138356.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Cucumis
sativus]
Length = 1036
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 112/170 (65%), Gaps = 5/170 (2%)
Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
+T ++ RN+L+ AE + ++ + +KRL F +S IH +G+ A P A D VIEY
Sbjct: 847 KTCQLDPRNILSVAEKYKFMR----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 902
Query: 828 GELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GE++R I+D RE + +G+ +Y+FR+DD V+DAT+ G IA INHSC PNCY++V
Sbjct: 903 GEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 962
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
ISV G + I I+AKR I EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 963 ISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVN 1012
>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
Length = 2027
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 107/155 (69%), Gaps = 3/155 (1%)
Query: 784 AELLKASQVKAR-KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR 842
++ ++ Q K R ++ +R SKIH +GL A I + ++EYVGE+IR +++D RE
Sbjct: 1874 SDAMQYRQCKERMRETVRVGYSKIHGFGLYAQRDISGGEMIVEYVGEVIRPELTDKREQF 1933
Query: 843 YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRH 902
Y+ +GS Y+FR+DD VVDAT GG ARF+NHSC+PNC +++IS + KKI I AK+H
Sbjct: 1934 YDARNMGS-YMFRIDDKQVVDATLTGGQARFVNHSCDPNCISRIISTDRGKKIVIVAKQH 1992
Query: 903 IAAGEEITYNYKFPL-EEKKIPCYCGSKKCHGSLN 936
I G+E+TY+Y+FPL ++ K+ C CG+ C G +N
Sbjct: 1993 ICKGDELTYDYQFPLDDDNKVRCSCGAANCRGFMN 2027
>gi|40225368|gb|AAH09337.2| MLL4 protein [Homo sapiens]
Length = 795
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 651 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 709
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 710 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 769
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 770 YKFPIEDASNKLPCNCGAKRCRRFLN 795
>gi|120974668|gb|ABM46716.1| MLL [Gorilla gorilla]
Length = 338
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 194 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 252
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 253 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 312
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 313 YKFPIEDASNKLPCNCGAKKCRKFLN 338
>gi|224128834|ref|XP_002320433.1| SET domain protein [Populus trichocarpa]
gi|222861206|gb|EEE98748.1| SET domain protein [Populus trichocarpa]
Length = 1050
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 100/283 (35%), Positives = 146/283 (51%), Gaps = 22/283 (7%)
Query: 672 HATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSD--GCARSSISGWEWHKWS- 728
+ L K + + R V ++ R S+ P + GCAR+ + +
Sbjct: 751 QCIRLLSFCKKHRQPSNERVVTDERVGQIPRRCSDYIPPCNLSGCARTEPYNYFGRRGRK 810
Query: 729 ----LNASPAERARVRGAQYVHTKYLGPEVNASQWA-NG---KGLSARTNRVKLR----- 775
L A+ +R V Y+ Y E + A NG G S+ R++
Sbjct: 811 EPEVLAAASLKRLFVENQPYLVGGYSQHESSGCTLASNGLINSGFSSSLQRLRASQLDAP 870
Query: 776 -NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLK 834
N+L+ AE + ++ + +KRL F +S IH +G+ A P A D VIEY GEL+R
Sbjct: 871 SNILSMAEKYQHMRHTF----RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPP 926
Query: 835 ISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
I+D RE Y + +Y+FR+DD V+DAT+ G IA INHSC PNCY++VISV G +
Sbjct: 927 IADRREHFIYNSLVGAGTYMFRIDDKRVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE 986
Query: 894 KIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I I+AKR I EE+TY+Y+F E+K+ CYCG +C G +N
Sbjct: 987 HIIIFAKRDIKRWEELTYDYRFFSIEEKLACYCGFSRCRGVVN 1029
>gi|195392728|ref|XP_002055009.1| GJ19139 [Drosophila virilis]
gi|194149519|gb|EDW65210.1| GJ19139 [Drosophila virilis]
Length = 2531
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2378 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2437
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2438 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2496
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 2497 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2531
>gi|308044237|ref|NP_001182990.1| uncharacterized protein LOC100501309 [Zea mays]
gi|238008654|gb|ACR35362.1| unknown [Zea mays]
Length = 531
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 113/174 (64%), Gaps = 5/174 (2%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
GLS + + N+ + E ++KA+ +KRL F +SKIH +G+ A +A D +
Sbjct: 338 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 393
Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
+EY+GE++R ISD RE R Y + +Y+FR+DD V+DAT+ G IAR INHSC PNC
Sbjct: 394 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 453
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++ I++ G + I I+AKR I EE+TY+Y+F ++++PC+CG KC G +N
Sbjct: 454 YSRAITILGDEHIIIFAKRDIDPWEELTYDYRFFSSDQRLPCFCGFPKCRGVVN 507
>gi|195132807|ref|XP_002010834.1| GI21761 [Drosophila mojavensis]
gi|193907622|gb|EDW06489.1| GI21761 [Drosophila mojavensis]
Length = 2748
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 2595 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2654
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 2655 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2713
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 2714 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2748
>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
Length = 2166
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 67/147 (45%), Positives = 98/147 (66%), Gaps = 1/147 (0%)
Query: 789 ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
A ++ R++RL F +S IH WGL+A + ++A V+EYVGE +R ++D+RE YE+ G
Sbjct: 2020 AECLRLRRERLTFGKSNIHGWGLIAKQFLKAGSMVVEYVGERLRPSVADLREKVYERTGH 2079
Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE 908
YL DD V+D T +G IARF NHSC PN YTK+++++G +IF + + + G+E
Sbjct: 2080 -DVYLLAADDKTVIDTTVKGSIARFTNHSCTPNMYTKLVALDGDSRIFFFTRIDVHPGQE 2138
Query: 909 ITYNYKFPLEEKKIPCYCGSKKCHGSL 935
+TYNY+F E K+PCYCG+ C G L
Sbjct: 2139 LTYNYRFDAESGKVPCYCGANNCRGFL 2165
>gi|194380712|dbj|BAG58509.1| unnamed protein product [Homo sapiens]
Length = 323
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 179 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 237
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 238 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 297
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 298 YKFPIEDASNKLPCNCGAKKCRKFLN 323
>gi|349604316|gb|AEP99904.1| Histone-lysine N-methyltransferase HRX-like protein, partial [Equus
caballus]
Length = 297
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 153 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 211
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 212 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 271
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 272 YKFPIEDASNKLPCNCGAKKCRKFLN 297
>gi|357502739|ref|XP_003621658.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
truncatula]
gi|355496673|gb|AES77876.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
truncatula]
Length = 791
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/240 (37%), Positives = 133/240 (55%), Gaps = 12/240 (5%)
Query: 709 PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
P GCARS + + +L AS +R V Y+ Y + +G+
Sbjct: 528 PNPSGCARSEPYDYFGRRGRKEPEALAASSLKRLFVENQPYLVGGYCQHGLLNDSEPSGR 587
Query: 764 GLSAR--TNRVKLRNLLAAAEGAELLKASQVK----ARKKRLRFQRSKIHDWGLVALEPI 817
G+ ++ + +LR + A + L A + K +K+L F +S+IH +G+ A P
Sbjct: 588 GVCSKFFCSEQRLRTSMVDAADSILTVAEKYKYMSETFRKQLAFGKSRIHGFGIFAKHPY 647
Query: 818 EAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
+ D VIEY GEL+R I+D RE Y + +Y+FR+DD V+DAT+ G IA INH
Sbjct: 648 KGGDMVIEYTGELVRPPIADRRERFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 707
Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
SC PNCY++VISV G + I I+AKR I EE+TY+Y+F ++++ CYCG KC G +N
Sbjct: 708 SCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLSCYCGFPKCRGVVN 767
>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
homolog; AltName: Full=Trithorax homolog 2; AltName:
Full=WW domain-binding protein 7; Short=WBP-7
Length = 2713
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713
>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
Length = 2713
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713
>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
sulphuraria]
Length = 969
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 73/138 (52%), Positives = 100/138 (72%), Gaps = 2/138 (1%)
Query: 800 RFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDG 859
+ ++S I GL LE + E+FVIEY GELIR I+DIRE Y++ IG Y+FRL+D
Sbjct: 833 KVKQSGIQGLGLYTLENLPDEEFVIEYAGELIRPVIADIREKFYDRRKIGC-YMFRLNDD 891
Query: 860 YVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIF-IYAKRHIAAGEEITYNYKFPLE 918
++VDAT +G ARFINHSC PNC +K+I+V+G K++ I+AKR+IAAGEE+TY+Y+F
Sbjct: 892 FIVDATMKGNYARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQFEEF 951
Query: 919 EKKIPCYCGSKKCHGSLN 936
+ IPC CG+ C G +N
Sbjct: 952 GETIPCNCGAPNCRGKMN 969
>gi|225380776|gb|ACN88689.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 148
Score = 152 bits (385), Expect = 6e-34, Method: Composition-based stats.
Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
KA + + RS IH GL + IE + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 4 KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 62
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+
Sbjct: 63 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVINVDGRKHIVIFATRKIYKGEELTYD 122
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+EE K+PC CG+KKC LN
Sbjct: 123 YKFPIEEPGNKLPCNCGAKKCRKFLN 148
>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
Length = 3750
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 76/136 (55%), Positives = 94/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y+FR+DD VV
Sbjct: 3616 RSAIHGRGLFCKRNIEAGEMVIEYSGIVIRSVLTDKREKYYDGKGIGC-YMFRIDDFDVV 3674
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT G ARFINHSC PNCY++VI+VEGQK I I+A R I GEE+TY+YKFP+E+
Sbjct: 3675 DATMHGNAARFINHSCEPNCYSRVINVEGQKHIVIFALRKIYRGEELTYDYKFPIEDASN 3734
Query: 921 KIPCYCGSKKCHGSLN 936
K+ C CG+K+C LN
Sbjct: 3735 KLGCNCGAKRCRRFLN 3750
>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
garnettii]
Length = 2714
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2570 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKYYDGKGIGC-Y 2628
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2629 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2688
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2689 YKFPIEDASNKLPCNCGAKRCRRFLN 2714
>gi|342209882|gb|AEL16988.1| trithorax-like protein [Phaseolus vulgaris]
Length = 1089
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 92/256 (35%), Positives = 140/256 (54%), Gaps = 26/256 (10%)
Query: 703 RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
R + LC P GCARS + + +L A+ +R V Y+ Y
Sbjct: 814 RVAGLCSDYEPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYIVGGYCQ 873
Query: 752 PEVNASQWANGKGLSAR--TNRVKLR--------NLLAAAEGAELLKASQVKARKKRLRF 801
+G+G+ ++ ++ +LR N+L+ +E + ++ + +KRL F
Sbjct: 874 HGALNDPEPSGRGVCSKFFCSQQRLRTSLIDTSNNILSISEKYKYMR----ETFRKRLAF 929
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
+S+IH +G+ A P + D VIEY GEL+R I+D RE Y + +Y+FR+DD
Sbjct: 930 GKSRIHGFGIFAKHPHKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDER 989
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
V+DAT+ G IA INHSC PNCY++VISV G + I I+AKR I EE+TY+Y+F ++
Sbjct: 990 VIDATRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE 1049
Query: 921 KIPCYCGSKKCHGSLN 936
++ CYCG KC G +N
Sbjct: 1050 RLSCYCGFPKCRGIVN 1065
>gi|170058057|ref|XP_001864756.1| Mll1 protein [Culex quinquefasciatus]
gi|167877297|gb|EDS40680.1| Mll1 protein [Culex quinquefasciatus]
Length = 114
Score = 152 bits (385), Expect = 7e-34, Method: Composition-based stats.
Identities = 69/114 (60%), Positives = 84/114 (73%), Gaps = 1/114 (0%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEY GELIR ++D RE Y+ GIG Y+F++D+ +VVDAT RG ARFINHSC PNC
Sbjct: 2 VIEYAGELIRSTLTDKRERYYDGRGIGC-YMFKIDENFVVDATMRGNAARFINHSCEPNC 60
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y+KV+ + G K I I+A R I GEE+TY+YKFP E+ KIPC CGSKKC LN
Sbjct: 61 YSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYLN 114
>gi|121483959|gb|ABM54292.1| MLL [Pan paniscus]
Length = 162
Score = 152 bits (385), Expect = 7e-34, Method: Composition-based stats.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 18 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 76
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 77 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 136
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 137 YKFPIEDASNKLPCNCGAKKCRKFLN 162
>gi|410904194|ref|XP_003965577.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like,
partial [Takifugu rubripes]
Length = 109
Score = 152 bits (385), Expect = 7e-34, Method: Composition-based stats.
Identities = 66/102 (64%), Positives = 83/102 (81%)
Query: 835 ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
I+D+RE RYE+ GIGSSY+FR+D ++DATK G ARFINHSCNPNCY KVI+VE QKK
Sbjct: 8 IADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKK 67
Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I IY+++ I EEITY+YKFP+E+ KIPC CG++ C G+LN
Sbjct: 68 IVIYSRQPINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 109
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 70/134 (52%), Positives = 94/134 (70%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + I + +IEY GE+IR + D RE YE G+G Y+FR+D+ VV
Sbjct: 2554 RSGIHGRGLYCKKDIAKGEMIIEYAGEVIRASLCDRREKYYEGRGLGC-YMFRMDNDEVV 2612
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT +G ARFINHSC+PNCY+K+I+V+ +K I IYA R I GEE+TY+YKFP+E+ K+
Sbjct: 2613 DATVKGNAARFINHSCDPNCYSKMITVDNKKHIVIYALREIRTGEELTYDYKFPIEDDKL 2672
Query: 923 PCYCGSKKCHGSLN 936
C CGS++C +N
Sbjct: 2673 HCTCGSRRCRKFMN 2686
>gi|302785193|ref|XP_002974368.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
gi|300157966|gb|EFJ24590.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
Length = 508
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 68/141 (48%), Positives = 98/141 (69%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
+RL F +S IH WGL A EP A D VIEY GE+IR ++D+RE R Y + +Y+F
Sbjct: 346 RRLTFGKSAIHGWGLFAKEPHGAGDMVIEYAGEIIRPTVADVREKRCYNSLVGAGTYMFC 405
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ VVDAT+ G IA INHSC PNCY++V++ G+++I I+AK+ IA G+E+TY+Y+F
Sbjct: 406 IDNERVVDATRAGSIAHLINHSCEPNCYSRVVTTNGKERIVIFAKQDIAGGDEVTYDYRF 465
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
++PC+CG+ C G +N
Sbjct: 466 TSIGDQLPCHCGTAGCRGIVN 486
>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
Length = 2270
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2126 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2184
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2185 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2244
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2245 YKFPIEDASNKLPCNCGAKRCRRFLN 2270
>gi|47225089|emb|CAF97504.1| unnamed protein product [Tetraodon nigroviridis]
Length = 352
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 77/136 (56%), Positives = 95/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL + IEA + VIEY G +IR ++D RE Y+ GIG Y+FR+DD VV
Sbjct: 218 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 276
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+YKFP+EE
Sbjct: 277 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYQGEELTYDYKFPIEEASS 336
Query: 921 KIPCYCGSKKCHGSLN 936
K+PC C SKKC LN
Sbjct: 337 KLPCNCNSKKCRKFLN 352
>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
porcellus]
Length = 2692
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2548 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2606
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2607 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2666
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2667 YKFPIEDASNKLPCNCGAKRCRRFLN 2692
>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
Length = 3017
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2873 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2931
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2932 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2991
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2992 YKFPIEDASNKLPCNCGAKRCRRFLN 3017
>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4 [Callithrix jacchus]
Length = 2660
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2516 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2574
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2575 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2634
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2635 YKFPIEDASNKLPCNCGAKRCRRFLN 2660
>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
Length = 2566
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2422 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2480
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2481 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2540
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2541 YKFPIEDASNKLPCNCGAKRCRRFLN 2566
>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
Length = 2605
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2461 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2519
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2520 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2579
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2580 YKFPIEDASNKLPCNCGAKRCRRFLN 2605
>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein
4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
domain-binding protein 7; Short=WBP-7
gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
Length = 2715
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2571 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2629
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2630 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2689
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2690 YKFPIEDASNKLPCNCGAKRCRRFLN 2715
>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 2665
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2521 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2579
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2580 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2639
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2640 YKFPIEDASNKLPCNCGAKRCRRFLN 2665
>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
anubis]
Length = 2716
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2572 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2630
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2631 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2690
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2691 YKFPIEDASNKLPCNCGAKRCRRFLN 2716
>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
troglodytes]
Length = 2526
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2382 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2440
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2441 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2500
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2501 YKFPIEDASNKLPCNCGAKRCRRFLN 2526
>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
mulatta]
Length = 2789
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2645 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2703
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2704 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2763
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2764 YKFPIEDASNKLPCNCGAKRCRRFLN 2789
>gi|255573673|ref|XP_002527758.1| phd finger protein, putative [Ricinus communis]
gi|223532845|gb|EEF34619.1| phd finger protein, putative [Ricinus communis]
Length = 1103
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 147/288 (51%), Gaps = 22/288 (7%)
Query: 667 DGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSD--GCARSSISGWEW 724
D +E + L K + + R V ++ R S+ P + GCARS +
Sbjct: 796 DDVEDQCIRLLSFCKRHKQPSNERPVTEERIGRITHRYSDYIPPCNPSGCARSEPYNYFG 855
Query: 725 HKW-----SLNASPAERARVRGAQYVHTKYLGPEVNA----SQWANGKGLSARTNRVKL- 774
+ +L A+ +R V Y+ Y E + S G S+ +K
Sbjct: 856 RRGRKEPEALAAASLKRLFVENQPYLVGGYCQHESSGITLPSNGVEGSRFSSNLQWLKTS 915
Query: 775 -----RNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
N+++ AE E ++ + +KRL F +S IH +G+ A P A D VIEY GE
Sbjct: 916 QLDAPNNIISMAEKYEYMR----QTFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 971
Query: 830 LIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
L+R I+D RE Y + +Y+FR++D V+DAT+ G IA INHSC PNCY++VIS
Sbjct: 972 LVRPPIADRREHFIYNSLVGAGTYMFRINDERVIDATRAGSIAHLINHSCEPNCYSRVIS 1031
Query: 889 VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
V G + I I+AKR I EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 1032 VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 1079
>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Ailuropoda melanoleuca]
Length = 2611
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2467 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2525
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2526 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2585
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2586 YKFPIEDASNKLPCNCGAKRCRRFLN 2611
>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
gorilla gorilla]
Length = 2536
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2392 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2450
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2451 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2510
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2511 YKFPIEDASNKLPCNCGAKRCRRFLN 2536
>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
glaber]
Length = 2592
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2448 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2506
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2507 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2566
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2567 YKFPIEDASNKLPCNCGAKRCRRFLN 2592
>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
Length = 2721
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2577 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2635
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2636 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2695
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2696 YKFPIEDASNKLPCNCGAKRCRRFLN 2721
>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
Length = 2415
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2271 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2329
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2330 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2389
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2390 YKFPIEDASNKLPCNCGAKRCRRFLN 2415
>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Pan paniscus]
Length = 2776
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2632 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2690
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2691 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2750
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2751 YKFPIEDASNKLPCNCGAKRCRRFLN 2776
>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
Length = 2013
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 1869 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 1927
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 1928 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 1987
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 1988 YKFPIEDASNKLPCNCGAKRCRRFLN 2013
>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
lupus familiaris]
Length = 2713
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713
>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2688
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2544 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2602
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2603 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2662
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2663 YKFPIEDASNKLPCNCGAKRCRRFLN 2688
>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Loxodonta africana]
Length = 2200
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2056 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2114
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2115 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2174
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2175 YKFPIEDASNKLPCNCGAKRCRRFLN 2200
>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2711
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2567 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2625
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2626 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2685
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2686 YKFPIEDASNKLPCNCGAKRCRRFLN 2711
>gi|224065044|ref|XP_002301643.1| SET domain protein [Populus trichocarpa]
gi|222843369|gb|EEE80916.1| SET domain protein [Populus trichocarpa]
Length = 1014
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 76/162 (46%), Positives = 105/162 (64%), Gaps = 5/162 (3%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
N+L+ AE + ++ + +KRL F +S IH +G+ A P A D VIEY GEL+R I
Sbjct: 833 NILSMAEKYQHMR----QTFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPPI 888
Query: 836 SDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
+D RE Y + +Y+FR+DD V+DAT+ G IA INHSC PNCY++VISV G +
Sbjct: 889 ADRRERFIYNSLVGAGTYMFRIDDKRVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 948
Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I I+AKR I EE+TY+Y+F E+K+ CYCG +C G +N
Sbjct: 949 IIIFAKRDIKRWEELTYDYRFFSIEEKLACYCGFPRCRGVVN 990
>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Nomascus leucogenys]
Length = 2433
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2289 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2347
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2348 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2407
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2408 YKFPIEDASNKLPCNCGAKRCRRFLN 2433
>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
Length = 4717
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 70/131 (53%), Positives = 90/131 (68%), Gaps = 1/131 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
++ K+ + RS IH GL IEA + VIEY GELIR ++D RE Y+ GIG Y
Sbjct: 4568 ESSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDGRGIGC-Y 4626
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+F++D+ +VVDAT RG ARFINHSC PNCY+KV+ + G K I I+A R I GEE+TY+
Sbjct: 4627 MFKIDENFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYD 4686
Query: 913 YKFPLEEKKIP 923
YKFP E+ KIP
Sbjct: 4687 YKFPFEDVKIP 4697
>gi|223365716|pdb|2W5Y|A Chain A, Binary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine.
gi|223365717|pdb|2W5Z|A Chain A, Ternary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine
And Histone Peptide
Length = 192
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y
Sbjct: 48 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 106
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+
Sbjct: 107 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 166
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+KKC LN
Sbjct: 167 YKFPIEDASNKLPCNCGAKKCRKFLN 192
>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
Length = 3463
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 3319 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDSKGIGC-Y 3377
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+VEG+K I I+A R I GEE+TY+
Sbjct: 3378 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 3437
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+ C CG+++C LN
Sbjct: 3438 YKFPIEDASNKLNCNCGARRCRRFLN 3463
>gi|28277052|gb|AAH44818.1| Mll1 protein, partial [Mus musculus]
Length = 142
Score = 151 bits (382), Expect = 1e-33, Method: Composition-based stats.
Identities = 76/143 (53%), Positives = 97/143 (67%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K+ + RS IH GL I+A + VIEY G +IR +D RE Y+ GIG Y+FR
Sbjct: 1 KEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-YMFR 59
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I GEE+TY+YKF
Sbjct: 60 IDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKF 119
Query: 916 PLEE--KKIPCYCGSKKCHGSLN 936
P+E+ K+PC CG+KKC LN
Sbjct: 120 PIEDASNKLPCNCGAKKCRKFLN 142
>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
Length = 2924
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2780 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGC-Y 2838
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+VEG+K I I+A R I GEE+TY+
Sbjct: 2839 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 2898
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+ C CG+++C LN
Sbjct: 2899 YKFPIEDASNKLNCNCGARRCRRFLN 2924
>gi|443723097|gb|ELU11678.1| hypothetical protein CAPTEDRAFT_179499 [Capitella teleta]
Length = 323
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 69/134 (51%), Positives = 93/134 (69%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL ++ + + +IEY G +IR ++D RE YE GIG Y+FR+DD +V+
Sbjct: 191 RSLIHGRGLFSIRDFDVGEMIIEYAGMVIRSSLTDSREKYYETKGIGC-YMFRIDDHFVI 249
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC+PNC+++VI V G K I I+A + + AGEE+TY+YKFP E+ KI
Sbjct: 250 DATVNGNAARFINHSCDPNCFSRVIDVSGVKHIIIFALKAVKAGEELTYDYKFPYEDVKI 309
Query: 923 PCYCGSKKCHGSLN 936
PC C +K+C LN
Sbjct: 310 PCTCQAKRCRKYLN 323
>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
mutus]
Length = 2524
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2380 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2438
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2439 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2498
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2499 YKFPIEDASNKLPCNCGAKRCRRFLN 2524
>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
Length = 2209
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2065 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2123
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 2124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2183
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 2184 YKFPIEDASNKLPCNCGAKRCRRFLN 2209
>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
Length = 1744
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 1600 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 1658
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 1659 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 1718
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 1719 YKFPIEDASNKLPCNCGAKRCRRFLN 1744
>gi|195404288|ref|XP_002060445.1| GJ19853 [Drosophila virilis]
gi|194156330|gb|EDW71514.1| GJ19853 [Drosophila virilis]
Length = 1720
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 1567 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 1626
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 1627 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1685
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 1686 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1720
>gi|168007105|ref|XP_001756249.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
gi|162692759|gb|EDQ79115.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
Length = 1900
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 97/145 (66%), Gaps = 6/145 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSYL 853
+L +S IH GL + I + V+EYVGE++ +++D RE Y G+ YL
Sbjct: 1756 KLGVYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACYL 1815
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
FR+D ++DAT++GGIARF+NHSC+PNC KVI VE KK+ +AKR I AGEE+TY+Y
Sbjct: 1816 FRIDTEQIIDATRKGGIARFVNHSCSPNCVAKVICVENLKKVVFFAKRDIYAGEEVTYDY 1875
Query: 914 KFPLEE--KKIPCYCGSKKCHGSLN 936
KF +E KIPC+CG+ +C G+LN
Sbjct: 1876 KFNCDEVGDKIPCFCGTPECRGTLN 1900
>gi|170050212|ref|XP_001859676.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
gi|167871728|gb|EDS35111.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
Length = 2977
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K R RSKI GL A +E VIEY+GE+IR+++S++RE +Y
Sbjct: 2824 KSSQYKKMKLEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRVEVSELREKQY 2883
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ VE + +I I+AKR I
Sbjct: 2884 EARNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVEREVRIIIFAKRRI 2942
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KI C CG+ C +N
Sbjct: 2943 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2977
>gi|154422490|ref|XP_001584257.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121918503|gb|EAY23271.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 259
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 98/147 (66%), Gaps = 2/147 (1%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
++++ K LRF++S IH WG+ + + ++EY GEL+RL +++ R+ YE G S
Sbjct: 92 IRSQLKYLRFEKSGIHLWGVFSACYFAPGEPIVEYTGELVRLSVTEARQKYYETEGNHGS 151
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y+FRLDD +DAT +GGIARF+NHSC+PNC T V+ GQ+ I I+AK+ I EE+TY
Sbjct: 152 YIFRLDDDLYIDATHKGGIARFLNHSCDPNCKTCVVEAGGQRHIVIFAKKKIEPFEELTY 211
Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
+Y P E EK I C CGS KC G LN
Sbjct: 212 DYNLPYESKEKAIVCLCGSPKCRGYLN 238
>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
Length = 3692
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 74/136 (54%), Positives = 94/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y+FR+DD VV
Sbjct: 3558 RSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGC-YMFRIDDFDVV 3616
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT G ARFINHSC PNCY++VI+VEG+K I I+A R I GEE+TY+YKFP+E+
Sbjct: 3617 DATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASN 3676
Query: 921 KIPCYCGSKKCHGSLN 936
K+ C CG+++C LN
Sbjct: 3677 KLNCNCGARRCRRFLN 3692
>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
Length = 2812
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/146 (51%), Positives = 100/146 (68%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 2668 KTSKEAVGVYRSLIHGRGLFCKRNIEAGEMVIEYAGTVIRSVLTDKREKFYDGKGIGC-Y 2726
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT +G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+
Sbjct: 2727 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 2786
Query: 913 YKFPLEEK--KIPCYCGSKKCHGSLN 936
YKFP+E++ K+ C CG+++C LN
Sbjct: 2787 YKFPIEDEDNKLHCNCGTRRCRRFLN 2812
>gi|38565948|gb|AAH62210.1| Wbp7 protein, partial [Mus musculus]
Length = 442
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 298 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 356
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 357 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 416
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 417 YKFPIEDASNKLPCNCGAKRCRRFLN 442
>gi|195477920|ref|XP_002100342.1| GE16999 [Drosophila yakuba]
gi|194187866|gb|EDX01450.1| GE16999 [Drosophila yakuba]
Length = 1421
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 1268 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 1327
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 1328 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1386
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 1387 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1421
>gi|221508321|gb|EEE33908.1| SET domain-containing protein / bromodomain-containing protein,
putative [Toxoplasma gondii VEG]
Length = 7555
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
RL + S IH GL A P+ + VIEYVG+++R +SD+RE YEK G G+ Y+F
Sbjct: 7411 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7470
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
RLDD +VVDAT+ G ++RFINHSC PNC +++ E GQK I I AK I AGEEITY+Y
Sbjct: 7471 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7530
Query: 914 KFPL--EEKKIPCYCGSKKCHGSLN 936
+F + E K+ C CG++ C G +N
Sbjct: 7531 QFGIGNETDKLACLCGARSCLGRMN 7555
>gi|237834073|ref|XP_002366334.1| SET domain-containing protein / Bromodomain-containing protein
[Toxoplasma gondii ME49]
gi|211963998|gb|EEA99193.1| SET domain-containing protein / Bromodomain-containing protein
[Toxoplasma gondii ME49]
Length = 7551
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
RL + S IH GL A P+ + VIEYVG+++R +SD+RE YEK G G+ Y+F
Sbjct: 7407 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7466
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
RLDD +VVDAT+ G ++RFINHSC PNC +++ E GQK I I AK I AGEEITY+Y
Sbjct: 7467 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7526
Query: 914 KFPL--EEKKIPCYCGSKKCHGSLN 936
+F + E K+ C CG++ C G +N
Sbjct: 7527 QFGIGNETDKLACLCGARSCLGRMN 7551
>gi|221486558|gb|EEE24819.1| SET domain-containing protein / bromodomain-containing protein,
putative [Toxoplasma gondii GT1]
Length = 7565
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
RL + S IH GL A P+ + VIEYVG+++R +SD+RE YEK G G+ Y+F
Sbjct: 7421 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7480
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
RLDD +VVDAT+ G ++RFINHSC PNC +++ E GQK I I AK I AGEEITY+Y
Sbjct: 7481 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7540
Query: 914 KFPL--EEKKIPCYCGSKKCHGSLN 936
+F + E K+ C CG++ C G +N
Sbjct: 7541 QFGIGNETDKLACLCGARSCLGRMN 7565
>gi|148692059|gb|EDL24006.1| WW domain binding protein 7 [Mus musculus]
Length = 209
Score = 150 bits (379), Expect = 3e-33, Method: Composition-based stats.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 65 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 123
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 183
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 184 YKFPIEDASNKLPCNCGAKRCRRFLN 209
>gi|211828317|gb|AAH07353.3| MLL4 protein [Homo sapiens]
Length = 172
Score = 150 bits (379), Expect = 4e-33, Method: Composition-based stats.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 28 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 86
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 87 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 146
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 147 YKFPIEDASNKLPCNCGAKRCRRFLN 172
>gi|380796799|gb|AFE70275.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
Length = 396
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 252 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 310
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 311 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 370
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C LN
Sbjct: 371 YKFPIEDASNKLPCNCGAKRCRRFLN 396
>gi|260836403|ref|XP_002613195.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
gi|229298580|gb|EEN69204.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
Length = 313
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/136 (55%), Positives = 94/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL I++ + VIEY G +IR ++D RE Y GIG Y+FR+DD VV
Sbjct: 179 RSPIHGRGLFCKRNIDSGEMVIEYAGMVIRSVLTDKRENYYNSKGIGC-YMFRIDDYEVV 237
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
DAT G ARFINHSC+PNCY++VI VEG+K I I+A R I GEE+TY+YKFP+E++
Sbjct: 238 DATMHGNAARFINHSCDPNCYSRVIQVEGKKHIVIFAMRKIYKGEELTYDYKFPIEDQNS 297
Query: 921 KIPCYCGSKKCHGSLN 936
KI C CGSK+C LN
Sbjct: 298 KIDCTCGSKRCRKYLN 313
>gi|158287424|ref|XP_309452.4| AGAP011192-PA [Anopheles gambiae str. PEST]
gi|157019644|gb|EAA05242.4| AGAP011192-PA [Anopheles gambiae str. PEST]
Length = 2808
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++S++RE +Y
Sbjct: 2655 KSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSEMREKQY 2714
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ + +I I+AKR I
Sbjct: 2715 EARNRGI-YMFRLDEERVVDATLSGGLARYINHSCNPNCVTEIVEVDRELRIIIFAKRRI 2773
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KI C CG+ C +N
Sbjct: 2774 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2808
>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
Length = 2204
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++ D RE Y
Sbjct: 2051 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSQLCDYREKLY 2110
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLDD VVDAT GG+AR+INHSCNPNC T+ + V+ + +I I+AKR I
Sbjct: 2111 EAKNRGI-YMFRLDDDRVVDATISGGLARYINHSCNPNCVTEKVEVDRELRIIIFAKRRI 2169
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
A GEE+ Y+Y+F +E + KIPC CG+ C +N
Sbjct: 2170 ARGEELAYDYQFDIEDDQHKIPCNCGAPNCRKWMN 2204
>gi|170055483|ref|XP_001863602.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875425|gb|EDS38808.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1357
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 67/119 (56%), Positives = 93/119 (78%), Gaps = 2/119 (1%)
Query: 763 KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
+GLS AR+N+ +L A+ +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1216 QGLSREARSNQRRLLTAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1275
Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
+ VIEYVG+++R ++D+RE +YE +GIGSSYLFR+D ++DATK G +ARFINHSCN
Sbjct: 1276 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCN 1334
>gi|195170125|ref|XP_002025864.1| GL18348 [Drosophila persimilis]
gi|194110717|gb|EDW32760.1| GL18348 [Drosophila persimilis]
Length = 1046
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 893 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 952
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 953 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1011
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KIPC CG+ C +N
Sbjct: 1012 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1046
>gi|21064853|gb|AAM29656.1| SD13650p [Drosophila melanogaster]
Length = 1019
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 866 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 925
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 926 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 984
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 985 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1019
>gi|195564658|ref|XP_002105931.1| GD16408 [Drosophila simulans]
gi|194203296|gb|EDX16872.1| GD16408 [Drosophila simulans]
Length = 1076
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 923 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 982
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 983 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1041
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 1042 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1076
>gi|195347745|ref|XP_002040412.1| GM18944 [Drosophila sechellia]
gi|194121840|gb|EDW43883.1| GM18944 [Drosophila sechellia]
Length = 898
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 745 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 804
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 805 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 863
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 864 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 898
>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
sinensis]
Length = 328
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 71/143 (49%), Positives = 91/143 (63%), Gaps = 1/143 (0%)
Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
AR R+ S IH GL AL ++ VIEY+GELIR + + RE RY G+ Y+
Sbjct: 187 ARYLRVAVHPSHIHGRGLFALRGFREDEMVIEYMGELIRNFVCETREIRYRSAGV-DCYM 245
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
FR+D V+DAT G ARFINHSC+PNCY KV++V+ +K I I A+R I GEE+TY+Y
Sbjct: 246 FRIDSDLVIDATYAGNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDY 305
Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
+FP E K+ C CGS C LN
Sbjct: 306 RFPKESDKLLCNCGSYNCRKYLN 328
>gi|12847486|dbj|BAB27589.1| unnamed protein product [Mus musculus]
Length = 209
Score = 149 bits (375), Expect = 9e-33, Method: Composition-based stats.
Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 65 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 123
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+
Sbjct: 124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 183
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+PC CG+K+C N
Sbjct: 184 YKFPIEDASNKLPCNCGAKRCRRFFN 209
>gi|399218649|emb|CCF75536.1| unnamed protein product [Babesia microti strain RI]
Length = 2180
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/156 (51%), Positives = 107/156 (68%), Gaps = 5/156 (3%)
Query: 786 LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY-E 844
L + Q ++ RLR + S IH GL A + I++ + VIEY+GELIR ISD+RE Y +
Sbjct: 2025 LYRYIQSLSQPSRLRVRGSPIHGIGLFANQLIQSGEPVIEYIGELIRDVISDVREKFYND 2084
Query: 845 KMGI-GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRH 902
K GI GS Y+F+L++ VVDAT+ G +ARFINHSC PNC ++I+ E G K I I+AKR
Sbjct: 2085 KWGIDGSCYMFKLNEQLVVDATRMGNMARFINHSCEPNCICRIITCESGLKHIVIFAKRD 2144
Query: 903 IAAGEEITYNYKFPL--EEKKIPCYCGSKKCHGSLN 936
I GEEITYNY+F + E KK+ C+CG+ C G +N
Sbjct: 2145 INPGEEITYNYQFGIEAEAKKLSCHCGAPSCLGRMN 2180
>gi|303286289|ref|XP_003062434.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226455951|gb|EEH53253.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 491
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 65/139 (46%), Positives = 93/139 (66%), Gaps = 1/139 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R F +S IH WGL+A +P++A VIE+ GE+++ ++D+RE Y+ I YL +
Sbjct: 353 ERFTFGKSNIHGWGLIAKKPVKAGSMVIEFRGEIVKPHVADLREKAYDDANI-DCYLLKA 411
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
D+ V+D T RG IARF NHSCNPN YTK++SV+G I +A+ + GEE+TY+Y+F
Sbjct: 412 DEKTVIDTTMRGNIARFTNHSCNPNMYTKIVSVDGSNHIIFFARVDVQPGEEMTYDYRFD 471
Query: 917 LEEKKIPCYCGSKKCHGSL 935
E K+PCYCG+ C G L
Sbjct: 472 AESGKVPCYCGAHNCRGFL 490
>gi|30685011|ref|NP_850170.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
gi|95147534|sp|Q9C5X4.2|ATX1_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX1; AltName:
Full=Protein SET DOMAIN GROUP 27; AltName:
Full=Trithorax-homolog protein 1; Short=TRX-homolog
protein 1
gi|330253475|gb|AEC08569.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
Length = 1062
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 93/249 (37%), Positives = 137/249 (55%), Gaps = 15/249 (6%)
Query: 696 KTKASKSRTSNLCPRSD--GCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTK 748
+ K++ +TS P + GCAR+ + G K +L A+ ++R V YV
Sbjct: 797 RIKSATHKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGG 856
Query: 749 YLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHD 808
Y E + + +G +S N+L+ AE ++ + +KRL F +S IH
Sbjct: 857 YSRLEFSTYKSIHGSKVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHG 909
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKR 867
+G+ A P A D +IEY GEL+R I+D RE Y M +Y+FR+DD V+DAT+
Sbjct: 910 FGIFAKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRT 969
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G IA INHSC PNCY++VI+V G + I I+AKRHI EE+TY+Y+F +++ C CG
Sbjct: 970 GSIAHLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCG 1029
Query: 928 SKKCHGSLN 936
C G +N
Sbjct: 1030 FPGCRGVVN 1038
>gi|12659210|gb|AAK01237.1|AF329273_1 trithorax-like protein 1 [Arabidopsis thaliana]
Length = 1062
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 93/249 (37%), Positives = 137/249 (55%), Gaps = 15/249 (6%)
Query: 696 KTKASKSRTSNLCPRSD--GCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTK 748
+ K++ +TS P + GCAR+ + G K +L A+ ++R V YV
Sbjct: 797 RIKSATHKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGG 856
Query: 749 YLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHD 808
Y E + + +G +S N+L+ AE ++ + +KRL F +S IH
Sbjct: 857 YSRLEFSTYKSIHGSKVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHG 909
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKR 867
+G+ A P A D +IEY GEL+R I+D RE Y M +Y+FR+DD V+DAT+
Sbjct: 910 FGIFAKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRT 969
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
G IA INHSC PNCY++VI+V G + I I+AKRHI EE+TY+Y+F +++ C CG
Sbjct: 970 GSIAHLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCG 1029
Query: 928 SKKCHGSLN 936
C G +N
Sbjct: 1030 FPGCRGVVN 1038
>gi|225433774|ref|XP_002268621.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Vitis
vinifera]
Length = 1084
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/196 (40%), Positives = 115/196 (58%), Gaps = 4/196 (2%)
Query: 744 YVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQV--KARKKRLRF 801
Y + LG +++S + G S R ++K L A +++ + +KRL F
Sbjct: 866 YCQHESLGNPLSSSALS-GSKFSFRNQKIKASQLDAPKSILSMVEKYNYMRETFRKRLAF 924
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
+S IH +G+ A +P A D VIEY GEL+R I+D RE Y + +Y+FR+DD
Sbjct: 925 GKSGIHGFGIFAKQPHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMFRIDDER 984
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
V+DAT+ G IA INHSC PNCY++VIS G I I+AKR I EE+TY+Y+F ++
Sbjct: 985 VIDATRAGSIAHLINHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYRFFSIDE 1044
Query: 921 KIPCYCGSKKCHGSLN 936
++ CYCG +C G +N
Sbjct: 1045 QLACYCGFPRCRGVVN 1060
>gi|157109807|ref|XP_001650833.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108878935|gb|EAT43160.1| AAEL005380-PA [Aedes aegypti]
Length = 2874
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K R RSKI GL A +E VIEY+GE+IR ++S++RE +Y
Sbjct: 2721 KSSQYKKMKLEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQY 2780
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+ + VE +I I+AKR I
Sbjct: 2781 EARNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTETVEVERDLRIIIFAKRRI 2839
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KI C CG+ C +N
Sbjct: 2840 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2874
>gi|47225575|emb|CAG12058.1| unnamed protein product [Tetraodon nigroviridis]
Length = 480
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 336 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDSKGIGC-Y 394
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY++VI+VEG+K I I+A R I GEE+TY+
Sbjct: 395 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 454
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
YKFP+E+ K+ C CG+++C LN
Sbjct: 455 YKFPIEDASNKLNCNCGARRCRRFLN 480
>gi|4582456|gb|AAD24840.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
Length = 186
Score = 148 bits (374), Expect = 1e-32, Method: Composition-based stats.
Identities = 75/162 (46%), Positives = 104/162 (64%), Gaps = 6/162 (3%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
N+L+ AE ++ + +KRL F +S IH +G+ A P A D +IEY GEL+R I
Sbjct: 6 NILSMAEKYRYMRETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMMIEYTGELVRPSI 61
Query: 836 SDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
+D RE Y MG G+ Y+FR+DD V+DAT+ G IA INHSC PNCY++VI+V G +
Sbjct: 62 ADKREQLIYNSMGAGT-YMFRIDDERVIDATRTGSIAHLINHSCVPNCYSRVITVNGDEH 120
Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I I+AKRHI EE+TY+Y+F +++ C CG C G +N
Sbjct: 121 IIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVN 162
>gi|297826607|ref|XP_002881186.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
lyrata]
gi|297327025|gb|EFH57445.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
lyrata]
Length = 1066
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 122/210 (58%), Gaps = 8/210 (3%)
Query: 728 SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELL 787
+L A+ ++R V YV Y E + + +G +S N+L+ AE +
Sbjct: 841 ALAAASSKRLFVENQPYVIGGYSKIEFSTYECIHGSKVSQMNTP---SNILSMAEKYRYM 897
Query: 788 KASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKM 846
K + +KRL F +S IH +G+ A P +A D +IEY GEL+R I+D RE Y M
Sbjct: 898 KETY----RKRLAFGKSGIHGFGIFAKLPHKAGDMMIEYTGELVRPSIADKRERLIYNSM 953
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
+Y+FR+DD V+DAT+ G IA INHSC PNCY++VI+V G++ I I+AKR I
Sbjct: 954 VGAGTYMFRIDDKRVIDATRAGSIAHLINHSCVPNCYSRVITVNGEEHIIIFAKRDIPIW 1013
Query: 907 EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
EE+TY+Y+F +++ C CG + C G +N
Sbjct: 1014 EELTYDYRFFSIGERLSCSCGFQGCRGVVN 1043
>gi|17861882|gb|AAL39418.1| GM10003p [Drosophila melanogaster]
Length = 421
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE +IEY+GE+IR ++S+IRE +Y
Sbjct: 268 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 327
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+++ V+ +I I+AKR I
Sbjct: 328 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 386
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E++ KIPC CG+ C +N
Sbjct: 387 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 421
>gi|302755933|ref|XP_002961390.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
gi|300170049|gb|EFJ36650.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
Length = 749
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 100/145 (68%), Gaps = 2/145 (1%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K R+ F +S IH WGL A IE + V+EY GE +R ++D+RE RY G Y
Sbjct: 606 KTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQVRRSVADLREKRYRDQG-KDCY 664
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + V+DAT++G I R INHSC+P+CY +++ V+G++ +I + AKR++AAGEE+TY
Sbjct: 665 LFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCVDGEESRIVLIAKRNVAAGEELTY 724
Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
+Y+F E+KK+PC CGS C +N
Sbjct: 725 DYQFEEEDKKVPCLCGSDSCRQYMN 749
>gi|302798240|ref|XP_002980880.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
gi|300151419|gb|EFJ18065.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
Length = 749
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 100/145 (68%), Gaps = 2/145 (1%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K R+ F +S IH WGL A IE + V+EY GE +R ++D+RE RY G Y
Sbjct: 606 KTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQVRRSVADLREKRYRDQG-KDCY 664
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + V+DAT++G I R INHSC+P+CY +++ V+G++ +I + AKR++AAGEE+TY
Sbjct: 665 LFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCVDGEESRIVLIAKRNVAAGEELTY 724
Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
+Y+F E+KK+PC CGS C +N
Sbjct: 725 DYQFEEEDKKVPCLCGSDSCRQYMN 749
>gi|321466655|gb|EFX77649.1| hypothetical protein DAPPUDRAFT_105888 [Daphnia pulex]
Length = 208
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/134 (52%), Positives = 91/134 (67%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL IE + V+EY G++IR + D RE YE G+G Y+FR+D+ VV
Sbjct: 76 RSHIHGRGLFCKRDIECGEMVVEYAGQVIRSVLCDKREKEYEAKGMGC-YMFRIDEQTVV 134
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
DAT G ARFINHSC PNCY++++ + G+K I I+A R I GEE+TY+YKFPLE+ KI
Sbjct: 135 DATVHGNAARFINHSCEPNCYSRIVDIFGKKHIIIFALRRILRGEELTYDYKFPLEDVKI 194
Query: 923 PCYCGSKKCHGSLN 936
C CGS+KC LN
Sbjct: 195 RCTCGSRKCRKYLN 208
>gi|322792928|gb|EFZ16758.1| hypothetical protein SINV_09282 [Solenopsis invicta]
Length = 3429
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 3276 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 3335
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 3336 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 3394
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 3395 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3429
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 94/136 (69%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH GL ++ + ++EY G++IR +++D RE YE IG Y+FR+DD YVV
Sbjct: 3272 RSTIHGRGLYCKRDFDSGEMIMEYTGQIIRQELTDKREKYYESKSIGC-YMFRMDDFYVV 3330
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
DAT G ARFINHSC+PNCY++++ EG+K I I+A R I GEE+TY+YKFP+E++
Sbjct: 3331 DATVLGSGARFINHSCDPNCYSRIVQFEGKKHIVIFALREIYKGEELTYDYKFPIEDENH 3390
Query: 921 KIPCYCGSKKCHGSLN 936
KI C CG++ C +N
Sbjct: 3391 KIACTCGARLCRKFMN 3406
>gi|356568903|ref|XP_003552647.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
Length = 1088
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 90/252 (35%), Positives = 136/252 (53%), Gaps = 18/252 (7%)
Query: 703 RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
R + LC P GCARS + + +L A+ +R V Y+ Y
Sbjct: 813 RVAGLCSDYEPPPNLSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 872
Query: 752 PEVNASQWANGKGLSAR--TNRVKLRNLLAAAEGAELLKASQV----KARKKRLRFQRSK 805
+ +G+G+ ++ ++ +LR L + L + + + +KRL F +S+
Sbjct: 873 HGSLNNLEPSGRGVCSKFFCSQQRLRTSLVDTSNSILSISEKYTYMRETFRKRLAFGKSR 932
Query: 806 IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDA 864
IH +G+ A + D VIEY GEL+R I+D RE Y + +Y+FR+DD V+DA
Sbjct: 933 IHGFGIFAKHAYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDERVIDA 992
Query: 865 TKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPC 924
T+ G IA INHSC PNCY++VISV G + I I+AKR I EE+TY+Y+F ++++ C
Sbjct: 993 TRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLAC 1052
Query: 925 YCGSKKCHGSLN 936
YCG KC G +N
Sbjct: 1053 YCGFPKCRGIVN 1064
>gi|348530102|ref|XP_003452550.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
[Oreochromis niloticus]
Length = 399
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/146 (50%), Positives = 100/146 (68%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 255 KISKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYAGTVIRAVLTDKREKYYDSKGIGC-Y 313
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT +G ARFINHSC PNCY++VI+V+G+K I I+A R I GEE+TY+
Sbjct: 314 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 373
Query: 913 YKFPLEEK--KIPCYCGSKKCHGSLN 936
YKFP+E++ K+ C CG+++C LN
Sbjct: 374 YKFPIEDESNKLLCNCGARRCRRYLN 399
>gi|297745169|emb|CBI39161.3| unnamed protein product [Vitis vinifera]
Length = 1068
Score = 146 bits (368), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 94/142 (66%), Gaps = 1/142 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLF 854
+KRL F +S IH +G+ A +P A D VIEY GEL+R I+D RE Y + +Y+F
Sbjct: 903 RKRLAFGKSGIHGFGIFAKQPHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMF 962
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+DD V+DAT+ G IA INHSC PNCY++VIS G I I+AKR I EE+TY+Y+
Sbjct: 963 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYR 1022
Query: 915 FPLEEKKIPCYCGSKKCHGSLN 936
F ++++ CYCG +C G +N
Sbjct: 1023 FFSIDEQLACYCGFPRCRGVVN 1044
>gi|168029907|ref|XP_001767466.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162681362|gb|EDQ67790.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 924
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 97/141 (68%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFR 855
+RL F +S IH G+ + A D +IEY GE++R I+DIRE R+ +G+ +Y+FR
Sbjct: 754 QRLVFGKSAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERRFYDSLVGAGTYMFR 813
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD VVDAT+ G IA INHSC PNCY++ ++ G+ +I I+AKR+I GEE+TY+Y+F
Sbjct: 814 IDDERVVDATRAGSIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEIGEELTYDYRF 873
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
+++ + CYCG C GS+N
Sbjct: 874 MSKDEVLTCYCGCAGCRGSVN 894
>gi|27754617|gb|AAO22754.1| putative trithorax protein 1 [Arabidopsis thaliana]
Length = 479
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 129/234 (55%), Gaps = 13/234 (5%)
Query: 709 PRSDGCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
P GCAR+ + G K +L A+ ++R V YV Y E + + +G
Sbjct: 229 PNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSRLEFSTYKSIHGS 288
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+S N+L+ AE ++ + +KRL F +S IH +G+ A P A D +
Sbjct: 289 KVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMM 341
Query: 824 IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
IEY GEL+R I+D RE Y M +Y+FR+DD V+DAT+ G IA INHSC PNC
Sbjct: 342 IEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIAHLINHSCVPNC 401
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y++VI+V G + I I+AKRHI EE+TY+Y+F +++ C CG C G +N
Sbjct: 402 YSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVN 455
>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
Length = 3609
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 109/175 (62%), Gaps = 5/175 (2%)
Query: 767 ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
+RTNR + R + A + +KRL + S IH GL E I A + V+EY
Sbjct: 3435 SRTNRTRSRKQYDMSNNAMQYRYLISLPPEKRLDVKPSAIHGLGLFTTEDITAGEPVVEY 3494
Query: 827 VGELIRLKISDIRECRYEKM--GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
VGELIR ISD RE Y + G GS Y+FRLDD +VDAT++G ++RFINHSC+PNC
Sbjct: 3495 VGELIRDIISDKREEIYSESQGGDGSCYMFRLDDELIVDATRKGNMSRFINHSCDPNCLC 3554
Query: 885 KVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
++I+ E G K I ++AK + AG+E+TY+Y+F +E +K+ C CG+ C G +N
Sbjct: 3555 RIITCEYGLKHIVVFAKSDLKAGDEVTYDYQFGVESETRKLQCLCGAPNCLGRMN 3609
>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile rotundata]
Length = 4805
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 4652 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 4711
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 4712 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 4770
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 4771 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4805
>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
Length = 4678
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 4525 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 4584
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 4585 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 4643
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 4644 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4678
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 5560 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5619
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 5620 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5678
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 5679 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5713
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 5469 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5528
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 5529 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5587
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 5588 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5622
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 5466 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5525
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 5526 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5584
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 5585 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5619
>gi|168050289|ref|XP_001777592.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671077|gb|EDQ57635.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 907
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/158 (46%), Positives = 103/158 (65%), Gaps = 8/158 (5%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
E LK Q K KK + F +S IH WGL + I+ + VIEY GE +R ++D+RE RY
Sbjct: 752 ERLKFLQ-KTEKKTVCFGKSAIHGWGLFSRRAIQEGEMVIEYRGERVRGSVADLREIRYH 810
Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-----EGQKKIFIYA 899
K G YLF++++ V+DAT +G IAR INHSC+P+CY K++ EG +I + A
Sbjct: 811 KEG-KDCYLFKINEEIVIDATDKGNIARLINHSCDPSCYAKILDFQRDDGEGDSRIVLIA 869
Query: 900 KRHIAAGEEITYNYKFPLEE-KKIPCYCGSKKCHGSLN 936
+++IAAGEE+TYNY+F E+ +K+PC CGS C +N
Sbjct: 870 RKYIAAGEELTYNYRFDEEDTQKVPCLCGSSNCRKYMN 907
>gi|168057166|ref|XP_001780587.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162667953|gb|EDQ54570.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 902
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 95/141 (67%), Gaps = 1/141 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
+RL F +S IH G+ + A D +IEY GE++R I+DIRE R Y+ + +Y+FR
Sbjct: 732 QRLAFGKSAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERRCYDSLVGAGTYMFR 791
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD VVDAT G IA INHSC PNCY++ ++ G+ +I I+AKR+I GEE+TY+Y+F
Sbjct: 792 IDDERVVDATHAGTIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEVGEELTYDYRF 851
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
+++ + CYCG C GS+N
Sbjct: 852 MSKDEVLTCYCGCAGCRGSVN 872
>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
[Nasonia vitripennis]
Length = 5138
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE++R +++DIRE +Y
Sbjct: 4985 KSSQYKKMKQDWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEIVRNELADIREKQY 5044
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC + + VE + ++ I+AKR I
Sbjct: 5045 EAKNRGI-YMFRLDENRVVDATLCGGLARYINHSCNPNCVVENVEVERKLRLIIFAKRRI 5103
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 5104 LRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5138
>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
Length = 6073
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 98/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K R RSKI GL A +E VIEY+GE+IR ++S+ RE +Y
Sbjct: 5920 KSSQYKKMKLEWRNNVYLARSKIQGLGLYAARDLERHTMVIEYIGEIIRHELSESREKQY 5979
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ V+DAT GG+AR+INHSCNPNC +++ V+ +I I+AKR I
Sbjct: 5980 EARNRGI-YMFRLDEERVIDATICGGLARYINHSCNPNCVAEIVEVDRDLRIIIFAKRRI 6038
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+ Y+YKF +E + KIPC CG+ C +N
Sbjct: 6039 TRGEELAYDYKFDIEDDQHKIPCLCGAPNCRKWMN 6073
>gi|355702679|gb|AES02012.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
Length = 374
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/130 (54%), Positives = 90/130 (69%), Gaps = 3/130 (2%)
Query: 801 FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
+ RS IH GL I+A + VIEY G +IR ++D RE Y+ GIG Y+FR+DD
Sbjct: 246 YSRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-YMFRMDDFD 304
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE- 919
VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I GEE+TY+YKFP+E+
Sbjct: 305 VVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDA 364
Query: 920 -KKIPCYCGS 928
K+PC CG+
Sbjct: 365 SNKLPCNCGA 374
>gi|198413251|ref|XP_002126315.1| PREDICTED: transcription factor protein, partial [Ciona intestinalis]
Length = 1130
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K+ R RS+I GL A IE VIEY+G++IR ++++ RE Y
Sbjct: 977 KSSQYRRMKQEWRSNVYLARSRIQGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNY 1036
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD Y+VDAT GG AR+INHSCNPNC +V++ E +KKI I + RHI
Sbjct: 1037 EAANRGV-YMFRLDSDYIVDATVTGGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHI 1095
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
+GEE+ Y+YKF E++ KIPC CG+ C +N
Sbjct: 1096 LSGEELNYDYKFDFEDEGNKIPCLCGAINCRKWMN 1130
>gi|302800676|ref|XP_002982095.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
gi|300150111|gb|EFJ16763.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
Length = 1045
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 65/139 (46%), Positives = 95/139 (68%), Gaps = 1/139 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRL 856
RL F +S IH WG+ A + +A D + EY GE++R I+DIRE R+ +G+ +Y+FR+
Sbjct: 882 RLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIREQRHYDSLVGAGTYMFRI 941
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
DD VVDAT G +A INHSC PNCY+++I+V+ + I I+AKR I EE+TY+Y+F
Sbjct: 942 DDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFA 1001
Query: 917 LEEKKIPCYCGSKKCHGSL 935
+ ++ C CG+ KC GS+
Sbjct: 1002 SKGAELVCNCGALKCRGSV 1020
>gi|219121873|ref|XP_002181282.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407268|gb|EEC47205.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 141
Score = 143 bits (360), Expect = 6e-31, Method: Composition-based stats.
Identities = 65/117 (55%), Positives = 88/117 (75%), Gaps = 3/117 (2%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEY GELI +++ R+ +Y+ IGS Y+FR+D V DATK+G +ARF+N SCNPNC
Sbjct: 2 VIEYRGELIGNAVAEKRDKQYDASKIGSDYMFRIDGSGVCDATKQGNVARFVNASCNPNC 61
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
YTK+I+++G K+I IYAKR I GEE++Y+YKFPLE K++ CYCG+K C G +N
Sbjct: 62 YTKIITLDGIKRIVIYAKRDILPGEELSYDYKFPLERIEAKRVSCYCGAKDCRGYMN 118
>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
Length = 1603
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 130/250 (52%), Gaps = 30/250 (12%)
Query: 712 DGCARSSISGWEWHKWSLNASPA----------ERARVRG---------------AQYVH 746
DGCAR+++ + + + +PA ER R++ + +
Sbjct: 1359 DGCARTTVFNRKTPIQTTSGNPADTERERKIMIERERLKSEIISFKTRERTIALLTKKIQ 1418
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
+ Y ++N K + + + N L + LL +K R+ + S I
Sbjct: 1419 SVYSMEKINQIGIPGLKDVLGFSTNSGIHNHLLSDNAQYLL----LKGEDSRVVVKPSSI 1474
Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
H G A + I A +F+IEYVGE+IR KI+D RE +Y+ G+ ++F ++ +VDAT
Sbjct: 1475 HSLGAFAKKSIVAGEFIIEYVGEIIRQKIADERERKYQNDGV-DCFMFAIEKDVIVDATF 1533
Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
+G ARF NHSC PN TK+IS++G KKI I A + IA EEITY+Y+FP E+ KI C C
Sbjct: 1534 KGNRARFANHSCEPNAKTKIISIDGVKKIIIVASKDIAKNEEITYDYQFPREKMKIKCLC 1593
Query: 927 GSKKCHGSLN 936
S +C G LN
Sbjct: 1594 QSTRCKGYLN 1603
>gi|321469515|gb|EFX80495.1| hypothetical protein DAPPUDRAFT_51465 [Daphnia pulex]
Length = 927
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 74/154 (48%), Positives = 98/154 (63%), Gaps = 6/154 (3%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A IE VIEY+GE+IR ++++ RE RY
Sbjct: 775 KSSQYKKMKQEWRHNIYLARSKIQGLGLYAARDIEKHTMVIEYIGEMIRAELAECREKRY 834
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ V+DAT GG+AR+INHSC PNC + + VE +I I+A R I
Sbjct: 835 EAANRGI-YMFRLDEQRVIDATLCGGLARYINHSCGPNCVAEAVEVERDLRIIIFASRRI 893
Query: 904 AAGEEITYNYKFPLE-EKKIPCYCGSKKCHGSLN 936
A GEE++Y+YKF +E + KIPC CG+ C +N
Sbjct: 894 ARGEELSYDYKFDIEDDHKIPCLCGAASCRKWMN 927
>gi|302821061|ref|XP_002992195.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
gi|300139962|gb|EFJ06692.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
Length = 1052
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 65/139 (46%), Positives = 95/139 (68%), Gaps = 1/139 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRL 856
RL F +S IH WG+ A + +A D + EY GE++R I+DIRE R+ +G+ +Y+FR+
Sbjct: 889 RLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIRERRHYDSLVGAGTYMFRI 948
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
DD VVDAT G +A INHSC PNCY+++I+V+ + I I+AKR I EE+TY+Y+F
Sbjct: 949 DDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFA 1008
Query: 917 LEEKKIPCYCGSKKCHGSL 935
+ ++ C CG+ KC GS+
Sbjct: 1009 SKGAELVCNCGALKCRGSV 1027
>gi|70568862|dbj|BAE06306.1| transcription factor protein [Ciona intestinalis]
Length = 589
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K+ R RS+I GL A IE VIEY+G++IR ++++ RE Y
Sbjct: 436 KSSQYRRMKQEWRSNVYLARSRIQGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNY 495
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD Y+VDAT GG AR+INHSCNPNC +V++ E +KKI I + RHI
Sbjct: 496 EAANRGV-YMFRLDSDYIVDATVTGGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHI 554
Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
+GEE+ Y+YKF E++ KIPC CG+ C +N
Sbjct: 555 LSGEELNYDYKFDFEDEGNKIPCLCGAINCRKWMN 589
>gi|428171302|gb|EKX40220.1| hypothetical protein GUITHDRAFT_75734 [Guillardia theta CCMP2712]
Length = 156
Score = 142 bits (358), Expect = 8e-31, Method: Composition-based stats.
Identities = 67/161 (41%), Positives = 100/161 (62%), Gaps = 16/161 (9%)
Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
+KA+Q +RL RS I +G+ A +PI + V+EY GE IR ++D+RE +YE
Sbjct: 1 MKANQ----NRRLVLGRSGIEGYGIFARQPIFKGEMVVEYTGEAIRPVVADVREEQYEAA 56
Query: 847 GIGSSYLFRLDD-----------GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
G+G+ Y +RL++ +VDAT R I +INH C+PNC K++ + GQ++I
Sbjct: 57 GLGT-YFWRLEEYLGAGEPPEGRAAIVDATIRHNIGHYINHCCDPNCEAKILKINGQRRI 115
Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
I A + GEE+TY+YK P E+KKIPC+CG+ C G++N
Sbjct: 116 IISAIHDVQFGEELTYDYKLPFEDKKIPCHCGAPTCRGTMN 156
>gi|218201707|gb|EEC84134.1| hypothetical protein OsI_30479 [Oryza sativa Indica Group]
Length = 1057
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 73/175 (41%), Positives = 101/175 (57%), Gaps = 34/175 (19%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK---------- 845
++RL F +S+IH +G+ A +A D +IEY+GEL+R ISDIRE R
Sbjct: 862 RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWK 921
Query: 846 -----------------MGI-------GSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
MG+ +Y+FR+DD V+DAT+ G IA INHSC PN
Sbjct: 922 TLVKLIGLFMEFNLKALMGVIITTFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 981
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY++VISV G + I I+AKR I EE+TY+Y+F ++++PCYCG KC G +N
Sbjct: 982 CYSRVISVLGDEHIIIFAKRDINPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVN 1036
>gi|222641096|gb|EEE69228.1| hypothetical protein OsJ_28463 [Oryza sativa Japonica Group]
Length = 1057
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/175 (41%), Positives = 101/175 (57%), Gaps = 34/175 (19%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK---------- 845
++RL F +S+IH +G+ A +A D +IEY+GEL+R ISDIRE R
Sbjct: 862 RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWK 921
Query: 846 -----------------MGI-------GSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
MG+ +Y+FR+DD V+DAT+ G IA INHSC PN
Sbjct: 922 TLVKLIGLFMEFNLKALMGVIITTFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 981
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
CY++VISV G + I I+AKR I EE+TY+Y+F ++++PCYCG KC G +N
Sbjct: 982 CYSRVISVLGDEHIIIFAKRDINPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVN 1036
>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 2678
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/145 (51%), Positives = 98/145 (67%), Gaps = 5/145 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI--GSSYLF 854
KRL ++SKIH +GL A E I + ++EYVGE+IR ++D RE YE+ G GS Y+F
Sbjct: 2534 KRLDVKKSKIHGYGLFAKECIHPNEPIVEYVGEVIRNSVADKREKIYEQEGQRDGSCYMF 2593
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVI-SVEGQKKIFIYAKRHIAAGEEITYNY 913
RLD+ V+DAT G +ARF+NH C PNC KVI S +G K I I++K I EE+TY+Y
Sbjct: 2594 RLDEHRVIDATNCGNLARFMNHCCQPNCVCKVITSDDGNKHIVIFSKTEIKPDEEVTYDY 2653
Query: 914 KFPLEE--KKIPCYCGSKKCHGSLN 936
+F LEE +KI CYCG+ C G +N
Sbjct: 2654 QFNLEEESEKIYCYCGAPNCLGRMN 2678
>gi|312384477|gb|EFR29200.1| hypothetical protein AND_02075 [Anopheles darlingi]
Length = 5086
Score = 142 bits (358), Expect = 1e-30, Method: Composition-based stats.
Identities = 74/155 (47%), Positives = 101/155 (65%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++S++RE +Y
Sbjct: 4933 KSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQY 4992
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC T+ + V+ + +I I+AKR I
Sbjct: 4993 EAKNRGI-YMFRLDEERVVDATLSGGLARYINHSCNPNCVTETVEVDRELRIIIFAKRRI 5051
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE++Y+YKF +E+ KI C CG+ C +N
Sbjct: 5052 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 5086
>gi|449512913|ref|XP_004175517.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like
[Taeniopygia guttata]
Length = 99
Score = 142 bits (357), Expect = 1e-30, Method: Composition-based stats.
Identities = 61/99 (61%), Positives = 77/99 (77%)
Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
+RE RY + GIGSSYLFR+D ++DATK G +ARFINH C PNCY KVI++E QKKI I
Sbjct: 1 MREKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIEAQKKIVI 60
Query: 898 YAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y+K+ I EEITY+YKFP+E+ KIPC C ++ C G+LN
Sbjct: 61 YSKQPIGVNEEITYDYKFPIEDTKIPCLCRTESCRGTLN 99
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 97/155 (62%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 5062 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKKY 5121
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ VVDAT GG+AR+INHSCNPNC + + V+ +I I+AKR I
Sbjct: 5122 EAKNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAETVEVDRDYRIIIFAKRRI 5180
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 5181 QRGEELAYDYKFDIEDDQHKISCMCGAPNCRKWMN 5215
>gi|159488034|ref|XP_001702027.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158271484|gb|EDO97302.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 2487
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 87/132 (65%), Gaps = 5/132 (3%)
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
D GL A+EPIEA +FVIEY G +R + D+R+ +Y+ G Y+F +D +VVDAT
Sbjct: 2358 DPGLFAVEPIEAGEFVIEYCGVRLRKPLDDVRQRQYDAAGY-MDYMFAVDGAWVVDATLA 2416
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPC 924
G ARFINH C+PNC +++ V G+KKI IY R + AGEE+ Y+YK P E +IPC
Sbjct: 2417 GNAARFINHCCDPNCVARIVEVGGEKKIAIYTLRRVEAGEELAYDYKLSCVPGAE-RIPC 2475
Query: 925 YCGSKKCHGSLN 936
CG+K C G +N
Sbjct: 2476 GCGAKNCRGFMN 2487
>gi|294948371|ref|XP_002785717.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus ATCC
50983]
gi|239899765|gb|EER17513.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus ATCC
50983]
Length = 1340
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 97/158 (61%), Gaps = 20/158 (12%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK-MGIGSS-YLFRL 856
L +RS+IH++GL A D V+EY GE++R ++D RE YE+ M +G Y+FRL
Sbjct: 1183 LEVKRSRIHNYGLFAKVRFAKGDMVVEYAGEIVRHSVADCRERYYEEEMLMGQCCYMFRL 1242
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKV------------ISVEG------QKKIFIY 898
D+ YVVDAT RG ARFINHSCNPNC KV IS G +K I I
Sbjct: 1243 DEHYVVDATLRGNTARFINHSCNPNCVCKVVEDPVPLASEDDISFRGVVRASHKKHIMIV 1302
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
AK I+AGEEITY+Y+F +E +K+ C CG+ C G LN
Sbjct: 1303 AKNDISAGEEITYDYQFAVESEKLACKCGAPNCLGRLN 1340
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ K K+ R RSKI GL A +E VIEY+GE+IR ++++ RE +Y
Sbjct: 3321 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSELAETREKQY 3380
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ V+DAT GG+AR+INHSCNPNC +++ VE +I I+AKR I
Sbjct: 3381 EARNRGI-YMFRLDEERVIDATLCGGLARYINHSCNPNCVAEIVEVERDFRIIIFAKRRI 3439
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+ Y+YKF +E + KI C CG+ C +N
Sbjct: 3440 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3474
>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1782
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 112/203 (55%), Gaps = 13/203 (6%)
Query: 740 RGA---QYVHTKYLGPE----VNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQV 792
RGA + + +K PE V A + A G A ++V ++ L E E + AS
Sbjct: 1585 RGAFKRELIASKKYTPEEFAAVQAERKARGVN-GASDDKVVEQHGLTMREKLEQMTASY- 1642
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
RL F +S IH WGLVA +A V ++ GE R ++D+RE YE G+ Y
Sbjct: 1643 ---SDRLTFAKSNIHGWGLVAKVFHKAGSIVTQFKGETCRSTVADLRETFYEDNGV-DCY 1698
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
L + DD VVD T +G ARF NHSCNPN Y+K++ V+ I +A+ + GEE+TYN
Sbjct: 1699 LLKQDDDTVVDCTFQGNFARFTNHSCNPNMYSKIVKVDDANHIIFFARTDVRPGEELTYN 1758
Query: 913 YKFPLEEKKIPCYCGSKKCHGSL 935
Y+F E+ K+PCYCG+ C G L
Sbjct: 1759 YRFESEDGKVPCYCGADNCRGYL 1781
>gi|311788758|gb|ADQ12920.1| trithorax [Solanum lycopersicum]
Length = 873
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/170 (41%), Positives = 106/170 (62%), Gaps = 5/170 (2%)
Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
R +++ R++++ E +K + +RL F +S IH +G+ A P +A D VIEY
Sbjct: 684 RCSQLTSRSIVSMVEKYNYMK----ETLGQRLAFGKSGIHGFGIFAKLPQKAGDMVIEYT 739
Query: 828 GELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
GEL+R I+D RE Y + +Y+FR+DD V+DAT+ G IA INHSC PNCY++V
Sbjct: 740 GELVRPPIADRREHLIYNSLVGAGTYMFRIDDQRVIDATRAGSIAHLINHSCEPNCYSRV 799
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
ISV I I++KR I EE+TY+Y+F ++++ CYCG +C G +N
Sbjct: 800 ISVNSIDHIIIFSKRDIEQWEELTYDYRFLSIDEQLACYCGFPRCRGVVN 849
>gi|294868764|ref|XP_002765683.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
gi|239865762|gb|EEQ98400.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
Length = 643
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 97/158 (61%), Gaps = 20/158 (12%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK-MGIGSS-YLFRL 856
L +RS+IH++GL A D V+EY GE++R I+D RE YE+ M +G Y+FRL
Sbjct: 486 LEVKRSRIHNYGLFAKVRFAKGDMVVEYAGEIVRHSIADCRERYYEEEMLMGQCCYMFRL 545
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKV------------ISVEG------QKKIFIY 898
D+ YVVDAT RG ARFINHSCNPNC KV IS G +K I I
Sbjct: 546 DEHYVVDATLRGNTARFINHSCNPNCVCKVVEDPVPLASEDDISFRGVVRASHKKHIMIA 605
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
AK I+AGEEITY+Y+F +E +K+ C CG+ C G LN
Sbjct: 606 AKNDISAGEEITYDYQFAVESEKLACKCGAPNCLGRLN 643
>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4137
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/127 (53%), Positives = 86/127 (67%), Gaps = 1/127 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
KA + + RS IH GL + IE + VIEY G +IR ++D RE Y+ GIG Y
Sbjct: 4007 KASRDAVGAYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4065
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT G ARFINHSC PNCY+ V++V+GQK I I+A R I GEE+TY+
Sbjct: 4066 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSHVVNVDGQKHIVIFATRRIYKGEELTYD 4125
Query: 913 YKFPLEE 919
YKFP+EE
Sbjct: 4126 YKFPIEE 4132
>gi|340501484|gb|EGR28266.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 956
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/150 (46%), Positives = 93/150 (62%), Gaps = 2/150 (1%)
Query: 789 ASQVKARKKRLRFQR--SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
A+ + +KK L SKIH +GL A + +D VIEY+GE IR ++D RE Y +
Sbjct: 807 ANYIADKKKHLNLYAGPSKIHKYGLFAKKHFLQDDIVIEYLGETIRQVLADYREKIYNQK 866
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
G G Y+F+ ++DAT +G ARF+NHSC PNC ++VI E KI IYAK I G
Sbjct: 867 GFGDCYMFKAGPNKIIDATFKGNEARFLNHSCQPNCASQVIEYEKDTKIIIYAKTEIQPG 926
Query: 907 EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
EE+TY+Y F +EE+K+ C CG KC G LN
Sbjct: 927 EELTYDYCFDIEEEKLVCNCGDPKCTGKLN 956
>gi|123454343|ref|XP_001314927.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121897588|gb|EAY02704.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 486
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 94/140 (67%), Gaps = 2/140 (1%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
+RF++SKI +G+ PI + VIEY+GE+IR I+D R+ YEKMG +Y+F+ D
Sbjct: 293 IRFEKSKIEGFGVKTTIPIRKGEKVIEYIGEVIRPIIADKRQINYEKMGNHGTYVFKADS 352
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL- 917
+ +DAT RGGIAR+INHSC+PNC +++I + G+ + + A + I EE+TY+YK P
Sbjct: 353 DHYLDATFRGGIARWINHSCDPNCESRIIKLNGRFAVVLVAIKDINPCEELTYDYKLPYE 412
Query: 918 -EEKKIPCYCGSKKCHGSLN 936
E+K I C CGS C G LN
Sbjct: 413 PEDKAIKCLCGSPNCRGWLN 432
>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
Length = 6442
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 95/142 (66%), Gaps = 4/142 (2%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL ++S IH +GL E I + VIEY+GE IR ISD RE Y+K+ S Y+FRL+
Sbjct: 6302 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6360
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
+ ++DATK G ++RFINHSC PNC+ K++S + K I I+AKR I A EEITY+Y+F
Sbjct: 6361 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFG 6420
Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
+E KK+ C CGS C G +N
Sbjct: 6421 VESEGKKLICLCGSSTCLGRMN 6442
>gi|168027193|ref|XP_001766115.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162682758|gb|EDQ69174.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 514
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 71/186 (38%), Positives = 109/186 (58%), Gaps = 5/186 (2%)
Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
P + + NG L + ++L+ ++ +K S +RL F +S IH G+
Sbjct: 303 PPPKVAGYTNGGSLWSLHWEASKGSILSMSDKFRRMKGSL----SQRLVFGKSAIHGMGV 358
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGI 870
A D +IEY GE++R ++D RE R+ +G+ +Y+FR+DD VVDAT G I
Sbjct: 359 FTKRVHYANDMIIEYAGEVVRPVVADSRERRHYDSLVGAGTYMFRIDDERVVDATTTGSI 418
Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
A INHSC PNCY++ ++ G+ +I I+AKR + GEE+TY+Y+F + + + CYCGS
Sbjct: 419 AHLINHSCEPNCYSRTVTASGEDRIIIFAKRDLEIGEELTYDYRFMSKTEVLTCYCGSAG 478
Query: 931 CHGSLN 936
C GS+N
Sbjct: 479 CRGSVN 484
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/143 (47%), Positives = 94/143 (65%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3550 KSNVYLARSRIQGLGLYAARDIEKCTMVIEYIGTIIRSEVANRKERLYESQNRGV-YMFR 3608
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ YV+DAT GG AR+INHSC+PNC T+V+SVE + KI I + R I GEE++Y+YKF
Sbjct: 3609 IDNDYVIDATITGGPARYINHSCSPNCITEVVSVEKENKIIISSCRRIQRGEELSYDYKF 3668
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
LE + KIPC+CG+ C +N
Sbjct: 3669 DLEDDQHKIPCHCGAVNCRKWMN 3691
>gi|401409540|ref|XP_003884218.1| Multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus, related [Neospora caninum
Liverpool]
gi|325118636|emb|CBZ54187.1| Multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus, related [Neospora caninum
Liverpool]
Length = 6755
Score = 139 bits (350), Expect = 6e-30, Method: Composition-based stats.
Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
RL + S IH GL A P+ + VIEYVG+++R +SD+RE YEK G G+ Y+F
Sbjct: 6611 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 6670
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
RLDD +VVDAT+ G ++RFINHSC PNC +++ E GQK I I AK I AGEEITY+Y
Sbjct: 6671 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 6730
Query: 914 KFPL--EEKKIPCYCGSKKCHGSLN 936
+F + E K+ C CG++ C G +N
Sbjct: 6731 QFGIGNETDKLACLCGARTCLGRMN 6755
>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
Length = 6587
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 95/142 (66%), Gaps = 4/142 (2%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL ++S IH +GL E I + VIEY+GE IR ISD RE Y+K+ S Y+FRL+
Sbjct: 6447 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6505
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
+ ++DATK G ++RFINHSC PNC+ K++S + K I I+AKR I A EEITY+Y+F
Sbjct: 6506 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFG 6565
Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
+E KK+ C CGS C G +N
Sbjct: 6566 VESEGKKLICLCGSSTCLGRMN 6587
>gi|449015700|dbj|BAM79102.1| similar to Trithorax protein [Cyanidioschyzon merolae strain 10D]
Length = 1910
Score = 139 bits (350), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 6/170 (3%)
Query: 771 RVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGEL 830
R R + A A L + Q++ RK + RS+IH +GL A ++A+ ++EY GE
Sbjct: 1743 RDAWRVVSATATDRRLYRQMQLEHRKYCVPL-RSRIHGFGLFARCDLQADQMIVEYAGER 1801
Query: 831 IRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE 890
I ++D+RE YE+ GIG Y+FR+D +V+DAT G +AR+INHSC PNCY++++ +
Sbjct: 1802 IASPVADVRERHYERRGIGC-YMFRIDADWVIDATMIGSVARYINHSCRPNCYSEILRIS 1860
Query: 891 GQKK---IFIYAKRHIAAGEEITYNYKFPLEEK-KIPCYCGSKKCHGSLN 936
+ K I I + R I GEE+TYNY F ++ KIPC CG C G +N
Sbjct: 1861 DEPKQDVIVIRSARAIKRGEELTYNYMFDVDNAMKIPCLCGEPGCVGFMN 1910
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/165 (45%), Positives = 104/165 (63%), Gaps = 7/165 (4%)
Query: 778 LAAAEGAELLKASQ---VKAR-KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
+AAA KA+Q +KA K + RS+I GL A IE VIEY+G +IR
Sbjct: 4292 VAAAHQGRHSKAAQYRRMKAEWKSNVYLARSRIQGLGLYAARDIEKCTMVIEYIGAIIRS 4351
Query: 834 KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
++++ +E YE G Y+FR+D+ YV+DAT GG AR+INHSC PNC T+V++VE +
Sbjct: 4352 EVANRKERLYESQNRGV-YMFRIDNDYVIDATITGGPARYINHSCAPNCITEVVNVEKEN 4410
Query: 894 KIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
KI I + R I GEE++Y+YKF LE + KIPC+CG+ C +N
Sbjct: 4411 KIIISSCRRIQRGEELSYDYKFDLEDDQHKIPCHCGAVNCRKWMN 4455
>gi|302846845|ref|XP_002954958.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
gi|300259721|gb|EFJ43946.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
Length = 128
Score = 139 bits (349), Expect = 1e-29, Method: Composition-based stats.
Identities = 64/130 (49%), Positives = 89/130 (68%), Gaps = 5/130 (3%)
Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
GL A+EPIE +FVIEY G+LIR + D+R+ +Y+ G + Y+F +DD +VVDAT G
Sbjct: 1 GLFAVEPIEPSEFVIEYCGQLIRKPLDDVRQRQYDAAGY-ADYMFAVDDNWVVDATLAGN 59
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPCYC 926
ARFINH C+PNC +++ V G+K+I IY R + AGEE+ Y+YK P E +IPC C
Sbjct: 60 AARFINHCCDPNCVARIVEVGGEKRIAIYTLRRVEAGEELHYDYKLSSVPGAE-RIPCGC 118
Query: 927 GSKKCHGSLN 936
G++ C G +N
Sbjct: 119 GARNCKGFMN 128
>gi|67586322|ref|XP_665182.1| SET-domain protein [Cryptosporidium hominis TU502]
gi|54655690|gb|EAL34953.1| SET-domain protein [Cryptosporidium hominis]
Length = 209
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/176 (43%), Positives = 112/176 (63%), Gaps = 9/176 (5%)
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
S++ N+++L ++ +L + + KRL ++S IH +GL A E I+ + +IE
Sbjct: 38 SSKRNKIQLEDMGPT----KLYRYLDLLPYDKRLNIKKSSIHGFGLFAKELIKTGEPIIE 93
Query: 826 YVGELIRLKISDIRECRYEKMG--IGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
YVGELIR ++D RE Y+ G GS Y+FRLD+ V+DAT G ARF+NH C+PN
Sbjct: 94 YVGELIRNSVADKRESLYKSNGNRDGSCYMFRLDESSVIDATNIGNHARFMNHCCDPNSI 153
Query: 884 TKVISVEGQKK-IFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
KVIS++ Q K I I++K+ I EEITY+Y+F +EE +KI C+CG+ C G +N
Sbjct: 154 CKVISIDSQNKHIVIFSKKTINKDEEITYDYQFNVEEASEKIICHCGASNCLGRMN 209
>gi|356554223|ref|XP_003545448.1| PREDICTED: uncharacterized protein LOC100812602 [Glycine max]
Length = 1985
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 95/145 (65%), Gaps = 5/145 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
K L +S+IH GL I + V+EY+GE++ L+++D RE Y+ K+ ++ Y
Sbjct: 1841 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKTACY 1900
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D +++DAT++GGIARF+NHSC PNC KVI+V +KK+ A+R I GEEITY+
Sbjct: 1901 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 1960
Query: 913 YKFPLE-EKKIPCYCGSKKCHGSLN 936
Y F E E KIPCYC SK C +N
Sbjct: 1961 YHFNHEDEGKIPCYCNSKNCRRYMN 1985
>gi|296082099|emb|CBI21104.3| unnamed protein product [Vitis vinifera]
Length = 1111
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 95/146 (65%), Gaps = 6/146 (4%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
K L +S IH GL I V+EYVGE++ L+++D RE Y+ K+ ++ Y
Sbjct: 966 KHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACY 1025
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D +++DAT++GGIARF+NHSC PNC KVISV +KK+ +A+R I GEEITY+
Sbjct: 1026 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYD 1085
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y F E+ KKIPC+C S+ C LN
Sbjct: 1086 YHFNHEDEGKKIPCFCNSRNCRRYLN 1111
>gi|196004646|ref|XP_002112190.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
gi|190586089|gb|EDV26157.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
Length = 152
Score = 137 bits (345), Expect = 3e-29, Method: Composition-based stats.
Identities = 68/141 (48%), Positives = 90/141 (63%), Gaps = 1/141 (0%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K+ + RS IH GL IEA + ++EY G IR ++D +E YE G+G Y+FR
Sbjct: 13 KETVGVYRSDIHGRGLFCKRIIEAGEMIMEYAGTKIRSILTDKQEKYYEGKGLGC-YMFR 71
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D V+DAT G ARFINHSC+PNCY K++ ++ KKI I+A R I GEE+TY+Y F
Sbjct: 72 IDRDEVIDATLSGNAARFINHSCDPNCYCKIVPIDNIKKIIIFALRRIYPGEELTYDYNF 131
Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
E+ K+PC C SKKC LN
Sbjct: 132 AKEDIKLPCNCSSKKCRKFLN 152
>gi|384244737|gb|EIE18235.1| Mll1 protein, partial [Coccomyxa subellipsoidea C-169]
Length = 137
Score = 137 bits (345), Expect = 3e-29, Method: Composition-based stats.
Identities = 61/136 (44%), Positives = 91/136 (66%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
+S IH WGL A + ++A+ VIEY G+++R +D RE RY+ G YLF + D +V+
Sbjct: 3 QSGIHGWGLFARKAMKADSMVIEYRGDVLRRTAADAREKRYQAAG-KDCYLFTVQDDFVI 61
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EK 920
DAT RGGI RF NH C+P+ YTKV+ +EG ++ +A+ + G+E+TY+Y+F E +
Sbjct: 62 DATMRGGIGRFTNHCCSPSMYTKVLEIEGTPRLVFFARFDLVPGQELTYDYRFKEEDGDN 121
Query: 921 KIPCYCGSKKCHGSLN 936
K+PC CG+ C G+LN
Sbjct: 122 KLPCGCGAPNCRGTLN 137
>gi|403271117|ref|XP_003927487.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 206
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 81/179 (45%), Positives = 108/179 (60%), Gaps = 8/179 (4%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQV----KARKKRLRFQRSKIHDWGLVALEPIEA 819
G S + +V+LR+ A EL A Q K K+ + RS IH L I+A
Sbjct: 30 GKSLPSAQVQLRSTRRATS-LELPMAMQFRHLKKTSKESVGVYRSTIHRRALFCKGDIDA 88
Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
+ +I+Y G +IR ++D RE Y+ GI Y+FR+DD V+DAT G ARFINHSC
Sbjct: 89 GEMIIDYSGIVIRSVLTDKREKFYDGKGI-RCYMFRMDDFDVLDATMHGHAARFINHSCE 147
Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
PNC+++VI VEGQK I I+A R I GEE+TY+YKFP+E+ K+PC G K+C LN
Sbjct: 148 PNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNYGGKRCRRFLN 206
>gi|302787360|ref|XP_002975450.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
gi|300157024|gb|EFJ23651.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
Length = 582
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 96/146 (65%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K R+ F +S IH WGL A I+ + V+EY GE IR ++D+RE RY G Y
Sbjct: 438 KTEKTRVCFGKSGIHGWGLFARRYIKEGEMVVEYRGERIRRSVADLREKRYCLEG-KHCY 496
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK--IFIYAKRHIAAGEEIT 910
LF++ + V+DAT+ G I R INHSC PNCY +++SVEG+ + I + A++ ++ GEE+T
Sbjct: 497 LFKISEEVVIDATENGNIGRLINHSCEPNCYARIVSVEGEGESHIVLIARKDVSVGEELT 556
Query: 911 YNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y+Y+F E+KK+ C CGS +C +N
Sbjct: 557 YDYQFDKEDKKVLCLCGSSRCRRFMN 582
>gi|313239304|emb|CBY14252.1| unnamed protein product [Oikopleura dioica]
Length = 2763
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 93/147 (63%), Gaps = 3/147 (2%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
V+ + L+ S IH GL A P E + VIEY G +IR ++ D+RE Y+ GIG+
Sbjct: 2618 VETSRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDRGIGT- 2676
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y+FR+DD VVDAT GG ARFINHSC PNC +K+I+V+ +K I I A R I GEE+TY
Sbjct: 2677 YMFRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTY 2736
Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
+YKF + ++I C C + C +N
Sbjct: 2737 DYKFDRDFGSERIECGCQAPNCRRFMN 2763
>gi|302761298|ref|XP_002964071.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
gi|300167800|gb|EFJ34404.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
Length = 564
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 96/146 (65%), Gaps = 3/146 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K R+ F +S IH WGL A I+ + V+EY GE IR ++D+RE RY G Y
Sbjct: 420 KTEKTRVCFGKSGIHGWGLFARRYIKEGEMVVEYRGERIRRSVADLREKRYCLEG-KHCY 478
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK--IFIYAKRHIAAGEEIT 910
LF++ + V+DAT+ G I R INHSC PNCY +++SVEG+ + I + A++ ++ GEE+T
Sbjct: 479 LFKISEEVVIDATENGNIGRLINHSCEPNCYARIVSVEGEGESHIVLIARKDVSVGEELT 538
Query: 911 YNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y+Y+F E+KK+ C CGS +C +N
Sbjct: 539 YDYQFDKEDKKVLCLCGSSRCRRFMN 564
>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
Length = 1055
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/167 (41%), Positives = 104/167 (62%), Gaps = 13/167 (7%)
Query: 782 EGAELLKASQVKAR--------KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
EGA+ S K R K R+ F +S IH WGL A ++ + V+EY GE +R
Sbjct: 890 EGADPKSFSSFKERLQYLQRTEKDRVCFGKSGIHGWGLFARRNVQEGEMVVEYRGEQVRR 949
Query: 834 KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ 892
++D+RE RY+ G YLF++ + V+DAT++G IAR INHSC PNCY +++SV + +
Sbjct: 950 SVADLREARYQLEG-KDCYLFKISEEVVIDATEKGNIARLINHSCMPNCYARIMSVGDNE 1008
Query: 893 KKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPCYCGSKKCHGSLN 936
+I + AK ++AAGEE+TY+Y F L+E K+PC+C + C +N
Sbjct: 1009 SRIVLIAKTNVAAGEELTYDYLFDPDELDELKVPCHCNAPNCRKFMN 1055
>gi|356562088|ref|XP_003549306.1| PREDICTED: uncharacterized protein LOC100816713 [Glycine max]
Length = 992
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 5/145 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
K L +S+IH GL I + V+EY+GE++ L+++D RE Y+ K+ S+ Y
Sbjct: 848 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACY 907
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D +++DAT++GGIARF+NHSC PNC KVI+V +KK+ A+R I GEEITY+
Sbjct: 908 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 967
Query: 913 YKFPLE-EKKIPCYCGSKKCHGSLN 936
Y F E E KIPCYC SK C +N
Sbjct: 968 YHFNHEDEGKIPCYCYSKNCRRYMN 992
>gi|225430418|ref|XP_002283013.1| PREDICTED: histone-lysine N-methyltransferase ATX1-like [Vitis
vinifera]
Length = 496
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 72/158 (45%), Positives = 99/158 (62%), Gaps = 7/158 (4%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
E + Q K K L +S IH GL I V+EYVGE++ L+++D RE Y+
Sbjct: 340 EFARYKQAKGWK-HLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQ 398
Query: 845 ---KMGIGSS-YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
K+ ++ Y FR+D +++DAT++GGIARF+NHSC PNC KVISV +KK+ +A+
Sbjct: 399 SGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAE 458
Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
R I GEEITY+Y F E+ KKIPC+C S+ C LN
Sbjct: 459 RDINPGEEITYDYHFNHEDEGKKIPCFCNSRNCRRYLN 496
>gi|195445276|ref|XP_002070254.1| GK11134 [Drosophila willistoni]
gi|194166339|gb|EDW81240.1| GK11134 [Drosophila willistoni]
Length = 290
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 62/118 (52%), Positives = 86/118 (72%)
Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
G AR+N+ + + +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 169 GISREARSNQRRQLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 228
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
IEYVG++IR ++D+RE +YE +GIGSSYLFR+D ++DATK G + RFINHSCN
Sbjct: 229 MAIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLERFINHSCN 286
>gi|255635862|gb|ACU18278.1| unknown [Glycine max]
Length = 245
Score = 136 bits (343), Expect = 5e-29, Method: Composition-based stats.
Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 5/145 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
K L +S+IH GL I + V+EY+GE++ L+++D RE Y+ K+ S+ Y
Sbjct: 101 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACY 160
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D +++DAT++GGIARF+NHSC PNC KVI+V +KK+ A+R I GEEITY+
Sbjct: 161 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 220
Query: 913 YKFPLE-EKKIPCYCGSKKCHGSLN 936
Y F E E KIPCYC SK C +N
Sbjct: 221 YHFNHEDEGKIPCYCYSKNCRRYMN 245
>gi|357127667|ref|XP_003565500.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
[Brachypodium distachyon]
Length = 972
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 95/148 (64%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K KR+ RS +H WGL A I+ VIEY G+ +R ++D+RE RY + Y
Sbjct: 826 KTENKRVSCGRSGVHGWGLFAARKIQEGQMVIEYRGDQVRQSVADLREARYHREN-KDCY 884
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + V+DAT+RG IAR INHSC PNCY +++SV + + +I + AKR ++AGEE+TY
Sbjct: 885 LFKISEDVVIDATERGNIARLINHSCMPNCYARIVSVGDDKSQIILIAKRDVSAGEELTY 944
Query: 912 NYKFPL---EEKKIPCYCGSKKCHGSLN 936
+YKF E++K+ C C + C G +N
Sbjct: 945 DYKFDRDESEDRKVLCLCKAPNCRGYMN 972
>gi|123478084|ref|XP_001322206.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121905048|gb|EAY09983.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 351
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 93/141 (65%), Gaps = 3/141 (2%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
L F++S+I WG+ + I+ V EYVGE+IR ++D R+ EK G +Y+F+LD
Sbjct: 166 LMFEKSEIEGWGVRSTCSIDKNQIVAEYVGEIIRPVVADKRQVYNEKHGNHGTYIFKLDS 225
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFPL 917
+DAT+RGGIARFINHSC+PNC ++++++ G+K + I + ++I EITY+YK P
Sbjct: 226 QNYLDATQRGGIARFINHSCDPNCRSELVTMSNGRKAVVIISNQYIPPNTEITYDYKLPY 285
Query: 918 E--EKKIPCYCGSKKCHGSLN 936
E +K I C CGS KC LN
Sbjct: 286 ESPDKAIKCLCGSDKCRHYLN 306
>gi|218202612|gb|EEC85039.1| hypothetical protein OsI_32352 [Oryza sativa Indica Group]
Length = 1741
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 65/146 (44%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
K L +S IH GL E I V++YVGE++ ++D RE Y+ + + Y
Sbjct: 1596 KHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQSGKRQQYKSACY 1655
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
F++ +++DAT++GGIARFINHSC PNC KVISV +KK+ +A+RHI GEEITY+
Sbjct: 1656 FFKIGKEHIIDATRKGGIARFINHSCQPNCVAKVISVRNEKKVVFFAERHINPGEEITYD 1715
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y F E+ ++IPC+C S+ C LN
Sbjct: 1716 YHFNREDEGQRIPCFCRSRGCRRYLN 1741
>gi|170067214|ref|XP_001868394.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863427|gb|EDS26810.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 284
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 57/99 (57%), Positives = 79/99 (79%)
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
VG+++R ++D+RE +YE +GI S+YLFR+D ++DATK G +ARF+NHSCNPNCYTKV
Sbjct: 186 VGQVVRPSVTDLRETKYEAIGISSTYLFRIDMETIIDATKCGNLARFLNHSCNPNCYTKV 245
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
I++E +KKI IY+K+ I EEITY+YK PLE++ I Y
Sbjct: 246 ITIESEKKIVIYSKQPIGVNEEITYDYKVPLEDENISFY 284
>gi|357617692|gb|EHJ70932.1| hypothetical protein KGM_14792 [Danaus plexippus]
Length = 147
Score = 135 bits (341), Expect = 8e-29, Method: Composition-based stats.
Identities = 68/143 (47%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
+ + RSKI GL A +E VIEY+GE+IR ++S+IRE +YE G Y+FR
Sbjct: 6 RNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSELSEIREKKYEAKNRGI-YMFR 64
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
LD+ VVDAT GG+AR+INHSC PNC + + V+ +I I+AKR IA GEE+ Y+YKF
Sbjct: 65 LDERRVVDATLCGGLARYINHSCQPNCVAETVEVDRHLRIIIFAKRRIARGEELAYDYKF 124
Query: 916 PLEEK--KIPCYCGSKKCHGSLN 936
+E+ KI C CG+ C +N
Sbjct: 125 DIEDDAHKIMCMCGAPNCRKWMN 147
>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1035
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 132/258 (51%), Gaps = 20/258 (7%)
Query: 683 TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
T K+ SR ++ + K S N AR I H A A RVRG
Sbjct: 794 TKKKTGSRLISSSRKKQDDSPVDNTEHEPFSAARCRIFQRTNHTKKRAADEAVSHRVRGP 853
Query: 743 QYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
Y H ++A + S T+R + A + E L Q + +R+ F
Sbjct: 854 -YHHP------LDAIE-------SLNTHRQVVHEPQAFSSFRERLYHLQ-RTENERVCFG 898
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS IH+WGL A I+ D V+EY GE +R I+D+RE RY G YLF++ + VV
Sbjct: 899 RSGIHEWGLFARRNIQEGDMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKISEEVVV 957
Query: 863 DATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF---PLE 918
DAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY+Y F E
Sbjct: 958 DATDKGNIARLINHSCMPNCYARIMSVGDEESRIVLIAKTNVAAGDELTYDYLFDPDEPE 1017
Query: 919 EKKIPCYCGSKKCHGSLN 936
E K+PC C + C +N
Sbjct: 1018 ENKVPCLCKAPNCRKFMN 1035
>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
Length = 2244
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/145 (49%), Positives = 98/145 (67%), Gaps = 5/145 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG--IGSSYLF 854
KRL ++S IH +GL A E I+ + +IEYVGELIR ++D RE Y+ G GS Y+F
Sbjct: 2100 KRLNIKKSSIHGFGLFAKELIKTGEPIIEYVGELIRNSVADKRESLYKSNGNRDGSCYMF 2159
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-IFIYAKRHIAAGEEITYNY 913
RLD+ V+DAT G ARF+NH C+PN KVIS++ Q K I I++K+ I EEITY+Y
Sbjct: 2160 RLDESSVIDATNIGNHARFMNHCCDPNSICKVISIDSQNKHIVIFSKKTINKDEEITYDY 2219
Query: 914 KFPLEE--KKIPCYCGSKKCHGSLN 936
+F +EE +KI C+CG+ C G +N
Sbjct: 2220 QFNVEEASEKIICHCGASNCLGRMN 2244
>gi|222642073|gb|EEE70205.1| hypothetical protein OsJ_30300 [Oryza sativa Japonica Group]
Length = 1792
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
K L +S IH GL E I V++YVGE++ ++D RE Y+ + + Y
Sbjct: 1647 KHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQSGKRQQYKSACY 1706
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
F++ +++DAT++GGIARFINHSC PNC K+ISV +KK+ +A+RHI GEEITY+
Sbjct: 1707 FFKIGKEHIIDATRKGGIARFINHSCQPNCVAKIISVRNEKKVVFFAERHINPGEEITYD 1766
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y F E+ ++IPC+C S+ C LN
Sbjct: 1767 YHFNREDEGQRIPCFCRSRGCRRYLN 1792
>gi|260794338|ref|XP_002592166.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
gi|229277381|gb|EEN48177.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
Length = 1891
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K++Q + K+ R RS+I GL A + I+ VIEY+G +IR ++ + RE Y
Sbjct: 1738 KSAQYRKLKQEWRNNVVLGRSRIQGLGLFAAKDIDKHVMVIEYIGVIIRNEVCNKREHIY 1797
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E+ G Y+FR+D V+DAT GG AR+INHSCNPNC +V++ E ++KI I + R +
Sbjct: 1798 EEQNRGV-YMFRIDSDLVIDATLAGGPARYINHSCNPNCVAEVVNFEKEQKIIIISSRRL 1856
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ GEE+TY+YKF +E E+KIPC CG+ C +N
Sbjct: 1857 SKGEELTYDYKFDIEDDEQKIPCCCGAPNCRKWMN 1891
>gi|427785301|gb|JAA58102.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
pulchellus]
Length = 2867
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K R RS I GL A IE +IEY+G+LIR +I++ E Y
Sbjct: 2714 KSSQYRRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEIAERNEAIY 2773
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ V+DAT GG+AR+INHSC+PNC +++ ++ + KI I A R I
Sbjct: 2774 EAQNRGV-YMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKILIIANRRI 2832
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE+TY+YKF E+ KIPC C + C +N
Sbjct: 2833 TRGEELTYDYKFDYEDDGHKIPCLCSASNCRKWMN 2867
>gi|145518147|ref|XP_001444951.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412384|emb|CAK77554.1| unnamed protein product [Paramecium tetraurelia]
Length = 841
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 96/153 (62%), Gaps = 6/153 (3%)
Query: 786 LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK 845
L + S+ + +K +L S IH +GL + + DFVIEY GE+IR ++D RE Y +
Sbjct: 693 LKQESKNQKQKNKLYVAPSHIHKYGLFTKQDFKKGDFVIEYTGEVIRNALADYRELTYNE 752
Query: 846 MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAA 905
G G Y+FR V+DAT +G ARF+NHSC PNC + ++ +KI IYA++ I+
Sbjct: 753 QGFGDCYMFRASKTKVIDATFKGSEARFLNHSCQPNCDSLLLD----EKILIYARKDISV 808
Query: 906 GEEITYNYKFPL--EEKKIPCYCGSKKCHGSLN 936
GEE+TY+Y+F + E +KI C CG+K C G LN
Sbjct: 809 GEELTYDYQFEIEAESQKIQCSCGAKNCIGRLN 841
>gi|242045554|ref|XP_002460648.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
gi|241924025|gb|EER97169.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
Length = 1658
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSYLF 854
L +S IH GL I V+EYVGE++ +++D RE Y+ + + Y F
Sbjct: 1515 LVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADKREIEYQSGKRQQYKSACYFF 1574
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
++D ++VDAT++GGIARF+NHSC PNC K+IS+ +KK+ +A+RHI GEEITY+Y
Sbjct: 1575 KIDREHIVDATRKGGIARFVNHSCQPNCVAKIISIRNEKKVMFFAERHINPGEEITYDYH 1634
Query: 915 FPLEE--KKIPCYCGSKKCHGSLN 936
F E+ ++IPC+C S+ C LN
Sbjct: 1635 FNREDEGQRIPCFCRSRYCRRYLN 1658
>gi|68071179|ref|XP_677503.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497646|emb|CAH98690.1| conserved hypothetical protein [Plasmodium berghei]
Length = 1325
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 94/142 (66%), Gaps = 4/142 (2%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL ++S IH +GL A E I + VIEY+GE IR ISD RE YEK+ S Y+FRL+
Sbjct: 1185 RLYVKKSSIHGYGLYAREFINEGEPVIEYIGEYIRNIISDKRETYYEKIE-SSCYMFRLN 1243
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
+ ++DATK G +RFINHSC PNC+ K++S + K I I+AK+ I EEITY+Y+F
Sbjct: 1244 ENIIIDATKWGNASRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFG 1303
Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
+E +K+ C CGS C G +N
Sbjct: 1304 VESEGEKLICLCGSNTCLGRMN 1325
>gi|297803296|ref|XP_002869532.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
gi|297315368|gb|EFH45791.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
Length = 1024
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY ++G YLF++
Sbjct: 883 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 941
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F
Sbjct: 942 EEVVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 1001
Query: 916 --PLEEKKIPCYCGSKKCHGSLN 936
EE K+PC C + C +N
Sbjct: 1002 PDEAEELKVPCLCKAPNCRKFMN 1024
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/141 (48%), Positives = 92/141 (65%), Gaps = 6/141 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K R+ RS+I GL A I DF+IEY GE+IR ++ ++RE RY G Y+FR
Sbjct: 2331 KDRVYLARSRIAGLGLYAKVDISMGDFIIEYKGEIIRSEVCEVREIRYVAQNRGV-YMFR 2389
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQKKIFIYAKRHIAAGEEITYN 912
+D+ +V+DAT GG AR+INHSC+PNC T+++ S +KKI I A R I+A EE+TY+
Sbjct: 2390 IDEEWVIDATMAGGPARYINHSCDPNCSTQILDAGSGAREKKIIITANRPISANEELTYD 2449
Query: 913 YKFPLE--EKKIPCYCGSKKC 931
Y+F LE KIPC CG+ C
Sbjct: 2450 YQFELEGTTDKIPCLCGAPNC 2470
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 134 bits (338), Expect = 2e-28, Method: Composition-based stats.
Identities = 72/142 (50%), Positives = 96/142 (67%), Gaps = 4/142 (2%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL ++S IH +GL E I + VIEY+GE IR ISD RE Y+K+ S Y+FRL+
Sbjct: 6613 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6671
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
+ ++DATK G ++RFINHSC PNC+ K++S + K I I+AKR IAA EEITY+Y+F
Sbjct: 6672 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQFG 6731
Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
+E KK+ C CGS C G +N
Sbjct: 6732 VESEGKKLICLCGSSTCLGRMN 6753
>gi|42567196|ref|NP_194520.3| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
gi|229488104|sp|Q9SUE7.3|ATX4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX4; AltName:
Full=Protein SET DOMAIN GROUP 16; AltName:
Full=Trithorax-homolog protein 4; Short=TRX-homolog
protein 4; Short=Trithorax 4
gi|332660008|gb|AEE85408.1| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
Length = 1027
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY ++G YLF++
Sbjct: 886 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 944
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F
Sbjct: 945 EEVVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 1004
Query: 916 --PLEEKKIPCYCGSKKCHGSLN 936
EE K+PC C + C +N
Sbjct: 1005 PDEAEELKVPCLCKAPNCRKFMN 1027
>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
Length = 991
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/148 (44%), Positives = 95/148 (64%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K KR+ RS IH WGL A + I+ VIEY G+ +R ++D+RE RY + Y
Sbjct: 845 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 903
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + VVDAT++G IAR INHSC PNCY +++SV +K +I + AKR ++AGEE+TY
Sbjct: 904 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 963
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E ++PC C + C G +N
Sbjct: 964 DYLFDPDESEDCRVPCLCKAPNCRGYMN 991
>gi|256076153|ref|XP_002574378.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|350644878|emb|CCD60414.1| mixed-lineage leukemia protein, mll, putative [Schistosoma mansoni]
Length = 1560
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 104/191 (54%), Gaps = 18/191 (9%)
Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLR----FQRSKIH 807
P N + +NG + N+ A++ + ++ Q + K + RS+I
Sbjct: 1382 PSFNGTTCSNG-----------VYNIDTASKQHQSSRSQQYRKLKTEVNSNVILGRSRIQ 1430
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
GL A +E + VIEYVGELIRL+I++ RE YE G Y+FRLDD V+DAT
Sbjct: 1431 GLGLFAARDLEQQTMVIEYVGELIRLEIANKREKNYEAHNRGI-YMFRLDDDTVIDATVC 1489
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCY 925
GG+AR+INHSC PNC + ++ G I I R I GEE+ Y+Y F LE++ KIPC
Sbjct: 1490 GGLARYINHSCQPNCLAEFVNFGGHSHIVIITNRRIKKGEELCYDYNFDLEDRSDKIPCL 1549
Query: 926 CGSKKCHGSLN 936
C + C +N
Sbjct: 1550 CRAPNCRKWMN 1560
>gi|357154561|ref|XP_003576824.1| PREDICTED: uncharacterized protein LOC100834435 [Brachypodium
distachyon]
Length = 973
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 92/147 (62%), Gaps = 6/147 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSS 851
+K L +S IH GL E I VIEYVGE++ +++D RE Y +
Sbjct: 827 QKHLVVYKSGIHGLGLYTSEFIPRGSMVIEYVGEIVGQRVADKREIEYHSGKRQQYKSVC 886
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y F++D +++DAT++GG+ARFINHSC PNC K+ISV +KK+ +++R I GEEITY
Sbjct: 887 YFFKIDREHIIDATQKGGVARFINHSCQPNCVAKIISVRNEKKVVFFSERQINPGEEITY 946
Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
+Y F E+ ++IPC+C S+ C LN
Sbjct: 947 DYHFTQEDEGQRIPCFCRSRSCRRYLN 973
>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2437
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 85/133 (63%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
SKIH +GL A + +D V+EY+GE IR ++D RE Y++ G G Y+F+ ++D
Sbjct: 2305 SKIHKYGLFAKTYFKQDDIVVEYLGETIRQVLADYREKIYKQRGFGDCYMFKACPDKIID 2364
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
AT +G AR++NHSCNPNC + VI E KI IYAKR I GEE+TY+Y F +EE+KI
Sbjct: 2365 ATFKGNEARYLNHSCNPNCSSLVIEYEKDSKIIIYAKRDIKPGEELTYDYCFDIEEEKIN 2424
Query: 924 CYCGSKKCHGSLN 936
C C C +N
Sbjct: 2425 CNCNDPNCTRIMN 2437
>gi|70950103|ref|XP_744403.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524343|emb|CAH80265.1| SET-domain protein, putative [Plasmodium chabaudi chabaudi]
Length = 870
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 94/142 (66%), Gaps = 4/142 (2%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL ++S IH +GL E I + VIEY+GE IR ISD RE YEK+ S Y+FRL+
Sbjct: 730 RLYVKKSSIHGYGLYTREFINEGEPVIEYIGEYIRNIISDKREKYYEKIE-SSCYMFRLN 788
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
+ ++DATK G ++RFINHSC PNC+ K++S + K I I+AK+ I EEITY+Y+F
Sbjct: 789 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFG 848
Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
+E +K+ C CGS C G +N
Sbjct: 849 VESEGEKLICLCGSSTCLGRMN 870
>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
Length = 1018
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A + I+ D V+EY GE +R I+D+RE RY G YLF++
Sbjct: 877 RVCFGRSGIHGWGLFARQAIQEGDMVLEYRGEQVRRSIADMREVRYRLEG-KDCYLFKIS 935
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY+Y F
Sbjct: 936 EEVVVDATDKGNIARLINHSCAPNCYARIMSVGDDESRIVLIAKTNVAAGDELTYDYLFD 995
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 996 PDEPDECKVPCLCKAPNCRKFMN 1018
>gi|224005356|ref|XP_002296329.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586361|gb|ACI65046.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 161
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 90/140 (64%), Gaps = 8/140 (5%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IH WGL A P +D V EY+GE I ++D+RE Y K I Y F++D V+D
Sbjct: 8 SGIHGWGLFADYPFRKDDIVAEYIGEYISDAVADVREKYYRKHRI-QDYQFKVDGSLVID 66
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITYNYKFPLE- 918
AT +GG AR+INHSCNPNC K++S + K++ I A+R I A EE++Y+Y+FPLE
Sbjct: 67 ATLKGGYARYINHSCNPNCIAKIVSGKAPNQHLKRVIIVAQRDIQATEELSYDYQFPLET 126
Query: 919 --EKKIPCYCGSKKCHGSLN 936
+ +IPC CGS+ C G +N
Sbjct: 127 DLDLRIPCNCGSRHCRGFMN 146
>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
Length = 2509
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 92/141 (65%), Gaps = 6/141 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K R+ RS+I GL A I +F+IEY GE+IR ++ ++RE RY G Y+FR
Sbjct: 2365 KDRVYLARSRIAGLGLYAKNDISMGEFIIEYKGEIIRAEVCEVREIRYTAQNRGV-YMFR 2423
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQKKIFIYAKRHIAAGEEITYN 912
+D+ +V+DAT GG AR+INHSC+PNC T+++ S +KKI I A R I++ EE+TY+
Sbjct: 2424 IDEEWVIDATMAGGPARYINHSCDPNCSTQILDAGSSGREKKIIITANRPISSNEELTYD 2483
Query: 913 YKFPLE--EKKIPCYCGSKKC 931
Y+F LE KIPC CG+ C
Sbjct: 2484 YQFELEGTTDKIPCLCGAPNC 2504
>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1094
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/148 (44%), Positives = 94/148 (63%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ R+ F RS IH WGL A + I+ D V+EY GE +R I+D+RE RY G Y
Sbjct: 948 RTENDRVCFGRSGIHGWGLFARQAIQEGDMVLEYRGEQVRRSIADMREVRYRLEG-KDCY 1006
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY
Sbjct: 1007 LFKISEEVVVDATDKGNIARLINHSCAPNCYARIMSVGDDESRIVLIAKTNVAAGDELTY 1066
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C +N
Sbjct: 1067 DYLFDPDEPDECKVPCLCKAPNCRKFMN 1094
>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
Length = 991
Score = 134 bits (336), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 66/148 (44%), Positives = 95/148 (64%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K KR+ RS IH WGL A + I+ VIEY G+ +R ++D+RE RY + Y
Sbjct: 845 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 903
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + VVDAT++G IAR INHSC PNCY +++SV +K +I + AKR ++AGEE+TY
Sbjct: 904 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 963
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E ++PC C + C G +N
Sbjct: 964 DYLFDPDESEDCRVPCLCKALNCRGYMN 991
>gi|313247613|emb|CBY15785.1| unnamed protein product [Oikopleura dioica]
Length = 386
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 93/147 (63%), Gaps = 3/147 (2%)
Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
V+ + L+ S IH GL A P E + VIEY G +IR ++ D+RE Y+ GIG+
Sbjct: 241 VETSRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDRGIGT- 299
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y+FR+DD VVDAT GG ARFINHSC PNC +K+I+V+ +K I I A R I GEE+TY
Sbjct: 300 YMFRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTY 359
Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
+YKF + ++I C C + C +N
Sbjct: 360 DYKFDRDFGSERIECGCQAPNCRRFMN 386
>gi|353232161|emb|CCD79516.1| hypothetical protein Smp_070170 [Schistosoma mansoni]
Length = 1430
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 64/144 (44%), Positives = 91/144 (63%), Gaps = 4/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A I F+IEY+GE+IR ++++ RE YE G Y+FR
Sbjct: 1288 KTNVVLARSRIQGLGLYAARDISKSTFIIEYLGEVIRNEVANRRERLYESQNRGI-YMFR 1346
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD ++VDAT GG+AR+INHSC+PNC +++ + I I A ++I G+E+TY+YKF
Sbjct: 1347 VDDDWIVDATMSGGLARYINHSCDPNCTAEILHCDNSNHIVIIASKNIEKGDELTYDYKF 1406
Query: 916 PLEE---KKIPCYCGSKKCHGSLN 936
LEE +IPC CGS C +N
Sbjct: 1407 DLEEDRWDRIPCLCGSINCRKWMN 1430
>gi|16118405|gb|AAL12215.1| trithorax 4 [Arabidopsis thaliana]
Length = 285
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 92/143 (64%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL I+ + V+EY GE +R I+D+RE RY ++G YLF++
Sbjct: 144 RVCFGRSGIHGWGLFGRRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 202
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F
Sbjct: 203 EELVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 262
Query: 916 --PLEEKKIPCYCGSKKCHGSLN 936
EE K+PC C + C +N
Sbjct: 263 PDEAEELKVPCLCKAPNCRKFMN 285
>gi|301116291|ref|XP_002905874.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262109174|gb|EEY67226.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 1659
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 57/118 (48%), Positives = 87/118 (73%), Gaps = 1/118 (0%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R+ ++S IH +GL EP+ ++EY G++I I+D RE RYE+ G+GS Y+FRL
Sbjct: 1539 ERMLVRKSSIHGYGLFLKEPVSDGQMIVEYQGQMINQTIADERERRYEEQGVGSCYMFRL 1598
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
D+ ++DAT+ G +ARFINHSC+P + ++++VE G+KKI I+AKR IA G+E+TY+Y
Sbjct: 1599 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDY 1656
>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1067
Score = 133 bits (334), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 91/143 (63%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ D V+EY GE +R I+D+RE RY G YLF++
Sbjct: 926 RVCFGRSGIHGWGLFARRNIQEGDMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKIS 984
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AG+E+TY+Y F
Sbjct: 985 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVVAGDELTYDYLFD 1044
Query: 916 --PLEEKKIPCYCGSKKCHGSLN 936
EE K+PC C + C +N
Sbjct: 1045 PDEPEENKVPCLCKAPNCRKYMN 1067
>gi|405970535|gb|EKC35431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 1927
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 86/136 (63%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+I GL A +E VIEY+G LIR ++++ E YE+ G Y+FR+D+ VV
Sbjct: 1793 RSRIQGLGLFAARDLEKHTMVIEYIGYLIRNEVANRTEVVYEEQNRGV-YMFRIDNETVV 1851
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
DAT GG AR+INHSCNPNC +V+ + + KI I R I GEE+TY+YKF E++
Sbjct: 1852 DATMAGGPARYINHSCNPNCVAEVVPFDKESKIIIITNRRIPRGEELTYDYKFDFEDEQH 1911
Query: 921 KIPCYCGSKKCHGSLN 936
KIPC CG+ C +N
Sbjct: 1912 KIPCCCGAPGCRKWMN 1927
>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
Length = 2498
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 91/141 (64%), Gaps = 6/141 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K R+ RS+I GL A + I +++IEY GE+IR ++ ++RE RY G Y FR
Sbjct: 2354 KDRVLLARSQIAGLGLYAKKAISMGEYIIEYKGEVIRSEVCEVREKRYIAQNRGV-YQFR 2412
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG---QKKIFIYAKRHIAAGEEITYN 912
+D+ +VVDAT GG+AR+INHSC+PNC T+++ G +KI I A R I+ EE+TYN
Sbjct: 2413 IDENWVVDATMSGGVARYINHSCDPNCSTEILPTGGGPSNQKIIIMANRPISEMEELTYN 2472
Query: 913 YKFPLEEK--KIPCYCGSKKC 931
Y F LE+ KIPC CG+ C
Sbjct: 2473 YNFDLEDPTDKIPCLCGAPNC 2493
>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
Length = 3915
Score = 132 bits (333), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3774 KSNVYLARSRIQGLGLYAARDIEKYTMVIEYIGTIIRSEVANRKEKMYEAQNRGV-YMFR 3832
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D +V+DAT GG AR+INHSC PNC T+V+++E KI I + R I GEE+ Y+YKF
Sbjct: 3833 IDSEHVIDATITGGPARYINHSCAPNCITEVVALERGHKIIISSNRRIQRGEELCYDYKF 3892
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
LE + KIPC+CG+ C +N
Sbjct: 3893 DLEDDQHKIPCHCGAVNCRKWMN 3915
>gi|414590164|tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1591
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
K L +S IH GL I V+EYVGE++ +++D RE Y+ + + Y
Sbjct: 1446 KHLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACY 1505
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
F++D +++DAT++GGIARF+NHSC PNC K+ISV +KK+ +A+RHI GEEITY+
Sbjct: 1506 FFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYD 1565
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y F E+ ++I C+C S+ C LN
Sbjct: 1566 YHFNREDEGQRILCFCRSRYCRRYLN 1591
>gi|414590165|tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1566
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
K L +S IH GL I V+EYVGE++ +++D RE Y+ + + Y
Sbjct: 1421 KHLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACY 1480
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
F++D +++DAT++GGIARF+NHSC PNC K+ISV +KK+ +A+RHI GEEITY+
Sbjct: 1481 FFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYD 1540
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y F E+ ++I C+C S+ C LN
Sbjct: 1541 YHFNREDEGQRILCFCRSRYCRRYLN 1566
>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
Length = 2288
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE +IEY GE+IR ++ ++RE +YE G Y+FR
Sbjct: 2144 KNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2202
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEITYN 912
+DD ++DAT GG AR+INHSC+PNC T+++ KKI I A R I+AGEE+TY+
Sbjct: 2203 IDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYD 2262
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y+F +E+ KIPC CG+ C +N
Sbjct: 2263 YQFDIEDAADKIPCLCGAPNCQKWMN 2288
>gi|115480613|ref|NP_001063900.1| Os09g0556700 [Oryza sativa Japonica Group]
gi|113632133|dbj|BAF25814.1| Os09g0556700 [Oryza sativa Japonica Group]
Length = 424
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 97/158 (61%), Gaps = 7/158 (4%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
E + Q+K K L +S IH GL E I V++YVGE++ ++D RE Y+
Sbjct: 268 EYVHYKQLKG-WKHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQ 326
Query: 845 ----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
+ + Y F++ +++DAT++GGIARFINHSC PNC K+ISV +KK+ +A+
Sbjct: 327 SGKRQQYKSACYFFKIGKEHIIDATRKGGIARFINHSCQPNCVAKIISVRNEKKVVFFAE 386
Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
RHI GEEITY+Y F E+ ++IPC+C S+ C LN
Sbjct: 387 RHINPGEEITYDYHFNREDEGQRIPCFCRSRGCRRYLN 424
>gi|242051571|ref|XP_002454931.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
gi|241926906|gb|EES00051.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
Length = 993
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ RS IH WGL A I+ VIEY GE +R ++D+RE +Y + Y
Sbjct: 847 KTENLRVSCGRSGIHGWGLFAARNIQEGQMVIEYRGEQVRRCVADLREAQYHREK-KDCY 905
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + V+DAT +G IAR INHSC PNCY ++++V G + +I + AKR ++AGEE+TY
Sbjct: 906 LFKISEDVVIDATDKGNIARLINHSCMPNCYARIMTVSGDRNQIILIAKRDVSAGEELTY 965
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C G +N
Sbjct: 966 DYLFDPDESEDCKVPCLCKAPNCRGYMN 993
>gi|297792715|ref|XP_002864242.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
gi|297310077|gb|EFH40501.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
Length = 1049
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY + G YLF++
Sbjct: 908 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKIS 966
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT++G IAR INHSC PNCY +++SV + + +I + AK +A+GEE+TY+Y F
Sbjct: 967 EEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASGEELTYDYLFD 1026
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C S C +N
Sbjct: 1027 PDEPDEFKVPCLCKSPNCRKFMN 1049
>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
Length = 2144
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE +IEY GE+IR ++ ++RE +YE G Y+FR
Sbjct: 2000 KNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2058
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEITYN 912
+DD ++DAT GG AR+INHSC+PNC T+++ KKI I A R I+AGEE+TY+
Sbjct: 2059 IDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYD 2118
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y+F +E+ KIPC CG+ C +N
Sbjct: 2119 YQFDIEDVADKIPCLCGAPNCQKWMN 2144
>gi|402865476|ref|XP_003919660.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Papio anubis]
Length = 3229
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3088 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3146
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 3147 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 3206
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 3207 DFEDDQHKIPCHCGAVNCRKWMN 3229
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4737 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4795
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4796 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQTGEELCYDYKF 4855
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4856 DFEDDQHKIPCHCGAVNCRKWMN 4878
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4358 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4416
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4417 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4476
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4477 DFEDDQHKIPCHCGAVNCRKWMN 4499
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4765 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4823
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4824 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4883
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4884 DFEDDQHKIPCHCGAVNCRKWMN 4906
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3888 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3946
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 3947 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4006
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4007 DFEDDQHKIPCHCGAVNCRKWMN 4029
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4804 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4862
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4863 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4922
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4923 DFEDDQHKIPCHCGAVNCRKWMN 4945
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4724 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4782
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4783 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4842
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4843 DFEDDQHKIPCHCGAVNCRKWMN 4865
>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
Length = 4724
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4583 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4641
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4642 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4701
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4702 DFEDDQHKIPCHCGAVNCRKWMN 4724
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4730 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4788
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4789 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4848
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4849 DFEDDQHKIPCHCGAVNCRKWMN 4871
>gi|219109809|ref|XP_002176658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411193|gb|EEC51121.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 139
Score = 131 bits (329), Expect = 2e-27, Method: Composition-based stats.
Identities = 64/140 (45%), Positives = 90/140 (64%), Gaps = 8/140 (5%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
SK+H WGL A +P D V EY+GE I ++D RE Y++ I Y FRLD+ V+D
Sbjct: 1 SKVHGWGLFADQPFIKGDVVAEYLGEYISYAVADNREKIYQEQRI-QDYQFRLDEKLVID 59
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITYNYKFPLE- 918
AT +GG AR+INH+C PNC+ K++S E K++ I A++ I+ EEI+Y+Y+FPLE
Sbjct: 60 ATLKGGPARYINHNCTPNCFAKIVSGEPPSPCLKRVIIIAQQEISINEEISYDYQFPLEL 119
Query: 919 --EKKIPCYCGSKKCHGSLN 936
++IPC C S C G +N
Sbjct: 120 DLSERIPCNCQSDACRGFMN 139
>gi|167526642|ref|XP_001747654.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773758|gb|EDQ87394.1| predicted protein [Monosiga brevicollis MX1]
Length = 1547
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 81/116 (69%), Gaps = 1/116 (0%)
Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
+R SKIH GL AL P++ + +IEY GE IR +++D RE Y+ GIG Y+FR+D
Sbjct: 1432 VRVSHSKIHGIGLFALRPLKPGEMIIEYAGEQIRPELTDKREAYYDSRGIGC-YMFRVDA 1490
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
VVDAT G ARF+NHSC+PNC +++I + K I I+A+R+IA GEE+TY+YK
Sbjct: 1491 NLVVDATLTGNPARFVNHSCDPNCASRIIQTDVGKHIVIFAERNIAVGEELTYDYK 1546
>gi|399217787|emb|CCF74674.1| unnamed protein product [Babesia microti strain RI]
Length = 602
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 99/183 (54%), Gaps = 20/183 (10%)
Query: 771 RVKLRNLLAAAEGAELLKASQVKARK--------------KRLRFQRSKIHDWGLVALEP 816
R K+ NL A LL + R K LR +S +H +GL A E
Sbjct: 422 RNKMNNLYAKHRRHNLLHNICSRIRNITPKNGFLSPFPVNKVLRLAKSGVHGFGLFAGED 481
Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
I+ +F+IEY G L+ ++D+RE YEK S+YLFR DD V+DATK G +ARF+NH
Sbjct: 482 IKKGEFIIEYAGVLVTDAMTDVREDAYEKCMGSSTYLFRADDNRVIDATKTGNLARFVNH 541
Query: 877 SCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCH 932
SC+PN YT + E + IYA R IA G+EI YNY+ L E K CYCGS C
Sbjct: 542 SCDPNAYTNTVHDEYNGPHVGIYASRDIAPGQEIFYNYR--LSESVYNKEYCYCGSSNCS 599
Query: 933 GSL 935
G +
Sbjct: 600 GRM 602
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4391 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4449
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4450 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4509
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4510 DFEDDQHKIPCHCGAVNCRKWMN 4532
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4769 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4827
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4828 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4887
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4888 DFEDDQHKIPCHCGAVNCRKWMN 4910
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan troglodytes]
Length = 4026
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3885 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3943
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 3944 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4003
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4004 DFEDDQHKIPCHCGAVNCRKWMN 4026
>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
Length = 1078
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 94/148 (63%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY G Y
Sbjct: 932 KTENDRVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRLEG-KDCY 990
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY
Sbjct: 991 LFKISEEVVVDATDKGNIARLINHSCMPNCYARIMSVGDNESRIVLIAKTNVSAGDELTY 1050
Query: 912 NYKF-PLE--EKKIPCYCGSKKCHGSLN 936
+Y F P E E K+PC C + C +N
Sbjct: 1051 DYLFDPNEPDEFKVPCLCKAPNCRKYMN 1078
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4434 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4492
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4493 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4552
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4553 DFEDDQHKIPCHCGAVNCRKWMN 4575
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4644 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4702
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4703 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4762
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4763 DFEDDQHKIPCHCGAVNCRKWMN 4785
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3884 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3942
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 3943 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4002
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4003 DFEDDQHKIPCHCGAVNCRKWMN 4025
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4771 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4829
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4830 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4889
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4890 DFEDDQHKIPCHCGAVNCRKWMN 4912
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4768 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4826
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4827 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4886
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4887 DFEDDQHKIPCHCGAVNCRKWMN 4909
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4715 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4773
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4774 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4833
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4834 DFEDDQHKIPCHCGAVNCRKWMN 4856
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4641 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4699
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4700 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4759
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4760 DFEDDQHKIPCHCGAVNCRKWMN 4782
>gi|358342811|dbj|GAA39455.2| histone-lysine N-methyltransferase MLL3 [Clonorchis sinensis]
Length = 1443
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/167 (41%), Positives = 99/167 (59%), Gaps = 13/167 (7%)
Query: 781 AEGAELLKASQVKARKKRLRFQ--------RSKIHDWGLVALEPIEAEDFVIEYVGELIR 832
+ A L S+ + + +RLR + RS+I GL A + F+IEY+GELIR
Sbjct: 1279 STAASFLSVSKSQ-QYRRLRLEWRYNVILARSQIQGLGLYAARNLSKHAFIIEYLGELIR 1337
Query: 833 LKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ 892
++ + RE YE G Y+FR+D+ +VDAT GG+AR+INHSC PNC +V++ E
Sbjct: 1338 NEVGNRRERLYELQNRGI-YMFRVDEDSIVDATMCGGLARYINHSCEPNCTAEVLNCENG 1396
Query: 893 KKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
I I A R I GEE+TY+YKF +EE+ +IPC CG+ C +N
Sbjct: 1397 SHIIIIASRDIEKGEELTYDYKFDIEEERGDRIPCLCGAPNCRRWMN 1443
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4669 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4727
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4728 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4787
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4788 DFEDDQHKIPCHCGAVNCRKWMN 4810
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4810 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4868
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4869 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4928
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4929 DFEDDQHKIPCHCGAVNCRKWMN 4951
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4789 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4847
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4848 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4907
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4908 DFEDDQHKIPCHCGAVNCRKWMN 4930
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4786 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4844
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4845 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4904
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4905 DFEDDQHKIPCHCGAVNCRKWMN 4927
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4775 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4833
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4834 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4893
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4894 DFEDDQHKIPCHCGAVNCRKWMN 4916
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4790 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4848
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4849 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4908
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4909 DFEDDQHKIPCHCGAVNCRKWMN 4931
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4733 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4791
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4792 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4851
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4852 DFEDDQHKIPCHCGAVNCRKWMN 4874
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4762 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4820
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4821 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4880
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4881 DFEDDQHKIPCHCGAVNCRKWMN 4903
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4763 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4821
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4822 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4881
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4882 DFEDDQHKIPCHCGAVNCRKWMN 4904
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/221 (37%), Positives = 121/221 (54%), Gaps = 19/221 (8%)
Query: 719 ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLL 778
I + + S + + A R RG + +T+ E A + LSA ++RV+
Sbjct: 2312 IETLQSRQASTSLTSANSGRTRGVRSYYTE----EAAARARGFPQDLSAISHRVE----- 2362
Query: 779 AAAEGAELLKASQVKAR---KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
A+ A + K R K+ + RS+I GL A I +++IEY GE+IR ++
Sbjct: 2363 -PAQSATAQYTAYQKMRREWKELVYLARSRIAGLGLYAKTDIPMGEYIIEYKGEIIRSEL 2421
Query: 836 SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQ 892
++RE RY G Y+FRLD+ +V+DAT GG AR++NHSC+PNC T + S
Sbjct: 2422 CEVREKRYNAQNRGV-YMFRLDEEWVIDATMSGGPARYVNHSCDPNCSTMLFDSNSGARD 2480
Query: 893 KKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKC 931
KKI I A R I+A EE+TY+Y+F LE+ K+PC CG+ C
Sbjct: 2481 KKILITANRPISANEELTYDYQFELEDATDKVPCLCGAPNC 2521
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4644 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4702
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4703 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4762
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4763 DFEDDQHKIPCHCGAVNCRKWMN 4785
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911
>gi|312089370|ref|XP_003146221.1| histone methyltransferase [Loa loa]
Length = 689
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 6/149 (4%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K + RS+I GL A IE +IEY GE+IR ++ ++RE +YE G Y
Sbjct: 542 KEWKNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-Y 600
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEI 909
+FR+DD ++DAT GG AR+INHSC+PNC T+++ KKI I A R I+AGEE+
Sbjct: 601 MFRIDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEEL 660
Query: 910 TYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
TY+Y+F +E+ KIPC CG+ C +N
Sbjct: 661 TYDYQFDIEDAADKIPCLCGAPNCQKWMN 689
>gi|414884806|tpg|DAA60820.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
gi|414884807|tpg|DAA60821.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
Length = 982
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 98/152 (64%), Gaps = 5/152 (3%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
GLS + + N+ + E ++KA+ +KRL F +SKIH +G+ A +A D +
Sbjct: 832 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 887
Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
+EY+GE++R ISD RE R Y + +Y+FR+DD V+DAT+ G IAR INHSC PNC
Sbjct: 888 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 947
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
Y++ I++ G + I I+AKR I EE+TY+Y+
Sbjct: 948 YSRAITILGDEHIIIFAKRDIDPWEELTYDYR 979
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4769 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4827
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4828 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4887
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4888 DFEDDQHKIPCHCGAVNCRKWMN 4910
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 89/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 5074 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 5132
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E +I I + R I GEE++Y+YKF
Sbjct: 5133 IDNEHVIDATLTGGPARYINHSCAPNCVAEVVTFEKGHRIIISSNRRIQKGEELSYDYKF 5192
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5193 DFEDDQHKIPCHCGAVNCRKWMN 5215
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4721 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4779
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4780 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4839
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4840 DFEDDQHKIPCHCGAVNCRKWMN 4862
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4639 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4697
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4698 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4757
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4758 DFEDDQHKIPCHCGAVNCRKWMN 4780
>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
Length = 2007
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/126 (47%), Positives = 83/126 (65%), Gaps = 1/126 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
RL F +S IH WGL+A +A V E+ GE R ++D+RE YE+ G+ YL + D
Sbjct: 1603 RLTFCKSNIHGWGLLAKTAHKAGSIVTEFKGETCRSTVADMRETAYEEEGV-DCYLLKQD 1661
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
D VVD T +G +ARF NHSCNPN Y+K++ V+G+ I +A+ I AGEE+TYNY+F
Sbjct: 1662 DDTVVDCTFQGNLARFTNHSCNPNMYSKIVKVDGENHIIFFARNDIKAGEELTYNYRFES 1721
Query: 918 EEKKIP 923
E+ K+P
Sbjct: 1722 EDGKVP 1727
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4730 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4788
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4789 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4848
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4849 DFEDDQHKIPCHCGAVNCRKWMN 4871
>gi|326431279|gb|EGD76849.1| hypothetical protein PTSG_12693 [Salpingoeca sp. ATCC 50818]
Length = 3557
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 102/170 (60%), Gaps = 9/170 (5%)
Query: 770 NRVKLRNLLAAAEGAELLKASQVKAR------KKRLRFQRSKIHDWGLVALEPIEAEDFV 823
+R+K R A E L + +++AR K +LR RS+I GL A + +A + V
Sbjct: 3377 HRIKPRRRPTRAYTVERLPSERIEARALAKMRKSKLRVGRSRISGHGLYAEQDFKAGECV 3436
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRL-DDGYVVDATKRGGIARFINHSCNPNC 882
EY G R ++D+RE Y G+G Y+F + ++ V+DAT +G RFINHSC+PNC
Sbjct: 3437 AEYAGVRFRTTLADVREGEYRDRGLGC-YMFSVGNNTVVIDATLQGNWTRFINHSCDPNC 3495
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL-EEKKIPCYCGSKKC 931
++ I +EGQ + I+A+R I AGEE+TY+YK + ++ K PC CG+ C
Sbjct: 3496 CSEEIDIEGQPYVVIFARRDIRAGEELTYDYKHQMDDDDKAPCLCGAIDC 3545
>gi|427785297|gb|JAA58100.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
pulchellus]
Length = 3936
Score = 130 bits (327), Expect = 3e-27, Method: Composition-based stats.
Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K R RS I GL A IE +IEY+G+LIR +I++ E Y
Sbjct: 3783 KSSQYRRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEIAERNEAIY 3842
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FRLD+ V+DAT GG+AR+INHSC+PNC +++ ++ + KI I A R I
Sbjct: 3843 EAQNRGV-YMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKILIIANRRI 3901
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
GEE+TY+YKF E+ KIPC C + C +N
Sbjct: 3902 TRGEELTYDYKFDYEDDGHKIPCLCSASNCRKWMN 3936
>gi|256084014|ref|XP_002578229.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
Length = 399
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 64/144 (44%), Positives = 91/144 (63%), Gaps = 4/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A I F+IEY+GE+IR ++++ RE YE G Y+FR
Sbjct: 257 KTNVVLARSRIQGLGLYAARDISKSTFIIEYLGEVIRNEVANRRERLYESQNRGI-YMFR 315
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+DD ++VDAT GG+AR+INHSC+PNC +++ + I I A ++I G+E+TY+YKF
Sbjct: 316 VDDDWIVDATMSGGLARYINHSCDPNCTAEILHCDNSNHIVIIASKNIEKGDELTYDYKF 375
Query: 916 PLEE---KKIPCYCGSKKCHGSLN 936
LEE +IPC CGS C +N
Sbjct: 376 DLEEDRWDRIPCLCGSINCRKWMN 399
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G LIR ++++ E YE G Y+FR
Sbjct: 4718 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4776
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
+DD VVDAT+ G AR+INHSCNPNC +V++ + QKKI I + R + GEE+TY+YK
Sbjct: 4777 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4836
Query: 915 FPLE--EKKIPCYCGSKKCHGSLN 936
F +E + KIPC C + C +N
Sbjct: 4837 FEIENDQNKIPCLCKAPNCRKWMN 4860
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G LIR ++++ E YE G Y+FR
Sbjct: 4731 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4789
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
+DD VVDAT+ G AR+INHSCNPNC +V++ + QKKI I + R + GEE+TY+YK
Sbjct: 4790 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4849
Query: 915 FPLE--EKKIPCYCGSKKCHGSLN 936
F +E + KIPC C + C +N
Sbjct: 4850 FEIENDQNKIPCLCKAPNCRKWMN 4873
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G LIR ++++ E YE G Y+FR
Sbjct: 4714 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4772
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
+DD VVDAT+ G AR+INHSCNPNC +V++ + QKKI I + R + GEE+TY+YK
Sbjct: 4773 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4832
Query: 915 FPLE--EKKIPCYCGSKKCHGSLN 936
F +E + KIPC C + C +N
Sbjct: 4833 FEIENDQNKIPCLCKAPNCRKWMN 4856
>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
Length = 1018
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 92/143 (64%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY G YLF++
Sbjct: 877 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKIS 935
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F
Sbjct: 936 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVSAGDELTYDYLFD 995
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 996 PDEPDEFKVPCLCKAPNCRQFMN 1018
>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1776
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/146 (47%), Positives = 90/146 (61%), Gaps = 17/146 (11%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K K+ + RS IH GL IEA + VIEY G +IR ++D R+ Y+ GIG Y
Sbjct: 1646 KISKEAVGVYRSDIHGRGLFCKRNIEAGEMVIEYAGTVIRAVLTDKRQKFYDSKGIGC-Y 1704
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+FR+DD VVDAT +G ARFINHSC PNCY++VI+ I GEE+TY+
Sbjct: 1705 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVIN--------------IYRGEELTYD 1750
Query: 913 YKFPLE--EKKIPCYCGSKKCHGSLN 936
YKFP+E E K+ C CG+++C G LN
Sbjct: 1751 YKFPIEDDESKLHCNCGTRRCRGFLN 1776
>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 93/147 (63%), Gaps = 4/147 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K +++ +S IH WGL A +E + V+EY GE +R I+D+RE +Y G Y
Sbjct: 840 KTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRRSITDLREAQYRSEG-KDCY 898
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
F++++ V+DAT +G IAR INHSC PNC+ +++ QK +I + AK +++AGEE+TY
Sbjct: 899 FFKINEEVVIDATDKGNIARLINHSCMPNCFARIVPSGDQKNRIVLIAKTNVSAGEELTY 958
Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
NY F E E+K+PC C + C G +N
Sbjct: 959 NYSFDDERDEEKVPCRCKAPNCSGFMN 985
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 87/143 (60%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4764 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4822
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4823 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4882
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4883 DFEDDQHKIPCHCGAVNCRKWMN 4905
>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
sativus]
Length = 1073
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE RY G YLF++
Sbjct: 932 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRTVADLREARYRLAG-KDCYLFKIS 990
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AGEE+TY+Y F
Sbjct: 991 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDYLFD 1050
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 1051 PDEPDEFKVPCLCKAPNCRKFMN 1073
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 87/143 (60%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4743 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4801
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 4802 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4861
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4862 DFEDDQHKIPCHCGAVNCRKWMN 4884
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 3255 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3313
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 3314 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 3373
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 3374 DFEDDQHKIPCHCGAVNCRKWMN 3396
>gi|226500304|ref|NP_001141695.1| uncharacterized protein LOC100273824 [Zea mays]
gi|194705578|gb|ACF86873.1| unknown [Zea mays]
Length = 418
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 97/158 (61%), Gaps = 7/158 (4%)
Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
E + Q+K K L +S IH GL I V+EYVGE++ +++D RE Y+
Sbjct: 262 EYIHYKQLKGWK-HLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQ 320
Query: 845 ----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
+ + Y F++D +++DAT++GGIARF+NHSC PNC K+ISV +KK+ +A+
Sbjct: 321 SGKRQQYKSACYFFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAE 380
Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
RHI GEEITY+Y F E+ ++I C+C S+ C LN
Sbjct: 381 RHINPGEEITYDYHFNREDEGQRILCFCRSRYCRRYLN 418
>gi|5630077|gb|AAD45822.1|AC006017_2 similar to ALR; similar to AAC51735 (PID:g2358287) [Homo sapiens]
Length = 1813
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 1672 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 1730
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 1731 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 1790
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 1791 DFEDDQHKIPCHCGAVNCRKWMN 1813
>gi|359489550|ref|XP_002278335.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1073
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 95/148 (64%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ R+ F +S IH WGL A I+ + VIEY GE +R ++D+RE +Y G Y
Sbjct: 927 RTENHRVCFGKSGIHGWGLFARRSIQEGEMVIEYRGEQVRRSVADLREAKYRLEG-KDCY 985
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY
Sbjct: 986 LFKISEEVVIDATNKGNIARLINHSCFPNCYARIMSVGDEESRIVLIAKINVSAGDELTY 1045
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E+ K+PC CG+ C +N
Sbjct: 1046 DYLFDPDERDESKVPCLCGAPNCRKFMN 1073
>gi|307105682|gb|EFN53930.1| hypothetical protein CHLNCDRAFT_25365, partial [Chlorella
variabilis]
Length = 188
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 91/149 (61%), Gaps = 4/149 (2%)
Query: 789 ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMG 847
A V +R+ +S IH WG A D VIEYVGEL+R +SD+RE R Y+ M
Sbjct: 10 AECVATLGQRVTIGKSGIHGWGAFAKRRHAEHDMVIEYVGELVRPSVSDLREARCYDDMV 69
Query: 848 IGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
+Y+FRL+ VDAT+ G +A +NHSC+PNC+++ I V + I+AK+ I E
Sbjct: 70 GAGTYVFRLNKALCVDATRAGNLAHMLNHSCDPNCFSRTIRV--VDHVIIFAKKDIEVAE 127
Query: 908 EITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
E+TY+Y+F EE ++PC CG+ C G +N
Sbjct: 128 ELTYDYRFCGEE-QLPCNCGAANCRGRVN 155
>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 1053
Score = 129 bits (325), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 92/143 (64%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE RY G YLF++
Sbjct: 912 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRSVADLREARYRAEG-KDCYLFKIS 970
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F
Sbjct: 971 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVSAGDELTYDYLFD 1030
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 1031 PDEPDEFKVPCMCKAPNCRKFMN 1053
>gi|449528994|ref|XP_004171486.1| PREDICTED: histone-lysine N-methyltransferase ATX5-like, partial
[Cucumis sativus]
Length = 588
Score = 129 bits (325), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE RY G YLF++
Sbjct: 447 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRTVADLREARYRLAG-KDCYLFKIS 505
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AGEE+TY+Y F
Sbjct: 506 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDYLFD 565
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 566 PDEPDEFKVPCLCKAPNCRKFMN 588
>gi|37360418|dbj|BAC98187.1| mKIAA1506 protein [Mus musculus]
Length = 1520
Score = 129 bits (325), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 1379 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 1437
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 1438 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 1497
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 1498 DFEDDQHKIPCHCGAVNCRKWMN 1520
>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
Length = 1012
Score = 129 bits (325), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 94/148 (63%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 866 KTEHTRVCFGRSGIHRWGLFARRGIQEGEMVLEYRGEQVRRSVADLREEQYRVQG-KDCY 924
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PNCY +++SV + +I + AK++++AGEE+TY
Sbjct: 925 LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTY 984
Query: 912 NYKFPLEE---KKIPCYCGSKKCHGSLN 936
+Y F +E +K+PC C + C +N
Sbjct: 985 DYLFDPDEADDRKVPCLCQTANCRKFMN 1012
>gi|297745446|emb|CBI40526.3| unnamed protein product [Vitis vinifera]
Length = 1003
Score = 129 bits (325), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 94/143 (65%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F +S IH WGL A I+ + VIEY GE +R ++D+RE +Y G YLF++
Sbjct: 862 RVCFGKSGIHGWGLFARRSIQEGEMVIEYRGEQVRRSVADLREAKYRLEG-KDCYLFKIS 920
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F
Sbjct: 921 EEVVIDATNKGNIARLINHSCFPNCYARIMSVGDEESRIVLIAKINVSAGDELTYDYLFD 980
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E+ K+PC CG+ C +N
Sbjct: 981 PDERDESKVPCLCGAPNCRKFMN 1003
>gi|401396171|ref|XP_003879769.1| putative SET domain-containing protein [Neospora caninum Liverpool]
gi|325114176|emb|CBZ49734.1| putative SET domain-containing protein [Neospora caninum Liverpool]
Length = 5146
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 116/230 (50%), Gaps = 18/230 (7%)
Query: 722 WEWHKWSLNASPAERARVRGAQYVHTKYLG-----PEVNASQWANGK---GLSARTNRVK 773
W+ +L A AER + R H LG PE Q + L ++ +
Sbjct: 4919 WQTADPALLAQAAERMQERNLSAFHGLLLGEDGRLPESAELQLKRNELFFALESQDRLTR 4978
Query: 774 LRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
NLL A + R K L+ +S++H WG+ A EPI ++FVIEY ++
Sbjct: 4979 TANLLEATAARSGARTPPFVVRTKLLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSE 5038
Query: 834 KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
+++ RE +Y + GS+YLF+L + +VDAT+ G + RFINHSC PNC T+ +S
Sbjct: 5039 AMANFREWQYMQSMGGSTYLFKLRNSTIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDD 5098
Query: 894 K-----IFIYAKRHIAAGEEITYNYKFPLEEKKI---PCYCGSKKCHGSL 935
+ I+A R IA GEE+ YNY L E + CYCG++ C G +
Sbjct: 5099 DNRHCHVGIFALRDIAIGEELFYNYS--LSEGALGHEACYCGAEGCKGVM 5146
>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
Length = 1032
Score = 129 bits (324), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 94/148 (63%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 886 KTEHTRVCFGRSGIHRWGLFARRGIQEGEMVLEYRGEQVRRSVADLREEQYRVQG-KDCY 944
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PNCY +++SV + +I + AK++++AGEE+TY
Sbjct: 945 LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTY 1004
Query: 912 NYKFPLEE---KKIPCYCGSKKCHGSLN 936
+Y F +E +K+PC C + C +N
Sbjct: 1005 DYLFDPDEADDRKVPCLCQTANCRKFMN 1032
>gi|30696333|ref|NP_200155.2| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
gi|75244456|sp|Q8GZ42.1|ATX5_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX5; AltName:
Full=Protein SET DOMAIN GROUP 29; AltName:
Full=Trithorax-homolog protein 5; Short=TRX-homolog
protein 5
gi|26449542|dbj|BAC41897.1| putative trithorax 5 TX5 [Arabidopsis thaliana]
gi|29029074|gb|AAO64916.1| At5g53430 [Arabidopsis thaliana]
gi|332008970|gb|AED96353.1| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
Length = 1043
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 93/144 (64%), Gaps = 5/144 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY + G YLF++
Sbjct: 901 ERVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKI 959
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF 915
+ VVDAT++G IAR INHSC PNCY +++SV + + +I + AK +A+ EE+TY+Y F
Sbjct: 960 SEEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDYLF 1019
Query: 916 PLEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C S C +N
Sbjct: 1020 DPDEPDEFKVPCLCKSPNCRKFMN 1043
>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
Length = 1070
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY G YLF++
Sbjct: 929 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREVRYRLEG-KDCYLFKIS 987
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
+ VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AG+E+TY+Y F
Sbjct: 988 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDNESRIVLIAKTNVPAGDELTYDYLFD 1047
Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C + C +N
Sbjct: 1048 PDEPDEFKVPCLCKAPNCRKFMN 1070
>gi|8843772|dbj|BAA97320.1| unnamed protein product [Arabidopsis thaliana]
Length = 1040
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/144 (45%), Positives = 93/144 (64%), Gaps = 5/144 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+R+ F RS IH WGL A I+ + V+EY GE +R I+D+RE RY + G YLF++
Sbjct: 898 ERVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKI 956
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF 915
+ VVDAT++G IAR INHSC PNCY +++SV + + +I + AK +A+ EE+TY+Y F
Sbjct: 957 SEEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDYLF 1016
Query: 916 PLEEK---KIPCYCGSKKCHGSLN 936
+E K+PC C S C +N
Sbjct: 1017 DPDEPDEFKVPCLCKSPNCRKFMN 1040
>gi|358333116|dbj|GAA32467.2| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
sinensis]
Length = 1763
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/136 (46%), Positives = 84/136 (61%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS+I GL A +E + VIEY+GELIRL++++ RE YE G Y+FRL+D V+
Sbjct: 1629 RSRIQGLGLFAARNLEPQTMVIEYIGELIRLELANKREKDYEAHNRGI-YMFRLNDDTVI 1687
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT GG+AR+INHSC PNC+ + ++ I I R I GEE+ Y+Y F LE+
Sbjct: 1688 DATVCGGLARYINHSCQPNCFAEFLNFGDHSHIVIITNRLIEKGEELCYDYNFDLEDGGS 1747
Query: 921 KIPCYCGSKKCHGSLN 936
KIPC C S C +N
Sbjct: 1748 KIPCLCRSTNCRKWMN 1763
>gi|326519903|dbj|BAK03876.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1025
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 879 KTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREAQYRVQG-KDCY 937
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PN Y +++SV Q +I + AKR+++AGEE+TY
Sbjct: 938 LFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSVGHDQSRIVLIAKRNVSAGEELTY 997
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C +N
Sbjct: 998 DYLFDPDEADDCKVPCLCQTADCRKFMN 1025
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 86/143 (60%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4781 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4839
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D +V+DAT GG AR+INHSC PNC +V++ KI I + R I GEE+ Y+YKF
Sbjct: 4840 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVAFRAGHKIIISSSRRIQKGEELCYDYKF 4899
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4900 DFEDDQHKIPCHCGAVNCRKWMN 4922
>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1003
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 93/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ +R+ F RS IH WGL A + I+ + V+EY GE +R I+D+RE RY G Y
Sbjct: 857 RTENERVCFGRSGIHGWGLFARQNIQEGEMVLEYRGEQVRRSIADLREARYRLEG-KDCY 915
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ G+E+TY
Sbjct: 916 LFKISEEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTDVSTGDELTY 975
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C +N
Sbjct: 976 DYLFDPDEPDEFKVPCLCKASNCRKFMN 1003
>gi|414875701|tpg|DAA52832.1| TPA: hypothetical protein ZEAMMB73_838843 [Zea mays]
Length = 971
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 111/193 (57%), Gaps = 11/193 (5%)
Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
+ +GP ++ + +G L+A +R ++ E L+ K R+ RS IH
Sbjct: 786 RIMGPRHHSQDFIDG--LNACMDRKDDQSFSTFKERLRYLQ----KTENLRVSCGRSGIH 839
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
WGL A I+ VIEY GE +R ++D+RE +Y + YLF++ + V+DAT +
Sbjct: 840 GWGLFAARNIQEGQMVIEYRGEQVRRCVADLREAQYHREK-KDCYLFKISEDVVIDATDK 898
Query: 868 GGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIP 923
G IAR INHSC PNCY ++++V + + +I + AKR ++AGEE+TY+Y F +E ++P
Sbjct: 899 GNIARLINHSCMPNCYARIMTVSDDRSEIILIAKRDVSAGEELTYDYLFDPDESEDCRVP 958
Query: 924 CYCGSKKCHGSLN 936
C C + C G +N
Sbjct: 959 CLCKAPNCRGYMN 971
>gi|95007060|emb|CAJ20276.1| SET-domain protein, putative [Toxoplasma gondii RH]
Length = 4382
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 130/264 (49%), Gaps = 20/264 (7%)
Query: 688 ASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKW--SLNASPAERARVRGAQYV 745
A+ + K+ +S S P G + S + +W +L + AER + R
Sbjct: 4119 ATLNLGKKRDTSSPGENSRFAPPLVGSSSSRLLAAQWQAADPALLSLAAERMQERNLSAF 4178
Query: 746 HTKYLG-----PEVNASQWANGK---GLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
H LG PE Q + L +++ + NLL A +A R K
Sbjct: 4179 HGLLLGEDGRLPENAELQLKRNELFFALESQSRLSRTANLLEATVLRSGARAPSFVVRTK 4238
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
L+ +S++H WG+ A EPI ++FVIEY ++ +++ RE +Y + GS+YLF+L
Sbjct: 4239 LLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSEAMANFREWQYMQSMGGSTYLFKLK 4298
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-----IFIYAKRHIAAGEEITYN 912
+ +VDAT+ G + RFINHSC PNC T+ +S + I+A R IA GEE+ YN
Sbjct: 4299 NSAIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDDDSRHCHVGIFALRDIAIGEELFYN 4358
Query: 913 YKFPLEEKKI---PCYCGSKKCHG 933
Y L E + CYCG++ C G
Sbjct: 4359 YS--LSEGALGHEACYCGAEGCKG 4380
>gi|297484769|ref|XP_002694621.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Bos taurus]
gi|296478627|tpg|DAA20742.1| TPA: SET domain containing 1B [Bos taurus]
Length = 1980
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 118/208 (56%), Gaps = 24/208 (11%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 1669 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1726
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 1727 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1786
Query: 864 ATKRGGIARFINHSCN---------PNCYTKVISVEG---QKKIFIYAKRHIAAGE---- 907
ATK G ARFINHSCN P + + G F R A E
Sbjct: 1787 ATKCGNFARFINHSCNVSVRLGATPPGACREDLEAAGTGASMASFPSMGRGRGAQEGGRS 1846
Query: 908 --EITYNYKFPLEEKKIPCYCGSKKCHG 933
+YKFP+E+ KIP GS+ C G
Sbjct: 1847 XXXXXXDYKFPIEDVKIPA-LGSENCRG 1873
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/146 (44%), Positives = 92/146 (63%), Gaps = 6/146 (4%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE +IEY GE+IR ++ ++RE +YE G Y+FR
Sbjct: 2203 KNNVYLARSRIQGLGLYAKRDIEMNAMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2261
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISV---EGQKKIFIYAKRHIAAGEEITYN 912
+D+ +VDAT GG AR+INHSC+PNC T++++ KKI I A R I+AGEE+TY+
Sbjct: 2262 IDEERLVDATMAGGPARYINHSCDPNCSTRIVASGPNGDDKKIIIIANRPISAGEELTYD 2321
Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
Y+F +E+ KI C CG+ C +N
Sbjct: 2322 YQFDIEDTADKIACLCGAPNCQKWMN 2347
>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 1149
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F +S IH WGL A ++ D V+EY GE +R ++D+RE +Y G Y
Sbjct: 1003 KTENLRICFGKSGIHGWGLFARRDLQEGDMVVEYRGEQVRRSVADLREAKYRSEG-KDCY 1061
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITY 911
LF++ + V+DAT +G IAR INHSC PNCY +++S+ Q +I + AK +++AG+E+TY
Sbjct: 1062 LFKISEEVVIDATHKGNIARLINHSCMPNCYARILSLGNQGNRIVLIAKTNVSAGQELTY 1121
Query: 912 NYKF---PLEEKKIPCYCGSKKCHGSLN 936
+Y F EE K+PC C + C +N
Sbjct: 1122 DYLFDPDEPEELKVPCLCRASNCRKFMN 1149
>gi|237844405|ref|XP_002371500.1| SET domain-containing protein [Toxoplasma gondii ME49]
gi|211969164|gb|EEB04360.1| SET domain-containing protein [Toxoplasma gondii ME49]
gi|221481217|gb|EEE19618.1| SET domain-containing protein, putative [Toxoplasma gondii GT1]
gi|221501765|gb|EEE27525.1| SET domain-containing protein, putative [Toxoplasma gondii VEG]
Length = 4827
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/264 (32%), Positives = 130/264 (49%), Gaps = 20/264 (7%)
Query: 688 ASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKW--SLNASPAERARVRGAQYV 745
A+ + K+ +S S P G + S + +W +L + AER + R
Sbjct: 4564 ATLNLGKKRDTSSPGENSRFAPPLVGSSSSRLLAAQWQAADPALLSLAAERMQERNLSAF 4623
Query: 746 HTKYLG-----PEVNASQWANGK---GLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
H LG PE Q + L +++ + NLL A +A R K
Sbjct: 4624 HGLLLGEDGRLPENAELQLKRNELFFALESQSRLSRTANLLEATVLRSGARAPSFVVRTK 4683
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
L+ +S++H WG+ A EPI ++FVIEY ++ +++ RE +Y + GS+YLF+L
Sbjct: 4684 LLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSEAMANFREWQYMQSMGGSTYLFKLK 4743
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-----IFIYAKRHIAAGEEITYN 912
+ +VDAT+ G + RFINHSC PNC T+ +S + I+A R IA GEE+ YN
Sbjct: 4744 NSAIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDDDSRHCHVGIFALRDIAIGEELFYN 4803
Query: 913 YKFPLEEKKI---PCYCGSKKCHG 933
Y L E + CYCG++ C G
Sbjct: 4804 YS--LSEGALGHEACYCGAEGCKG 4825
>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Brachypodium
distachyon]
Length = 1037
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 91/148 (61%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 891 KTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREAQYRVQG-KDCY 949
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PN Y +++SV Q +I + AKR++ AGEE+TY
Sbjct: 950 LFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSVGHDQSRIVLIAKRNVRAGEELTY 1009
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C +N
Sbjct: 1010 DYLFDPDEADDCKVPCLCQTADCRKFMN 1037
>gi|21739477|emb|CAD38780.1| hypothetical protein [Homo sapiens]
Length = 1033
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 892 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 950
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 951 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 1010
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 1011 DFEDDQHKIPCHCGAVNCRKWMN 1033
>gi|242053849|ref|XP_002456070.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
gi|241928045|gb|EES01190.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
Length = 1051
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 93/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 905 KTEHGRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKYRVQG-KDCY 963
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PNCY +++SV + +I + A++++ AG+E+TY
Sbjct: 964 LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIARKNVCAGDELTY 1023
Query: 912 NYKF---PLEEKKIPCYCGSKKCHGSLN 936
+Y F +E+K+PC C + C +N
Sbjct: 1024 DYLFDPDEADERKVPCLCQTANCRKFMN 1051
>gi|293333544|ref|NP_001169542.1| uncharacterized protein LOC100383419 [Zea mays]
gi|193735372|gb|ACF20188.1| histone methylase [Zea mays]
gi|224030013|gb|ACN34082.1| unknown [Zea mays]
Length = 641
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F RS IH WGL A I+ + V+EY GE +R ++D+RE +Y G Y
Sbjct: 495 KTEHGRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKYRVQG-KDCY 553
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT +G +AR INHSC PNCY +++SV + +I + AK+ + AG+E+TY
Sbjct: 554 LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKKVCAGDELTY 613
Query: 912 NYKF---PLEEKKIPCYCGSKKCHGSLN 936
+Y F +E+K+PC C + C +N
Sbjct: 614 DYLFDPDEADERKVPCLCQTANCRKFMN 641
>gi|300176194|emb|CBK23505.2| unnamed protein product [Blastocystis hominis]
Length = 400
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 10/171 (5%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
+ L E +A Q +A ++ + +S IH WGL A VIE+ GE++ +
Sbjct: 230 DTLGTTELLRQYRALQCRAESEQPKVIKSPIHGWGLFAKTAFRRGMMVIEFAGEVVGQDV 289
Query: 836 SDIRECRYEK----------MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
++ RE + + +G Y FR+D ++VDAT +G +RFINH C+PNC +
Sbjct: 290 ANAREAVWAQGAAVRRRYNRRNVGGCYFFRMDAEHIVDATVKGNESRFINHCCDPNCDAR 349
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
V+++ + KI +A R I GEEITY+YKF E++++ C CG+K C G +N
Sbjct: 350 VVNINNENKIIFFANRDINPGEEITYDYKFQYEDRELVCLCGAKNCVGRMN 400
>gi|74193661|dbj|BAE22782.1| unnamed protein product [Mus musculus]
Length = 865
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 724 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 782
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 783 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 842
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 843 DFEDDQHKIPCHCGAVNCRKWMN 865
>gi|113470963|gb|ABI34883.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 96
Score = 127 bits (318), Expect = 4e-26, Method: Composition-based stats.
Identities = 59/96 (61%), Positives = 72/96 (75%), Gaps = 3/96 (3%)
Query: 843 YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRH 902
Y+ GIG Y+FR+DD VVDAT G ARFINHSC PNCY++VI+V+GQK I I+A R
Sbjct: 2 YDDKGIGC-YMFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVINVDGQKHIVIFATRK 60
Query: 903 IAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
I GEE+TY+YKFP+EE K+PC CG+KKC LN
Sbjct: 61 IYKGEELTYDYKFPIEEPGNKLPCNCGAKKCRKFLN 96
>gi|84997431|ref|XP_953437.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304433|emb|CAI76812.1| hypothetical protein, conserved [Theileria annulata]
Length = 3595
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 67/159 (42%), Positives = 97/159 (61%), Gaps = 3/159 (1%)
Query: 765 LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
L R NR + R + + + + RL + S IH GL A E I A + V+
Sbjct: 3434 LETRNNRSRSRKQYDLSNTSTQYRYLLSIPPENRLDVKPSVIHGLGLFATENIPAGEPVV 3493
Query: 825 EYVGELIRLKISDIRECRYE--KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
EYVGELIR + D RE Y + G GS Y+FRLDD Y+VDAT++G ++RFINHSC+PNC
Sbjct: 3494 EYVGELIRDVVGDQREEFYSEGQGGDGSCYMFRLDDQYIVDATRKGNMSRFINHSCDPNC 3553
Query: 883 YTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
++I+ E G K I ++AK ++ G+E+TY+Y+ ++ K
Sbjct: 3554 LCRIITCENGMKHIVVFAKSELSPGDEVTYDYQVTIQFK 3592
>gi|74218832|dbj|BAE37820.1| unnamed protein product [Mus musculus]
Length = 342
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 201 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 259
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 260 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 319
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 320 DFEDDQHKIPCHCGAVNCRKWMN 342
>gi|343960312|dbj|BAK64010.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 homolog [Pan
troglodytes]
Length = 411
Score = 126 bits (316), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 270 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 328
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 329 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 388
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 389 DFEDDQHKIPCHCGAVNCRKWMN 411
>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 125 bits (315), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 92/147 (62%), Gaps = 4/147 (2%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K +++ +S IH WGL A +E + V+EY GE +R I+D+RE +Y G Y
Sbjct: 840 KTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRRSITDLREAQYRSEG-KDCY 898
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
F++ + V+DAT +G IAR INHSC PNC+ +++ + Q+ +I + AK +++AGEE+TY
Sbjct: 899 FFKISEEVVIDATDKGNIARLINHSCMPNCFARIVPLSDQENRIVLIAKTNVSAGEELTY 958
Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
NY F E E+K+ C C + C G +N
Sbjct: 959 NYSFDDERDEEKVVCRCKAPNCSGFMN 985
>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 989
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 93/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ K R+ F +S IH WGL A I+ + V+EY G +R ++D+RE +Y G Y
Sbjct: 843 ETEKFRVCFGKSGIHGWGLFARRDIQEGEMVVEYRGVHVRRSVADLREEKYRSEG-KDCY 901
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITY 911
LF++ + VVDAT RG IAR INHSC PNCY +++S+ Q +I + AK +++AGEE+TY
Sbjct: 902 LFKISEEVVVDATNRGNIARLINHSCMPNCYARIMSLGDQGSRIVLIAKTNVSAGEELTY 961
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E+ K+PC C + C +N
Sbjct: 962 DYLFDPDERDELKVPCLCKAPNCRRFMN 989
>gi|339244151|ref|XP_003378001.1| putative SET domain protein [Trichinella spiralis]
gi|316973127|gb|EFV56754.1| putative SET domain protein [Trichinella spiralis]
Length = 1416
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 89/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RSKI GL A +E VIEYVGE+IR ++++ RE Y+K G Y+FR
Sbjct: 1275 KSNVYLARSKIQGLGLFANRDVEMNAMVIEYVGEVIRNEVAERREKSYQKRNRGV-YMFR 1333
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
LD +V+DAT GG AR+INHSC+PNC + I + + +I I + R I GEE+TY+Y+F
Sbjct: 1334 LDSDHVIDATVAGGPARYINHSCDPNCIAERIDFDRESRIVIMSCRPICKGEELTYDYQF 1393
Query: 916 PLEEK--KIPCYCGSKKCHGSLN 936
E++ K+PC C + C +N
Sbjct: 1394 DFEDELNKLPCLCRAPNCRKWMN 1416
>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 982
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 110/190 (57%), Gaps = 19/190 (10%)
Query: 765 LSARTNRVK--LRNLLAAAEGAELLKASQVKARKKRLR-----------FQRSKIHDWGL 811
L A ++R+K + L+A E K + + ++RL+ F +S IH WGL
Sbjct: 794 LEAISHRLKGPSHHSLSAIENLNSFKEADFTSFRERLKHLQRTENFRVCFGKSGIHGWGL 853
Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
A + I+ + +IEY G +R ++D+RE Y G YLF++ + V+DAT G IA
Sbjct: 854 FARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKISEEIVIDATDSGNIA 912
Query: 872 RFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYC 926
R INHSC PNCY +++S+ +G+ +I + AK ++AAGEE+TY+Y F + EE K+PC C
Sbjct: 913 RLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLFEVDESEEIKVPCLC 972
Query: 927 GSKKCHGSLN 936
+ C +N
Sbjct: 973 KAPNCRKFMN 982
>gi|74145466|dbj|BAE36172.1| unnamed protein product [Mus musculus]
Length = 563
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 422 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 480
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 481 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 540
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 541 DFEDDQHKIPCHCGAVNCRKWMN 563
>gi|350595097|ref|XP_003134587.3| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Sus
scrofa]
Length = 3482
Score = 125 bits (313), Expect = 1e-25, Method: Composition-based stats.
Identities = 66/155 (42%), Positives = 93/155 (60%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K + RS+I GL A IE VIEY+G +IR ++++ +E Y
Sbjct: 3329 KSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLY 3388
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FR+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I
Sbjct: 3389 ESQNRGV-YMFRMDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRI 3447
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+ Y+YKF E + KIPC+CG+ C +N
Sbjct: 3448 QKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 3482
>gi|403223612|dbj|BAM41742.1| predicted protein [Theileria orientalis strain Shintoku]
Length = 4555
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/130 (49%), Positives = 87/130 (66%), Gaps = 3/130 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE--KMGIGSSYL 853
++RL + S IH GL A PI A D V+EYVGELIR I D RE Y + G GS Y+
Sbjct: 4425 EQRLDVRPSLIHGLGLFASVPIRAGDPVVEYVGELIRDVIGDQREEIYSEGQGGDGSCYM 4484
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYN 912
FRLDD +VDAT++G ++RFINHSC+PNC ++I+ E G K I ++AK + G+EITY+
Sbjct: 4485 FRLDDQLIVDATRKGNMSRFINHSCDPNCLCRIITCENGLKHIVVFAKSDLGPGDEITYD 4544
Query: 913 YKFPLEEKKI 922
Y+ + K+
Sbjct: 4545 YQVTERDDKL 4554
>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
Full=Protein SET DOMAIN GROUP 14; AltName:
Full=Trithorax-homolog protein 3; Short=TRX-homolog
protein 3
gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 1018
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F +S IH WGL A + I+ + +IEY G +R ++D+RE Y G YLF++
Sbjct: 876 RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 934
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
+ V+DAT G IAR INHSC PNCY +++S+ +G+ +I + AK ++AAGEE+TY+Y F
Sbjct: 935 EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLF 994
Query: 916 PL---EEKKIPCYCGSKKCHGSLN 936
+ EE K+PC C + C +N
Sbjct: 995 EVDESEEIKVPCLCKAPNCRKFMN 1018
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 125 bits (313), Expect = 1e-25, Method: Composition-based stats.
Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K + RS+I GL A IE VIEY+G +IR ++++ +E Y
Sbjct: 3879 KSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLY 3938
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FR+D +V+DAT GG AR+INHSC PNC +V++ E KI I + R I
Sbjct: 3939 ESQNRGV-YMFRMDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRI 3997
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+ Y+YKF E + KIPC+CG+ C +N
Sbjct: 3998 QKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 4032
>gi|302832672|ref|XP_002947900.1| histone H3 Lys 4 methyltransferase [Volvox carteri f. nagariensis]
gi|300266702|gb|EFJ50888.1| histone H3 Lys 4 methyltransferase [Volvox carteri f. nagariensis]
Length = 2171
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 93/152 (61%), Gaps = 5/152 (3%)
Query: 789 ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
A+ +++ R+ F +S IH WG+ A I + V E+ GE +R ++D+RE RY+ G
Sbjct: 2021 AAALRSEPSRITFGKSGIHGWGIFARTDIPQDAIVTEFRGEAVRPVVADLRERRYKAQGR 2080
Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY-AKRHIAAGE 907
+LF ++ V+D+T G RF NHSC+P+ YTKV+ EG + ++ A+ I AG+
Sbjct: 2081 -DCFLFHMNGEVVLDSTHLGHYGRFTNHSCSPSLYTKVLEFEGGRVRLVFCARTDIRAGQ 2139
Query: 908 EITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
E+T++Y+F E E K+PC CG+ C G+LN
Sbjct: 2140 ELTFDYRFKEEDHPEAKVPCRCGAPNCRGTLN 2171
>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
Length = 1018
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 92/144 (63%), Gaps = 6/144 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F +S IH WGL A + I+ + +IEY G +R ++D+RE Y G YLF++
Sbjct: 876 RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 934
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
+ V+DAT G IAR INHSC PNCY +++S+ +G+ +I + AK ++AAGEE+TY+Y F
Sbjct: 935 EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLF 994
Query: 916 PLEEK---KIPCYCGSKKCHGSLN 936
++E K+PC C + C +N
Sbjct: 995 EVDESGEIKVPCLCKAPNCRKFMN 1018
>gi|71029598|ref|XP_764442.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351396|gb|EAN32159.1| SET-domain protein, putative [Theileria parva]
Length = 175
Score = 124 bits (312), Expect = 2e-25, Method: Composition-based stats.
Identities = 66/170 (38%), Positives = 102/170 (60%), Gaps = 3/170 (1%)
Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
++L ++ + + + + R NR + R + + + + RL + S IH
Sbjct: 3 EHLSKDLREREQTDPELIDVRNNRSRTRKQYDLSNSSTQYRYLLSIPPESRLDVKPSVIH 62
Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE--KMGIGSSYLFRLDDGYVVDAT 865
GL A E I A + V+EYVGELIR + D RE Y + G GS Y+FRLDD Y+VDAT
Sbjct: 63 GLGLFATESIPAGEPVVEYVGELIRDVVGDQREELYSEGQGGDGSCYMFRLDDQYIVDAT 122
Query: 866 KRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
++G ++RFINHSC+PNC ++I+ E G K I ++AK ++ G+E+TY+Y+
Sbjct: 123 RKGNMSRFINHSCDPNCLCRIITCENGMKHIVVFAKSELSPGDEVTYDYQ 172
>gi|223974115|gb|ACN31245.1| unknown [Zea mays]
Length = 214
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 97/152 (63%), Gaps = 5/152 (3%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
GLS + + N+ + E ++KA+ +KRL F +SKIH +G+ A +A D +
Sbjct: 64 GLSDAVQQEAIGNVSSMVEKYTIVKATF----RKRLTFGKSKIHGYGVFAKVAHKAGDMM 119
Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
+EY+GE++R ISD RE R Y + +Y+FR+DD V+DAT+ G IAR INHS PNC
Sbjct: 120 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSREPNC 179
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
Y++ I++ G + I I+AKR I EE+TY+Y+
Sbjct: 180 YSRAITILGDEHIIIFAKRDIDPWEELTYDYR 211
>gi|255076429|ref|XP_002501889.1| predicted protein [Micromonas sp. RCC299]
gi|226517153|gb|ACO63147.1| predicted protein [Micromonas sp. RCC299]
Length = 140
Score = 124 bits (311), Expect = 3e-25, Method: Composition-based stats.
Identities = 61/140 (43%), Positives = 88/140 (62%), Gaps = 6/140 (4%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK--MGIGSSYLFRLD--D 858
RS+IH G+ P+ A+ +EYVGE+I +D RE ++ + S+Y+F LD +
Sbjct: 1 RSRIHGLGVYCTVPLPAKCAALEYVGEIISDANADAREAAEKRSILARASTYMFALDPVN 60
Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-- 916
G VVDA ++G RF+NH C PNC T+V+ V G+KKI ++ R +AAGEE+TY+Y F
Sbjct: 61 GTVVDAARKGNATRFVNHCCEPNCVTRVVVVGGEKKILLFTSRDVAAGEELTYDYMFKPD 120
Query: 917 LEEKKIPCYCGSKKCHGSLN 936
L E + PC CG+ C G +N
Sbjct: 121 LPENEAPCDCGADTCRGIIN 140
>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
Length = 1057
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ F +S IH WGL A I+ + VIEY GE +R I+D+RE RY G Y
Sbjct: 911 KTECHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGEQVRRSIADLRESRYRLEG-KDCY 969
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
LF++ + V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++A +E+TY
Sbjct: 970 LFKISEEVVIDATNKGNIARLINHSCMPNCYARIMSVGDVENRIVLIAKTNVSAADELTY 1029
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E K+PC C + C +N
Sbjct: 1030 DYLFDPDEHDDLKVPCLCRAPNCRKFMN 1057
>gi|122934986|gb|ABM68250.1| MLL [Lagothrix lagotricha]
gi|122938206|gb|ABM68961.1| MLL [Lemur catta]
gi|124013641|gb|ABM88082.1| MLL [Macaca nemestrina]
Length = 91
Score = 123 bits (309), Expect = 4e-25, Method: Composition-based stats.
Identities = 56/92 (60%), Positives = 70/92 (76%), Gaps = 3/92 (3%)
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
GIG Y+FR+DD VVDAT G ARFINHSC PNCY++VI+++GQK I I+A R I G
Sbjct: 1 GIGC-YMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRG 59
Query: 907 EEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
EE+TY+YKFP+E+ K+PC CG+KKC LN
Sbjct: 60 EELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 91
>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1002
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 91/144 (63%), Gaps = 6/144 (4%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
R+ F +S IH WGL A I+ + +IEY G +R ++D+RE Y G YLF++
Sbjct: 860 RVCFGKSGIHGWGLFARNSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 918
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
+ V+DAT G IAR INHSC PNCY +++S+ +G+ +I + AK +++AGEE+TY+Y F
Sbjct: 919 EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVSAGEELTYDYLF 978
Query: 916 PL---EEKKIPCYCGSKKCHGSLN 936
+ EE K+PC C + C +N
Sbjct: 979 EVDESEEIKVPCLCKAPNCRKFMN 1002
>gi|357142828|ref|XP_003572707.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR3-like [Brachypodium distachyon]
Length = 466
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 89/141 (63%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ +S+ WG +ALEP+E DF+IEYVGE+I + R ++ G + Y+
Sbjct: 228 KDKKIKIVKSEGCGWGAIALEPLEKCDFIIEYVGEVIDDATCEQRLWEMKRRGDKNFYMC 287
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ Y +DAT +G +RF+NHSC+PNC + V+G+ ++ ++A R I GE +TY+Y+
Sbjct: 288 EISKDYTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRFIEVGEPLTYDYR 347
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ C+CG+K C G L
Sbjct: 348 FVHFGEKVKCHCGAKSCQGYL 368
>gi|449275932|gb|EMC84661.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
Length = 87
Score = 122 bits (307), Expect = 7e-25, Method: Composition-based stats.
Identities = 54/87 (62%), Positives = 65/87 (74%), Gaps = 2/87 (2%)
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y+FR+DD VVDAT G ARFINHSC PNCY++VI VEG K I I+A R I GEE+TY
Sbjct: 1 YMFRIDDAEVVDATMHGSAARFINHSCEPNCYSRVIQVEGHKHIVIFALRRILRGEELTY 60
Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
+YKFP+EE K+PC CG+K+C LN
Sbjct: 61 DYKFPIEEPAAKLPCNCGAKRCRRFLN 87
>gi|384252564|gb|EIE26040.1| hypothetical protein COCSUDRAFT_13072, partial [Coccomyxa
subellipsoidea C-169]
Length = 859
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 88/149 (59%), Gaps = 10/149 (6%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRL 856
RL +S IH WG P D ++EY+GEL+R ++D RE R Y+++ +Y+F L
Sbjct: 681 RLTCGKSAIHGWGAFTKVPAAESDMLVEYMGELLRRPVADARERRTYDRLVGAGTYVFGL 740
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG------QKKIFIYAKRHIAAGEEIT 910
D VVDAT++G +A +NHSC PN Y++ +SV + I+AKR IAAGEE+T
Sbjct: 741 SDELVVDATRKGNMAHLLNHSCEPNSYSRTVSVRCPDTGTLSDHVVIFAKRAIAAGEELT 800
Query: 911 YNYKFPLEEK---KIPCYCGSKKCHGSLN 936
Y+Y+ K+PC CG+ C G +N
Sbjct: 801 YDYRRAPSHSTPHKLPCNCGAATCRGFVN 829
>gi|443696187|gb|ELT96958.1| hypothetical protein CAPTEDRAFT_153177 [Capitella teleta]
Length = 844
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 67/138 (48%), Positives = 87/138 (63%), Gaps = 3/138 (2%)
Query: 801 FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
+RS + GL A +E VIEY+GELIR ++++ RE Y+ G Y+FR+DD
Sbjct: 708 LRRSNVQGLGLFASRDLERHTMVIEYIGELIRSEVAEARERVYDSQNRGV-YMFRIDDNT 766
Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE-- 918
VVDAT GG AR+INHSC PNC +V+ E KI I A+R IA GEE+TY+YKF E
Sbjct: 767 VVDATMTGGPARYINHSCAPNCVAEVVPFEKDSKIIIIARRRIARGEELTYDYKFDFEDD 826
Query: 919 EKKIPCYCGSKKCHGSLN 936
+ KIPC CG+ C +N
Sbjct: 827 QHKIPCLCGAPNCRKWMN 844
>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 954
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 91/145 (62%), Gaps = 5/145 (3%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K R+ F +S IH WGL A I+ + V+EY G +R ++D+RE +Y G YLF+
Sbjct: 811 KFRVCFGKSGIHGWGLFARRDIQEGEMVVEYRGVHVRRSVTDLREEKYRSEG-KDCYLFK 869
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYK 914
+ + VVDAT G IAR INHSC PNCY +++S+ Q +I + AK +++AGEE+TY+Y
Sbjct: 870 ISEEVVVDATNSGNIARLINHSCMPNCYARIMSMGDQGSRIVLIAKTNVSAGEELTYDYL 929
Query: 915 FPLEEK---KIPCYCGSKKCHGSLN 936
F +E+ K+PC C + C +N
Sbjct: 930 FDPDERDELKVPCLCKAPNCRRFMN 954
>gi|295646353|gb|ADG23062.1| myeloid/lymphoid or mixed-lineage leukemia 3, partial [Gadus
morhua]
Length = 117
Score = 122 bits (307), Expect = 7e-25, Method: Composition-based stats.
Identities = 57/116 (49%), Positives = 83/116 (71%), Gaps = 3/116 (2%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEY+G +IR ++++ +E YE G Y+FR+D+ YV+DAT GG AR++NHSC+PNC
Sbjct: 3 VIEYIGTIIRSEVANRKERLYEAQNRGV-YMFRIDNDYVIDATITGGPARYVNHSCSPNC 61
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
T+V++VE + KI I + R I GEE++Y+YKF LE + KIPC+CG+ C +N
Sbjct: 62 ITEVVTVERENKIIISSCRRIQRGEELSYDYKFDLEDDQHKIPCHCGTLNCRKWMN 117
>gi|291232347|ref|XP_002736118.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like, partial
[Saccoglossus kowalevskii]
Length = 3264
Score = 122 bits (307), Expect = 7e-25, Method: Composition-based stats.
Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
K+SQ + K+ R RS+I GL A IE VIEY+G +IR ++++ RE Y
Sbjct: 3111 KSSQYRRLKQEWRNNVYLARSRIQGLGLFATRDIEKHTMVIEYIGTIIRNEVANRREDIY 3170
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E+ G Y+FR++ YV+DAT GG AR++NHSC+PNC +V++ + ++KI I + I
Sbjct: 3171 EEANRGV-YMFRINSDYVIDATLTGGPARYVNHSCSPNCVAEVVTFDKEQKIIIISNSRI 3229
Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
A GEE+TY+YKF E+ KI C C + C +N
Sbjct: 3230 AKGEELTYDYKFDFEDDKNKISCLCKAVNCKKWMN 3264
>gi|429327493|gb|AFZ79253.1| hypothetical protein BEWA_021000 [Babesia equi]
Length = 849
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 87/146 (59%), Gaps = 5/146 (3%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K + +SKIH +GL A++PI + ++EY G +I ++DI E Y+ GS+YLF
Sbjct: 700 KAKHVTIGKSKIHGYGLFAVQPIPKGELILEYAGVVITEHLADILEREYDATIGGSTYLF 759
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTK---VISVEG-QKKIFIYAKRHIAAGEEIT 910
RLD Y+VD+T G ARFINHSCNPN T I +G + +YA R I AGEE+
Sbjct: 760 RLDQNYIVDSTLFGNSARFINHSCNPNTMTTNFACIEDDGFGTHVGVYASRDILAGEELY 819
Query: 911 YNYKFPL-EEKKIPCYCGSKKCHGSL 935
YNY+ KK CYCG + C G +
Sbjct: 820 YNYRLSEGSNKKEVCYCGDEMCQGYM 845
>gi|14626492|gb|AAK70214.1| MLL3-like protein [Mus musculus]
Length = 677
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/134 (45%), Positives = 84/134 (62%), Gaps = 3/134 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 545 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 603
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE+ Y+YKF
Sbjct: 604 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 663
Query: 916 PLE--EKKIPCYCG 927
E + KIPC+CG
Sbjct: 664 DFEDDQHKIPCHCG 677
>gi|431918576|gb|ELK17794.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
Length = 100
Score = 122 bits (306), Expect = 1e-24, Method: Composition-based stats.
Identities = 56/92 (60%), Positives = 69/92 (75%), Gaps = 3/92 (3%)
Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
GIG Y+FR+DD VVDAT G ARFINHSC PNC+++VI VEGQK I I+A R I G
Sbjct: 10 GIGC-YMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRG 68
Query: 907 EEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
EE+TY+YKFP+E+ K+PC CG+K+C LN
Sbjct: 69 EELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 100
>gi|326522849|dbj|BAJ88470.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 394
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 89/141 (63%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ +S+ WG VALEP+E DF+IEYVGE+I + R ++ G + Y+
Sbjct: 160 KDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 219
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I GE +TY+Y+
Sbjct: 220 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRVGVFASRIIEVGEPLTYDYR 279
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ C+CG+K C G L
Sbjct: 280 FVHFGEKVKCHCGAKSCQGYL 300
>gi|222617992|gb|EEE54124.1| hypothetical protein OsJ_00897 [Oryza sativa Japonica Group]
Length = 1585
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 86/130 (66%), Gaps = 2/130 (1%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K KR+ RS IH WGL A + I+ VIEY G+ +R ++D+RE RY + Y
Sbjct: 753 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 811
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + VVDAT++G IAR INHSC PNCY +++SV +K +I + AKR ++AGEE+TY
Sbjct: 812 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 871
Query: 912 NYKFPLEEKK 921
+Y F +E +
Sbjct: 872 DYLFDPDESE 881
>gi|222623224|gb|EEE57356.1| hypothetical protein OsJ_07493 [Oryza sativa Japonica Group]
Length = 384
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 89/141 (63%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ ++K WG ++LEP+E DF+IEYVGE+I + R ++ G + Y+
Sbjct: 141 KDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 200
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + V+G+ ++ ++A R I GE +TY+Y+
Sbjct: 201 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYR 260
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ CYCG++ C G L
Sbjct: 261 FVHFGEKVKCYCGAQNCQGYL 281
>gi|357512571|ref|XP_003626574.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355501589|gb|AES82792.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 164
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 92/148 (62%), Gaps = 5/148 (3%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K R+ +S I WGL A ++ D V+EY GE +R ++D+RE +Y G Y
Sbjct: 18 KTENCRVCLGKSAIQGWGLFARRDLQEGDMVVEYRGEQLRRSVADLREAKYRLEG-KDCY 76
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
LF++ + V+DAT +G IAR INHSC PNC+ +++ + Q+ +I + AK +I+AGEE+TY
Sbjct: 77 LFKISEEVVIDATDKGNIARLINHSCMPNCFARIMCLGDQESRIVLIAKTNISAGEELTY 136
Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
+Y F +E+ K+PC+C + C +N
Sbjct: 137 DYLFDTDERDELKVPCHCKAPNCRKFMN 164
>gi|297599580|ref|NP_001047405.2| Os02g0611300 [Oryza sativa Japonica Group]
gi|255671080|dbj|BAF09319.2| Os02g0611300 [Oryza sativa Japonica Group]
Length = 263
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 89/141 (63%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ ++K WG ++LEP+E DF+IEYVGE+I + R ++ G + Y+
Sbjct: 20 KDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 79
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + V+G+ ++ ++A R I GE +TY+Y+
Sbjct: 80 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYR 139
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ CYCG++ C G L
Sbjct: 140 FVHFGEKVKCYCGAQNCQGYL 160
>gi|300176189|emb|CBK23500.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 85/144 (59%), Gaps = 10/144 (6%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----------MGIGSSY 852
+S IH WGL A VIE+ GE++ +++ RE + + +G Y
Sbjct: 177 KSPIHGWGLFAKTAFRRGMMVIEFAGEVVGQDVANAREAVWAQGAAVRRRYNRRNVGGCY 236
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D ++VDAT +G +RFINH C+PNC +V+++ + KI +A R I GEEITY+
Sbjct: 237 FFRMDAEHIVDATVKGNESRFINHCCDPNCDARVVNINNENKIIFFANRDINPGEEITYD 296
Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
YKF E++++ C CG+K C G +N
Sbjct: 297 YKFQYEDRELVCLCGAKNCVGRMN 320
>gi|83286297|ref|XP_730100.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23489726|gb|EAA21665.1| similar to KIAA0304 gene product-related [Plasmodium yoelii yoelii]
Length = 1137
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/122 (50%), Positives = 83/122 (68%), Gaps = 4/122 (3%)
Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
E E VIEY+GE IR ISD RE YEK+ S Y+FRL++ ++DATK G ++RFINHS
Sbjct: 1017 EKEMPVIEYIGEYIRNIISDKREKYYEKIE-SSCYMFRLNENIIIDATKWGNVSRFINHS 1075
Query: 878 CNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGS 934
C PNC+ K++S + K I I+AK+ I EEITY+Y+F +E +K+ C CGS C G
Sbjct: 1076 CEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFGVESEGEKLICLCGSSTCLGR 1135
Query: 935 LN 936
+N
Sbjct: 1136 MN 1137
>gi|326487338|dbj|BAJ89653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 88/141 (62%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ +S+ WG VALEP+E DF+IEYVGE+I + R ++ G + Y+
Sbjct: 106 KDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 165
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + VEG+ + ++A R I GE +TY+Y+
Sbjct: 166 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRAGVFASRIIEVGEPLTYDYR 225
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ C+CG+K C G L
Sbjct: 226 FVHFGEKVKCHCGAKSCQGYL 246
>gi|255557755|ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
gi|223540953|gb|EEF42511.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
Length = 1125
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/128 (46%), Positives = 82/128 (64%), Gaps = 4/128 (3%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
K L +S IH GL I + V+EYVGE++ L+++D RE Y+ K+ S+ Y
Sbjct: 985 KHLVVYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQSGRKLQYKSACY 1044
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
FR+D ++DAT +GGIARF+NHSC PNC KVISV KK+ +A+R I GEEITY+
Sbjct: 1045 FFRIDKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAERDIYPGEEITYD 1104
Query: 913 YKFPLEEK 920
Y F E++
Sbjct: 1105 YHFNHEDE 1112
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5878 KTNVYLARSRIQGLGLYASKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5936
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5937 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5996
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5997 DFEDDQHKIPCHCGAWNCRKWMN 6019
>gi|297461543|ref|XP_591902.5| PREDICTED: histone-lysine N-methyltransferase SETD1B [Bos taurus]
Length = 1936
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/136 (52%), Positives = 94/136 (69%), Gaps = 5/136 (3%)
Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
T G + A A+ + S R R + R LL++ G+ +LLK +Q+K RKK+L+F +
Sbjct: 1726 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1783
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S IHDWGL A+EPI A++ VIEYVG+ IR I+D+RE RYE GIGSSY+FR+D ++D
Sbjct: 1784 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1843
Query: 864 ATKRGGIARFINHSCN 879
ATK G ARFINHSCN
Sbjct: 1844 ATKCGNFARFINHSCN 1859
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5120 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5178
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5179 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5238
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5239 DFEDDQHKIPCHCGAWNCRKWMN 5261
>gi|62088568|dbj|BAD92731.1| myeloid/lymphoid or mixed-lineage leukemia 2 variant [Homo sapiens]
Length = 2704
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 2563 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 2621
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 2622 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 2681
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 2682 DFEDDQHKIPCHCGAWNCRKWMN 2704
>gi|196016259|ref|XP_002117983.1| hypothetical protein TRIADDRAFT_62004 [Trichoplax adhaerens]
gi|190579456|gb|EDV19551.1| hypothetical protein TRIADDRAFT_62004 [Trichoplax adhaerens]
Length = 589
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/136 (47%), Positives = 90/136 (66%), Gaps = 3/136 (2%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
RS I GL A + IE VIEY+G +IR ++++ RE Y+K G Y+FR+D +VV
Sbjct: 455 RSLIQGLGLFARKNIEKNTMVIEYIGAMIRNEVANKRERIYQKANHGI-YMFRIDPDFVV 513
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
DAT+ GG+AR+INHSC PNC +V++ + + +I I + R I GEE+TY+YKF E+
Sbjct: 514 DATEDGGLARYINHSCQPNCVAEVVTFDSEPRIIIISNRRIDKGEELTYDYKFEYEDDLN 573
Query: 921 KIPCYCGSKKCHGSLN 936
KIPC CG+ C G +N
Sbjct: 574 KIPCMCGAPNCRGWMN 589
>gi|170047037|ref|XP_001851046.1| set domain protein [Culex quinquefasciatus]
gi|167869598|gb|EDS32981.1| set domain protein [Culex quinquefasciatus]
Length = 719
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 49/89 (55%), Positives = 71/89 (79%)
Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
V +++R ++D+RE +YE +GI S+YLFR+D ++DA+ G +ARFINHSCNPNCYTKV
Sbjct: 332 VSQVVRPSVTDLRETKYEAIGISSTYLFRIDMETIIDASTCGNLARFINHSCNPNCYTKV 391
Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
I++E +KKI IY+K+ I +EITY+YK+
Sbjct: 392 ITIESEKKIVIYSKQPIGVNKEITYDYKW 420
>gi|344254290|gb|EGW10394.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 1475
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 1334 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 1392
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 1393 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 1452
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 1453 DFEDDQHKIPCHCGAWNCRKWMN 1475
>gi|37590100|gb|AAH58659.1| Mll2 protein [Mus musculus]
Length = 1250
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 1109 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 1167
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 1168 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 1227
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 1228 DFEDDQHKIPCHCGAWNCRKWMN 1250
>gi|402577843|gb|EJW71799.1| hypothetical protein WUBG_17294 [Wuchereria bancrofti]
Length = 122
Score = 117 bits (293), Expect = 3e-23, Method: Composition-based stats.
Identities = 57/119 (47%), Positives = 81/119 (68%), Gaps = 6/119 (5%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
+IEY GE+IR ++ ++RE +YE G Y+FR+DD ++DAT GG AR+INHSC+PNC
Sbjct: 5 IIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFRIDDERLIDATMAGGPARYINHSCDPNC 63
Query: 883 YTKVIS---VEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
T+++ KKI I A R I+AGEE+TY+Y+F +E+ KIPC CG+ C +N
Sbjct: 64 STRLVDSGPCGDDKKIIIIANRPISAGEELTYDYQFDIEDAADKIPCLCGAPNCQKWMN 122
>gi|156088645|ref|XP_001611729.1| SET domain containing protein [Babesia bovis]
gi|154798983|gb|EDO08161.1| SET domain containing protein [Babesia bovis]
Length = 866
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/165 (40%), Positives = 99/165 (60%), Gaps = 12/165 (7%)
Query: 781 AEGAELLKASQVK---ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
AE +LL + K A+ KR+ +S++H +GL A++ I D ++EY G +I ++D
Sbjct: 697 AEQRQLLGLTNKKPFMAKCKRVIIGKSRVHGYGLFAVDTINKGDLIMEYAGVVISDYMAD 756
Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIF- 896
+RE Y+++ GS Y+FRLD ++D+T G ARFINHSC+PN T S + + +F
Sbjct: 757 MREVMYQRLVCGSIYMFRLDLNRIIDSTFYGNCARFINHSCDPNTATSNFS-DIDEDVFG 815
Query: 897 ----IYAKRHIAAGEEITYNYKFPL--EEKKIPCYCGSKKCHGSL 935
+YA + I AGEEI YNY+ L E +I C CGS +C G +
Sbjct: 816 THVGVYASKVILAGEEIYYNYRLSLGSENPQI-CRCGSYQCTGYM 859
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 88/167 (52%), Gaps = 27/167 (16%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A IE VIEY+G +IR ++++ +E YE G Y+FR
Sbjct: 4620 KSSVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4678
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE------- 908
+D+ +V+DAT GG AR+INHSC PNC +V++ E KI I + R I GEE
Sbjct: 4679 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEESFTNVIT 4738
Query: 909 -----------------ITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+ Y+YKF E + KIPC+CG+ C +N
Sbjct: 4739 HHGSSRAGKLPLSAMGPLCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 4785
>gi|302849081|ref|XP_002956071.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
nagariensis]
gi|300258576|gb|EFJ42811.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
nagariensis]
Length = 2029
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 87/147 (59%), Gaps = 13/147 (8%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
+RL +S IH WG A P + D +IEY GELIR +SD+RE R Y K+ +Y+F
Sbjct: 1729 QRLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELIRPSVSDVREKRMYNKLVGCGTYIFT 1788
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG------QKKIFIYAK-----RHIA 904
L+D +DAT+ G +A +NHSC+PNCY++ I++ + I AK R I
Sbjct: 1789 LNDDQHIDATRAGNMAHLLNHSCDPNCYSRTITLTDPVTRATSDHVIITAKVCVLWRDIE 1848
Query: 905 AGEEITYNYKFPLEEKKIPCYCGSKKC 931
A EE+TY+Y+F E ++PC CG+ C
Sbjct: 1849 AWEELTYDYRFNSSE-ELPCNCGAATC 1874
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE G Y+FR
Sbjct: 4160 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 4218
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ V+DAT GG AR++NHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4219 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4278
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4279 DFEDDQHKIPCHCGAWNCRKWMN 4301
>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
Length = 521
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 84/144 (58%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + R R R++ WGLVA E I A FVIEY GE+I K S R YE G+ +Y
Sbjct: 81 KCQYARTRLVRTEGRGWGLVADENIMAGQFVIEYCGEVISWKESKRRAQAYETQGLKDAY 140
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DAT++G ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 141 IIYLNADESIDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPFGTELSYD 200
Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 201 YNFEWYGGVMVRCLCGAASCSGFL 224
>gi|148672216|gb|EDL04163.1| mCG147092 [Mus musculus]
Length = 900
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 759 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 817
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 818 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 877
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 878 DFEDDQHKIPCHCGAWNCRKWMN 900
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE G Y+FR
Sbjct: 4682 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 4740
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ V+DAT GG AR++NHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4741 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4800
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4801 DFEDDQHKIPCHCGAWNCRKWMN 4823
>gi|440801495|gb|ELR22513.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 981
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 77/128 (60%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL A E I F+IEYVGE+I + R YE+MG+ Y LD +DA+++G
Sbjct: 420 WGLRAREKIPKGTFIIEYVGEVISTDMCQDRMKYYEEMGLEHYYFLTLDGSECIDASQKG 479
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSCNPN T +V+ + ++ I+A+ I G+EIT++Y+F KK C+CG
Sbjct: 480 NLARFINHSCNPNAKTHKWTVDKEIRVGIFAEEDIPVGQEITFDYQFERFGGKKQKCFCG 539
Query: 928 SKKCHGSL 935
C G L
Sbjct: 540 ETNCRGFL 547
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
niloticus]
Length = 4907
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE G Y+FR
Sbjct: 4766 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYEAQNRGI-YMFR 4824
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ V+DAT GG AR++NHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4825 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4884
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4885 DFEDDQHKIPCHCGAWNCRKWMN 4907
>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
[Brachypodium distachyon]
Length = 517
Score = 116 bits (290), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + R R +++ WGL+A E I A FVIEY GE+I K + R YE G+ +Y
Sbjct: 79 KCQYARTRLVKTEGRGWGLLAEENIMAGQFVIEYCGEVISWKEAKRRSQAYEDQGLMEAY 138
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 139 IIYLNTAESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGMELSYD 198
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 199 YNFEWFGGAIVRCLCGAASCSGFL 222
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 116 bits (290), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 100/184 (54%), Gaps = 18/184 (9%)
Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ-----------RSKIHDWGLVAL 814
SAR+N+ N+ GA K V R + R+ RS+I GL A
Sbjct: 4694 SARSNQ----NVAQGEGGAPHNKPPVVHPRSSQYRWMKSEWRANVYLGRSRIQGLGLFAA 4749
Query: 815 EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
IE + VIEY G ++R +++ IR+ + + + ++FR+D +VVDAT GGIAR+I
Sbjct: 4750 RDIEKQTMVIEYNGTILRNEVA-IRKEKIYRSKNRAVFMFRIDSEHVVDATLNGGIARYI 4808
Query: 875 NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCH 932
NHSC PNC +V++ E KI I R I GEE+ +NY+F E KI C+CG+ +C
Sbjct: 4809 NHSCAPNCVAEVVTFERGYKIIISCIRRIEKGEELCFNYQFESVEGQHKIACHCGAPECR 4868
Query: 933 GSLN 936
+N
Sbjct: 4869 KWIN 4872
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 116 bits (290), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE G Y+FR
Sbjct: 3574 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 3632
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ V+DAT GG AR++NHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 3633 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 3692
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 3693 DFEDDQHKIPCHCGAWNCRKWMN 3715
>gi|238013758|gb|ACR37914.1| unknown [Zea mays]
Length = 252
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 86/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ ++K WG VALEP+E DFVIEYVGE+I + R + G + Y+
Sbjct: 20 KDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRRRGDKNFYMC 79
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G ++RF+NHSC PNC + V+G+ ++ ++A R I GE +TY+Y+
Sbjct: 80 EISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIEVGEPLTYDYR 139
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ C+C + C G L
Sbjct: 140 FVHFGEKVKCHCEAVNCQGYL 160
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE G Y+FR
Sbjct: 4756 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYELQNRGI-YMFR 4814
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ V+DAT GG AR++NHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4815 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4874
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4875 DFEDDQHKIPCHCGAWNCRKWMN 4897
>gi|255575537|ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
gi|223531892|gb|EEF33708.1| set domain protein, putative [Ricinus communis]
Length = 495
Score = 115 bits (289), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 86/139 (61%), Gaps = 1/139 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
+ R +++ WGL+A E I+A F+IEY GE+I K + R YE+ G+ +++ L+
Sbjct: 84 KTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVISWKEAKRRSQAYERQGLKDAFIISLN 143
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
+DAT++G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I+ G E+ Y+Y F
Sbjct: 144 SSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKQDISIGTELAYDYNFEW 203
Query: 917 LEEKKIPCYCGSKKCHGSL 935
K+ C CGS C G L
Sbjct: 204 YGGAKVRCLCGSASCSGFL 222
>gi|168027908|ref|XP_001766471.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
gi|162682380|gb|EDQ68799.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
Length = 2933
Score = 115 bits (289), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 85/157 (54%), Gaps = 31/157 (19%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSY 852
KRL +S IH GL + I + V+EYVGE++ +++D RE Y G+ Y
Sbjct: 2668 KRLAVYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACY 2727
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCN---------------------------PNCYTK 885
LFR+D ++DAT++GGIARF+NHSC+ PNC K
Sbjct: 2728 LFRIDTEQIIDATRKGGIARFVNHSCSLCMCGILEIANTSGLLGYGYLIAWNSWPNCVAK 2787
Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
VI VE KK+ +AKR+I A EE+TY+YKF +E I
Sbjct: 2788 VICVENLKKVVFFAKRNIYAAEEVTYDYKFNCDEHII 2824
>gi|355702682|gb|AES02013.1| myeloid/lymphoid or mixed-lineage leukemia 2 [Mustela putorius furo]
Length = 1014
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 90/138 (65%), Gaps = 3/138 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 874 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 932
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 933 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 992
Query: 916 PLE--EKKIPCYCGSKKC 931
E + KIPC+CG+ C
Sbjct: 993 DFEDDQHKIPCHCGAWNC 1010
>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
Length = 500
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 88/141 (62%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K+++ +++ WG+ +EPI DF+IEY+GE+I + + R + G+ + Y+
Sbjct: 328 KEKKIKIVKTEFCGWGVETVEPINKGDFIIEYIGEVIDDAVCEQRLWDMKYKGVQNFYMC 387
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I GE +TY+Y+
Sbjct: 388 EIRKDFTIDATFKGNSSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKVGEPLTYDYR 447
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ C+CG+ CHG L
Sbjct: 448 FVQFGPEVRCHCGAPNCHGYL 468
>gi|224067978|ref|XP_002302628.1| SET domain protein [Populus trichocarpa]
gi|222844354|gb|EEE81901.1| SET domain protein [Populus trichocarpa]
Length = 667
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 93/172 (54%), Gaps = 28/172 (16%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE-------- 844
K R+ F +S IH WGL A I+ + VIEY GE +R ++D+RE RY
Sbjct: 496 KTENHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGEKVRRSVADLREARYRLEGKDCYV 555
Query: 845 ----------------KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
MG + LF++ + V+DAT +G IAR INHSC PNCY +++S
Sbjct: 556 HHKGFESQDLLSGIEGAMGQLNLPLFKISEEVVIDATNKGNIARLINHSCMPNCYARIMS 615
Query: 889 V-EGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
V + + +I + AK ++AG E+TY+Y F +E+ K+PC C + C +N
Sbjct: 616 VGDVENRIVLIAKTDVSAGNELTYDYLFDPDERDDLKVPCLCKAPNCRKFMN 667
>gi|297842431|ref|XP_002889097.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
gi|297334938|gb|EFH65356.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
Length = 525
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/128 (44%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGLVALE I+A F+IEY GE+I K + R YE G+ +Y+ L+ +DATK+G
Sbjct: 101 WGLVALEDIKAGQFIIEYCGEVISWKEAKRRAQTYETHGVKDAYIISLNASEAIDATKKG 160
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I+ E+ Y+Y F K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220
Query: 928 SKKCHGSL 935
+ C G L
Sbjct: 221 AVACSGFL 228
>gi|212721872|ref|NP_001132822.1| uncharacterized protein LOC100194312 [Zea mays]
gi|194695492|gb|ACF81830.1| unknown [Zea mays]
gi|413923000|gb|AFW62932.1| putative SET-domain containing protein family [Zea mays]
Length = 418
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 81/135 (60%)
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV 861
Q +K WG VALEP+E DFVIEYVGE+I + R + G + Y+ + +
Sbjct: 193 QHTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRRRGDKNFYMCEISKDFT 252
Query: 862 VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 921
+DAT +G ++RF+NHSC PNC + V+G+ ++ ++A R I GE +TY+Y+F +K
Sbjct: 253 IDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIEVGEPLTYDYRFVHFGEK 312
Query: 922 IPCYCGSKKCHGSLN 936
+ C+C + C G L
Sbjct: 313 VKCHCEAVNCQGYLG 327
>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
Length = 477
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 86/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K+++ +++ WG+ A E I EDF++EY+GE+I + R + G+ Y+
Sbjct: 304 KEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 363
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSCNPNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 364 EIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 423
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ C CGS+ C G L
Sbjct: 424 FVQFGPEVKCNCGSENCQGYL 444
>gi|22330671|ref|NP_177797.2| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|42572135|ref|NP_974158.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|75243465|sp|Q84WW6.1|ASHH1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH1; AltName:
Full=ASH1 homolog 1; AltName: Full=Protein SET DOMAIN
GROUP 26
gi|25054844|gb|AAN71912.1| unknown protein [Arabidopsis thaliana]
gi|225898088|dbj|BAH30376.1| hypothetical protein [Arabidopsis thaliana]
gi|332197758|gb|AEE35879.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|332197759|gb|AEE35880.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
Length = 492
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGLVALE I+A F++EY GE+I K + R YE G+ +Y+ L+ +DATK+G
Sbjct: 101 WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 160
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I+ E+ Y+Y F K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220
Query: 928 SKKCHGSL 935
+ C G L
Sbjct: 221 AVACSGFL 228
>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
Full=ASH1-related protein 3; AltName: Full=Protein SET
DOMAIN GROUP 4; AltName: Full=Protein stamen loss
gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
Length = 497
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 86/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K+++ +++ WG+ A E I EDF++EY+GE+I + R + G+ Y+
Sbjct: 324 KEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 383
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSCNPNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 384 EIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 443
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ C CGS+ C G L
Sbjct: 444 FVQFGPEVKCNCGSENCQGYL 464
>gi|34782989|gb|AAH39197.1| MLL2 protein, partial [Homo sapiens]
Length = 395
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 254 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 312
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 313 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 372
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 373 DFEDDQHKIPCHCGAWNCRKWMN 395
>gi|297602734|ref|NP_001052807.2| Os04g0429100 [Oryza sativa Japonica Group]
gi|255675465|dbj|BAF14721.2| Os04g0429100, partial [Oryza sativa Japonica Group]
Length = 612
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K++ R +++ WGL+A E I A FV+EY GE+I K + R YE G+ +Y
Sbjct: 176 KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 235
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 236 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 295
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 296 YNFEWFGGAMVRCLCGAGSCSGFL 319
>gi|6143888|gb|AAF04434.1|AC010718_3 unknown protein; 29143-26659 [Arabidopsis thaliana]
Length = 528
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGLVALE I+A F++EY GE+I K + R YE G+ +Y+ L+ +DATK+G
Sbjct: 101 WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 160
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I+ E+ Y+Y F K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220
Query: 928 SKKCHGSL 935
+ C G L
Sbjct: 221 AVACSGFL 228
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 114 bits (285), Expect = 2e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5411 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5469
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5470 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5529
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5530 DFEDDQHKIPCHCGAWNCRKWMN 5552
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 4791 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4849
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4850 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4909
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4910 DFEDDQHKIPCHCGAWNCRKWMN 4932
>gi|15488418|gb|AAL01110.1|AF408059_1 ASH1-like protein 1 [Arabidopsis thaliana]
Length = 229
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGLVALE I+A F++EY GE+I K + R YE G+ +Y+ L+ +DATK+G
Sbjct: 19 WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 78
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I+ E+ Y+Y F K+ C CG
Sbjct: 79 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 138
Query: 928 SKKCHGSL 935
+ C G L
Sbjct: 139 AVACSGFL 146
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5418 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5476
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5477 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5536
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5537 DFEDDQHKIPCHCGAWNCRKWMN 5559
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5266 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5324
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5325 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5384
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5385 DFEDDQHKIPCHCGAWNCRKWMN 5407
>gi|224082490|ref|XP_002306713.1| SET domain protein [Populus trichocarpa]
gi|222856162|gb|EEE93709.1| SET domain protein [Populus trichocarpa]
Length = 495
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 95/169 (56%), Gaps = 9/169 (5%)
Query: 776 NLLAAAEGAELLKASQVKARKKRL-RFQRSKIH-------DWGLVALEPIEAEDFVIEYV 827
NLL + E V + +R +F+ +K WGL+A E I+A F+IEY
Sbjct: 54 NLLTSTECTPGYCPCGVYCKNQRFQKFEYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYC 113
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE+I K + R YE G+ +++ L+ +DATK+G +ARFINHSC PNC T+
Sbjct: 114 GEVISWKEAKKRSQVYENQGLKDAFIISLNSTESIDATKKGSLARFINHSCQPNCETRKW 173
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCGSKKCHGSL 935
+V G+ ++ I+AK++I+ G E+ Y+Y F K+ C CG+ C G L
Sbjct: 174 TVLGEIRVGIFAKQNISIGTELAYDYNFEWYGGAKVRCLCGAVNCSGFL 222
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5342 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5400
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5401 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5460
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5461 DFEDDQHKIPCHCGAWNCRKWMN 5483
>gi|170047578|ref|XP_001851293.1| Setd1a protein [Culex quinquefasciatus]
gi|167869969|gb|EDS33352.1| Setd1a protein [Culex quinquefasciatus]
Length = 373
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/96 (53%), Positives = 71/96 (73%), Gaps = 6/96 (6%)
Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
++R ++D+RE +YE +GI S+YLFR+D ++DA+ G +ARFINHSCNPNCYTKVI++
Sbjct: 284 VVRPSVTDLREIKYEAIGISSTYLFRIDMETIIDASNYGNLARFINHSCNPNCYTKVITI 343
Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
E +KKI I +EITY+YK PLE++KI Y
Sbjct: 344 ELEKKI------PIGVNKEITYDYKVPLEDEKISFY 373
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 4906 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4964
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4965 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5024
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5025 DFEDDQHKIPCHCGAWNCRKWMN 5047
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5148 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5206
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5207 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5266
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5267 DFEDDQHKIPCHCGAWNCRKWMN 5289
>gi|57997471|emb|CAE45854.2| hypothetical protein [Homo sapiens]
Length = 116
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 54/116 (46%), Positives = 77/116 (66%), Gaps = 3/116 (2%)
Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
VIEY+G +IR ++++ +E YE G Y+FR+D+ +V+DAT GG AR+INHSC PNC
Sbjct: 2 VIEYIGTIIRNEVANRKEKLYESQNRGV-YMFRMDNDHVIDATLTGGPARYINHSCAPNC 60
Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
+V++ E KI I + R I GEE+ Y+YKF E + KIPC+CG+ C +N
Sbjct: 61 VAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 116
>gi|319764851|ref|YP_004128788.1| nuclear protein set [Alicycliphilus denitrificans BC]
gi|330827056|ref|YP_004390359.1| nuclear protein SET [Alicycliphilus denitrificans K601]
gi|317119412|gb|ADV01901.1| nuclear protein SET [Alicycliphilus denitrificans BC]
gi|329312428|gb|AEB86843.1| nuclear protein SET [Alicycliphilus denitrificans K601]
Length = 183
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 60/149 (40%), Positives = 89/149 (59%), Gaps = 12/149 (8%)
Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
A +R++ +RS +H G+ AL+PI A + VIEY GE+I K +R ++ ++
Sbjct: 13 APGRRIQTRRSGVHGKGVFALQPIAAGEVVIEYTGEVISWK-EALRRHPHDPAQPNHTFY 71
Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
F +D+ V+DA G AR+INHSCNPNCY E +IFI A R+I AGEE+ Y+Y
Sbjct: 72 FHVDEDRVIDAKFGGNSARWINHSCNPNCYAD----ERHGRIFITALRNIEAGEELNYDY 127
Query: 914 KFPLEEK-------KIPCYCGSKKCHGSL 935
++E+ + PC+CGS+ C G+L
Sbjct: 128 GLIIDERYTPKLKAEYPCWCGSRNCRGTL 156
>gi|225445763|ref|XP_002272781.1| PREDICTED: histone-lysine N-methyltransferase ASHR3 [Vitis
vinifera]
gi|297743704|emb|CBI36587.3| unnamed protein product [Vitis vinifera]
Length = 494
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 88/141 (62%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K+++ ++++ WG+ A E I DFVIEY+GE+I + + R + G + Y+
Sbjct: 326 KEKKIKIVKTELCGWGVDAAESINKGDFVIEYIGEVIDDALCERRLWDMKDRGDQNFYMC 385
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 386 EIRKDFTIDATFKGNASRFLNHSCDPNCKLEKWQVEGETRVGVFAARSIKAGEPLTYDYR 445
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ C+CG+ CHG L
Sbjct: 446 FVRFGPEVKCHCGAPSCHGYL 466
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 114 bits (285), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5436 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5494
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5495 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5554
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5555 DFEDDQHKIPCHCGAWNCRKWMN 5577
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5246 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5304
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5305 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5364
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5365 DFEDDQHKIPCHCGAWNCRKWMN 5387
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 4398 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4456
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4457 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4516
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 4517 DFEDDQHKIPCHCGAWNCRKWMN 4539
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5152 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5210
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5211 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5270
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5271 DFEDDQHKIPCHCGAWNCRKWMN 5293
>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 489
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + R R R+ WGLVA E I A FVIEY GE+I K + R YE + +Y
Sbjct: 57 KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 116
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DAT++G +ARFINHSC PNC T+ +V G+ ++ I+AK++I G E++Y+
Sbjct: 117 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 176
Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 177 YNFEWYGGVMVRCLCGAASCSGFL 200
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5406 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5464
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5465 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5524
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5525 DFEDDQHKIPCHCGAWNCRKWMN 5547
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5397 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5455
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5456 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5515
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5516 DFEDDQHKIPCHCGAWNCRKWMN 5538
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5362 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5420
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5421 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5480
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5481 DFEDDQHKIPCHCGAWNCRKWMN 5503
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5143 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5201
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5202 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5261
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5262 DFEDDQHKIPCHCGAWNCRKWMN 5284
>gi|242065740|ref|XP_002454159.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
gi|241933990|gb|EES07135.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
Length = 328
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 85/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+ K+++ ++K WG VALEP+E DFVIEYVGE+I + R G + Y+
Sbjct: 96 KDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRYRGDKNFYMC 155
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G ++RF+NHSC PNC + V+G+ ++ ++A R I GE +TY+Y+
Sbjct: 156 EISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIKVGEPLTYDYR 215
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F +K+ C+C + C G L
Sbjct: 216 FVHFGEKVKCHCEAVNCQGYL 236
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5409 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5467
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5468 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5527
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5528 DFEDDQHKIPCHCGAWNCRKWMN 5550
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5347 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5405
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5406 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5465
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5466 DFEDDQHKIPCHCGAWNCRKWMN 5488
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5418 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5476
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5477 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5536
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5537 DFEDDQHKIPCHCGAWNCRKWMN 5559
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5124 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5182
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5183 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5242
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5243 DFEDDQHKIPCHCGAWNCRKWMN 5265
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5357 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5415
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5416 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5475
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5476 DFEDDQHKIPCHCGAWNCRKWMN 5498
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5447 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5505
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5506 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5565
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5566 DFEDDQHKIPCHCGAWNCRKWMN 5588
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5447 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5505
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5506 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5565
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5566 DFEDDQHKIPCHCGAWNCRKWMN 5588
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5396 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5454
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5455 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5514
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5515 DFEDDQHKIPCHCGAWNCRKWMN 5537
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5232 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5290
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5291 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5350
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5351 DFEDDQHKIPCHCGAWNCRKWMN 5373
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5402 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5460
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5461 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5520
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5521 DFEDDQHKIPCHCGAWNCRKWMN 5543
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5402 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5460
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5461 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5520
Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
E + KIPC+CG+ C +N
Sbjct: 5521 DFEDDQHKIPCHCGAWNCRKWMN 5543
>gi|397580684|gb|EJK51670.1| hypothetical protein THAOC_29137 [Thalassiosira oceanica]
Length = 1171
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 8/139 (5%)
Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
S++H GL A I+ V EY+GE + + +D R+ ++ G G Y FR+ V+D
Sbjct: 1022 SEVHGRGLYADCKIDKGQIVAEYIGEYVTQEAADRRD--HKGGGEGQDYQFRIAASLVLD 1079
Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQ---KKIFIYAKRHIAAGEEITYNYKFPLEEK 920
AT +GG AR+INHSC+PNC ++ E K++ I +KR I A EE+TY+Y+F LE
Sbjct: 1080 ATLKGGCARYINHSCDPNCIANIVEGEQHTHLKRVLIISKRDIDANEELTYDYQFALETN 1139
Query: 921 ---KIPCYCGSKKCHGSLN 936
++PC CGS C G LN
Sbjct: 1140 LGLRVPCNCGSACCRGFLN 1158
>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 497
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 86/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K++R +++ WG+ A E I EDF++EY+GE+I + R + G+ Y+
Sbjct: 324 KEKKIRIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 383
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC+PNC + VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 384 EIQKDFTIDATFKGNASRFLNHSCSPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 443
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ C CGS+ C G L
Sbjct: 444 FVQFGPEVKCNCGSESCQGYL 464
>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 503
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + R R R+ WGLVA E I A FVIEY GE+I K + R YE + +Y
Sbjct: 71 KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 130
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DAT++G +ARFINHSC PNC T+ +V G+ ++ I+AK++I G E++Y+
Sbjct: 131 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 190
Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 191 YNFEWYGGVMVRCLCGAASCSGFL 214
>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
Length = 513
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + R R R+ WGLVA E I A FVIEY GE+I K + R YE + +Y
Sbjct: 81 KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 140
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DAT++G +ARFINHSC PNC T+ +V G+ ++ I+AK++I G E++Y+
Sbjct: 141 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 200
Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 201 YNFEWYGGVMVRCLCGAASCSGFL 224
>gi|413955941|gb|AFW88590.1| putative SET-domain containing protein family [Zea mays]
Length = 730
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)
Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
RKRT DG + + + A +Q+ K + T C + GC++S
Sbjct: 469 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 514
Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
K +++ R Q + ++ P+V + W + G G L
Sbjct: 515 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 562
Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
LA +G + + +++R+ RS + WG P+ D++ EY GELI K +D
Sbjct: 563 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 622
Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ I
Sbjct: 623 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 680
Query: 898 YAKRHIAAGEEITYNYKF 915
YAK HI A EE+ Y+Y++
Sbjct: 681 YAKEHIEASEELFYDYRY 698
>gi|356576073|ref|XP_003556159.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
Length = 480
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E I+A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 95 WGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKG 154
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I G E+ Y+Y F K+ C CG
Sbjct: 155 SLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIPIGNELAYDYNFEWFGGAKVRCLCG 214
Query: 928 SKKCHGSL 935
+ KC G L
Sbjct: 215 ALKCSGFL 222
>gi|222628880|gb|EEE61012.1| hypothetical protein OsJ_14832 [Oryza sativa Japonica Group]
Length = 518
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K++ R +++ WGL+A E I A FV+EY GE+I K + R YE G+ +Y
Sbjct: 82 KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 141
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 142 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 201
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 202 YNFEWFGGAMVRCLCGAGSCSGFL 225
>gi|224143229|ref|XP_002324887.1| SET domain protein [Populus trichocarpa]
gi|222866321|gb|EEF03452.1| SET domain protein [Populus trichocarpa]
Length = 87
Score = 113 bits (283), Expect = 5e-22, Method: Composition-based stats.
Identities = 50/87 (57%), Positives = 64/87 (73%), Gaps = 2/87 (2%)
Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
Y FR+D +++DAT++GGIARF+NHSC PNC KVISV +KK+ +A+R I GEEITY
Sbjct: 1 YFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDIYPGEEITY 60
Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
+Y F E+ KKIPC+C SK C LN
Sbjct: 61 DYHFNNEDEGKKIPCFCNSKNCRRYLN 87
>gi|242044114|ref|XP_002459928.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
gi|241923305|gb|EER96449.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
Length = 166
Score = 113 bits (282), Expect = 5e-22, Method: Composition-based stats.
Identities = 48/98 (48%), Positives = 69/98 (70%), Gaps = 1/98 (1%)
Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
R+ + MG G+ Y+FR+DD V+DAT+ G IA INHSC PNCY++ I++ G + I I+
Sbjct: 47 RQLLFPLMGAGT-YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRAITILGDEHIIIF 105
Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
AKR I EE+TY+Y+F ++++PCYCG KC G +N
Sbjct: 106 AKRDIDPWEELTYDYRFFSSDQRLPCYCGFPKCRGVVN 143
>gi|116309320|emb|CAH66406.1| OSIGBa0093L02.2 [Oryza sativa Indica Group]
Length = 519
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K++ R +++ WGL+A E I A FV+EY GE+I K + R YE G+ +Y
Sbjct: 83 KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 142
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 143 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 202
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 203 YNFEWFGGAMVRCLCGAGSCSGFL 226
>gi|428167957|gb|EKX36908.1| hypothetical protein GUITHDRAFT_78620, partial [Guillardia theta
CCMP2712]
Length = 133
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 80/130 (61%), Gaps = 12/130 (9%)
Query: 768 RTNRVKLRN--LLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
R NR R + ++ ++ + Q+K+R KRL F++S IH WGL A E IE E+ VIE
Sbjct: 14 RANRRATRKQATVTSSISSDAFRFDQLKSRVKRLSFRKSAIHGWGLYADEVIEPEEMVIE 73
Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
Y GE E YE+ G+GSSY+FR+ V+DAT G AR+INHSC PNCYT+
Sbjct: 74 YKGE----------EKMYERSGMGSSYMFRVSKSQVIDATHCGSRARYINHSCEPNCYTR 123
Query: 886 VISVEGQKKI 895
+I V G+ K+
Sbjct: 124 IIHVGGKPKV 133
>gi|356535766|ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
gi|34529091|dbj|BAC85636.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E I+A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 95 WGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIIFLNVSESIDATRKG 154
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK I G E+ Y+Y F K+ C CG
Sbjct: 155 SLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIPIGTELAYDYNFEWFGGAKVRCLCG 214
Query: 928 SKKCHGSL 935
+ KC G L
Sbjct: 215 ALKCSGFL 222
>gi|413955942|gb|AFW88591.1| putative SET-domain containing protein family [Zea mays]
Length = 555
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)
Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
RKRT DG + + + A +Q+ K + T C + GC++S
Sbjct: 294 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 339
Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
K +++ R Q + ++ P+V + W + G G L
Sbjct: 340 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 387
Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
LA +G + + +++R+ RS + WG P+ D++ EY GELI K +D
Sbjct: 388 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 447
Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ I
Sbjct: 448 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 505
Query: 898 YAKRHIAAGEEITYNYKF 915
YAK HI A EE+ Y+Y++
Sbjct: 506 YAKEHIEASEELFYDYRY 523
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 113 bits (282), Expect = 6e-22, Method: Composition-based stats.
Identities = 60/138 (43%), Positives = 90/138 (65%), Gaps = 3/138 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5015 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5073
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5074 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5133
Query: 916 PLE--EKKIPCYCGSKKC 931
E + KIPC+CG+ C
Sbjct: 5134 DFEDDQHKIPCHCGAWNC 5151
>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
Length = 820
Score = 112 bits (281), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 84/139 (60%)
Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
+RLR + ++ WGL A E IE DF+IEY+GE+I K + R ++ G + YL +
Sbjct: 176 RRLRLKETENCGWGLFADENIERGDFLIEYIGEVIDDKTCEERLWDLKERGENNFYLCEV 235
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
V+DAT +G ++RFINHSCNPN + +G+ +I ++A I G+EITY+YK+
Sbjct: 236 GHDKVIDATFKGNMSRFINHSCNPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYI 295
Query: 917 LEEKKIPCYCGSKKCHGSL 935
+ C+CGSK C G L
Sbjct: 296 QFGTEQQCHCGSKNCKGIL 314
>gi|224143888|ref|XP_002325110.1| SET domain protein [Populus trichocarpa]
gi|222866544|gb|EEF03675.1| SET domain protein [Populus trichocarpa]
Length = 516
Score = 112 bits (281), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 86/141 (60%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
++K+++ +++ WG+ A EP+ DF+IEY+GE+I K+ + R + G+ + Y+
Sbjct: 346 KEKKIKIVKTEFCGWGVEAAEPLNKGDFIIEYIGEVIDDKLCEQRLWDMKYKGVQNFYMC 405
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
+ + +DAT +G +RF+NHSC PNC + VEG+ ++ ++A I GE +TY+Y+
Sbjct: 406 EIRKDFTIDATFKGNSSRFLNHSCKPNCILEKWDVEGETRVGVFAAGSIRVGEPLTYDYR 465
Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
F ++ CYCG+ C G L
Sbjct: 466 FVRFGPEVKCYCGAPNCQGYL 486
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 112 bits (281), Expect = 8e-22, Method: Composition-based stats.
Identities = 60/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 4816 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4874
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 4875 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4934
Query: 916 PLEEKK--IPCYCGSKKCHGSLN 936
E+ + IPC+CG+ C +N
Sbjct: 4935 DFEDDQHEIPCHCGAWNCRKWMN 4957
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 112 bits (281), Expect = 8e-22, Method: Composition-based stats.
Identities = 60/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
K + RS+I GL A + +E VIEY+G +IR ++++ RE YE+ G Y+FR
Sbjct: 5121 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5179
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+++ +V+DAT GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+F
Sbjct: 5180 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5239
Query: 916 PLEEKK--IPCYCGSKKCHGSLN 936
E+ + IPC+CG+ C +N
Sbjct: 5240 DFEDDQHEIPCHCGAWNCRKWMN 5262
>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
[Amphimedon queenslandica]
Length = 862
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 55/134 (41%), Positives = 80/134 (59%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
++++ WGL A I FV+EY GE+ L+ + R YEK Y L ++
Sbjct: 129 KTEMKGWGLKATCDISRYSFVMEYCGEVCSLEEFERRRNIYEKESRRHYYFMSLKTDEIL 188
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DAT++G ++RFINHSC PNC T+ +V G+ ++ +A RHI AGEE+T++Y+F E
Sbjct: 189 DATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFFALRHIPAGEELTFDYQFQRFGESV 248
Query: 922 IPCYCGSKKCHGSL 935
CYCGS+ C G L
Sbjct: 249 QKCYCGSETCRGFL 262
>gi|159470003|ref|XP_001693149.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158277407|gb|EDP03175.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1708
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 84/146 (57%), Gaps = 8/146 (5%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG-SSYLFRL 856
RL +S IH WG A P + D +IEY GELIR +SD+RE R +G +Y+F L
Sbjct: 1521 RLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELIRPVVSDVREKRMYNDLVGCGTYIFSL 1580
Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ------KKIFIYAKRHIAAGEEIT 910
+ +DATK G +A +NHSC+PNCY++ I++ + I AKR + EE+T
Sbjct: 1581 NGQQHIDATKAGNMAHLLNHSCDPNCYSRAITLTDPLTGATTDHVIITAKRDLQPWEELT 1640
Query: 911 YNYKFPLEEKKIPCYCGSKKCHGSLN 936
Y+Y+F ++PC CG+ C +N
Sbjct: 1641 YDYRF-NSAVELPCNCGAASCRLLVN 1665
>gi|162459885|ref|NP_001105650.1| histone-lysine N-methyltransferase EZ2 [Zea mays]
gi|33112288|sp|Q8S4P5.1|EZ2_MAIZE RecName: Full=Histone-lysine N-methyltransferase EZ2; AltName:
Full=Enhancer of zeste protein 2
gi|20152909|gb|AAM13421.1|AF443597_1 enhancer of zeste-like protein 2 [Zea mays]
gi|413955940|gb|AFW88589.1| putative SET-domain containing protein family [Zea mays]
Length = 894
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)
Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
RKRT DG + + + A +Q+ K + T C + GC++S
Sbjct: 633 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 678
Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
K +++ R Q + ++ P+V + W + G G L
Sbjct: 679 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 726
Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
LA +G + + +++R+ RS + WG P+ D++ EY GELI K +D
Sbjct: 727 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 786
Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ I
Sbjct: 787 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 844
Query: 898 YAKRHIAAGEEITYNYKF 915
YAK HI A EE+ Y+Y++
Sbjct: 845 YAKEHIEASEELFYDYRY 862
>gi|218194855|gb|EEC77282.1| hypothetical protein OsI_15918 [Oryza sativa Indica Group]
Length = 472
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K++ R +++ WGL+A E I A FV+EY GE+I K + R YE G+ +Y
Sbjct: 77 KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 136
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G +ARFINHSC PNC T+ +V G+ ++ I+AK+ I G E++Y+
Sbjct: 137 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 196
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F + C CG+ C G L
Sbjct: 197 YNFEWFGGAMVRCLCGAGSCSGFL 220
>gi|359493199|ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 515
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + + R++ WGL+A E I+A FVIEY GE+I K + R Y +G+ ++
Sbjct: 93 KCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYASLGLKDAF 152
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G + RFINHSC PNC T+ +V G+ ++ I+AK+ I+ G E+ YN
Sbjct: 153 IISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDISIGTELAYN 212
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F K+ C CG+ C G L
Sbjct: 213 YNFEWYGGAKVRCLCGAISCSGFL 236
>gi|385301625|gb|EIF45804.1| putative histone lysine methyltransferase set2p [Dekkera
bruxellensis AWRI1499]
Length = 695
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 15/149 (10%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR+++ D +G+ A I A F+IEY GE+I + IR+ Y K GI
Sbjct: 111 RFQRNQMADISIFLTEHKGYGMRANCDIPANTFIIEYKGEVIDEQAYKIRKEAYAKEGIK 170
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y + DG ++DATK+G + RF NHSC+PN Y + V + ++ I+AKR I GEEI
Sbjct: 171 HFYFMMIQDGQIIDATKKGSLGRFCNHSCDPNAYXEKWVVNKRYRMGIFAKRKIPKGEEI 230
Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHGSL 935
T++Y ++ E +K CYCG+K C G L
Sbjct: 231 TFDYNVDRYGAEPQK--CYCGAKNCVGYL 257
>gi|449446403|ref|XP_004140961.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Cucumis
sativus]
Length = 497
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 95/169 (56%), Gaps = 9/169 (5%)
Query: 776 NLLAAAEGAELLKASQVKARKKRLR---FQRSKI-----HDWGLVALEPIEAEDFVIEYV 827
N+L + E S V R +R + + ++K+ WGL+A E I+ F+IEY
Sbjct: 54 NVLTSTECTPGHCPSGVHCRNQRFQKCEYAKTKLFKTEGRGWGLLADENIKNGQFIIEYC 113
Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
GE+I K + R YE G+ +Y+ L+ +DAT++G +ARFINHSC PNC T+
Sbjct: 114 GEVISWKEAKRRSHTYENQGLKDAYIISLNASESIDATRKGSLARFINHSCFPNCETRKW 173
Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCGSKKCHGSL 935
+V G+ ++ I+AK+ I+ G E+ Y+Y F K+ C CG+ C G L
Sbjct: 174 NVLGEIRVGIFAKQDISIGTELAYDYNFEWYGGAKVRCLCGASSCSGFL 222
>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
Length = 1179
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 84/142 (59%), Gaps = 9/142 (6%)
Query: 799 LRFQRSKIHDWG---LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
L+FQ DWG L A + I F+IEY+GE+I S IR K+G+ + Y+
Sbjct: 994 LKFQT----DWGGNGLKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGVTNYYILE 1049
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
LD+ ++DA RG IARFINHSC+PNC V+G +I I++KR I GEE+T+NY+
Sbjct: 1050 LDNLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQL 1109
Query: 916 PL--EEKKIPCYCGSKKCHGSL 935
+E K C CGSK C G +
Sbjct: 1110 QQSSDEGKTKCLCGSKNCAGFI 1131
>gi|359493195|ref|XP_003634538.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 82/134 (61%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
R++ WGL+A E I+A +FV+EY GE+I + R Y G+ Y+ L+ +
Sbjct: 120 RAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNARECI 179
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DATK+G +ARFINHSC PNC T SV G+ ++ I+A R+I+ G E+TY+Y F K
Sbjct: 180 DATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEWYSGAK 239
Query: 922 IPCYCGSKKCHGSL 935
+ C CG+ +C G L
Sbjct: 240 VRCLCGATRCSGFL 253
>gi|147816102|emb|CAN75267.1| hypothetical protein VITISV_028162 [Vitis vinifera]
Length = 460
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
R++ WGL+A E I+A F+IEY GE+I + R Y GI +Y+ L+ +
Sbjct: 166 RTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLNARECI 225
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DATK G ARFINHSC PNC T+ SV G+ +I I+A R I+ G E+TY+Y F K
Sbjct: 226 DATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQWYGGAK 285
Query: 922 IPCYCGSKKCHGSL 935
+ C CG+ C G L
Sbjct: 286 VHCLCGATSCCGFL 299
>gi|296081208|emb|CBI18234.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
K + + R++ WGL+A E I+A FVIEY GE+I K + R Y +G+ ++
Sbjct: 494 KCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYASLGLKDAF 553
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
+ L+ +DATK+G + RFINHSC PNC T+ +V G+ ++ I+AK+ I+ G E+ YN
Sbjct: 554 IISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDISIGTELAYN 613
Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
Y F K+ C CG+ C G L
Sbjct: 614 YNFEWYGGAKVRCLCGAISCSGFL 637
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 81/139 (58%), Gaps = 1/139 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
+ + R++ WGL+A E I+A F+IEY GE+I + R Y GI +Y+ L+
Sbjct: 76 KTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLN 135
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
+DATK G ARFINHSC PNC T+ SV G+ +I I+A R I+ G E+TY+Y F
Sbjct: 136 ARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQW 195
Query: 917 LEEKKIPCYCGSKKCHGSL 935
K+ C CG+ C G L
Sbjct: 196 YGGAKVHCLCGATSCCGFL 214
>gi|342209884|gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
Length = 481
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 79/128 (61%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E ++A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 95 WGLLAGEDLKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKG 154
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+AK + G E+ Y+Y F K+ C CG
Sbjct: 155 SLARFINHSCRPNCETRKWNVLGEIRVGIFAKHDVPIGTELAYDYNFEWFGGAKVRCLCG 214
Query: 928 SKKCHGSL 935
+ KC G L
Sbjct: 215 ALKCSGFL 222
>gi|359493197|ref|XP_003634539.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
R++ WGL+A E I+A F+IEY GE+I + R Y GI +Y+ L+ +
Sbjct: 119 RTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLNARECI 178
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DATK G ARFINHSC PNC T+ SV G+ +I I+A R I+ G E+TY+Y F K
Sbjct: 179 DATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQWYGGAK 238
Query: 922 IPCYCGSKKCHGSL 935
+ C CG+ C G L
Sbjct: 239 VHCLCGATSCCGFL 252
>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1363
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 86/148 (58%), Gaps = 17/148 (11%)
Query: 797 KRLRFQR----------SKIHDWGLVALEPIEAEDFVIEYVGELI-RLKISD-IRECRYE 844
K RFQR +K WGL A E I A FVIEY GE+I R + D +RE E
Sbjct: 973 KNQRFQRQEYALIAPFNAKKKGWGLKAKEKISAHQFVIEYCGEVITRAQSMDRMREADGE 1032
Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
K Y LD V+DA+++G +ARFINHSC+PNC T+ SV+G+ +I I+A + I
Sbjct: 1033 KY----FYFLTLDSKEVLDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIE 1088
Query: 905 AGEEITYNYKFP-LEEKKIPCYCGSKKC 931
AG E+T++Y + + K CYCGS C
Sbjct: 1089 AGTELTFDYNYERVGSSKQSCYCGSVNC 1116
>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
NZE10]
Length = 963
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 82/149 (55%), Gaps = 15/149 (10%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR K D +GL A + + DFV EY+GE+I + R +Y++ GI
Sbjct: 202 RFQRKKYADVTVIKTEKKGYGLRADKELRPGDFVYEYIGEVIGENVFRRRMQQYDEEGIK 261
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L G VDATK+G + RF NHSCNPNCY V + ++ I+ +R+I AGEE+
Sbjct: 262 HFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVNDKLRMGIFVERNIQAGEEL 321
Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHGSL 935
+NY ++ + + PCYCG C G +
Sbjct: 322 VFNYNVDRYGADPQ--PCYCGEPNCTGYI 348
>gi|296081207|emb|CBI18233.3| unnamed protein product [Vitis vinifera]
Length = 370
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 84/139 (60%), Gaps = 1/139 (0%)
Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
+ + R++ WGL+A E I+A +FV+EY GE+I + R Y G+ Y+ L+
Sbjct: 72 KTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLN 131
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
+DATK+G +ARFINHSC PNC T SV G+ ++ I+A R+I+ G E+TY+Y F
Sbjct: 132 ARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEW 191
Query: 917 LEEKKIPCYCGSKKCHGSL 935
K+ C CG+ +C G L
Sbjct: 192 YSGAKVRCLCGATRCSGFL 210
>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
Length = 1004
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 78/128 (60%), Gaps = 1/128 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
+GL LE +E FV+EY GE+I L+ + R+ Y K I Y L ++DA+++G
Sbjct: 135 FGLRTLEDLEDNQFVLEYCGEVIDLREFERRKRDYAKKKIKHYYFMTLSPNEIIDASRKG 194
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI-PCYCG 927
+RFINHSC+PNC T+ +V G +I + R I A E+T++Y+F +++ CYCG
Sbjct: 195 TFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTELTFDYQFERYGREVQECYCG 254
Query: 928 SKKCHGSL 935
S+KC G L
Sbjct: 255 SEKCRGYL 262
>gi|398407533|ref|XP_003855232.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339475116|gb|EGP90208.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 799
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 81/150 (54%), Gaps = 15/150 (10%)
Query: 799 LRFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
+RFQR K + +GL + DF+ EY+GE+I + R +Y++ GI
Sbjct: 115 MRFQRKKYANVDVIKTEKKGYGLRTQTDLRPNDFIFEYIGEVIGENVFRRRMQQYDEDGI 174
Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE 908
Y L G VDATKRG + RF NHSCNPNCY V + ++ I+A+R I AGEE
Sbjct: 175 KHFYFMSLTKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERAIQAGEE 234
Query: 909 ITYNY---KFPLEEKKIPCYCGSKKCHGSL 935
+ +NY ++ E + PCYCG C G +
Sbjct: 235 LVFNYNVDRYGAEPQ--PCYCGEANCTGYI 262
>gi|33305503|gb|AAQ02781.1|AF373874_1 Mll protein [Xenopus laevis]
Length = 84
Score = 111 bits (277), Expect = 2e-21, Method: Composition-based stats.
Identities = 50/84 (59%), Positives = 62/84 (73%), Gaps = 2/84 (2%)
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+DD VVDAT G ARFINHSC PNCY++VI ++GQK I I+A R I GEE+TY+YK
Sbjct: 1 RIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYDYK 60
Query: 915 FPLEE--KKIPCYCGSKKCHGSLN 936
FP+E+ K+ C CG+KKC LN
Sbjct: 61 FPIEDANNKLACNCGTKKCRKFLN 84
>gi|225562106|gb|EEH10386.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1000
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + D +GL A + F+ EY+GE+I + R Y++ GI
Sbjct: 199 RFQRRQYADVSVIKTEKKGYGLRAESDLRPNQFIFEYIGEVINETLFRKRMLSYDEEGIK 258
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L G VDATK+G + RF NHSCNPNCY V + ++ I+A+RHI AGEE+
Sbjct: 259 HFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEEL 318
Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
+NY PCYCG C G
Sbjct: 319 VFNYNVDRYGADPQPCYCGEPNCTG 343
>gi|407926722|gb|EKG19683.1| WW/Rsp5/WWP [Macrophomina phaseolina MS6]
Length = 972
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 74/132 (56%), Gaps = 1/132 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
R+ +GL A ++ DFV EY+GE+I + R +Y++ GI Y L G V
Sbjct: 214 RTDKKGYGLRANTDLKPNDFVYEYIGEVINERAFRGRMVQYDEEGIKHFYFMSLSKGEFV 273
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DATK+G + RF NHSCNPNC+ V + ++ I+A+RHI AGEE+ +NY
Sbjct: 274 DATKKGNLGRFCNHSCNPNCFVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADP 333
Query: 922 IPCYCGSKKCHG 933
PCYCG C G
Sbjct: 334 QPCYCGEANCTG 345
>gi|302836397|ref|XP_002949759.1| histone H3 lys4 methyltransferase [Volvox carteri f. nagariensis]
gi|300265118|gb|EFJ49311.1| histone H3 lys4 methyltransferase [Volvox carteri f. nagariensis]
Length = 118
Score = 111 bits (277), Expect = 2e-21, Method: Composition-based stats.
Identities = 51/118 (43%), Positives = 76/118 (64%), Gaps = 5/118 (4%)
Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL----D 857
+RS I G+ A E I A+ ++EY GE +R ++D+RE RY G+G YLF D
Sbjct: 2 RRSGIAQLGVFAAERIPADTLLMEYHGEAVRTSVADLREKRYSAAGLGC-YLFNPGGLPD 60
Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
+ V+DAT RG I RF+NHSC PNC ++ + +EGQ++IF+++ + G E+TY+YK
Sbjct: 61 EAVVLDATHRGNIMRFVNHSCGPNCVSRTVVIEGQRRIFLFSLHELHPGTELTYDYKL 118
>gi|147846734|emb|CAN80636.1| hypothetical protein VITISV_017995 [Vitis vinifera]
Length = 278
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 82/134 (61%), Gaps = 1/134 (0%)
Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
R++ WGL+A E I+A +FV+EY GE+I + R Y G+ Y+ L+ +
Sbjct: 120 RAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNARECI 179
Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
DATK+G +ARFINHSC PNC T SV G+ ++ I+A R+I+ G E+TY+Y F K
Sbjct: 180 DATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEWYSXAK 239
Query: 922 IPCYCGSKKCHGSL 935
+ C CG+ +C G L
Sbjct: 240 VRCLCGATRCSGFL 253
>gi|452985541|gb|EME85297.1| hypothetical protein MYCFIDRAFT_83386 [Pseudocercospora fijiensis
CIRAD86]
Length = 835
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 81/147 (55%), Gaps = 15/147 (10%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + + +GL A + A DF+ EY+GE+I + + +Y++ GI
Sbjct: 122 RFQRKQYANVSVIQTEKKGYGLRANTDLRANDFIFEYIGEVIGENVFRRKMQQYDEQGIK 181
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L G VDATK+G + RF NHSCNPNCY V + ++ I+A+R I AGEE+
Sbjct: 182 HFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIKAGEEL 241
Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHG 933
+NY ++ E + PCYCG C G
Sbjct: 242 VFNYNVDRYGAEPQ--PCYCGEPNCTG 266
>gi|410902883|ref|XP_003964923.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Takifugu
rubripes]
Length = 766
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/135 (41%), Positives = 80/135 (59%), Gaps = 2/135 (1%)
Query: 781 AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE 840
++G S + KK L S + WG EP++ +F+ EY GELI +D R
Sbjct: 616 SKGVSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRG 675
Query: 841 CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
Y+K SS+LF L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AK
Sbjct: 676 RIYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAK 733
Query: 901 RHIAAGEEITYNYKF 915
R I GEE+ ++Y++
Sbjct: 734 RAILQGEELFFDYRY 748
>gi|449300569|gb|EMC96581.1| hypothetical protein BAUCODRAFT_481348 [Baudoinia compniacensis
UAMH 10762]
Length = 992
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 76/130 (58%), Gaps = 5/130 (3%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
+GL ++A DF+ EY+GE+I + R +Y++ GI Y L G VDATK+G
Sbjct: 222 YGLRTNTNLKANDFIFEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKG 281
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY---KFPLEEKKIPCY 925
+ RF NHSCNPNCY V + ++ I+A+R I AGEE+ +NY ++ E + PCY
Sbjct: 282 NLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIQAGEELVFNYNVDRYGAEPQ--PCY 339
Query: 926 CGSKKCHGSL 935
CG C G +
Sbjct: 340 CGEPNCTGYI 349
>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
Length = 207
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 72/114 (63%), Gaps = 2/114 (1%)
Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
F+IEY+GE+I S IR K+G+ + Y+ LD+ ++DA RG IARFINHSC+PN
Sbjct: 44 FIIEYIGEIISHDESRIRLEESAKIGVTNYYILELDNLRMIDAGPRGNIARFINHSCDPN 103
Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL--EEKKIPCYCGSKKCHG 933
C V+G +I I++KR I GEE+T+NY+ +E K C CGSK C G
Sbjct: 104 CGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQSSDEGKTKCLCGSKNCAG 157
>gi|159490078|ref|XP_001703016.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158270923|gb|EDO96754.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 2983
Score = 110 bits (275), Expect = 4e-21, Method: Composition-based stats.
Identities = 63/176 (35%), Positives = 96/176 (54%), Gaps = 12/176 (6%)
Query: 771 RVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGEL 830
R R L A L +A + + R+ F +S IH WG+ A I + V E+ GE+
Sbjct: 2810 RASERRLKAVRWEERLQRA--LATERARITFGKSGIHGWGVFARVDIPQDAVVTEFRGEV 2867
Query: 831 IRLKISDIRECRYEKMGIG------SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
+R ++++RE RY G YLF + V+D+T G RF NHSC+P+ YT
Sbjct: 2868 VRPVLAEMRERRYRAAGRAWACAGKDCYLFHVSRELVLDSTHLGHYGRFANHSCSPSLYT 2927
Query: 885 KVISVEGQKKIFIY--AKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
KV+ EG +++ + A+ I AG+E+T++Y+F EE K+ C CG+ C G+LN
Sbjct: 2928 KVLEFEGGRRVRLAFCARVDIRAGQELTFDYRFKEEEGSAKVACRCGAPNCKGTLN 2983
>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1074
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/126 (44%), Positives = 75/126 (59%), Gaps = 3/126 (2%)
Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
GL+++ A V+EY GE+I L + R K S Y L++ YV+DA ++G
Sbjct: 259 GLLSIRSFNAGSLVVEYTGEVIHLDEVEHRLNTIYKES-DSYYFLGLEEEYVIDAGQKGS 317
Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP--CYCG 927
+ARF NHSC+PN + V G+ +I ++AKR I AGEEITY+Y F E P CYCG
Sbjct: 318 VARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEEITYDYNFEWFENGEPQKCYCG 377
Query: 928 SKKCHG 933
SK CHG
Sbjct: 378 SKNCHG 383
>gi|212535752|ref|XP_002148032.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
gi|210070431|gb|EEA24521.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
Length = 900
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 78/145 (53%), Gaps = 11/145 (7%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + + +GL A I+ F+ EY+GE+I R +Y+K GI
Sbjct: 211 RFQRKEYANVTVIKTAKKGFGLRAESDIQPHQFIYEYIGEVINEANFRRRMIQYDKEGIK 270
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L+ G VDATK+G +ARF NHSCNPNCY V + ++ I+A+R+I AGEE+
Sbjct: 271 HFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCYVDKWVVGEKLRMGIFAERYIQAGEEL 330
Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
+NY PCYCG C G
Sbjct: 331 VFNYNVDRYGADPQPCYCGEPNCSG 355
>gi|410927554|ref|XP_003977206.1| PREDICTED: histone-lysine N-methyltransferase EZH2-like [Takifugu
rubripes]
Length = 760
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 2/123 (1%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ KK L S + WG+ EP++ DF+ EY GE+I +D R Y+K S+
Sbjct: 622 RGAKKHLLLAPSDVAGWGIFIKEPVQKNDFISEYCGEIISQDEADRRGKVYDKYMC--SF 679
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
LF L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++
Sbjct: 680 LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFD 739
Query: 913 YKF 915
Y++
Sbjct: 740 YRY 742
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 110 bits (274), Expect = 4e-21, Method: Composition-based stats.
Identities = 60/144 (41%), Positives = 91/144 (63%), Gaps = 3/144 (2%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE-KMGIGSSYLF 854
K + RS+I GL A + IE VIEY+G +IR ++++ RE YE ++ Y+F
Sbjct: 4156 KNNVYLARSRIQGLGLYAAKDIEKHTMVIEYIGTIIRNEVANRREKIYEEQVRTRGIYMF 4215
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
R+++ +V+DA GG AR+INHSC PNC +V++ + + KI I + R I GEE+TY+Y+
Sbjct: 4216 RINNEHVIDAPLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 4275
Query: 915 FPLE--EKKIPCYCGSKKCHGSLN 936
F E + KIPC+CG+ C +N
Sbjct: 4276 FDFEDDQHKIPCHCGAWNCRKWMN 4299
>gi|16605541|emb|CAC86146.1| EZH2 homolog [Tetraodon nigroviridis]
Length = 759
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 2/123 (1%)
Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
+ KK L S + WG+ EP++ DF+ EY GE+I +D R Y+K S+
Sbjct: 621 RGAKKHLLLAPSDVAGWGIFIKEPVQKNDFISEYCGEIISQDEADRRGKVYDKYMC--SF 678
Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
LF L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++
Sbjct: 679 LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFD 738
Query: 913 YKF 915
Y++
Sbjct: 739 YRY 741
>gi|296808927|ref|XP_002844802.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844285|gb|EEQ33947.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 969
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 79/147 (53%), Gaps = 11/147 (7%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQ+ + D +GL A ++ DF+ EY+GE+I R +Y++ GI
Sbjct: 188 RFQKRQYADVTVIKTEKKGFGLRANTDLQPNDFIFEYIGEVINEPQFRKRMIQYDEEGIK 247
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L+ G VDAT++G + RF NHSCNPNCY V + ++ I+A+RH+ AGEE+
Sbjct: 248 HFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKLRMGIFAERHVKAGEEL 307
Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHGSL 935
+NY PCYCG C G +
Sbjct: 308 VFNYNVDRYGADPQPCYCGEANCLGYI 334
>gi|432867980|ref|XP_004071355.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
EZH1-like [Oryzias latipes]
Length = 780
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG EP++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 645 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 702
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++Y++
Sbjct: 703 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 762
>gi|295901334|dbj|BAJ07324.1| enhancer of zeste homolog 1 (Drosophila) [Oryzias latipes]
Length = 766
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG EP++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 631 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 688
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++Y++
Sbjct: 689 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 748
>gi|449530380|ref|XP_004172173.1| PREDICTED: uncharacterized protein LOC101230765, partial [Cucumis
sativus]
Length = 627
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 64/178 (35%), Positives = 106/178 (59%), Gaps = 6/178 (3%)
Query: 89 GSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDY 148
GSI+ F + ++C+MLFD+ +QV+WNAV D VAEYSSAWR+++ WS P + +S Y
Sbjct: 449 GSIDNFREVHAVICQMLFDYSLQVVWNAVSYDTVAEYSSAWRRKRFWSYRPHYSLASSGY 508
Query: 149 KDDRKRMEQAPS-----RHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
+D K++E+ P+ R +SS+ +TEN + +SLSVPVG S+
Sbjct: 509 RDRVKKIEKTPAEASLPRKESSLHGVSSLSVSKFKGAQTENCARSAVISLSVPVGHKSSR 568
Query: 204 QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINM 261
+ SC + +D+K +++ +E EL+ S K + EY++ ++E+EV +AS + +
Sbjct: 569 PTSHSCCER-PKEDLKWMVEYLEKELHSSAKVSMAEYIQDILEEEVISSCNASTDVKL 625
>gi|348509107|ref|XP_003442093.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like
[Oreochromis niloticus]
Length = 768
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG EP++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 633 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 690
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++Y++
Sbjct: 691 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 750
>gi|340515729|gb|EGR45981.1| predicted protein [Trichoderma reesei QM6a]
Length = 933
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 80/147 (54%), Gaps = 15/147 (10%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + D +GL A P+E DF+ EY+GE+I R +Y++ GI
Sbjct: 173 RFQRKQYADVTVIKTEKKGFGLRANSPLEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIK 232
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L+ VDATK+G + RF NHSC+PNC+ V + ++ I+A R I AGEE+
Sbjct: 233 HFYFMSLNKNEFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIQAGEEL 292
Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHG 933
+NY ++ E + PCYCG C G
Sbjct: 293 VFNYNVDRYGAEPQ--PCYCGEANCVG 317
>gi|320118893|ref|NP_001035072.2| enhancer of zeste 1 [Danio rerio]
Length = 756
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG EP++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 621 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 678
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I GEE+ ++Y++
Sbjct: 679 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRY 738
>gi|302811554|ref|XP_002987466.1| hypothetical protein SELMODRAFT_446978 [Selaginella moellendorffii]
gi|300144872|gb|EFJ11553.1| hypothetical protein SELMODRAFT_446978 [Selaginella moellendorffii]
Length = 835
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
G AR + + RN+ LLK Q R+ RS + WG P+ D++
Sbjct: 666 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 713
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
EY GELI + +D R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY
Sbjct: 714 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 771
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
KVI V G ++ I+AK I+AGEE+ Y+Y++
Sbjct: 772 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 803
>gi|383316103|ref|YP_005376945.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
gi|379043207|gb|AFC85263.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
Length = 180
Score = 109 bits (273), Expect = 6e-21, Method: Composition-based stats.
Identities = 66/158 (41%), Positives = 91/158 (57%), Gaps = 15/158 (9%)
Query: 790 SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG-I 848
S+ +R+R +RS IH G+ AL + D V+ Y GELI +D R Y G
Sbjct: 8 SRFPPMTQRIRARRSPIHGRGVFALADLAKGDCVVRYRGELISHAQADER---YGDDGES 64
Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV-ISVEG---QKKIFIYAKRHIA 904
G ++LF L++ YVVD +RG +AR+INHSC PNC + IS G + +I I A R IA
Sbjct: 65 GHTFLFTLNEQYVVDGNRRGNVARWINHSCMPNCQAVIEISAGGDPRRDRILIEAVRDIA 124
Query: 905 AGEEITYNYKFPLE-------EKKIPCYCGSKKCHGSL 935
AGEEITY+Y L ++ PC+CG+ +C G+L
Sbjct: 125 AGEEITYDYGIVLAVPHTAAMKRLWPCHCGAARCTGTL 162
>gi|156374111|ref|XP_001629652.1| predicted protein [Nematostella vectensis]
gi|156216657|gb|EDO37589.1| predicted protein [Nematostella vectensis]
Length = 463
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 7/155 (4%)
Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
KASQ + K R RS I GL A +E VIEY+G +IR ++++ +E Y
Sbjct: 310 KASQYRKLKTEWRQNVFLGRSNIQGLGLFANRDMEPGCMVIEYIGSIIRNEVANKKESIY 369
Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
E G Y+FR+D V+DAT GG AR+INHSC PNC +V++ E ++KI I + R I
Sbjct: 370 ESQNRGI-YMFRIDSDSVIDATIAGGPARYINHSCMPNCVAEVVTFEKEQKIIIISSRKI 428
Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
GEE+TY+YKF E E KI C CG+ C +N
Sbjct: 429 EKGEELTYDYKFDFEDDEHKISCLCGAPNCRKWMN 463
>gi|162463310|ref|NP_001105079.1| histone-lysine N-methyltransferase EZ3 [Zea mays]
gi|33112287|sp|Q8S4P4.1|EZ3_MAIZE RecName: Full=Histone-lysine N-methyltransferase EZ3; AltName:
Full=Enhancer of zeste protein 3
gi|20152911|gb|AAM13422.1|AF443598_1 enhancer of zeste-like protein 3 [Zea mays]
Length = 895
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+++R+ RS + WG P+ D++ EY GELI K +D R Y++ SS+LF
Sbjct: 745 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 802
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ IYAK HI A EE+ Y+Y+
Sbjct: 803 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 862
Query: 915 F 915
+
Sbjct: 863 Y 863
>gi|302796619|ref|XP_002980071.1| hypothetical protein SELMODRAFT_111860 [Selaginella moellendorffii]
gi|300152298|gb|EFJ18941.1| hypothetical protein SELMODRAFT_111860 [Selaginella moellendorffii]
Length = 820
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
G AR + + RN+ LLK Q R+ RS + WG P+ D++
Sbjct: 651 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 698
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
EY GELI + +D R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY
Sbjct: 699 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 756
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
KVI V G ++ I+AK I+AGEE+ Y+Y++
Sbjct: 757 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 788
>gi|414866465|tpg|DAA45022.1| TPA: putative SET-domain containing protein family [Zea mays]
Length = 898
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+++R+ RS + WG P+ D++ EY GELI K +D R Y++ SS+LF
Sbjct: 748 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 805
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ IYAK HI A EE+ Y+Y+
Sbjct: 806 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 865
Query: 915 F 915
+
Sbjct: 866 Y 866
>gi|67903676|ref|XP_682094.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|74592565|sp|Q5ASA5.1|SET2_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|40740923|gb|EAA60113.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|259482964|tpe|CBF77940.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC
2.1.1.43)(SET domain-containing protein 2)
[Source:UniProtKB/Swiss-Prot;Acc:Q5ASA5] [Aspergillus
nidulans FGSC A4]
Length = 980
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + + +GL A E + F+ EYVGE+I R +Y+ GI
Sbjct: 234 RFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFIFEYVGEVINEGPFHRRMRQYDAEGIK 293
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L G VDATK+G + RF NHSCNPNCY V + ++ I+A+RHI AGEE+
Sbjct: 294 HFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIQAGEEL 353
Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
+NY PCYCG C G
Sbjct: 354 VFNYNVDRYGADPQPCYCGEPNCTG 378
>gi|414866464|tpg|DAA45021.1| TPA: putative SET-domain containing protein family [Zea mays]
Length = 895
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)
Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
+++R+ RS + WG P+ D++ EY GELI K +D R Y++ SS+LF
Sbjct: 745 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 802
Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
L+D YV+DA ++G +F NHS NPNCY KV+ V G ++ IYAK HI A EE+ Y+Y+
Sbjct: 803 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 862
Query: 915 F 915
+
Sbjct: 863 Y 863
>gi|384253240|gb|EIE26715.1| hypothetical protein COCSUDRAFT_59232 [Coccomyxa subellipsoidea
C-169]
Length = 798
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/126 (43%), Positives = 79/126 (62%), Gaps = 4/126 (3%)
Query: 789 ASQVKARKKRLRF--QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
A+ + A+KK RF +RS I G+ LE EA + ++E VGE++R ++ +RE +
Sbjct: 673 AAVIAAKKKMGRFNIRRSDISGLGVFTLEFCEAGELLMECVGEVLRRPVASMREAAAIER 732
Query: 847 GIG--SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
GI + Y+F LD +VVD T G IARF+NHSC+PNC I V+G + IF+ A R +A
Sbjct: 733 GINDVACYVFALDADHVVDTTSSGNIARFVNHSCSPNCVATAIDVKGARHIFLIAARQLA 792
Query: 905 AGEEIT 910
GEE+T
Sbjct: 793 PGEELT 798
>gi|357443579|ref|XP_003592067.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
gi|355481115|gb|AES62318.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
Length = 491
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 77/124 (62%), Gaps = 1/124 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E I+A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 108 WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 167
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+A +I G E+ Y+Y F K+ C CG
Sbjct: 168 SLARFINHSCQPNCETRKWNVMGEIRVGIFALENIPIGTELAYDYNFEWFGGAKVRCLCG 227
Query: 928 SKKC 931
+ KC
Sbjct: 228 ALKC 231
>gi|336266160|ref|XP_003347849.1| hypothetical protein SMAC_06682 [Sordaria macrospora k-hell]
gi|380091782|emb|CCC10510.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 907
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
RFQR + D +GL A ++ DF+ EY+GE+I R +Y+K GI
Sbjct: 175 RFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSRMIKYDKEGIK 234
Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
Y L VDATK+G + RF NHSCNPNCY V + ++ I+A R I AGEE+
Sbjct: 235 HFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAGRAIKAGEEL 294
Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
+NY PCYCG C G
Sbjct: 295 VFNYNVDRYGADPQPCYCGEPNCTG 319
>gi|149391157|gb|ABR25596.1| set domain containing protein [Oryza sativa Indica Group]
Length = 107
Score = 109 bits (272), Expect = 8e-21, Method: Composition-based stats.
Identities = 51/107 (47%), Positives = 72/107 (67%), Gaps = 6/107 (5%)
Query: 836 SDIRECRYE----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG 891
+D RE Y+ + + Y F++ +++DAT++GGIARFINHSC PNC KVISV
Sbjct: 1 ADKREIEYQSGKRQQYKSACYFFKIGKEHIIDATRKGGIARFINHSCQPNCVAKVISVRN 60
Query: 892 QKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
+KK+ +A+RHI GEEITY+Y F E+ ++IPC+C S+ C LN
Sbjct: 61 EKKVVFFAERHINPGEEITYDYHFNREDEGQRIPCFCRSRGCRRYLN 107
>gi|357443507|ref|XP_003592031.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
gi|355481079|gb|AES62282.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
Length = 479
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 1/124 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E I+A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 96 WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 155
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+A I G E+ Y+Y F K+ C CG
Sbjct: 156 SLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIPIGTELAYDYNFEWFGGAKVRCLCG 215
Query: 928 SKKC 931
+ KC
Sbjct: 216 ALKC 219
>gi|242011020|ref|XP_002426255.1| protein MLP1, putative [Pediculus humanus corporis]
gi|212510318|gb|EEB13517.1| protein MLP1, putative [Pediculus humanus corporis]
Length = 2688
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 77/130 (59%), Gaps = 5/130 (3%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLK-ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
WG+ +PI+A DF++EYVGE++ K D Y + Y LD G V+D +
Sbjct: 1958 WGIRTKQPIKAGDFILEYVGEVVSDKEFKDRMASIY--VNDKHHYCLHLDGGLVIDGHRM 2015
Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL--EEKKIPCY 925
GG RF+NHSCNPNC + SV G ++ ++A R+I A +E+TY+Y F L + PCY
Sbjct: 2016 GGDGRFVNHSCNPNCEMQKWSVNGLFRMALFALRNIPAHQELTYDYNFSLFNPAEGQPCY 2075
Query: 926 CGSKKCHGSL 935
CG+ +C G +
Sbjct: 2076 CGTNECRGVI 2085
>gi|354485080|ref|XP_003504712.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Cricetulus
griseus]
gi|344251953|gb|EGW08057.1| Histone-lysine N-methyltransferase EZH1 [Cricetulus griseus]
Length = 747
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|357443463|ref|XP_003592009.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
gi|355481057|gb|AES62260.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
Length = 616
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 1/124 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
WGL+A E I+A FVIEY GE+I K + R YE G+ +++ L+ +DAT++G
Sbjct: 233 WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 292
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ARFINHSC PNC T+ +V G+ ++ I+A I G E+ Y+Y F K+ C CG
Sbjct: 293 SLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIPIGTELAYDYNFEWFGGAKVRCLCG 352
Query: 928 SKKC 931
+ KC
Sbjct: 353 ALKC 356
>gi|302826182|ref|XP_002994616.1| hypothetical protein SELMODRAFT_138884 [Selaginella moellendorffii]
gi|300137307|gb|EFJ04316.1| hypothetical protein SELMODRAFT_138884 [Selaginella moellendorffii]
Length = 833
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)
Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
G AR + + RN+ LLK Q R+ RS + WG P+ D++
Sbjct: 664 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 711
Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
EY GELI + +D R Y++ SS+LF L+D YV+DA ++G +F NHS NPNCY
Sbjct: 712 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 769
Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
KVI V G ++ I+AK I+AGEE+ Y+Y++
Sbjct: 770 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 801
>gi|291406159|ref|XP_002719453.1| PREDICTED: enhancer of zeste homolog 1-like [Oryctolagus cuniculus]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|395826329|ref|XP_003786371.1| PREDICTED: histone-lysine N-methyltransferase EZH1 [Otolemur
garnettii]
Length = 753
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 618 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 675
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 676 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 735
>gi|325091551|gb|EGC44861.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1000
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
+GL A + F+ EY+GE+I + R Y++ GI Y L G VDATK+G
Sbjct: 218 YGLRAESDLRPNQFIFEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKG 277
Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
+ RF NHSCNPNCY V + ++ I+A+RHI AGEE+ +NY PCYCG
Sbjct: 278 NLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCG 337
Query: 928 SKKCHG 933
C G
Sbjct: 338 EPNCTG 343
>gi|440903225|gb|ELR53912.1| Histone-lysine N-methyltransferase EZH1, partial [Bos grunniens
mutus]
Length = 752
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 617 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 674
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 675 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 734
>gi|348562498|ref|XP_003467047.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Cavia
porcellus]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|426238089|ref|XP_004012990.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Ovis
aries]
gi|223635231|sp|A7E2Z2.2|EZH1_BOVIN RecName: Full=Histone-lysine N-methyltransferase EZH1; AltName:
Full=Enhancer of zeste homolog 1
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|296201474|ref|XP_002748044.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1
[Callithrix jacchus]
Length = 753
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 618 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 675
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 676 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 735
>gi|344285102|ref|XP_003414302.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Loxodonta
africana]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|417412559|gb|JAA52658.1| Putative transcriptional repressor ezh1, partial [Desmodus
rotundus]
Length = 751
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 616 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 673
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 674 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 733
>gi|410981177|ref|XP_003996949.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Felis
catus]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|281344548|gb|EFB20132.1| hypothetical protein PANDA_011171 [Ailuropoda melanoleuca]
Length = 749
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 614 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFN 671
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 672 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 731
>gi|426238091|ref|XP_004012991.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Ovis
aries]
Length = 707
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 572 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 629
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 630 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 689
>gi|403304420|ref|XP_003942795.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1
[Saimiri boliviensis boliviensis]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|301773590|ref|XP_002922190.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Ailuropoda
melanoleuca]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|431890589|gb|ELK01468.1| Histone-lysine N-methyltransferase EZH1 [Pteropus alecto]
Length = 744
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 609 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 666
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 667 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 726
>gi|410981179|ref|XP_003996950.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Felis
catus]
Length = 677
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 542 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 599
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 600 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 659
>gi|149723756|ref|XP_001493467.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Equus
caballus]
Length = 747
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
>gi|73965665|ref|XP_849127.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Canis
lupus familiaris]
Length = 759
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 624 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 681
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 682 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 741
>gi|403304422|ref|XP_003942796.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2
[Saimiri boliviensis boliviensis]
Length = 677
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)
Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
KK L S + WG E ++ +F+ EY GELI +D R Y+K SS+LF
Sbjct: 542 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 599
Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
L++ +VVDAT++G RF NHS NPNCY KV+ V G +I I+AKR I AGEE+ ++Y++
Sbjct: 600 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 659
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,921,200,602
Number of Sequences: 23463169
Number of extensions: 576743054
Number of successful extensions: 1592983
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3172
Number of HSP's successfully gapped in prelim test: 2562
Number of HSP's that attempted gapping in prelim test: 1572274
Number of HSP's gapped (non-prelim): 13301
length of query: 936
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 784
effective length of database: 8,792,793,679
effective search space: 6893550244336
effective search space used: 6893550244336
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 82 (36.2 bits)