BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 042571
         (936 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
 gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
          Length = 1258

 Score =  753 bits (1944), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 447/863 (51%), Positives = 555/863 (64%), Gaps = 64/863 (7%)

Query: 86   KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
            KS G+I+ FWGSY +VC++LFD+CM+VMWNAVF D +A+YS++WR+RKLWS    I  PA
Sbjct: 448  KSVGTIDNFWGSYAVVCRILFDYCMEVMWNAVFYDAIADYSNSWRRRKLWSARSNIRLPA 507

Query: 146  SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQK 205
            S  KD    +E+  S                +V ++ +N  Q ++LS  + V E  SK  
Sbjct: 508  S-IKDYGGEIEKLSSE-------------LELVCLKKDNHAQSHNLSPFLHVRERASKLN 553

Query: 206  NLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDV 265
             LS   H     ++ IL+ V+NEL++STK  ++EYVE L++ EV K+V  S+   + E+ 
Sbjct: 554  ALS---HKAYRGIRRILEYVKNELHMSTKPFFSEYVEFLIDKEVGKIVRVSEDDKLNEET 610

Query: 266  VDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLS----KDLLSN 321
            V+  S    T     ++    +  DS + + E  +S+D++SL QAGKPL     +DL SN
Sbjct: 611  VESFSRRCQTTDYSSSEFQDELTTDSVKLNVE--TSDDTQSLVQAGKPLGSLAPEDLFSN 668

Query: 322  ILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGE 381
             +A AF +S    VD V+ +   DEP PPGF D+ R LVPS   KF+ +  +E   K+ E
Sbjct: 669  FVASAFAKSQVD-VDFVMVDQNIDEPPPPGFGDNARTLVPSPIHKFRPTQPEESIPKIRE 727

Query: 382  YVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEH 441
            YVA+A+CRQKLH  V+ EWKS F+D  L QFL     +++ C+     K  G SNA+ +H
Sbjct: 728  YVAMAICRQKLHDDVLSEWKSFFIDGILNQFLRSIHTLRQHCQP--GSKMGGTSNANKDH 785

Query: 442  HGDTSTVVDKLKEGSKRFHSSEA---STMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAF 498
            +G   T + KLK G++ F+SS++   S++ +KYTY+RKKKL+RKK GS S     V+   
Sbjct: 786  NGTALTSLYKLK-GTREFNSSDSAGVSSVCDKYTYYRKKKLVRKKLGSSSQSITPVDTGL 844

Query: 499  QTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIG 558
            Q   VEK +KQ V  D+     V       +K G+ +L D                    
Sbjct: 845  QHHPVEKLQKQNVVKDIEVEPVVATLKKKKQKKGQTELSD-------------------- 884

Query: 559  KNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVP----KPSGE 614
                       A  S VKS LPS  S AK+   QKV+K   AV R  + V     KP+ +
Sbjct: 885  --------DRRAIKSIVKSSLPSDQSMAKNGTHQKVIKYKHAVPRPSINVTIDTIKPNRK 936

Query: 615  MLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLEL-HA 673
              S +S D   V KV     H+ GIE+    D SK K  ATK SK KRK + DG  + H 
Sbjct: 937  NSSDVSKDHAKVKKVSDSNNHDGGIEEVPTHDYSK-KNLATKISKLKRKHSADGRSVSHP 995

Query: 674  TKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASP 733
             K LKV    +KQAASRQV   K K+ KSR SN CPRSDGCARSSI+GWEWHKWS +ASP
Sbjct: 996  MKFLKVTTSGSKQAASRQVTAGKAKSRKSRASNSCPRSDGCARSSITGWEWHKWSHSASP 1055

Query: 734  AERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVK 793
            A+RARVRG   +H  Y   E   SQ +NGK LSARTNRVK+RNLLAAAEGA+LLKA+Q+K
Sbjct: 1056 ADRARVRGIHCLHANYSVSEAYTSQLSNGKVLSARTNRVKMRNLLAAAEGADLLKATQLK 1115

Query: 794  ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
            ARKKRLRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR +ISDIRE  YEKMGIGSSYL
Sbjct: 1116 ARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRPRISDIRERLYEKMGIGSSYL 1175

Query: 854  FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
            FRLDDGYVVDATKRGG+ARFINHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY
Sbjct: 1176 FRLDDGYVVDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 1235

Query: 914  KFPLEEKKIPCYCGSKKCHGSLN 936
            KFPLEEKKIPC CGS+KC GSLN
Sbjct: 1236 KFPLEEKKIPCNCGSRKCRGSLN 1258


>gi|224061805|ref|XP_002300607.1| SET domain protein [Populus trichocarpa]
 gi|222842333|gb|EEE79880.1| SET domain protein [Populus trichocarpa]
          Length = 1390

 Score =  730 bits (1885), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 443/917 (48%), Positives = 553/917 (60%), Gaps = 105/917 (11%)

Query: 81   LLLAQKSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPK 140
            L  + KS G+ + FWGSY ++C+ L D+CM+VMWNAVF D +AEY+ +WRK KLW  HP 
Sbjct: 419  LPRSTKSVGNADDFWGSYAVICRCLSDYCMEVMWNAVFYDTIAEYTISWRKSKLWFHHPY 478

Query: 141  ITGPASDYKDDRKRMEQAPSR-----HDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSV 195
            +            ++E+ PS       +S  S  DCPPGF +++ ++++ V     S   
Sbjct: 479  LC----------MKIEELPSETYFSGQESPASSVDCPPGFELLKTKSDHTVPSSITSSCA 528

Query: 196  PVGENLSKQKNLS---CNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKV 252
             VG+   +Q +LS   C D    DD+KCIL+ V  EL+ STK +  EYVEILV+++V+K+
Sbjct: 529  HVGQEPCEQNSLSFKDCPD----DDMKCILESVAYELHKSTKVSLLEYVEILVKEKVKKL 584

Query: 253  VSASKGINMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGK 312
            V+ S+   + E++ D S       + G  ++     IDSN+  AEI  S   +S  Q  K
Sbjct: 585  VNFSEDKRLNEEIFDFSIPSSQASEYGSIEMKDEKMIDSNQIPAEIMFSSKPQSSLQVQK 644

Query: 313  PL----SKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQ 368
                  S++ +SN LA AFKR     V N +D+   D P PPGF+D+   L PS   KF+
Sbjct: 645  SFFPFQSENEISNFLAIAFKRLRPSVV-NAIDDENIDGPPPPGFKDTA--LFPSAINKFR 701

Query: 369  FSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGN 428
             S S + T K+G YV IAMC QKLH  V+  WKS+FVD+ L +   L C+ ++  E   N
Sbjct: 702  PSKSLKLTPKVGAYVTIAMCMQKLHDDVLNVWKSIFVDEILHRSPRLCCSSEKHTEPGIN 761

Query: 429  EKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMV---EKYTYHRKKKLLRKKFG 485
            E  EGA                K  EGS +FHS ++S +     KYTYHRK+KL+ KK G
Sbjct: 762  E--EGAF---------------KFTEGSNKFHSPDSSVLSLVSGKYTYHRKRKLVGKKLG 804

Query: 486  SPSNCSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKK-- 543
            S S+ + +V++    + VEKSRKQ V  DV EN  VQP     KK G+   +DA   K  
Sbjct: 805  SSSHSTTTVDSGLLKQPVEKSRKQDVLSDVSENVVVQPVKTPKKK-GQASSVDAKPLKAT 863

Query: 544  IGANKFTSVPSKMIGKNKVTAESSASAGSSK--VKSKLPSGYSSAKSTISQKVMKVTSAV 601
            I  +   + P K        AESS + G SK  VKS L    S  K+   +KVMK+  AV
Sbjct: 864  IAESSVNARPLKA-----TIAESSVNVGPSKAAVKSTLKRDQSLPKNISRRKVMKIARAV 918

Query: 602  QRDKVPVP--KPSGEMLSTLSADGNDVG-----------KVVRGKAHNVGIEKDSILDSS 648
              DK      K S +++  +  +G D G           K +     +    K ++   S
Sbjct: 919  NDDKDAKDSIKTSRDVVGLIDCNGRDAGIKKSGTTECSKKTLNSTKVSNSKRKSTVDGGS 978

Query: 649  KSKP--------------------------------NATKESKQKRKRTMDGLEL-HATK 675
             S P                                 ATK SK KRK T++G  + H  K
Sbjct: 979  VSHPMKILKVENDVNKQAATGQVMARKTKSDHVFLCTATKVSKLKRKSTVNGGSVSHPMK 1038

Query: 676  ALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAE 735
             LKV  G  KQ A+ Q   +KTK+SKSR    CPRSDGCARSSI+GWEWH WS+ ASPAE
Sbjct: 1039 ILKVENGANKQTATGQFTARKTKSSKSRMLIPCPRSDGCARSSINGWEWHAWSVKASPAE 1098

Query: 736  RARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKAR 795
            RARVRG + +H KY G E  ASQ +NGK LSARTNRVKLRNLLAAAEG +LLKA+Q+KAR
Sbjct: 1099 RARVRGVRCIHAKYSGSEAYASQLSNGKVLSARTNRVKLRNLLAAAEGVDLLKATQLKAR 1158

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            KKRL FQRSKIHDWGLVALE IEAEDFVIEYVGELIR +ISDIRE  YEKMGIGSSYLFR
Sbjct: 1159 KKRLCFQRSKIHDWGLVALESIEAEDFVIEYVGELIRPQISDIRERLYEKMGIGSSYLFR 1218

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            LDDGYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIYAKR+IAAGEEITYNYKF
Sbjct: 1219 LDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRYIAAGEEITYNYKF 1278

Query: 916  PLEEKKIPCYCGSKKCH 932
            PLE+KKIPC CGS+K +
Sbjct: 1279 PLEDKKIPCNCGSRKLY 1295


>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
          Length = 1213

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/862 (46%), Positives = 524/862 (60%), Gaps = 71/862 (8%)

Query: 86   KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
            KS GSIE FW SY +V K+L D+ MQVMWNAVF D +AEY S+WRK+KLWS H K    A
Sbjct: 412  KSVGSIENFWWSYAVVRKVLLDYSMQVMWNAVFFDTLAEYLSSWRKKKLWS-HRKPQPSA 470

Query: 146  SDYKDDRKRMEQAP--SRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
            ++ +D  +++E        D+S S+ D    FG+  + TE +      S S   G NL +
Sbjct: 471  NECEDHTEKIESEALVINPDTSESNVDGYNQFGV--LATEKNCPRLFSSSSSLKGGNLLE 528

Query: 204  QKNLSCNDHLLLD---DVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGIN 260
             + +SC    L D   D+ CIL+ VENEL+ S+K +  +Y++  +E EV K++       
Sbjct: 529  GQKVSC----LYDNSRDLTCILESVENELHFSSKVSLADYIQSFIEKEVNKLI------- 577

Query: 261  MKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIF---SSEDSKSLFQAGKPLSKD 317
                   P   +    +    D     ++    +  EI    S + +K+    G+  S +
Sbjct: 578  -------PFPEENKFNEVAVGDTRFSEKLADKTSVKEILNDKSVDPAKAGNSFGESASGN 630

Query: 318  LLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTT 377
             +S++ + AFK    G+VD+VV+E E D+  PPG E S + +    N KF+ S S E   
Sbjct: 631  HMSDVFSKAFKE-LCGYVDDVVEE-EIDD-LPPGLEKS-QTVALHYNSKFRPSRSAECNL 686

Query: 378  KMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNA 437
            K+ EYVA A+CRQKLH  V+ +W+SLF+D   +Q       +K+  ++DG++K +   NA
Sbjct: 687  KITEYVATALCRQKLHDEVLEKWRSLFLDSVPKQVFISSSTIKKHFKSDGHKKRKTV-NA 745

Query: 438  HNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENA 497
              EH    ++ + ++KEG+K   SSE   ++ KYTY RKK   ++   S S   N     
Sbjct: 746  SKEHLNSATSGLGRVKEGAKS--SSEVPPVIGKYTYCRKKLSRKELISSKSVAENDSRPG 803

Query: 498  FQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMI 557
             Q   V K RK   +GDV E A+V+ ++V     GK K+I          K +   +   
Sbjct: 804  KQP--VAKLRKH-FSGDVGEAAEVKIASVIH---GKTKMIKGKKDTTSKGKSSVSVNSSS 857

Query: 558  GKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLS 617
              +++                      S K+   QKV+K +  VQ D     K + + LS
Sbjct: 858  HNDQL----------------------SLKNKAGQKVLKFSGEVQNDVKDFVKSNVKKLS 895

Query: 618  TLSADGNDVGKVVRGKAHNVGIEKDSILDS-SKSKPNAT-KESKQKRKRTMDGL-ELHAT 674
              + +   + K+V+      G  K+ +    S+   NAT K SK KRK  MDG    H T
Sbjct: 896  ASTDNSVVMKKIVKSD----GTVKEKVTSHCSREIQNATMKVSKSKRKHQMDGTASSHPT 951

Query: 675  KALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPA 734
            K LK++ G A   AS+QV +   K++KS+  NLCPRSDGCAR+SI GWEWHKWS +ASPA
Sbjct: 952  KVLKISNGGAYLGASKQVTVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWSRSASPA 1011

Query: 735  ERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKA 794
             +ARVRG   V  K +  E N SQ +NGKGLSARTNRVKLRNLLAAAEGA+LLK  Q+KA
Sbjct: 1012 YKARVRGLPCVQNKCIDSENNLSQLSNGKGLSARTNRVKLRNLLAAAEGADLLKVPQLKA 1071

Query: 795  RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
            RKK LRFQRSKIHDWGL+ALEPIEAEDFVIEY+GELIR +ISDIRE +YEKMGIGSSYLF
Sbjct: 1072 RKKHLRFQRSKIHDWGLLALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLF 1131

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            RLDDGYVVDATKRGGIARF+NHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK
Sbjct: 1132 RLDDGYVVDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1191

Query: 915  FPLEEKKIPCYCGSKKCHGSLN 936
            FPLEEKKIPC CGS+KC GSLN
Sbjct: 1192 FPLEEKKIPCNCGSRKCRGSLN 1213


>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
          Length = 1300

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 402/858 (46%), Positives = 523/858 (60%), Gaps = 60/858 (6%)

Query: 86   KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
            KS GSIE FW S+ +V K+L D+ MQVMWNAVF D +AEY S+WRK+ LWS HPK    A
Sbjct: 496  KSVGSIENFWWSFAVVRKVLLDYSMQVMWNAVFFDTLAEYLSSWRKKMLWS-HPKPQPSA 554

Query: 146  SDYKDDRKRMEQAP--SRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
            +  +D  +++E        DSS S+ D    FG+  + TE +      S S   G NL +
Sbjct: 555  NGCEDHTEKIESEALVFNPDSSESNVDGDNQFGV--LTTEKNCPLLFSSPSSLKGGNLLE 612

Query: 204  QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKE 263
             + +SC  ++   D+ CIL+ VENEL+ S+K +  +Y+   VE EV K++          
Sbjct: 613  GQKVSC-PYVNSRDLTCILESVENELHFSSKVSLADYIRSFVEKEVNKLIPF-------- 663

Query: 264  DVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPL----SKDLL 319
                P  +  +    G    +G   I +++TS +   ++ S    +AG       S +  
Sbjct: 664  ----PEENKFNEVAVGGTHFSG---ILADKTSMKEILNDKSVDPVKAGNSFGESASGNHK 716

Query: 320  SNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKM 379
            S+I + AFK    G+VD+VV+E E D+  PPG E S + +VP  N KF+ S S E   K+
Sbjct: 717  SDIFSKAFKE-LCGYVDDVVEE-EIDD-LPPGLEKS-QTVVPHYNSKFRPSRSAESNPKI 772

Query: 380  GEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHN 439
             EYVA A+CRQKLH  V+ +W+ LF++   +Q       +K+  ++DG++K + A +A  
Sbjct: 773  TEYVATALCRQKLHDEVLEKWRLLFLNSVPKQVFISSSTVKKHFKSDGHKKRKMA-DASK 831

Query: 440  EHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQ 499
            EH    ++ + ++KEG+K   SSE   ++ KYTY RKK   ++   S S   N      Q
Sbjct: 832  EHLNSATSGLGRVKEGAKS--SSEVPPVIGKYTYCRKKLSQKELIFSKSVAENDSRTGKQ 889

Query: 500  TEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGK 559
               V K RK  V+GDV E A+V+   ++S K GK K+I          K +   +     
Sbjct: 890  L--VTKLRKH-VSGDVGEAAEVK---IASAKHGKTKMIKGKKDTSSKGKSSVSVNSSSHN 943

Query: 560  NKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTL 619
            ++++ ++ A     K    +       K  +   V K++ +           S  M    
Sbjct: 944  DQLSLKNKAGRKVLKFSDDV-------KDFVKSNVKKLSVSTNN--------SVGMKKVA 988

Query: 620  SADGNDVGKVVRGKA-HNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALK 678
             +DG D    V+ K   +   E  ++  ++K      K +K KRK  MDG   H TK LK
Sbjct: 989  KSDGCDRDDTVKEKTTSHCSREIQNVFTATK------KVTKSKRKHQMDGTSSHPTKVLK 1042

Query: 679  VAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERAR 738
            ++ G A   AS+QV +   K++KS+  NLCPRSDGCAR+SI GWEWHKWS +ASPA +AR
Sbjct: 1043 ISNGGAYLGASKQVPVASRKSAKSKPLNLCPRSDGCARTSIDGWEWHKWSRSASPAYKAR 1102

Query: 739  VRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKR 798
            VRG   V  K +  E N  Q +NGKGLSARTNRVKLRNLLAAAEGA+LLK  Q+KARKK 
Sbjct: 1103 VRGLPCVRNKCIDSENNLFQLSNGKGLSARTNRVKLRNLLAAAEGADLLKVPQLKARKKH 1162

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
            LRFQRSKIHDWGLVALEPIEAEDFVIEY+GELIR +ISDIRE +YEKMGIGSSYLFRLDD
Sbjct: 1163 LRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIRERQYEKMGIGSSYLFRLDD 1222

Query: 859  GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
            GYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE
Sbjct: 1223 GYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 1282

Query: 919  EKKIPCYCGSKKCHGSLN 936
            EKKIPC CGS+KC GSLN
Sbjct: 1283 EKKIPCNCGSRKCRGSLN 1300


>gi|449458127|ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
          Length = 1289

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 391/885 (44%), Positives = 531/885 (60%), Gaps = 81/885 (9%)

Query: 89   GSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDY 148
            GSI+ F   + ++C+MLFD+ +QV+WNAV  D VAEYSSAWR+++ WS  P  +  +S Y
Sbjct: 449  GSIDNFREVHAVICQMLFDYSLQVVWNAVSYDTVAEYSSAWRRKRFWSYRPHYSLASSGY 508

Query: 149  KDDRKRMEQAPS-----RHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
            +D  K++E+ P+     R +SS+              +TEN  +   +SLSVPVG   S+
Sbjct: 509  RDRVKKIEKTPAEASLPRKESSLHGVSSLSVSKFKGAQTENCARSAVISLSVPVGHKSSR 568

Query: 204  QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKE 263
              + SC +    +D+K +++ +E EL+ S K +  EY++ ++E+EV    +AS  + + +
Sbjct: 569  PTSHSCCERPK-EDLKWMVEYLEKELHSSAKVSMAEYIQDILEEEVISSCNASTDVKLDK 627

Query: 264  DVVDPSSHDLHTCQC----GFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLL 319
              +D S       QC     +++  G ++ DSN+T  +  S E   +L      LS D  
Sbjct: 628  VALDVS------IQCSSTDNYSNSFGELQCDSNDTHGDRNSGELKLALLPEVN-LSNDTA 680

Query: 320  SNILACAFKRSF--------SGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSW 371
             N +A +    F          F ++     + +E   PG E+     +PS   KF+ S 
Sbjct: 681  LNSVANSLYEVFKEICTNEGCAFNEDCAFNEDCNELLAPGLEEHPTFQIPSPACKFRPSS 740

Query: 372  SDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKA 431
            S++  +K+  Y+ +A+CRQKLH  V+ EW S + DD L+QF++ W   K+ C  + N   
Sbjct: 741  SNKCYSKIEGYIMLAICRQKLHDAVLKEWTSSYKDDLLRQFVSSWIASKKHC--NSNRIV 798

Query: 432  EGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCS 491
            EGA +      G+ S V DKL+EGS+RF   E+S +   YTY+RKK   ++K GS S+C+
Sbjct: 799  EGACDG-----GEASKVPDKLREGSERFL--ESSLVTGNYTYYRKKSS-KRKLGS-SDCA 849

Query: 492  NSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTS 551
                   + +  EKSRK+ ++  V E    + ++++ K I KNK                
Sbjct: 850  TEGSPVVRNQPSEKSRKENISVGVCETTDSEIASLTLKSIAKNKR--------------- 894

Query: 552  VPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRD---KVPV 608
                   K  ++ +++     ++V   LPS +SS K+    K +K +  V+ D   K  V
Sbjct: 895  -------KKDLSIKATCKRTCAEVT--LPSSHSSGKTICGTKKLKFSPPVKDDNAKKDSV 945

Query: 609  PKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPN--------------- 653
                G M+ +     N V +V+      VG ++     S                     
Sbjct: 946  KHGKGRMIGSPLMIKN-VDQVMNKCDRGVGAQEKLCSTSLLLSSLIFSLSLSSFFPGLYL 1004

Query: 654  --ATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRS 711
              A   SK KRK+ +D   L   K L VA   +KQAAS++V  +K K+ KSR  N+   S
Sbjct: 1005 CAAVNVSKIKRKQKVDEASLLGNKVLTVADDFSKQAASKRVVAQKKKSDKSRKLNISIIS 1064

Query: 712  DGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNR 771
            DGCARSSI+GWEW +W+L ASPAERAR RG QY ++  +GP+V+ S   NGKGLSARTNR
Sbjct: 1065 DGCARSSINGWEWRRWTLKASPAERARNRGFQYFYSDPIGPDVSTSHLLNGKGLSARTNR 1124

Query: 772  VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI 831
            VKLRNLLAAA+GA+LLKASQ+KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI
Sbjct: 1125 VKLRNLLAAADGADLLKASQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELI 1184

Query: 832  RLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG 891
            R +ISDIRE +YEKMGIGSSYLFRLDDGYVVDATKRGG+ARFINHSC PNCYTKVI+VEG
Sbjct: 1185 RPRISDIRERQYEKMGIGSSYLFRLDDGYVVDATKRGGVARFINHSCEPNCYTKVITVEG 1244

Query: 892  QKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            QKKIFIYAKRHI+AGEEITYNYKFPLEEKKIPC C S++C GSLN
Sbjct: 1245 QKKIFIYAKRHISAGEEITYNYKFPLEEKKIPCNCRSRRCRGSLN 1289


>gi|357449257|ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
 gi|355483953|gb|AES65156.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
          Length = 1232

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 396/864 (45%), Positives = 519/864 (60%), Gaps = 75/864 (8%)

Query: 86   KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
            KS GSIE FW SY +V K+LFDH +QVMWNAVF D V E   +WRK+K WS HPK     
Sbjct: 403  KSVGSIENFWWSYAVVRKVLFDHSLQVMWNAVFFDTVTEVLFSWRKKKYWS-HPKPQSSV 461

Query: 146  SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLS---VPVGENLS 202
            ++ KD  ++++       +  S  +         + TE D  P  L LS   + +G N++
Sbjct: 462  NESKDSVEKLKSEALALGTGSSVCNVEADIQSGAMATERDCHP-ELLLSPNNIKIG-NIA 519

Query: 203  KQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVV-SASKGINM 261
            + + +SC+ +   +D+  IL+ VENEL+ S KA+  +YV  +VE EV +++ S  K I  
Sbjct: 520  EGQRVSCS-YGNSEDLTRILESVENELHCSAKASLADYVRSVVEKEVNQLIPSPEKDIFS 578

Query: 262  KEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGK----PLSKD 317
            + DV D          C  + +  G      +TS +   S+ S    + G     P S++
Sbjct: 579  EVDVSD----------CRISKMVAG------KTSVKETLSDKSIDPVKNGDSICVPSSEN 622

Query: 318  LLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTT 377
             +S++ + AF+    G +++VVD+ E  +  PPGFE +   + P CN KF+ S + E   
Sbjct: 623  HMSDVFSKAFQE-LCGHLNDVVDDEEIGD-LPPGFEKN--SIFPHCNSKFRPSRTVECNP 678

Query: 378  KMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNA 437
            K+ EYV  A+CRQKLH  V+ +WK   +D   ++ ++  C +K+  ++ G+ K +  S A
Sbjct: 679  KITEYVTAALCRQKLHDEVLKDWKLSILDSTFKKVMSS-CTIKKNLQSRGHGKGKSFS-A 736

Query: 438  HNEHHGDTSTVVDKLKEGSKR--------FHSSEASTMVEKYTYHRKKKLLRKKFGSPSN 489
            + EH  D +  + K+KEG+K           SS     +EKYTYHRK  L RK+  S   
Sbjct: 737  NKEHLNDATLGLGKVKEGTKLGLGKVKEGAKSSGVPLAIEKYTYHRKN-LSRKELCSSKP 795

Query: 490  CSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGANKF 549
              +  +N+   +      ++ V+GDV E+A+V+ +A+   K GK K+I            
Sbjct: 796  VVD--DNSGPGKKPLAKLRKDVSGDVKESAEVKVTAI---KRGKAKMIKGK--------- 841

Query: 550  TSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVP 609
                            SS  +    V +  PS   S K+   QKV K    VQ     V 
Sbjct: 842  -------------KDTSSKKSSPVNVDNSSPSVQLSLKNKTCQKVSKFAHTVQNGVTDVL 888

Query: 610  KPSGEMLSTLSADGNDVG-KVVRGKAHNVGIEKDSILDSSKSKPNATKE-SKQKRKRTMD 667
            K + + L  L +  N VG KVV+    +V I++ +    SK K NAT   SK KRK   D
Sbjct: 889  KSNKKRL--LVSSDNSVGMKVVKRNNTDVTIQRKTTGHISKEKLNATNTVSKSKRKHQPD 946

Query: 668  GL-ELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHK 726
            G+   H  K LK++   A   AS+QV   +  ++KS++ +LCPRS GCAR+SI GWEWHK
Sbjct: 947  GVTSSHPAKVLKISNSGASLEASKQVTEARRNSAKSKSLDLCPRSIGCARTSIDGWEWHK 1006

Query: 727  WSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAEL 786
            WS +ASP  RARVRG   +  K++  E N SQ +N KGLSARTNRVKLRNLLAAAEGA+L
Sbjct: 1007 WSQSASPTSRARVRGLPRLQNKFINSEKNPSQLSNSKGLSARTNRVKLRNLLAAAEGADL 1066

Query: 787  LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
            LK  Q+KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY+GELIR +ISDIRE +YEKM
Sbjct: 1067 LKVPQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIREVQYEKM 1126

Query: 847  GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
            GIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCY KVIS EGQKKIFIYAKRHI AG
Sbjct: 1127 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYPKVISFEGQKKIFIYAKRHINAG 1186

Query: 907  EEITYNYKFPLEEKKIPCYCGSKK 930
            EEITYNYKFPLEEKKIPC CGSKK
Sbjct: 1187 EEITYNYKFPLEEKKIPCNCGSKK 1210


>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
 gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
            Group]
 gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
          Length = 1212

 Score =  407 bits (1047), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 319/892 (35%), Positives = 437/892 (48%), Gaps = 139/892 (15%)

Query: 101  VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
            V + +++  M+ +W+ V  D V +Y  AW KRK  S       P ASD +  +   E +P
Sbjct: 404  VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 463

Query: 160  SR---HDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHLLLD 216
                  D+  SD D PPGFG  +   E+     H +    V E    +   S    L   
Sbjct: 464  KAICDSDAPESDMDFPPGFGPNQESAEHS----HSACVEYVTEKTDGRSGSSIT--LFSG 517

Query: 217  DVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVDPSSHDLHTC 276
             +  +   + NELY++ K    ++ E ++ +E+   +      ++ ++ +    H     
Sbjct: 518  PLGRVQVMLANELYVAAKEALFQHFEEVISEEITNCLCIGFEDDINQERIRTPVHAPEPS 577

Query: 277  QCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVD 336
                     G+ +    + AE+   E S     A   +S D+  + + C    + SG   
Sbjct: 578  S------PPGISVHETPSPAEMPRDEISDMAEMARDEIS-DMAVDTIPCPADMAASG--T 628

Query: 337  NVVDELETDE----------PS-------------------------PPGFEDSVRKLVP 361
            + V E+ TD+          PS                         PPG E  +  LV 
Sbjct: 629  STVPEVTTDKLIIPYVEHQSPSASHASIFEKLDAHEEAELDDSFDEVPPGTEAGLASLVI 688

Query: 362  SCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKE 421
                K+Q S S +    +  Y + A  RQ LH  V+ EW SLF   AL      W   K 
Sbjct: 689  MEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHESVMKEWASLF-SGALSNCFDSWYARKN 747

Query: 422  CCEADGNEKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKYTYHRKKKLLR 481
                                     T+ D L+               ++YTY+RK+KL +
Sbjct: 748  IV---------------------AKTMDDTLRP--------------KEYTYYRKRKLRK 772

Query: 482  KKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGDV-FENAKVQPSAVSSKKIGKNKLIDAS 540
                   NC  S       E + +  +  V   V  +N      A  S+++    +++  
Sbjct: 773  -------NCEASSSEKPMDEQLSRPLRDLVECKVNMKNIHRSSKAGISQRVS---VVEKP 822

Query: 541  SKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVM---KV 597
            SKK       + PS     N +  +      S KV  +  S + ++K  +S KV    + 
Sbjct: 823  SKK------RAKPSHNDNIN-LNIQQDLKLLSDKVPKRNRSSHPTSKPLVSSKVPTEDRT 875

Query: 598  TSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDS------ILDSSKSK 651
            TSA+     P  K   + L+T   + N   K V     + G E  +      I+  +K K
Sbjct: 876  TSAM-----PAKKRKQKNLAT---ESNLKTKAVILSPESHGCEAPTEKRTTAIMPVNKRK 927

Query: 652  PNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNL---- 707
             N + ESK K K       L + ++         +  S   A K+ + + S  SNL    
Sbjct: 928  KNLSGESKLKAK------PLTSPESYVCEAPIDNRTTSTMPAKKRKQKNLSNESNLKKKP 981

Query: 708  ---CPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKG 764
               CP S GCAR+S+SGWEW  W+ NA+P+ERA+VRG + V +    PE N  + +  KG
Sbjct: 982  LVLCPESYGCARASVSGWEWRDWARNATPSERAQVRGYR-VRSILSAPENNVLKSSQVKG 1040

Query: 765  LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
             SARTNRVKLRNLLAAAEG +LLK  Q K+RKKRLRFQRSKIH+WGLVALE I+AEDFVI
Sbjct: 1041 SSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKKRLRFQRSKIHEWGLVALESIDAEDFVI 1100

Query: 825  EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            EYVGELIR ++SDIRE +YEK GIGSSYLFRLDD YVVDATKRGG+ARFINHSC+PNCYT
Sbjct: 1101 EYVGELIRRQVSDIREDQYEKSGIGSSYLFRLDDDYVVDATKRGGLARFINHSCDPNCYT 1160

Query: 885  KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            KVI+VEGQKKI IYAKR I AGEE+TYNYKFPLEEKKIPC+CGS++C GS+N
Sbjct: 1161 KVITVEGQKKIVIYAKRRIYAGEELTYNYKFPLEEKKIPCHCGSQRCRGSMN 1212


>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
            distachyon]
          Length = 1194

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 311/902 (34%), Positives = 433/902 (48%), Gaps = 178/902 (19%)

Query: 101  VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDYKDDRKRMEQAPS 160
            V + ++   M+ +W+ V  D V  Y   W KR      P    P +    + + M++   
Sbjct: 405  VWQTIYYESMKNIWDEVMYDPVMNYCGGWLKRNHQLSLPCTIVPGAP---ENRNMQETDG 461

Query: 161  RHDSSVSDD-------DCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHL 213
                 + D        D PPGFG      E+ + P  L + V   + +S +   S    +
Sbjct: 462  LSLKVICDPEAIECDMDFPPGFGPSIECPESSLPPLSLDV-VSCADQISGKPKSSST--I 518

Query: 214  LLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVD------ 267
             +  +  +   + NELY++ K T   +   ++ +E+   +      +  ++ +       
Sbjct: 519  FIGPLAGVQIMLANELYVAAKETLFHHFREVIAEEITNCLCIGLEESTDQERIRTPVHAP 578

Query: 268  --PSSHDLHTCQCGFA-DVN-----GGMRIDSNETSAEIFSSEDSKSLFQAGK------P 313
              PSS  +H        D++       M ID+ ++SAEI   E    +F A +      P
Sbjct: 579  ELPSSASVHETHISPELDMDKLLNVAKMDIDTMDSSAEIALDE----VFVAAELATDTIP 634

Query: 314  LSKDLLSNILACAFK-------------RSFSGFVDNVVDEL---ETDE------PSPPG 351
               D+ +   +C  K              S S    ++ ++L   E+DE        PPG
Sbjct: 635  SPADMATEETSCVAKVATDKMLTSHAEHHSLSASYASIFEKLDVCESDELDESFDEVPPG 694

Query: 352  FEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQ 411
             E  +  +V     K+Q S S      +  Y+ +A+CRQ LH   + EW S F+ D + +
Sbjct: 695  METGLASVVLMDKSKYQPSKSVNSVPVISRYITLAVCRQTLHQDAMKEWAS-FLSDTISE 753

Query: 412  FLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKEGSKRFHSSEASTMVEKY 471
                W  M+                           VV K  +GS R          +++
Sbjct: 754  CFDSWYTME--------------------------NVVPKNIDGSLR---------PKEF 778

Query: 472  TYHRKKKLLR--------KKFGSPSN------CSNSVENAFQTEHVEKSRKQGVAGDVFE 517
            TY+RK+KL +        K    P +       S  VE     ++V+ SRK         
Sbjct: 779  TYYRKRKLRKSSEVVSSKKPMEVPMDEQLSKPLSELVERKAHLKNVQGSRK--------- 829

Query: 518  NAKVQ-PSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSSKVK 576
             AK++ PS + +K +      D  ++++ +      PS  + K+K + E  A      ++
Sbjct: 830  GAKLKKPSKIQAKTL------DKEARRLNSKLDLKRPSHDVPKSKSSKEVHALNIEQDLE 883

Query: 577  SKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHN 636
             +L +     KS    + + +   V+R    VPK      + L    + V K        
Sbjct: 884  -RLSNDVPKRKSLKEVRTLNIEQDVKRLSNDVPK---RQRTPLLIKKHLVDK-------K 932

Query: 637  VGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKK 696
            V +E DS+         ATK  K+++ + +        K L                   
Sbjct: 933  VSMENDSV---------ATKLVKKRKHKNISSDSSQKVKPLV------------------ 965

Query: 697  TKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNA 756
                      LCP SDGCAR+S SGWEW  W+ NA+P+ER RVRG  Y     L    N 
Sbjct: 966  ----------LCPESDGCARASTSGWEWRDWARNATPSERTRVRG--YRVRSILSTPAN- 1012

Query: 757  SQWANG--KGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            S W N   KG SARTNRVKLRNLLAA EGA+LLK +Q KARKKRLRFQRSKIH+WGLVAL
Sbjct: 1013 SLWKNSQVKGTSARTNRVKLRNLLAAYEGADLLKIAQSKARKKRLRFQRSKIHEWGLVAL 1072

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I+AEDFVIEYVGELIR  +SDIRE +YEK GIGSSYLFRLDD YVVDATKRGG+ARFI
Sbjct: 1073 ESIDAEDFVIEYVGELIRRPVSDIREAQYEKSGIGSSYLFRLDDDYVVDATKRGGLARFI 1132

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGS 934
            NHSC PNCYTKVI+V+GQKKIFIY+KR I AGEE+TYNYKFPLEEKKIPC+CGS +C GS
Sbjct: 1133 NHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYKFPLEEKKIPCHCGSLRCRGS 1192

Query: 935  LN 936
            +N
Sbjct: 1193 MN 1194


>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
          Length = 1388

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 176/228 (77%), Positives = 198/228 (86%), Gaps = 4/228 (1%)

Query: 709  PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
            P+SDGCAR+SI+GW WH WSL AS  ERARVRG+  VH ++ G + + +Q      LSAR
Sbjct: 1165 PKSDGCARTSINGWHWHAWSLKASAEERARVRGSSCVHMQHFGSKSSLTQNV----LSAR 1220

Query: 769  TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
            TNR KLRNLLAAA+GA++LK SQ+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVIEYVG
Sbjct: 1221 TNRAKLRNLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVG 1280

Query: 829  ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
            ELIR  IS+IRE +YEKMGIGSSYLFRLDDGYV+DATKRGGIARFINHSC PNCYTK+IS
Sbjct: 1281 ELIRSSISEIRERQYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIIS 1340

Query: 889  VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VEG+KKIFIYAKRHI AGEEI+YNYKFPLE+ KIPC CG+ KC GSLN
Sbjct: 1341 VEGKKKIFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGAPKCRGSLN 1388



 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 166/342 (48%), Gaps = 38/342 (11%)

Query: 86  KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
           KS GSIE F  S + VC+ L  HCMQ+MWNAVF D VA +SS WRK K+W     I+   
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494

Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
              G  + Y D  +  E    R DSS S       F +                      
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534

Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
           N ++ + LS + +     +  I + VENEL+LS K   T+Y  IL++D      S+++  
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594

Query: 260 NMKED---VVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP--- 313
            M E+    +  S  + +  +      NG   + +    +  FS  DS+ L Q G+    
Sbjct: 595 KMHEENPPSLQGSFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQ 652

Query: 314 -LSKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWS 372
             S+D+++NI + A + S    V++ +D L   EP PPG E ++   +P    K+Q   S
Sbjct: 653 ITSEDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRS 709

Query: 373 DEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
            E   ++  YV++A+CRQKLH  V+ +WKSLF+   L +FLA
Sbjct: 710 KESIPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 751


>gi|222617469|gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japonica Group]
          Length = 1165

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 259/613 (42%), Positives = 334/613 (54%), Gaps = 91/613 (14%)

Query: 335  VDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHA 394
            +D+  DE+      PPG E  +  LV     K+Q S S +    +  Y + A  RQ LH 
Sbjct: 627  LDDSFDEV------PPGTEAGLASLVIMEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHE 680

Query: 395  IVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKE 454
             V+ EW SLF   AL      W   K                          T+ D L+ 
Sbjct: 681  SVMKEWASLF-SGALSNCFDSWYARKNIV---------------------AKTMDDTLRP 718

Query: 455  GSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGD 514
                          ++YTY+RK+KL +       NC  S       E + +  +  V   
Sbjct: 719  --------------KEYTYYRKRKLRK-------NCEASSSEKPMDEQLSRPLRDLVECK 757

Query: 515  V-FENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSS 573
            V  +N      A  S+++    +++  SKK       + PS     N +  +      S 
Sbjct: 758  VNMKNIHRSSKAGISQRVS---VVEKPSKK------RAKPSHNDNIN-LNIQQDLKLLSD 807

Query: 574  KVKSKLPSGYSSAKSTISQKVM---KVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVV 630
            KV  +  S + ++K  +S KV    + TSA+     P  K   + L+T   + N   K V
Sbjct: 808  KVPKRNRSSHPTSKPLVSSKVPTEDRTTSAM-----PAKKRKQKNLAT---ESNLKTKAV 859

Query: 631  RGKAHNVGIEKDS------ILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTA 684
                 + G E  +      I+  +K K N + ESK K K       L + ++        
Sbjct: 860  ILSPESHGCEAPTEKRTTAIMPVNKRKKNLSGESKLKAK------PLTSPESYVCEAPID 913

Query: 685  KQAASRQVAMKKTKASKSRTSNL-------CPRSDGCARSSISGWEWHKWSLNASPAERA 737
             +  S   A K+ + + S  SNL       CP S GCAR+S+SGWEW  W+ NA+P+ERA
Sbjct: 914  NRTTSTMPAKKRKQKNLSNESNLKKKPLVLCPESYGCARASVSGWEWRDWARNATPSERA 973

Query: 738  RVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
            +VRG + V +    PE N  + +  KG SARTNRVKLRNLLAAAEG +LLK  Q K+RKK
Sbjct: 974  QVRGYR-VRSILSAPENNVLKSSQVKGSSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKK 1032

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RLRFQRSKIH+WGLVALE I+AEDFVIEYVGELIR ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 1033 RLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLFRLD 1092

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
            D YVVDATKRGG+ARFINHSC+PNCYTKVI+VEGQKKI IYAKR I AGEE+TYNYKFPL
Sbjct: 1093 DDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYKFPL 1152

Query: 918  EEKKIPCYCGSKK 930
            EEKKIPC+CGS++
Sbjct: 1153 EEKKIPCHCGSQR 1165



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
           V + +++  M+ +W+ V  D V +Y  AW KRK  S       P ASD +  +   E +P
Sbjct: 404 VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 463

Query: 160 S---RHDSSVSDDDCPPGFG 176
                 D+  SD D PPGFG
Sbjct: 464 KAICDSDAPESDMDFPPGFG 483


>gi|218187240|gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indica Group]
          Length = 1167

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 259/613 (42%), Positives = 334/613 (54%), Gaps = 91/613 (14%)

Query: 335  VDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHA 394
            +D+  DE+      PPG E  +  LV     K+Q S S +    +  Y + A  RQ LH 
Sbjct: 629  LDDSFDEV------PPGTEAGLASLVIMEKNKYQPSKSVDSVLDIYRYTSWAFFRQILHE 682

Query: 395  IVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDKLKE 454
             V+ EW SLF   AL      W   K                          T+ D L+ 
Sbjct: 683  SVMKEWASLF-SGALSNCFDSWYARKNIV---------------------AKTMDDTLRP 720

Query: 455  GSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGVAGD 514
                          ++YTY+RK+KL +       NC  S       E + +  +  V   
Sbjct: 721  --------------KEYTYYRKRKLRK-------NCEASSSEKPMDEQLSRPLRDLVECK 759

Query: 515  V-FENAKVQPSAVSSKKIGKNKLIDASSKKIGANKFTSVPSKMIGKNKVTAESSASAGSS 573
            V  +N      A  S+++    +++  SKK       + PS     N +  +      S 
Sbjct: 760  VNMKNIHRSSKAGISQRVS---VVEKPSKK------RAKPSHNDNIN-LNIQQDLKLLSD 809

Query: 574  KVKSKLPSGYSSAKSTISQKVM---KVTSAVQRDKVPVPKPSGEMLSTLSADGNDVGKVV 630
            KV  +  S + ++K  +S KV    + TSA+     P  K   + L+T   + N   K V
Sbjct: 810  KVPKRNRSSHPTSKPLVSSKVPTEDRTTSAM-----PAKKRKQKNLAT---ESNLKTKAV 861

Query: 631  RGKAHNVGIEKDS------ILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTA 684
                 + G E  +      I+  +K K N + ESK K K       L + ++        
Sbjct: 862  ILSPESHGCEAPTEKRTTAIMPVNKRKKNLSGESKLKAK------PLTSPESYVCEAPID 915

Query: 685  KQAASRQVAMKKTKASKSRTSNL-------CPRSDGCARSSISGWEWHKWSLNASPAERA 737
             +  S   A K+ + + S  SNL       CP S GCAR+S+SGWEW  W+ NA+P+ERA
Sbjct: 916  NRTTSTMPAKKRKQKNLSNESNLKKKPLVLCPESYGCARASVSGWEWRDWARNATPSERA 975

Query: 738  RVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
            +VRG + V +    PE N  + +  KG SARTNRVKLRNLLAAAEG +LLK  Q K+RKK
Sbjct: 976  QVRGYR-VRSILSAPENNVLKSSQVKGSSARTNRVKLRNLLAAAEGTDLLKIMQSKSRKK 1034

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RLRFQRSKIH+WGLVALE I+AEDFVIEYVGELIR ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 1035 RLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQVSDIREDQYEKSGIGSSYLFRLD 1094

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
            D YVVDATKRGG+ARFINHSC+PNCYTKVI+VEGQKKI IYAKR I AGEE+TYNYKFPL
Sbjct: 1095 DDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKKIVIYAKRRIYAGEELTYNYKFPL 1154

Query: 918  EEKKIPCYCGSKK 930
            EEKKIPC+CGS++
Sbjct: 1155 EEKKIPCHCGSQR 1167



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 101 VCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGP-ASDYKDDRKRMEQAP 159
           V + +++  M+ +W+ V  D V +Y  AW KRK  S       P ASD +  +   E +P
Sbjct: 406 VWETIYNKSMKSIWDEVLYDPVMDYCDAWLKRKNESNLLSTVVPGASDNQKMQDTDEMSP 465

Query: 160 S---RHDSSVSDDDCPPGFG 176
                 D+  SD D PPGFG
Sbjct: 466 KAICDSDAPESDMDFPPGFG 485


>gi|296085637|emb|CBI29431.3| unnamed protein product [Vitis vinifera]
          Length = 1127

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 270/701 (38%), Positives = 379/701 (54%), Gaps = 62/701 (8%)

Query: 86   KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
            KS G IE F  +  +VC+ +FD CMQVMWNAVF   VAEY S WRKRK WSGHP+I  PA
Sbjct: 356  KSVGGIENFQHTCMVVCRTIFDSCMQVMWNAVFYAPVAEYCSTWRKRKRWSGHPRIMHPA 415

Query: 146  SD----YKDDRKRMEQA-----PSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVP 196
             +    ++D+ ++ E+         H+ SV + DCPPGFG+V    +  +Q   + LS  
Sbjct: 416  VEQAMLFRDNVEKSEKLIDEPLQEEHEYSVCEVDCPPGFGLVMTDQDIHIQS-SVGLSSS 474

Query: 197  VGENLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSAS 256
              E +  ++    ++    DD++CI++ V+NEL LS K    E VE  +E+EV  ++ + 
Sbjct: 475  TVEGIPFKEKRPSDNVQPYDDMQCIVETVQNELQLSAKMMLVECVEAFIEEEVMNLIDSF 534

Query: 257  KGINMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSK 316
            K   +KE   D S    H  +   +D+  G+RI+S  T AE+  S DS +  Q+      
Sbjct: 535  KDKKLKEGTSDFSIQCPHANEDASSDMVSGLRIES--TVAEMILSVDSCTPQQSPTDFH- 591

Query: 317  DLLSNILACAFKRSFSGFVDNVVD------ELETDEPSPPGFEDSVRKLVPSCNGKFQFS 370
              L N  + +    F   ++ +        + + DEP PPGFE + R  VPS   +F+ S
Sbjct: 592  --LPNNASVSVSEHFMSKLNKLCTTDDVVDDQDIDEPPPPGFEYNSRTFVPSQICRFRPS 649

Query: 371  WSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEK 430
             SDE T  +GEYVA+A+CRQ+LH  V+ EWK L V+  L QF A W   K+ C++ G E 
Sbjct: 650  SSDECTPIIGEYVALALCRQRLHEDVLQEWKDLLVEGTLDQFFASWWTSKQRCDSTGCE- 708

Query: 431  AEGASNAHNEHHGDTSTVVDKLKEGSKRFHS---SEASTMVEKYTYHRKKKLLRKKFGSP 487
             EG SN++ E   D+S   D+ +E +K  HS    E S ++ KYTY+RKKKL+RKK GS 
Sbjct: 709  -EGVSNSNKEKPCDSSAASDQRRERTKDRHSLGSPELSLVIGKYTYYRKKKLVRKKIGSL 767

Query: 488  SNCSNSVENAFQTEHVEKSRKQGVAGDVFENAKVQPSAVSSKKIGKNKLIDASSKKIGAN 547
            S+ + SV++  Q + +EKSRKQ V GDV E  +V+   +  +KIG N             
Sbjct: 768  SHAAASVDSGSQDQLMEKSRKQDVPGDVSEITEVEMGILKRRKIGLN------------- 814

Query: 548  KFTSVPSKMIGKNKVTAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVP 607
                           T  +  ++  + V+S LP   SS +   +++  K    V+  +V 
Sbjct: 815  ---------------TCHAEDNSLQAIVQSTLPGDSSSVRIKPNRRSTKCAHVVRNGEVI 859

Query: 608  VPKPSG--EMLSTLSADGNDVGKVVR--GKAHNVGIEKDSILDSSKSKPNATKESKQKRK 663
                +   E  S  + D + V KVV   G  H+VG  K+   D SK    +TK SK+KRK
Sbjct: 860  EDDLACGREEASPFAEDCDFVDKVVNSNGNGHDVGNLKELAGDCSKKT-KSTKVSKKKRK 918

Query: 664  RTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWE 723
               D     + K LK A G AKQ   RQVA+ K+K SK +T N C RS GCARSSI+GW+
Sbjct: 919  DLKDVPSSRSAKVLKPANGAAKQDTGRQVAVHKSKFSKFKTLNPCLRSVGCARSSINGWD 978

Query: 724  WHKWSLNASPAERARVRG---AQYVHTKYLGPEVNASQWAN 761
            W  WSLNASP ERA VRG   AQ+   +Y   EV +SQ +N
Sbjct: 979  WRNWSLNASPTERAHVRGIHKAQFACDQYFRSEVVSSQLSN 1019



 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/75 (86%), Positives = 66/75 (88%)

Query: 862  VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 921
            VDATKRGGIARFINHSC PNCYTKVISVEG+KKIFIYAKR I AGEEITYNYKFPLEEKK
Sbjct: 1020 VDATKRGGIARFINHSCEPNCYTKVISVEGEKKIFIYAKRQITAGEEITYNYKFPLEEKK 1079

Query: 922  IPCYCGSKKCHGSLN 936
            IPC CGSK     LN
Sbjct: 1080 IPCNCGSKSFRMPLN 1094


>gi|413916020|gb|AFW55952.1| putative SET-domain containing protein family [Zea mays]
          Length = 710

 Score =  370 bits (949), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 191/313 (61%), Positives = 227/313 (72%), Gaps = 19/313 (6%)

Query: 627 GKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQ 686
            K V   A+++ I++D  L SS+  P   + S   +K  +      A +   V    A  
Sbjct: 408 AKTVANDAYDLNIQQDLTLLSSEV-PKRARSSHPTKKHMV------ANRTPTVNDNVANN 460

Query: 687 AA-SRQVAMKKTKASKSRTSN------LCPRSDGCARSSISGWEWHKWSLNASPAERARV 739
           +  ++ V  KK +   S TS        CP SDGCAR+SI+GWEW  W+ NA+P+ERARV
Sbjct: 461 SMLTKHVKKKKGRNISSETSQKVKPVISCPESDGCARASINGWEWRNWARNATPSERARV 520

Query: 740 RGAQYVHTKYLGPEVNASQWAN--GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
           RG    H + +    N + W N   K  SARTNRVKLRNLLAAAEG+ELLK +Q+KARKK
Sbjct: 521 RG---YHVRTILSASNDNVWKNSQAKVSSARTNRVKLRNLLAAAEGSELLKITQMKARKK 577

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           RLRFQRSKIHDWGLVALE IEAEDFVIEYVG+LI  ++SDIRE +YEK GIGSSYLFRLD
Sbjct: 578 RLRFQRSKIHDWGLVALESIEAEDFVIEYVGQLIHRRVSDIRESQYEKSGIGSSYLFRLD 637

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
           D +VVDATKRGG+ARFINHSC PNCYTKVI+V+GQKKIFIYAKR I AGEEITYNYKFPL
Sbjct: 638 DDFVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPL 697

Query: 918 EEKKIPCYCGSKK 930
           EEKKIPC+CGS++
Sbjct: 698 EEKKIPCHCGSRR 710



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 72/316 (22%), Positives = 115/316 (36%), Gaps = 49/316 (15%)

Query: 156 EQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQKNLSCNDHLLL 215
           E +P   D++  D D PPGFG        +       L V     L  +   S    L  
Sbjct: 28  ELSPKDLDATECDMDFPPGFG--PCWKSPESSLSPSLLEVNGSAKLDGKSESSTT--LFS 83

Query: 216 DDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINMKEDVVDPSSH---- 271
             +  +   + NELY+S+K +   Y E ++ D++   +      N+ ++ +    H    
Sbjct: 84  GPLAVVQRMLANELYISSKQSLFHYFEEVIADDITNCLCFGLESNIDQEQIGTPIHAPES 143

Query: 272 ------DLHTCQCGFADVNGG------------------MRID--------SNETSAEIF 299
                  +H       ++ GG                  M +D        +  +  E  
Sbjct: 144 PISAETSMHETANSI-EMTGGDVLNIVEIATSTRTSPLEMTVDEKLNTVEATRTSRTEPT 202

Query: 300 SSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVDNVVDELETDE---PSPPGFEDSV 356
           S E S     A   ++   +   L+ ++ R F     N+    E DE     PPG E  +
Sbjct: 203 SVETSTVAEMATDKMTTYHVEEHLSMSYARVFEKM--NICKTAELDEKFDEVPPGMETGL 260

Query: 357 RKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLALW 416
             L       +Q S S      +  Y+ +A+CRQKLH  VV EW  LF  D + + L  W
Sbjct: 261 VPLPLMDKNIYQPSKSVNSIPLISRYITLALCRQKLHENVVREWTYLF-SDTISKCLGSW 319

Query: 417 CNMKECC--EADGNEK 430
              +      ADG+ K
Sbjct: 320 YTRRNAVPKSADGSSK 335


>gi|297791815|ref|XP_002863792.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309627|gb|EFH40051.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1414

 Score =  369 bits (946), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 180/260 (69%), Positives = 207/260 (79%), Gaps = 4/260 (1%)

Query: 670  ELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSL 729
            E    K L V  G  K  A + +    +K SK +    CP+ DGCAR+SI+GW W  WSL
Sbjct: 1130 ERSPIKDLSVDDGRPKSIALKPLEKLSSKPSKKKVLLSCPKCDGCARTSINGWHWRAWSL 1189

Query: 730  NASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKA 789
             AS  ER RVRG+  VH ++ G + N++Q      LSARTNR K+RNLLAAA+GA++LK 
Sbjct: 1190 KASAEERTRVRGSSSVHIQHFGSKSNSTQ----NVLSARTNRAKMRNLLAAADGADVLKI 1245

Query: 790  SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
            SQ+KARKKRLRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR  IS+IRE +YEKMGIG
Sbjct: 1246 SQLKARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIG 1305

Query: 850  SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
            SSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTK+ISVEG+KKIFIYAKRHI AGEEI
Sbjct: 1306 SSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEI 1365

Query: 910  TYNYKFPLEEKKIPCYCGSK 929
            +YNYKFPLE+ KIPC CG++
Sbjct: 1366 SYNYKFPLEDDKIPCNCGAQ 1385



 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 167/343 (48%), Gaps = 41/343 (11%)

Query: 86  KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPA 145
           KS GSIE F  S + VC+ L +HCMQ+MWNAVF D VA +S +WRK+KLW          
Sbjct: 438 KSVGSIENFQTSCSAVCRTLHNHCMQIMWNAVFYDTVATHSLSWRKKKLWF--------- 488

Query: 146 SDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSKQK 205
                D   +      H +     D P  F     R ++       S  + +  N +  +
Sbjct: 489 --RSSDNSTLNYCKGSHRNY---SDKPEAFKSFTCRVDSFSSKTAHSNELDLPTNGASVR 543

Query: 206 NLSCNDHLLLDD------VKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
            +S     L D       +  I + VE+EL+LS +   T+Y  +L++D      + ++  
Sbjct: 544 GVSSRKVTLPDTDETESLIASISEHVESELFLSLETHLTDYTSVLIKDGANTTTTTARDG 603

Query: 260 NMKEDVVDPSSHDLHTCQCGF-ADVNGGMRIDSNETSAEI-FSSE--DSKSLFQAGKP-- 313
            M E           +C+  +  + +G  + D N  SA++ FS++  DS+ L Q G+   
Sbjct: 604 KMHEG----------SCREQYNLEGSGEKQNDLNVVSAKLRFSNDFSDSQRLLQEGESSE 653

Query: 314 --LSKDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSW 371
              S+D+++NI +   + S     D  +D L+  EP PPG E S++  +PS   KF+   
Sbjct: 654 QITSEDIIANIFSTTLETSDIPVSDE-LDALDIHEPPPPGCESSIK--MPSLRCKFRPVR 710

Query: 372 SDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
           S E   ++  YVA A+CRQKLH  V+ EW SLF+   L +FLA
Sbjct: 711 SKESIPEIKAYVATALCRQKLHTDVMREWISLFMKCHLNEFLA 753


>gi|9759476|dbj|BAB10481.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1421

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 200/344 (58%), Positives = 238/344 (69%), Gaps = 10/344 (2%)

Query: 589  TISQKVMKVTSAVQR----DKVPVPKPSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSI 644
            +ISQK    +S ++R    D+     PS   LS  S D  D   V++        EK   
Sbjct: 1058 SISQKGPSQSSILKRKHQLDEKISNVPSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPC 1115

Query: 645  LDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRT 704
              S K +    K  ++++       E    K L V  G  K  A + +    +K SK + 
Sbjct: 1116 HTSDKLQKGPNKLIRRRKPLAKHTTERSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKL 1175

Query: 705  SNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKG 764
                P+SDGCAR+SI+GW WH WSL AS  ERARVRG+  VH ++ G + + +Q      
Sbjct: 1176 FLSIPKSDGCARTSINGWHWHAWSLKASAEERARVRGSSCVHMQHFGSKSSLTQ----NV 1231

Query: 765  LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            LSARTNR KLRNLLAAA+GA++LK SQ+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVI
Sbjct: 1232 LSARTNRAKLRNLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVI 1291

Query: 825  EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            EYVGELIR  IS+IRE +YEKMGIGSSYLFRLDDGYV+DATKRGGIARFINHSC PNCYT
Sbjct: 1292 EYVGELIRSSISEIRERQYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYT 1351

Query: 885  KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGS 928
            K+ISVEG+KKIFIYAKRHI AGEEI+YNYKFPLE+ KIPC CG+
Sbjct: 1352 KIISVEGKKKIFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGA 1395



 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 111/339 (32%), Positives = 164/339 (48%), Gaps = 39/339 (11%)

Query: 86  KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
           KS GSIE F  S + VC+ L  HCMQ+MWNAVF D VA +SS WRK K+W     I+   
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494

Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
              G  + Y D  +  E    R DSS S       F +                      
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534

Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
           N ++ + LS + +     +  I + VENEL+LS K   T+Y  IL++D      S+++  
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594

Query: 260 NMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP----LS 315
            M E     S  + +  +      NG   + +    +  FS  DS+ L Q G+      S
Sbjct: 595 KMHEG----SFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQITS 648

Query: 316 KDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEF 375
           +D+++NI + A + S    V++ +D L   EP PPG E ++   +P    K+Q   S E 
Sbjct: 649 EDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRSKES 705

Query: 376 TTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
             ++  YV++A+CRQKLH  V+ +WKSLF+   L +FLA
Sbjct: 706 IPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 744


>gi|30694058|ref|NP_199055.2| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
 gi|332007422|gb|AED94805.1| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
          Length = 1423

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 193/318 (60%), Positives = 226/318 (71%), Gaps = 6/318 (1%)

Query: 611  PSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLE 670
            PS   LS  S D  D   V++        EK     S K +    K  ++++       E
Sbjct: 1086 PSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPCHTSDKLQKGPNKLIRRRKPLAKHTTE 1143

Query: 671  LHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLN 730
                K L V  G  K  A + +    +K SK +     P+SDGCAR+SI+GW WH WSL 
Sbjct: 1144 RSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKLFLSIPKSDGCARTSINGWHWHAWSLK 1203

Query: 731  ASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKAS 790
            AS  ERARVRG+  VH ++ G + + +Q      LSARTNR KLRNLLAAA+GA++LK S
Sbjct: 1204 ASAEERARVRGSSCVHMQHFGSKSSLTQ----NVLSARTNRAKLRNLLAAADGADVLKMS 1259

Query: 791  QVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS 850
            Q+KARKK LRFQ+SKIHDWGLVALEPIEAEDFVIEYVGELIR  IS+IRE +YEKMGIGS
Sbjct: 1260 QLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSISEIRERQYEKMGIGS 1319

Query: 851  SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEIT 910
            SYLFRLDDGYV+DATKRGGIARFINHSC PNCYTK+ISVEG+KKIFIYAKRHI AGEEI+
Sbjct: 1320 SYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEIS 1379

Query: 911  YNYKFPLEEKKIPCYCGS 928
            YNYKFPLE+ KIPC CG+
Sbjct: 1380 YNYKFPLEDDKIPCNCGA 1397



 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 111/339 (32%), Positives = 164/339 (48%), Gaps = 39/339 (11%)

Query: 86  KSGGSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKIT--- 142
           KS GSIE F  S + VC+ L  HCMQ+MWNAVF D VA +SS WRK K+W     I+   
Sbjct: 435 KSVGSIENFQTSCSAVCRTLHHHCMQIMWNAVFYDTVATHSSCWRKNKIWFRSSDISTVN 494

Query: 143 ---GPASDYKDDRKRMEQAPSRHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGE 199
              G  + Y D  +  E    R DSS S       F +                      
Sbjct: 495 YCKGSHTKYSDKPESFESFTCRVDSSSSKTAYSDEFDL--------------------AT 534

Query: 200 NLSKQKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGI 259
           N ++ + LS + +     +  I + VENEL+LS K   T+Y  IL++D      S+++  
Sbjct: 535 NGARVRGLSSDTYGTESVIASISEHVENELFLSLKTHLTDYTSILIKDGANNTTSSARDG 594

Query: 260 NMKEDVVDPSSHDLHTCQCGFADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKP----LS 315
            M E     S  + +  +      NG   + +    +  FS  DS+ L Q G+      S
Sbjct: 595 KMHEG----SFREQYNLEGSSKKKNGLNVVPAKLRFSNDFS--DSQRLLQEGESSEQITS 648

Query: 316 KDLLSNILACAFKRSFSGFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEF 375
           +D+++NI + A + S    V++ +D L   EP PPG E ++   +P    K+Q   S E 
Sbjct: 649 EDIIANIFSTALETS-DIPVNDELDALAIHEPPPPGCESNIN--MPCLRYKYQPVRSKES 705

Query: 376 TTKMGEYVAIAMCRQKLHAIVVGEWKSLFVDDALQQFLA 414
             ++  YV++A+CRQKLH  V+ +WKSLF+   L +FLA
Sbjct: 706 IPEIKAYVSMALCRQKLHNDVMRDWKSLFLKCYLNEFLA 744


>gi|242084688|ref|XP_002442769.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
 gi|241943462|gb|EES16607.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
          Length = 1033

 Score =  365 bits (937), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 190/311 (61%), Positives = 226/311 (72%), Gaps = 15/311 (4%)

Query: 627  GKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKGTAKQ 686
             K V   AH++ I++D  L SS+  P   + S   +K  +      A +   V    A  
Sbjct: 731  AKPVANDAHDLNIQQDLTLLSSEV-PKRARSSHPTKKHMV------ANRTPTVNDNVANN 783

Query: 687  AA-SRQVAMKKTKASKSRTSN------LCPRSDGCARSSISGWEWHKWSLNASPAERARV 739
            +  ++ V  KK +   S TS        CP SDGCAR+SI+GWEW  W+ NA+P+ERARV
Sbjct: 784  SMLTKHVKKKKGRDISSETSQKVKPMISCPESDGCARASINGWEWRNWARNATPSERARV 843

Query: 740  RGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRL 799
            RG + V T       N  + +  K  SARTNRVKLRNLLAAAEGAELLK +Q+KARKKRL
Sbjct: 844  RGYR-VRTILSASNNNVWKNSQAKVSSARTNRVKLRNLLAAAEGAELLKITQMKARKKRL 902

Query: 800  RFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDG 859
            RFQRSKIH+WGLVALE IEAEDFVIEYVG+LI  ++SDIRE +YEK GIGSSYLFRLDD 
Sbjct: 903  RFQRSKIHEWGLVALELIEAEDFVIEYVGQLIHRRVSDIRESQYEKSGIGSSYLFRLDDD 962

Query: 860  YVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
            +VVDATKRGG+ARFINHSC PNCYTKVI+V+GQKKIFIYAKR I AGEEITYNYKFPLEE
Sbjct: 963  FVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPLEE 1022

Query: 920  KKIPCYCGSKK 930
            +KIPC+CGS++
Sbjct: 1023 EKIPCHCGSRR 1033



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 65/148 (43%), Gaps = 9/148 (6%)

Query: 289 IDSNETS-AEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFSGFVDNVVDELETDE- 346
           +++  TS  E  S+E S +   A   +    +   L+ ++ R F     ++    E DE 
Sbjct: 514 VEATRTSLTEPTSAETSTAAEMATDKMPTSHVEEHLSMSYARIFEKM--DICKTAELDEK 571

Query: 347 --PSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQKLHAIVVGEWKSLF 404
               PPG E  +  +       +Q S S      +  Y+ +A+CRQKLH  VV EW SLF
Sbjct: 572 FDEVPPGMETGLVPVPLMDKNIYQPSKSMNSIPLISRYITLALCRQKLHENVVREWTSLF 631

Query: 405 VDDALQQFLALWCNMKECC--EADGNEK 430
             D + + L  W   +      ADG+ K
Sbjct: 632 -SDTISKCLGSWYTRRNAVPKSADGSSK 658


>gi|295913201|gb|ADG57859.1| transcription factor [Lycoris longituba]
          Length = 164

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 139/161 (86%), Positives = 152/161 (94%)

Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
           NLLAAAEGA++LK +Q+ ARKKRLRFQRS IHDWGLVALEPIE+EDFVIEYVGEL+R +I
Sbjct: 4   NLLAAAEGADILKFNQLTARKKRLRFQRSNIHDWGLVALEPIESEDFVIEYVGELVRRQI 63

Query: 836 SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
           SDIREC+YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTKVI+VEGQKKI
Sbjct: 64  SDIRECQYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVEGQKKI 123

Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           FIYAKRHI AGEE+TYNYKFPLEE+KI C CGSK+C GS+N
Sbjct: 124 FIYAKRHIHAGEELTYNYKFPLEEQKILCNCGSKRCRGSMN 164


>gi|449525052|ref|XP_004169534.1| PREDICTED: uncharacterized LOC101220062, partial [Cucumis sativus]
          Length = 509

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 202/540 (37%), Positives = 286/540 (52%), Gaps = 70/540 (12%)

Query: 280 FADVNGGMRIDSNETSAEIFSSEDSKSLFQAGKPLSKDLLSNILACAFKRSFS------- 332
           +++  G ++ DSN+T  +  S E   +L      LS D   N +A +    F        
Sbjct: 15  YSNSFGELQCDSNDTHGDRNSGELKLALLPEVN-LSNDTALNSVANSLYEVFKEICTNEG 73

Query: 333 -GFVDNVVDELETDEPSPPGFEDSVRKLVPSCNGKFQFSWSDEFTTKMGEYVAIAMCRQK 391
             F ++     + +E   PG E+     +PS   KF+ S S++  +K+  Y+ +A+CRQK
Sbjct: 74  CAFNEDCAFNEDCNELLAPGLEEHPTFQIPSPACKFRPSSSNKCYSKIEGYIMLAICRQK 133

Query: 392 LHAIVVGEWKSLFVDDALQQFLALWCNMKECCEADGNEKAEGASNAHNEHHGDTSTVVDK 451
           LH  V+ EW S + DD L+QF++ W   K+ C  + N   EGA +      G+ S V DK
Sbjct: 134 LHDAVLKEWTSSYKDDLLRQFVSSWIASKKHC--NSNRIVEGACDG-----GEASKVPDK 186

Query: 452 LKEGSKRFHSSEASTMVEKYTYHRKKKLLRKKFGSPSNCSNSVENAFQTEHVEKSRKQGV 511
           L+EGS+RF   E+S +   YTY+RKK   ++K GS S+C+       + +  EKSRK+ +
Sbjct: 187 LREGSERFL--ESSLVTGNYTYYRKKSS-KRKLGS-SDCATEGSPVVRNQPSEKSRKENI 242

Query: 512 AGDVFENAKVQPSAVSSKKIGKNKL-----IDASSKKIGANKFTSVPSKMIGKNKVTAES 566
           +  V E    + ++++ K I KNK      I A+ K+  A              +VT   
Sbjct: 243 SVGVCETTDSEIASLTLKSIAKNKRKKDLSIKATCKRTCA--------------EVT--- 285

Query: 567 SASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRD---KVPVPKPSGEMLSTLSADG 623
                       LPS +SS K+    K +K +  V+ D   K  V    G M+ +     
Sbjct: 286 ------------LPSSHSSGKTICGTKKLKFSPPVKDDNAKKDSVKHGKGRMIGSPLMIK 333

Query: 624 NDVGKVVRGKAHNVGI-EKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKG 682
           N V +V+      VG  EK S+             SK KRK+ +D   L   K L VA  
Sbjct: 334 N-VDQVMNKCDRGVGAQEKLSV-----------NVSKIKRKQKVDEASLLGNKVLTVADD 381

Query: 683 TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
            +KQAAS++V  +K K+ KSR  N+   SDGCARSSI+GWEW +W+L ASPAERAR RG 
Sbjct: 382 FSKQAASKRVVAQKKKSDKSRKLNISIISDGCARSSINGWEWRRWTLKASPAERARNRGF 441

Query: 743 QYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
           QY ++  +GP+V+ S   NGKGLSARTNRVKLRNLLAAA+GA+LLKASQ+KARK    F 
Sbjct: 442 QYFYSDPIGPDVSTSHLLNGKGLSARTNRVKLRNLLAAADGADLLKASQLKARKNVYAFN 501


>gi|168031561|ref|XP_001768289.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
 gi|162680467|gb|EDQ66903.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
          Length = 1655

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 131/251 (52%), Positives = 173/251 (68%), Gaps = 31/251 (12%)

Query: 709  PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK----- 763
            P S GCAR SI+GW+W  W+ +   A+R   R  +    K +  ++   + A  K     
Sbjct: 1388 PTSVGCARCSITGWDWRTWARDR--AKRRLQRRVKAAIRKEVKQDLKKLKRATKKNVTNV 1445

Query: 764  ----------GL-SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
                      GL +AR NR  +R L   AEG++LL+ + +KARKK+L+FQRSKIHDWGLV
Sbjct: 1446 NNSTTTTVIAGLQAARKNRADMRKLAVGAEGSDLLRFNMLKARKKQLKFQRSKIHDWGLV 1505

Query: 813  ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV----------- 861
            A EPI+AE+FVIEYVGE+IR +++DIRE RYE +GIGSSY+FR+DD +            
Sbjct: 1506 AAEPIDAEEFVIEYVGEVIRNRVTDIREKRYEAIGIGSSYMFRVDDEHTLNTFIVLDHVP 1565

Query: 862  --VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
              +DAT+RGG+ARFINHSC+PNCYTK+I+VEGQKK+ IY+K+ I  GEE+TY+YKF LEE
Sbjct: 1566 SQLDATRRGGLARFINHSCDPNCYTKIITVEGQKKVVIYSKQRIVPGEELTYDYKFSLEE 1625

Query: 920  KKIPCYCGSKK 930
             KIPC+CG+ K
Sbjct: 1626 VKIPCFCGAAK 1636


>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
 gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
          Length = 2607

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 132/277 (47%), Positives = 172/277 (62%), Gaps = 61/277 (22%)

Query: 710  RSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLG---------PEVNASQWA 760
            +S GCAR SISGW+W +W+ + +   R R +    + T  +          P  +++ + 
Sbjct: 2335 KSFGCARCSISGWKWRRWAHHGA-KHRLRQKVQDSISTSSISRGMIDSVVRPNPSSATYL 2393

Query: 761  NGKGLSARTNRVKLRNLLAAAEGAELLKASQV---------------------------- 792
                 SARTNR  LR L++  EG++ LK SQ+                            
Sbjct: 2394 Q----SARTNRAALRKLISVVEGSDKLKLSQLTVIPAIPLPSYQTKLSIKTSLRIPGINR 2449

Query: 793  -------------------KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
                                + KK L+FQRSKIHDWGL+ALE IEAEDFVIEYVGE+IR 
Sbjct: 2450 SRRSVINAGPSQSSIAVLCSSMKKHLKFQRSKIHDWGLLALESIEAEDFVIEYVGEIIRR 2509

Query: 834  KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
            ++S+ RE +YE MGIGSSYLFR+DD  VVDAT++GG+ARFINHSCNPNCYTK+I+VEG+K
Sbjct: 2510 QVSNFRERQYEIMGIGSSYLFRVDDELVVDATQKGGLARFINHSCNPNCYTKIITVEGRK 2569

Query: 894  KIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            K+ IY+KR I AGEE+TY+YKF LE+KKIPCYCG+ +
Sbjct: 2570 KVVIYSKRAIGAGEELTYDYKFSLEDKKIPCYCGAPR 2606


>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
 gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
          Length = 1326

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 115/228 (50%), Positives = 160/228 (70%), Gaps = 19/228 (8%)

Query: 709  PRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
            P+  GCARSS+ G EW  W  +             Y     +  + + ++   G   SAR
Sbjct: 1118 PQLPGCARSSLPGSEWKLWKTS-------------YESYCCIDRKQSRAKERAGTLPSAR 1164

Query: 769  TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
            +NR  +R L     G+  ++  +++ARKK L+FQRSKIH WG+VA+E IE EDF++EYVG
Sbjct: 1165 SNRAGVRKL----GGSRAME--KMRARKKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVG 1218

Query: 829  ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
            E++R K++D+RE RY + G+GSSY FR+ DG+V+DAT+RGG+ RFINHSC PNCY K+I+
Sbjct: 1219 EVLRPKVADVREVRYLRQGLGSSYFFRVGDGFVIDATQRGGLGRFINHSCEPNCYPKIIT 1278

Query: 889  VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VEGQK++FIYA+ HIA G E+TY+YKFP E++KIPC CG+++C G LN
Sbjct: 1279 VEGQKRVFIYARTHIAPGTELTYDYKFPHEDQKIPCLCGAERCRGFLN 1326


>gi|347967460|ref|XP_307938.5| AGAP002246-PA [Anopheles gambiae str. PEST]
 gi|333466285|gb|EAA03662.5| AGAP002246-PA [Anopheles gambiae str. PEST]
          Length = 2003

 Score =  236 bits (603), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 103/176 (58%), Positives = 140/176 (79%), Gaps = 2/176 (1%)

Query: 763  KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            +G+S  AR+N+ +L N   A+  +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1828 QGISREARSNQRRLLNAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1887

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + VIEYVG+++R  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNP
Sbjct: 1888 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1947

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            NCY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG+  C G+LN
Sbjct: 1948 NCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAPGCRGTLN 2003


>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
 gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
          Length = 2482

 Score =  234 bits (598), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 119/230 (51%), Positives = 155/230 (67%), Gaps = 12/230 (5%)

Query: 713  GCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
            GCARS     IS  E  K+        RAR+          L    N ++ A      AR
Sbjct: 2259 GCARSEGFYKISFKEKTKY------LHRARLASVVEKPEDDLMESQNKAKQATQSSREAR 2312

Query: 769  TNRVKLRNLLAAAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            +N+ +L     A+ G  ++LL+ +Q+K RKK+L F++S+IHDWGL ALEPI AE+ VIEY
Sbjct: 2313 SNQRRLLASFGASCGDISDLLRFNQLKFRKKQLSFRKSRIHDWGLFALEPIAAEEMVIEY 2372

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG+ IR  I+D RE RYE+ GIGSSYLFR+D   ++DATK G +ARFINH CNPNCY K+
Sbjct: 2373 VGQCIRQTIADERERRYEEQGIGSSYLFRVDHDMIIDATKNGNLARFINHCCNPNCYAKI 2432

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I+VEG KKI IY++R IA  EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 2433 ITVEGYKKIVIYSRRDIAVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 2482


>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
 gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
          Length = 1670

 Score =  233 bits (595), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 103/176 (58%), Positives = 140/176 (79%), Gaps = 2/176 (1%)

Query: 763  KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            +GLS  AR+N+ +L     A+  +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1495 QGLSREARSNQRRLLTAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1554

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + VIEYVG+++R  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNP
Sbjct: 1555 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1614

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            NCY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1615 NCYAKVITIESEKKIVIYSKQAIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1670


>gi|312384250|gb|EFR29018.1| hypothetical protein AND_02361 [Anopheles darlingi]
          Length = 2074

 Score =  233 bits (593), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 102/176 (57%), Positives = 139/176 (78%), Gaps = 2/176 (1%)

Query: 763  KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            +G+S  AR+N+ +L N   A+  +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1899 QGISREARSNQRRLLNAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1958

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + VIEYVG+++R  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNP
Sbjct: 1959 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 2018

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            NCY K+I++E +KKI IY+K+ I   EEITY+YKFPLEE+KI C CG+  C G+LN
Sbjct: 2019 NCYAKIITIESEKKIVIYSKQPIGVNEEITYDYKFPLEEEKIRCLCGAPGCRGTLN 2074


>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
           corporis]
 gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
           corporis]
          Length = 574

 Score =  232 bits (591), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 102/177 (57%), Positives = 134/177 (75%)

Query: 760 ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
            N     AR+N+ +L      A  ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A
Sbjct: 398 VNAASREARSNQRRLLTAFGTATDSDLLKFNQLKFRKKQLKFSKSDIHDWGLFAMEPIAA 457

Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
           ++ VIEYVG+++R  ++D RE  YEK GIGSSYLFR+D   ++DATK G +ARFINHSCN
Sbjct: 458 DEMVIEYVGQMVRPFLADFREKEYEKRGIGSSYLFRIDLETIIDATKCGNLARFINHSCN 517

Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           PNCY K+I++EGQKKI IY+K+ I   EEITY+YKFP+EE+KIPC CG+ +C G LN
Sbjct: 518 PNCYAKIITIEGQKKIVIYSKKDIKVDEEITYDYKFPIEEEKIPCLCGAAQCKGYLN 574


>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
 gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
          Length = 1687

 Score =  231 bits (590), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 102/175 (58%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1513 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1572

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1573 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1632

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1633 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1687


>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
 gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
          Length = 1765

 Score =  231 bits (589), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 102/176 (57%), Positives = 139/176 (78%), Gaps = 2/176 (1%)

Query: 763  KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            +G+S  AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1590 QGISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1649

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNP
Sbjct: 1650 EMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNP 1709

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            NCY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1710 NCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1765


>gi|195062427|ref|XP_001996188.1| GH22347 [Drosophila grimshawi]
 gi|193899683|gb|EDV98549.1| GH22347 [Drosophila grimshawi]
          Length = 1714

 Score =  231 bits (589), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 102/175 (58%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1540 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1599

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1600 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1659

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1660 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1714


>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
 gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
          Length = 1755

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 101/175 (57%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1581 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1640

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1641 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1700

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1701 CYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1755


>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
 gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
          Length = 1548

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 101/175 (57%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1374 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1433

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1434 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1493

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1494 CYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1548


>gi|195496958|ref|XP_002095897.1| GE25383 [Drosophila yakuba]
 gi|194181998|gb|EDW95609.1| GE25383 [Drosophila yakuba]
          Length = 1628

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 102/175 (58%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1454 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1513

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1514 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1573

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1574 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1628


>gi|194898301|ref|XP_001978769.1| GG11901 [Drosophila erecta]
 gi|190650472|gb|EDV47727.1| GG11901 [Drosophila erecta]
          Length = 1626

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 102/175 (58%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1452 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1511

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1512 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1571

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1572 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1626


>gi|195356446|ref|XP_002044683.1| GM18767 [Drosophila sechellia]
 gi|194133849|gb|EDW55365.1| GM18767 [Drosophila sechellia]
          Length = 1637

 Score =  230 bits (587), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 102/175 (58%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1463 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1522

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1523 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1582

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KIPC CG++ C G+LN
Sbjct: 1583 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1637


>gi|332016392|gb|EGI57305.1| Histone-lysine N-methyltransferase SETD1B [Acromyrmex echinatior]
          Length = 1513

 Score =  230 bits (586), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)

Query: 751  GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
            GP+ N S+   GK   LS  AR+N+ +L         ++LLK +Q+K RKK+L+F +S I
Sbjct: 1325 GPK-NNSKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1383

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            HDWGL A+EPI A++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK
Sbjct: 1384 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEATGIGSSYLFRIDLDTIIDATK 1443

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
             G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC C
Sbjct: 1444 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1503

Query: 927  GSKKCHGSLN 936
            G+ +C G+LN
Sbjct: 1504 GAPQCRGTLN 1513


>gi|307182871|gb|EFN69932.1| Histone-lysine N-methyltransferase SETD1B [Camponotus floridanus]
          Length = 1422

 Score =  230 bits (586), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)

Query: 751  GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
            GP+ N S+   GK   LS  AR+N+ +L         ++LLK +Q+K RKK+L+F +S I
Sbjct: 1234 GPK-NNSKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1292

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            HDWGL A+EPI A++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK
Sbjct: 1293 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEATGIGSSYLFRIDLDTIIDATK 1352

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
             G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC C
Sbjct: 1353 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1412

Query: 927  GSKKCHGSLN 936
            G+ +C G+LN
Sbjct: 1413 GAPQCRGTLN 1422


>gi|340721798|ref|XP_003399301.1| PREDICTED: hypothetical protein LOC100643294 [Bombus terrestris]
          Length = 1502

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 100/170 (58%), Positives = 132/170 (77%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1333 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1392

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG++IR  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1393 VGQMIRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1452

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1453 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1502


>gi|350408006|ref|XP_003488269.1| PREDICTED: hypothetical protein LOC100743429 [Bombus impatiens]
          Length = 1503

 Score =  229 bits (584), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 100/170 (58%), Positives = 132/170 (77%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG++IR  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMIRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503


>gi|307195485|gb|EFN77371.1| Histone-lysine N-methyltransferase SETD1B [Harpegnathos saltator]
          Length = 1490

 Score =  229 bits (583), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 107/190 (56%), Positives = 142/190 (74%), Gaps = 5/190 (2%)

Query: 751  GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
            GP+ N S+   GK   LS  AR+N+ +L         ++LLK +Q+K RKK+L+F +S I
Sbjct: 1302 GPKSN-SKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGI 1360

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            HDWGL A+EPI A++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK
Sbjct: 1361 HDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATK 1420

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
             G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC C
Sbjct: 1421 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1480

Query: 927  GSKKCHGSLN 936
            G+ +C G+LN
Sbjct: 1481 GAPQCRGTLN 1490


>gi|383849246|ref|XP_003700256.1| PREDICTED: uncharacterized protein LOC100875701 [Megachile rotundata]
          Length = 1503

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 132/170 (77%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503


>gi|328778989|ref|XP_395451.4| PREDICTED: hypothetical protein LOC411985 isoform 1 [Apis mellifera]
          Length = 1503

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 132/170 (77%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1334 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1393

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1394 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1453

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1454 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1503


>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
          Length = 565

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 105/178 (58%), Positives = 136/178 (76%), Gaps = 7/178 (3%)

Query: 766 SARTNRVKLRNLLA-------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
           ++RT RV  R L++       +   ++LLK +Q+K RKKRL F +S IHDWGL A E I+
Sbjct: 388 NSRTARVNNRRLVSGIMVQNKSMAESDLLKFNQLKNRKKRLIFDKSPIHDWGLYAGESID 447

Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
           A D VIEY+GE+IR ++++IRE  YE++GIGSSYLFR+DD  V+DATK+GG+ARFINH C
Sbjct: 448 AHDIVIEYIGEVIRQQVAEIREKHYERIGIGSSYLFRVDDDMVIDATKKGGMARFINHCC 507

Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            PNC  K+I+V+ QKK+ IYA R I  GEEITY+YKFP+E +KIPC+CGSK C GSLN
Sbjct: 508 TPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEAEKIPCFCGSKFCKGSLN 565


>gi|380013032|ref|XP_003690574.1| PREDICTED: uncharacterized protein LOC100867676 [Apis florea]
          Length = 1500

 Score =  229 bits (583), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 132/170 (77%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEY
Sbjct: 1331 ARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEY 1390

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KV
Sbjct: 1391 VGQMVRPVVADLRESQYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKV 1450

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG+ +C G+LN
Sbjct: 1451 ITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1500


>gi|62862148|ref|NP_001015221.1| Set1, isoform A [Drosophila melanogaster]
 gi|62862150|ref|NP_001015222.1| Set1, isoform B [Drosophila melanogaster]
 gi|161076059|ref|NP_001104406.1| Set1, isoform C [Drosophila melanogaster]
 gi|281366745|ref|NP_001163846.1| Set1, isoform D [Drosophila melanogaster]
 gi|281366747|ref|NP_001163847.1| Set1, isoform E [Drosophila melanogaster]
 gi|281366749|ref|NP_001163848.1| Set1, isoform F [Drosophila melanogaster]
 gi|281366751|ref|NP_001163849.1| Set1, isoform G [Drosophila melanogaster]
 gi|281366753|ref|NP_001163850.1| Set1, isoform H [Drosophila melanogaster]
 gi|281366755|ref|NP_001163851.1| Set1, isoform I [Drosophila melanogaster]
 gi|51951109|gb|EAL24598.1| Set1, isoform A [Drosophila melanogaster]
 gi|51951110|gb|EAL24599.1| Set1, isoform B [Drosophila melanogaster]
 gi|158529717|gb|EDP28071.1| Set1, isoform C [Drosophila melanogaster]
 gi|281309231|gb|EFA98694.1| Set1, isoform D [Drosophila melanogaster]
 gi|281309232|gb|EFA98695.1| Set1, isoform E [Drosophila melanogaster]
 gi|281309233|gb|EFA98696.1| Set1, isoform F [Drosophila melanogaster]
 gi|281309234|gb|EFA98697.1| Set1, isoform G [Drosophila melanogaster]
 gi|281309235|gb|EFA98698.1| Set1, isoform H [Drosophila melanogaster]
 gi|281309236|gb|EFA98699.1| Set1, isoform I [Drosophila melanogaster]
          Length = 1641

 Score =  229 bits (583), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 101/175 (57%), Positives = 136/175 (77%)

Query: 762  GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 1467 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 1526

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 1527 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 1586

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC CG++ C G+LN
Sbjct: 1587 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641


>gi|196014878|ref|XP_002117297.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
 gi|190580050|gb|EDV20136.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
          Length = 217

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 104/173 (60%), Positives = 137/173 (79%), Gaps = 3/173 (1%)

Query: 767 ARTNRVKLRNLLA---AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           +R NR   R +++   A++  +LLK +Q+K RKK+LRF RS+IH+WGL A EPI A++ V
Sbjct: 45  SRENRASNRRIISFFQASDAGDLLKLNQLKTRKKQLRFGRSQIHEWGLFAREPIAADEMV 104

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
           IEYVG+ IR  ++D RE RYEK+GIGSSYLFR+DD Y++DATK G +ARFINHSC+PNCY
Sbjct: 105 IEYVGQTIRQTVADEREHRYEKIGIGSSYLFRIDDNYIIDATKCGNLARFINHSCSPNCY 164

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            K+IS+E QKKI IY+K  I   EEITY+YKFP+E+ KIPC+CG+ +C G+LN
Sbjct: 165 AKIISLESQKKIVIYSKYDIQVNEEITYDYKFPIEDVKIPCHCGALQCRGALN 217


>gi|432875260|ref|XP_004072753.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Oryzias
            latipes]
          Length = 1547

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 103/167 (61%), Positives = 131/167 (78%), Gaps = 1/167 (0%)

Query: 771  RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            R + R LL++    ++L++ +Q+K RKKR+RF RS IH+WGL A+EPI A++ VIEYVGE
Sbjct: 1381 RSEQRRLLSSFSCDSDLVRFNQLKYRKKRIRFSRSHIHEWGLFAMEPIAADEMVIEYVGE 1440

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR  I+D RE RYE+ GIG SYLFR+D   ++DATK G +ARFINHSCNPNCY KVISV
Sbjct: 1441 TIRQVIADRREQRYEEEGIGGSYLFRIDQDTIIDATKCGNLARFINHSCNPNCYAKVISV 1500

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            E QKKI IY+++ I+  EEITYNYKFP+E+ KIPC CG++ C GSLN
Sbjct: 1501 ESQKKIVIYSRQPISINEEITYNYKFPIEDTKIPCLCGAENCRGSLN 1547


>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
            tropicalis]
 gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
          Length = 1956

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 108/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)

Query: 751  GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
            G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+LRF +S IH
Sbjct: 1770 GKSIPAQPQASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLRFCKSHIH 1827

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 1828 DWGLFAMEPIIADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1887

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G  ARFINHSCNPNCY KVI+VE QKKI IY+K++I   EEITY+YKFP+E+ KIPC CG
Sbjct: 1888 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCG 1947

Query: 928  SKKCHGSLN 936
            ++ C G+LN
Sbjct: 1948 AENCRGTLN 1956


>gi|189238620|ref|XP_969339.2| PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum]
 gi|270009170|gb|EFA05618.1| hypothetical protein TcasGA2_TC015826 [Tribolium castaneum]
          Length = 1268

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 105/177 (59%), Positives = 136/177 (76%), Gaps = 4/177 (2%)

Query: 763  KGLSARTNRVKLRNLLAAAEG---AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
            +GLS R  R   R LLAA  G   ++LLK +Q+K RKK L+F +S IHDWGL A+EPI A
Sbjct: 1093 QGLS-REARSNQRRLLAAFGGDTDSDLLKFNQLKFRKKHLKFAKSAIHDWGLFAMEPIAA 1151

Query: 820  EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
            ++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++DATK G +ARFINHSCN
Sbjct: 1152 DEMVIEYVGQMVRHSVADLRERKYEATGIGSSYLFRIDLENIIDATKCGNLARFINHSCN 1211

Query: 880  PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            PNCY KVI++E QKKI IY+K+ I   EEITY+YKFP+E++KIPC CG+  C G+LN
Sbjct: 1212 PNCYAKVITIESQKKIVIYSKQSIGVNEEITYDYKFPIEDEKIPCLCGAATCRGTLN 1268


>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
           [Metaseiulus occidentalis]
          Length = 976

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 107/186 (57%), Positives = 138/186 (74%), Gaps = 5/186 (2%)

Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWG 810
           P V ASQ +      AR+N+ +L  +    E  ++LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 795 PVVAASQMSR----EARSNQRRLLTIFQETEVVSDLLKFNQLKFRKKQLKFAKSTIHDWG 850

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L ALEPI A++ VIEYVG+ +R  ++D RE +Y K+GIGSSYLFR+D   ++DATK G +
Sbjct: 851 LFALEPIAADEMVIEYVGQSVRCSVADSREVQYTKVGIGSSYLFRVDHETIIDATKCGNL 910

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY +VI+VEGQKKI IY+KR I+  EEITY+YKFP EE KI C CG+ +
Sbjct: 911 ARFINHSCNPNCYARVITVEGQKKIVIYSKRDISVNEEITYDYKFPREEVKITCLCGTPQ 970

Query: 931 CHGSLN 936
           C G LN
Sbjct: 971 CRGYLN 976


>gi|328711160|ref|XP_001945277.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
            [Acyrthosiphon pisum]
          Length = 1322

 Score =  226 bits (576), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 114/254 (44%), Positives = 159/254 (62%), Gaps = 16/254 (6%)

Query: 687  AASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVH 746
            A S Q+  KK K         C R++G  +        HK+    + A+  R+       
Sbjct: 1081 AVSNQIPRKKKKDEWKVHDTGCGRTEGYYKVDSKQKANHKYHFGHTIAQLNRIT------ 1134

Query: 747  TKYLGPEVNASQWANGKGLS----ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
                  E+   + + GK L+    AR+N+ +L     A   ++LLK + +K RKK+L+F 
Sbjct: 1135 ------ELKRVKESTGKMLALSREARSNQRRLLTAFGATSDSDLLKFNVLKFRKKQLKFG 1188

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            +S IHDWGL A+E I A++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++
Sbjct: 1189 KSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGSSYLFRIDLDTII 1248

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DATK G +ARFINHSCNPNCY K+I ++GQKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 1249 DATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLEDNKI 1308

Query: 923  PCYCGSKKCHGSLN 936
            PC CG+  C G+LN
Sbjct: 1309 PCLCGTHCCRGTLN 1322


>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
 gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
          Length = 1938

 Score =  225 bits (574), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 106/189 (56%), Positives = 140/189 (74%), Gaps = 5/189 (2%)

Query: 751  GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
            G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK++RF +S IH
Sbjct: 1752 GKSIPAQPQASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKIRFCKSHIH 1809

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 1810 DWGLFAMEPIVADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1869

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G  ARFINHSCNPNCY KV++VE QKKI IY+K++I   EEITY+YKFP+E+ KIPC CG
Sbjct: 1870 GNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCG 1929

Query: 928  SKKCHGSLN 936
            ++ C G+LN
Sbjct: 1930 AENCRGTLN 1938


>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 375

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 103/180 (57%), Positives = 134/180 (74%), Gaps = 8/180 (4%)

Query: 765 LSARTNRVKLRNLLAAAE--------GAELLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
           +S+R NR+  R L    E          ++L+ + +KARKK+L+F +S IH+WGL A+E 
Sbjct: 196 ISSRMNRINNRRLAVGIEMQKKISSSETDILRFNALKARKKQLKFSKSPIHNWGLYAMEH 255

Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
           I+  D VIEYVGE++R  ++DIRE +YE+ GIGSSYLFR+DD  VVDATK+G IARFINH
Sbjct: 256 IDMGDMVIEYVGEIVRQTVADIRERQYERQGIGSSYLFRIDDDTVVDATKKGNIARFINH 315

Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           SC+P+C  K+I VEG+KKI IYA R I  GEEITY+YKFP+E+ KIPC CG+K C G+LN
Sbjct: 316 SCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYKFPIEDVKIPCLCGAKACRGTLN 375


>gi|326670188|ref|XP_699940.5| PREDICTED: histone-lysine N-methyltransferase SETD1B [Danio rerio]
          Length = 753

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 115/229 (50%), Positives = 149/229 (65%), Gaps = 21/229 (9%)

Query: 713 GCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSAR 768
           GCARS     IS  E  ++  N           +Q  H+   G E+ A Q          
Sbjct: 541 GCARSEGYYFISKREKLQYLCNERTVSEEFTADSQVSHSSRSGSELRAEQ---------- 590

Query: 769 TNRVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
                 R LL++    ++LLK +Q+K RKKRLRF +S+IHDWGL A EPI A++ +IEYV
Sbjct: 591 ------RRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIEYV 644

Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
           G+ IR  I+D+RE RYE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KVI
Sbjct: 645 GQSIRQVIADMRERRYETEGIGSSYLFRVDHDTIIDATKCGNLARFINHSCNPNCYAKVI 704

Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           +VE QKKI IY+++ I   EEITY+YKFP+E++KIPC C ++ C G+LN
Sbjct: 705 TVEAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 753


>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
 gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
            Full=SET domain-containing protein 1B-A
 gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
          Length = 1844

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 101/167 (60%), Positives = 131/167 (78%), Gaps = 1/167 (0%)

Query: 771  RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            R + R LL++    ++LLK +Q+K RKK++RF RS IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1678 RSEQRRLLSSFSCDSDLLKFNQLKFRKKKIRFCRSHIHDWGLFAMEPIAADEMVIEYVGQ 1737

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+V
Sbjct: 1738 NIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1797

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            E QKKI IY+++ I   EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1798 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1844


>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
           [Oreochromis niloticus]
          Length = 595

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 108/204 (52%), Positives = 146/204 (71%), Gaps = 11/204 (5%)

Query: 743 QYVHTKYLGPEVNASQWANGKGLSARTN---------RVKLRNLLAAAE-GAELLKASQV 792
           +Y+H   L  E+ ++  A G  + A+           R + R LL++    ++L+K +Q+
Sbjct: 393 RYLHNTRLAAELPSTS-AQGTCIPAQQPTSLRAGSDFRCEQRRLLSSFSCDSDLVKFNQL 451

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K R+KR+RF RS IH+WGL A+EPI A++ VIEYVG++IR  I+D+RE RYE+ GIGSSY
Sbjct: 452 KFRRKRIRFSRSYIHEWGLFAMEPIAADEMVIEYVGQIIRQVIADMREQRYEEEGIGSSY 511

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           LFR+D   ++DATK G +ARFINHSCNPNCY K+I+VE QKKI IY+++ I   EEITY+
Sbjct: 512 LFRVDQDTIIDATKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSRQPININEEITYD 571

Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
           YKFP+EE KIPC CG+  C GSLN
Sbjct: 572 YKFPIEETKIPCLCGADGCRGSLN 595


>gi|47217812|emb|CAG07226.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1884

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 102/171 (59%), Positives = 129/171 (75%), Gaps = 5/171 (2%)

Query: 771  RVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            R + R LL      A   ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 1714 RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 1773

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            YVG+ IR  ++D RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY K
Sbjct: 1774 YVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 1833

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VI++E QKKI IY+K+ IA  EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 1834 VITIESQKKIVIYSKQAIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 1884


>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
          Length = 1787

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 101/167 (60%), Positives = 133/167 (79%), Gaps = 1/167 (0%)

Query: 771  RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            R + R LL++ A  ++LLK +Q+K RKK++RF +S IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1621 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSNIHDWGLFAMEPIAADEMVIEYVGQ 1680

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR  I+D+RE RYE+ GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+V
Sbjct: 1681 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1740

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            E QKKI IY+++ I   EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1741 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1787


>gi|348510108|ref|XP_003442588.1| PREDICTED: hypothetical protein LOC100694318 [Oreochromis niloticus]
          Length = 2205

 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 105/177 (59%), Positives = 132/177 (74%), Gaps = 7/177 (3%)

Query: 765  LSARTNRVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
            LS R  R + R LL      A   ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A
Sbjct: 2031 LSER--RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAA 2088

Query: 820  EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
            ++ VIEYVG+ IR  ++D RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C 
Sbjct: 2089 DEMVIEYVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCT 2148

Query: 880  PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            PNCY KVI++E QKKI IY+K+ IA  EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 2149 PNCYAKVITIESQKKIVIYSKQPIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 2205


>gi|328701191|ref|XP_003241521.1| PREDICTED: hypothetical protein LOC100573227 [Acyrthosiphon pisum]
          Length = 1315

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 113/254 (44%), Positives = 158/254 (62%), Gaps = 16/254 (6%)

Query: 687  AASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVH 746
            A   Q+  KK K         C R++G  +        HK+    + A+  R+       
Sbjct: 1074 ALPNQLPRKKKKDEWKVHDTGCGRTEGYYKVDSKQKANHKYHFGHTIAQLNRIT------ 1127

Query: 747  TKYLGPEVNASQWANGKGLS----ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
                  E+   + + GK L+    AR+N+ +L     A   ++LLK + +K RKK+L+F 
Sbjct: 1128 ------ELKRVKESTGKMLALSREARSNQRRLLTAYGATSDSDLLKFNVLKFRKKQLKFG 1181

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            +S IHDWGL A+E I A++ VIEYVG+++R  ++D+RE +YE  GIGSSYLFR+D   ++
Sbjct: 1182 KSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGSSYLFRIDLDTII 1241

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DATK G +ARFINHSCNPNCY K+I ++GQKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 1242 DATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLEDNKI 1301

Query: 923  PCYCGSKKCHGSLN 936
            PC CG+  C G+LN
Sbjct: 1302 PCLCGTHCCRGTLN 1315


>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1830

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 102/167 (61%), Positives = 132/167 (79%), Gaps = 1/167 (0%)

Query: 771  RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            R + R LL++ A  ++LLK +Q+K RKK++RF +S IHDWGL ALEPI A++ VIEYVG+
Sbjct: 1664 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFALEPIAADEMVIEYVGQ 1723

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR  I+D+RE RYE+ GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+V
Sbjct: 1724 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1783

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            E QKKI IY+++ I   EEITY+YKFP+E+ KIPC CG++ C G+LN
Sbjct: 1784 ESQKKIVIYSRQPINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 1830


>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
            [Oreochromis niloticus]
          Length = 1846

 Score =  223 bits (567), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 101/167 (60%), Positives = 133/167 (79%), Gaps = 1/167 (0%)

Query: 771  RVKLRNLLAA-AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            R + R LL++ A  ++LLK +Q+K RKK++RF +S IHDWGL A+EPI A++ VIEYVG+
Sbjct: 1680 RSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFAMEPIAADEMVIEYVGQ 1739

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR  I+D+RE RYE+ GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+V
Sbjct: 1740 NIRQVIADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITV 1799

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            E QKKI IY+++ I   EEITY+YKFP+E++KIPC CG++ C G+LN
Sbjct: 1800 ESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1846


>gi|410903041|ref|XP_003965002.1| PREDICTED: uncharacterized protein LOC101071878 [Takifugu rubripes]
          Length = 1776

 Score =  223 bits (567), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 102/171 (59%), Positives = 129/171 (75%), Gaps = 5/171 (2%)

Query: 771  RVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            R + R LL      A   ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 1606 RSEQRRLLTVIGTTAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 1665

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            YVG+ IR  ++D RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY K
Sbjct: 1666 YVGQNIRQMVADNREKRYAQQGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 1725

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VI++E QKKI IY+K+ IA  EEITY+YKFPLEE KIPC CG++ C G+LN
Sbjct: 1726 VITIESQKKIVIYSKQAIAVNEEITYDYKFPLEENKIPCLCGTENCRGTLN 1776


>gi|194764460|ref|XP_001964347.1| GF23123 [Drosophila ananassae]
 gi|190614619|gb|EDV30143.1| GF23123 [Drosophila ananassae]
          Length = 236

 Score =  223 bits (567), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 100/175 (57%), Positives = 135/175 (77%)

Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 62  GISREARSNQRRLLTAFGSIGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 121

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
            VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 122 MVIEYVGQMIRPVVADLRESKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPN 181

Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           CY KVI++E +KKI IY+K+ I   EEITY+YKFPLE++KIPC C ++ C G+LN
Sbjct: 182 CYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCAAQGCRGTLN 236


>gi|302764394|ref|XP_002965618.1| hypothetical protein SELMODRAFT_84518 [Selaginella moellendorffii]
 gi|300166432|gb|EFJ33038.1| hypothetical protein SELMODRAFT_84518 [Selaginella moellendorffii]
          Length = 139

 Score =  222 bits (566), Expect = 6e-55,   Method: Composition-based stats.
 Identities = 90/139 (64%), Positives = 121/139 (87%)

Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
           ++ARKK L+FQRSKIH WG+VA+E IE EDF++EYVGE++R K++D+RE RY + G+GSS
Sbjct: 1   MRARKKLLKFQRSKIHAWGVVAMEFIEPEDFIVEYVGEVLRPKVADVREVRYLRQGLGSS 60

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y FR+ DG+V+DAT+RGG+ RFINHSC PNCY K+I+VEGQK++FIYA+ HIA G E+TY
Sbjct: 61  YFFRVGDGFVIDATQRGGLGRFINHSCEPNCYAKIITVEGQKRVFIYARTHIAPGTELTY 120

Query: 912 NYKFPLEEKKIPCYCGSKK 930
           +YKFP E++KIPC CG+++
Sbjct: 121 DYKFPHEDQKIPCLCGAER 139


>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
          Length = 1674

 Score =  222 bits (566), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 104/178 (58%), Positives = 133/178 (74%), Gaps = 6/178 (3%)

Query: 765  LSARTNRVKLRNLLAAAE------GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
            +S+R  RV  R L    E       +++LK +Q+K+RKK+L+F +S IHDWGL A E I+
Sbjct: 1497 ISSRATRVNNRRLAIGMEMQKKTIDSDILKFNQLKSRKKQLKFAKSPIHDWGLFAEEHID 1556

Query: 819  AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
              D VIEYVGE+IR ++++ RE +YE+ GIGSSYLFR+DD  V+DATKRG IARFINH C
Sbjct: 1557 VNDMVIEYVGEMIRQQVAEEREKQYERCGIGSSYLFRVDDDTVIDATKRGSIARFINHCC 1616

Query: 879  NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +PNC  K+I+V+ QKKI IYA R I  GEEITY+YKFP+E +KIPC CGSK C G+LN
Sbjct: 1617 SPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKFPIEAEKIPCLCGSKFCKGTLN 1674


>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
          Length = 1557

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 102/171 (59%), Positives = 134/171 (78%), Gaps = 1/171 (0%)

Query: 767  ARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            AR+N+ +L      AE  ++LLK +Q+K RKK+L+F +S+IHDWGL ALEPI A++ VIE
Sbjct: 1387 ARSNQRRLLTSFGEAEVISDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIE 1446

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            YVG+++R  ++D RE  Y ++GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY K
Sbjct: 1447 YVGQMVRPIMADRREQFYTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAK 1506

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VI+VEGQKKI IY+K+ I   EEITY+YKFPLE++KI C CG+ +C G LN
Sbjct: 1507 VITVEGQKKIVIYSKQPINVNEEITYDYKFPLEDEKIVCLCGAPQCRGFLN 1557


>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
 gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
          Length = 1340

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 98/171 (57%), Positives = 131/171 (76%), Gaps = 5/171 (2%)

Query: 766  SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            SAR++R   R      +G++ +  + +K+R+K+++FQRS IHDWGL A+E I A+D VIE
Sbjct: 1175 SARSSRFDSR-----FQGSDSVTLASLKSRRKKIKFQRSDIHDWGLFAMETINAKDMVIE 1229

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            Y+GE+IR K++D RE RY K GIGSSYLFR+DD  ++DAT +G +ARFINH C+PNC  K
Sbjct: 1230 YIGEVIRQKVADEREKRYIKKGIGSSYLFRVDDDTIIDATLKGNLARFINHCCDPNCIAK 1289

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            V+++  QKKI IYAKR I  GEEITY+YKFP+E++KIPC C S KC G+LN
Sbjct: 1290 VLTINNQKKIIIYAKRDINIGEEITYDYKFPIEDEKIPCLCKSPKCRGTLN 1340


>gi|301622880|ref|XP_002940755.1| PREDICTED: hypothetical protein LOC100487270 [Xenopus (Silurana)
            tropicalis]
          Length = 1817

 Score =  221 bits (563), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 105/189 (55%), Positives = 136/189 (71%), Gaps = 7/189 (3%)

Query: 753  EVNASQWANGKGLSARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIH 807
            E  A      + LS R  R + R LL+A        ++LLK +Q+K RKK+LRF RS IH
Sbjct: 1631 EQEADTQGTNRVLSER--RSEQRRLLSAMGSTALLDSDLLKLNQLKFRKKKLRFGRSHIH 1688

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            +WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK 
Sbjct: 1689 EWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYSQQGIGSSYLFRMDQDTIIDATKC 1748

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G +ARFINH C+PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG
Sbjct: 1749 GNLARFINHCCSPNCYAKVITIESQKKIVIYSKQPIGINEEITYDYKFPLEDNKIPCLCG 1808

Query: 928  SKKCHGSLN 936
            ++ C G+LN
Sbjct: 1809 TENCRGTLN 1817


>gi|326666050|ref|XP_001920852.3| PREDICTED: hypothetical protein LOC556535 [Danio rerio]
          Length = 2253

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 100/171 (58%), Positives = 129/171 (75%), Gaps = 5/171 (2%)

Query: 771  RVKLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            R + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A++ VIE
Sbjct: 2083 RSEQRRLLSAIGTPAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIE 2142

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            YVG+ IR  ++D RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY K
Sbjct: 2143 YVGQSIRQMVADNREKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAK 2202

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VI++E QKKI IY+K+ I   EEITY+YKFP+EE KIPC CG++ C G+LN
Sbjct: 2203 VITIESQKKIVIYSKQPIGVNEEITYDYKFPIEENKIPCLCGTESCRGTLN 2253


>gi|344294411|ref|XP_003418911.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A-like [Loxodonta africana]
          Length = 1703

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1525 TNRVLSERRSEQRRLLSAIGTSAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1584

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1585 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1644

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1645 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1703


>gi|29437232|gb|AAH49883.1| Setd1a protein [Mus musculus]
          Length = 849

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)

Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
           V+A Q   G   +  TNRV      + R LL+A        ++LLK +Q+K RKK+LRF 
Sbjct: 658 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 715

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++
Sbjct: 716 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 775

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 776 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 835

Query: 923 PCYCGSKKCHGSLN 936
           PC CG++ C GSLN
Sbjct: 836 PCLCGTESCRGSLN 849


>gi|357631650|gb|EHJ79119.1| hypothetical protein KGM_15585 [Danaus plexippus]
          Length = 1491

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 97/170 (57%), Positives = 129/170 (75%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            AR+N+ +L         ++LLK +Q+K RKK+L+F +S IHDWGL A E I A++ VIEY
Sbjct: 1322 ARSNQRRLLTAFGTDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAQEAIAADEMVIEY 1381

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VG+++R  ++D+RE  YE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY K+
Sbjct: 1382 VGQMVRPIVADVREAHYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKI 1441

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I++E QKKI IY+K+ I   EEITY+YKFPLE++KIPC CG+ +C G LN
Sbjct: 1442 ITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDEKIPCLCGAPQCRGYLN 1491


>gi|119572565|gb|EAW52180.1| hCG1998636, isoform CRA_b [Homo sapiens]
          Length = 1189

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1011 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1070

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1071 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1130

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1131 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1189


>gi|329664196|ref|NP_001192362.1| histone-lysine N-methyltransferase SETD1A [Bos taurus]
          Length = 1708

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708


>gi|296473196|tpg|DAA15311.1| TPA: SET domain containing 1A [Bos taurus]
          Length = 1708

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708


>gi|6683126|dbj|BAA20797.2| KIAA0339 protein [Homo sapiens]
          Length = 1709

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1531 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1590

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1591 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1650

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1651 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1709


>gi|55741677|ref|NP_055527.1| histone-lysine N-methyltransferase SETD1A [Homo sapiens]
 gi|68052990|sp|O15047.3|SET1A_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1A; AltName:
            Full=Lysine N-methyltransferase 2F; AltName: Full=SET
            domain-containing protein 1A; Short=hSET1A; AltName:
            Full=Set1/Ash2 histone methyltransferase complex subunit
            SET1
 gi|168272962|dbj|BAG10320.1| SET domain-containing protein 1A [synthetic construct]
          Length = 1707

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707


>gi|296220008|ref|XP_002807469.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A [Callithrix jacchus]
          Length = 1708

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708


>gi|348584344|ref|XP_003477932.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cavia
            porcellus]
          Length = 1702

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1524 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1583

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1584 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1643

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1644 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1702


>gi|119508422|ref|NP_821172.2| SET domain containing 1A [Mus musculus]
          Length = 1716

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)

Query: 754  VNASQWANGKGLSARTNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQ 802
            V+A Q   G   +  TNRV      + R LL+A        ++LLK +Q+K RKK+LRF 
Sbjct: 1525 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 1582

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++
Sbjct: 1583 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 1642

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 1643 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 1702

Query: 923  PCYCGSKKCHGSLN 936
            PC CG++ C GSLN
Sbjct: 1703 PCLCGTESCRGSLN 1716


>gi|431906860|gb|ELK10981.1| Histone-lysine N-methyltransferase SETD1A [Pteropus alecto]
          Length = 1665

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1487 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1546

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1547 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1606

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1607 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1665


>gi|114662089|ref|XP_001155055.1| PREDICTED: histone-lysine N-methyltransferase SETD1A isoform 2 [Pan
            troglodytes]
 gi|410289610|gb|JAA23405.1| SET domain containing 1A [Pan troglodytes]
          Length = 1707

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707


>gi|410221312|gb|JAA07875.1| SET domain containing 1A [Pan troglodytes]
          Length = 1707

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707


>gi|403276868|ref|XP_003930105.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Saimiri
            boliviensis boliviensis]
          Length = 1713

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1535 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1594

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1595 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1654

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1655 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1713


>gi|354497843|ref|XP_003511027.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cricetulus
            griseus]
 gi|344247133|gb|EGW03237.1| Histone-lysine N-methyltransferase SETD1A [Cricetulus griseus]
          Length = 1723

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1545 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1604

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1605 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1664

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1665 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1723


>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
          Length = 1685

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 101/172 (58%), Positives = 127/172 (73%), Gaps = 3/172 (1%)

Query: 768  RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            R  R   R LLA     E  +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1514 REARSAQRRLLAQFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1573

Query: 825  EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            EYVG+++R  ++++RE +YE  GIG SYLFR+DD +V+DAT  G   RFINHSC PNCY 
Sbjct: 1574 EYVGQVVRKSVAELRERQYEAKGIGGSYLFRIDDDFVIDATMCGNNGRFINHSCQPNCYA 1633

Query: 885  KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            K+I+VEG+KKI IY+KR I   EEITY+YKFP EE+KIPC CG+  C G+LN
Sbjct: 1634 KIITVEGKKKIVIYSKRDINVMEEITYDYKFPYEEEKIPCQCGASTCRGTLN 1685


>gi|194219010|ref|XP_001915080.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A [Equus caballus]
          Length = 1707

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707


>gi|410984758|ref|XP_003998693.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Felis catus]
          Length = 1708

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1530 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1589

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1590 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1649

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1650 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1708


>gi|402908170|ref|XP_003916826.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Papio anubis]
          Length = 1707

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1529 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1588

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1589 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1648

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1649 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707


>gi|148685638|gb|EDL17585.1| mCG141846, isoform CRA_b [Mus musculus]
          Length = 1006

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)

Query: 754  VNASQWANGKGLSARTNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQ 802
            V+A Q   G   +  TNRV      + R LL+A        ++LLK +Q+K RKK+LRF 
Sbjct: 815  VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 872

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++
Sbjct: 873  RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 932

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 933  DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 992

Query: 923  PCYCGSKKCHGSLN 936
            PC CG++ C GSLN
Sbjct: 993  PCLCGTESCRGSLN 1006


>gi|332262972|ref|XP_003280531.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Nomascus
            leucogenys]
          Length = 1675

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1497 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1556

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1557 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1616

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1617 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1675


>gi|297283866|ref|XP_002808342.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A-like [Macaca mulatta]
          Length = 1704

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1526 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1585

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1586 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1645

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1646 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1704


>gi|351711375|gb|EHB14294.1| Histone-lysine N-methyltransferase SETD1A [Heterocephalus glaber]
          Length = 1502

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1324 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1383

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1384 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1443

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1444 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDSKIPCLCGAESCRGSLN 1502


>gi|345801544|ref|XP_848999.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A [Canis lupus familiaris]
          Length = 1713

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1535 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1594

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1595 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1654

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1655 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1713


>gi|350581585|ref|XP_003481071.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sus scrofa]
          Length = 1550

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1372 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1431

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1432 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1491

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1492 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1550


>gi|395846353|ref|XP_003795872.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Otolemur
            garnettii]
          Length = 1706

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1528 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1587

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1588 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1647

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1648 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1706


>gi|126334524|ref|XP_001364653.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Monodelphis
            domestica]
          Length = 1706

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1528 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1587

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1588 AADEMVIEYVGQNIRQVVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1647

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1648 CTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1706


>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
 gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
          Length = 208

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 103/171 (60%), Positives = 134/171 (78%), Gaps = 1/171 (0%)

Query: 767 ARTNRVKLRNLLAAAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
           AR+N+ +L      AE  ++LLK +Q+K RKK+L+F +S+IHDWGL ALEPI A++ VIE
Sbjct: 38  ARSNQRRLLTSFGEAEVVSDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIE 97

Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
           YVG+++R  ++D RE  Y ++GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY K
Sbjct: 98  YVGQMVRPVMADRREQHYTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAK 157

Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           VI+VEGQKKI IY+K+ I   EEITY+YKFPLEE+KI C CG+ +C G LN
Sbjct: 158 VITVEGQKKIVIYSKQPINVNEEITYDYKFPLEEEKISCLCGAPQCRGFLN 208


>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1367

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 95/152 (62%), Positives = 119/152 (78%)

Query: 785  ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
            + L  +Q+KARKKR++F +S IHDWGL A+EPI+A D VIEY+GE+IR K++D RE  YE
Sbjct: 1216 DALSFNQLKARKKRIKFDKSIIHDWGLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYE 1275

Query: 845  KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
              GIGSSYLFR+D+  ++DATK G +ARFINH C PNC  KVISV+G K+I IYA R I 
Sbjct: 1276 ASGIGSSYLFRVDEDTIIDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIK 1335

Query: 905  AGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             GEE+TY+YKFP+EE KIPC CG+  C G+LN
Sbjct: 1336 EGEELTYDYKFPIEEDKIPCLCGAVNCRGTLN 1367


>gi|397472033|ref|XP_003807565.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A [Pan paniscus]
          Length = 1479

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1301 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1360

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1361 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1420

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1421 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1479


>gi|444725782|gb|ELW66336.1| Histone-lysine N-methyltransferase SETD1A [Tupaia chinensis]
          Length = 1344

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1166 TNRVLSERRSEQRRLLSAIGTSAILDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1225

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1226 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1285

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1286 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1344


>gi|119572564|gb|EAW52179.1| hCG1998636, isoform CRA_a [Homo sapiens]
 gi|119572567|gb|EAW52182.1| hCG1998636, isoform CRA_a [Homo sapiens]
          Length = 1010

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 832  TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 891

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 892  AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 951

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 952  CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1010


>gi|426255271|ref|XP_004021280.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A [Ovis aries]
          Length = 1394

 Score =  219 bits (559), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1216 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1275

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1276 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1335

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1336 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1394


>gi|395514884|ref|XP_003761641.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sarcophilus
            harrisii]
          Length = 1691

 Score =  219 bits (559), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769  TNRV------KLRNLLAAAE-----GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1513 TNRVLSERRSEQRRLLSAIGTSAIVDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1572

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1573 AADEMVIEYVGQNIRQVVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1632

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1633 CTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1691


>gi|321472797|gb|EFX83766.1| hypothetical protein DAPPUDRAFT_301653 [Daphnia pulex]
          Length = 303

 Score =  219 bits (558), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 100/187 (53%), Positives = 134/187 (71%)

Query: 750 LGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDW 809
           LGP              AR+N+ +L         ++LLK +Q+K R  +L+F +S IHDW
Sbjct: 117 LGPTDTGINKGQALSREARSNQRRLLTAFGLENDSDLLKFNQLKFRGAKLKFGKSSIHDW 176

Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
           GL A+E I A++ VIEYVG+++R  ++D+RE +YE +GIGSSYLFR+D  Y++DATK G 
Sbjct: 177 GLFAVESIAADEMVIEYVGQVVRPVLADLRETQYEAVGIGSSYLFRVDLEYIIDATKCGN 236

Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
           +ARFINHSCNPNCY +VI++E QKKI IY+K+ I  GEEITY+YKFP+EE KI C CGS 
Sbjct: 237 LARFINHSCNPNCYARVITIESQKKIVIYSKQPIGVGEEITYDYKFPIEEDKIICLCGSS 296

Query: 930 KCHGSLN 936
           +C G+LN
Sbjct: 297 QCRGTLN 303


>gi|27552784|gb|AAH42890.1| Setd1a protein, partial [Mus musculus]
          Length = 458

 Score =  218 bits (556), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)

Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
           V+A Q   G   +  TNRV      + R LL+A        ++LLK +Q+K RKK+LRF 
Sbjct: 267 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 324

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++
Sbjct: 325 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 384

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 385 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 444

Query: 923 PCYCGSKKCHGSLN 936
           PC CG++ C GSLN
Sbjct: 445 PCLCGTESCRGSLN 458


>gi|302760998|ref|XP_002963921.1| hypothetical protein SELMODRAFT_81880 [Selaginella moellendorffii]
 gi|302769161|ref|XP_002968000.1| hypothetical protein SELMODRAFT_88044 [Selaginella moellendorffii]
 gi|300164738|gb|EFJ31347.1| hypothetical protein SELMODRAFT_88044 [Selaginella moellendorffii]
 gi|300167650|gb|EFJ34254.1| hypothetical protein SELMODRAFT_81880 [Selaginella moellendorffii]
          Length = 143

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 97/143 (67%), Positives = 123/143 (86%), Gaps = 4/143 (2%)

Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
           ++AR K L+FQRSKIH WG++AL+ IE EDFVIEY+GEL+R K++D+RE RYEKMGI SS
Sbjct: 1   MQARTKSLKFQRSKIHSWGVIALQSIEPEDFVIEYIGELVRSKVADLRERRYEKMGIDSS 60

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCN----PNCYTKVISVEGQKKIFIYAKRHIAAGE 907
           YLFR+D   VVDATK GG+ARFINHSC+    PNCYTK+++VEGQK+IFIY+K+HI  GE
Sbjct: 61  YLFRVDAENVVDATKHGGLARFINHSCDARILPNCYTKILTVEGQKRIFIYSKKHIKVGE 120

Query: 908 EITYNYKFPLEEKKIPCYCGSKK 930
           E+TY+YKFP EE+KIPC CG+++
Sbjct: 121 ELTYDYKFPFEEQKIPCLCGAER 143


>gi|113206662|gb|ABI34482.1| SET domain containing 1Bb [Danio rerio]
          Length = 175

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 101/167 (60%), Positives = 131/167 (78%), Gaps = 1/167 (0%)

Query: 771 RVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
           R + R LL++    ++LLK +Q+K RKKRLRF +S+IHDWGL A EPI A++ +IEYVG+
Sbjct: 9   RAEQRRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIEYVGQ 68

Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
            IR  I+D+RE RYE  GIGSSYLFR+D   ++DATK G +ARFINHSCNPNCY KVI+V
Sbjct: 69  SIRQVIADMRERRYETEGIGSSYLFRVDHDTIIDATKCGNLARFINHSCNPNCYAKVITV 128

Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           E QKKI IY+++ I   EEITY+YKFP+E++KIPC C ++ C G+LN
Sbjct: 129 EAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 175


>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
            queenslandica]
          Length = 1053

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 110/231 (47%), Positives = 154/231 (66%), Gaps = 17/231 (7%)

Query: 711  SDGCARSSISGWEWHKWSLNASPAERAR----VRGAQYVHTKYLGPEVNASQWANGKGLS 766
            S GCARS   G+          P++++R    +R ++    K    + N++      G  
Sbjct: 835  SSGCARSE--GY------YKIDPSDKSRYLHHLRRSKASEDK----DKNSAPPPTSSGRG 882

Query: 767  ARTNRVKLRNLLAAAE-GAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
             R N  ++ +L  + +   EL + +Q+KARKK+L F +S IH+WGL ALE I A++ V+E
Sbjct: 883  NRLNHRRVASLFTSGDVSRELQQYNQLKARKKQLTFAKSTIHNWGLFALETIPADEMVVE 942

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            Y+G+++R  I+D RE RYE  GIGSSYLFR+D  +V+DATK G  ARFINH C+PNCY K
Sbjct: 943  YIGQVVRHGIADERERRYEAQGIGSSYLFRVDYDHVIDATKSGNFARFINHCCDPNCYAK 1002

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +I+V  QKKI IY+KR I AGEEITY+YKFP+E++KIPC CG+ +C G+LN
Sbjct: 1003 IITVGNQKKIVIYSKRDIRAGEEITYDYKFPIEDEKIPCLCGAPQCRGTLN 1053


>gi|20071601|gb|AAH27450.1| SETD1A protein [Homo sapiens]
          Length = 469

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 291 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 350

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 351 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 410

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 411 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 469


>gi|16307411|gb|AAH10250.1| Setd1a protein, partial [Mus musculus]
          Length = 316

 Score =  216 bits (551), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 108/194 (55%), Positives = 139/194 (71%), Gaps = 13/194 (6%)

Query: 754 VNASQWANGKGLSARTNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQ 802
           V+A Q   G   +  TNRV      + R LL+A        ++LLK +Q+K RKK+LRF 
Sbjct: 125 VSARQLEGGD--TQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFG 182

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++
Sbjct: 183 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTII 242

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KI
Sbjct: 243 DATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKI 302

Query: 923 PCYCGSKKCHGSLN 936
           PC CG++ C GSLN
Sbjct: 303 PCLCGTESCRGSLN 316


>gi|301778787|ref|XP_002924825.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1A-like [Ailuropoda melanoleuca]
          Length = 1594

 Score =  216 bits (550), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 104/180 (57%), Positives = 133/180 (73%), Gaps = 12/180 (6%)

Query: 769  TNRV------KLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 1415 TNRVLSERRSEQRRLLSAIGTSAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 1474

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 1475 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 1534

Query: 878  C-NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            C  PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1535 CMQPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1594


>gi|66805051|ref|XP_636258.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74852293|sp|Q54HS3.1|SET1_DICDI RecName: Full=Histone-lysine N-methyltransferase set1; AltName:
            Full=Histone H3 lysine 4 methyltransferase; AltName:
            Full=SET domain-containing protein 1
 gi|60464689|gb|EAL62816.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1486

 Score =  216 bits (550), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 97/171 (56%), Positives = 128/171 (74%), Gaps = 5/171 (2%)

Query: 766  SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            SAR++R   R       G++ +  + +K+R+KR++F+RS IHDWGL A+E I A+D VIE
Sbjct: 1321 SARSSRFDNRGF-----GSDPITLASLKSRRKRIKFERSDIHDWGLFAMETISAKDMVIE 1375

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            Y+GE+IR K++D RE RY K GIGSSYLFR+DD  ++DAT +G +ARFINH C+PNC  K
Sbjct: 1376 YIGEVIRQKVADEREKRYVKKGIGSSYLFRVDDDTIIDATFKGNLARFINHCCDPNCIAK 1435

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            V+++  QKKI IYAKR I  GEEITY+YKFP+E+ KIPC C S KC  +LN
Sbjct: 1436 VLTIGNQKKIIIYAKRDINIGEEITYDYKFPIEDVKIPCLCKSPKCRQTLN 1486


>gi|71051743|gb|AAH98812.1| RGD1311624 protein, partial [Rattus norvegicus]
          Length = 216

 Score =  216 bits (550), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 108/190 (56%), Positives = 139/190 (73%), Gaps = 8/190 (4%)

Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKI 806
           PE   +Q  N + LS R  R + R LL+A        ++LLK +Q+K RKK+LRF RS+I
Sbjct: 30  PEGVDTQGTN-RVLSER--RSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRI 86

Query: 807 HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
           H+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK
Sbjct: 87  HEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATK 146

Query: 867 RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
            G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC C
Sbjct: 147 CGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLC 206

Query: 927 GSKKCHGSLN 936
           G++ C GSLN
Sbjct: 207 GTESCRGSLN 216


>gi|195553639|ref|XP_002076709.1| GD11928 [Drosophila simulans]
 gi|194202088|gb|EDX15664.1| GD11928 [Drosophila simulans]
          Length = 374

 Score =  216 bits (549), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 102/194 (52%), Positives = 136/194 (70%), Gaps = 19/194 (9%)

Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           G    AR+N+ +L     +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 181 GISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 240

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN-- 879
            VIEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCN  
Sbjct: 241 MVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNVS 300

Query: 880 -----------------PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
                            PNCY KVI++E +KKI IY+K+ I   EEITY+YKFPLEE+KI
Sbjct: 301 ITGYFDIMLPTYNFVFQPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKI 360

Query: 923 PCYCGSKKCHGSLN 936
           PC CG++ C G+LN
Sbjct: 361 PCLCGAQGCRGTLN 374


>gi|380801021|gb|AFE72386.1| histone-lysine N-methyltransferase SETD1A, partial [Macaca mulatta]
          Length = 310

 Score =  216 bits (549), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 132 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 191

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 192 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 251

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 252 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 310


>gi|355756723|gb|EHH60331.1| hypothetical protein EGM_11666, partial [Macaca fascicularis]
          Length = 256

 Score =  216 bits (549), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 104/179 (58%), Positives = 133/179 (74%), Gaps = 11/179 (6%)

Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 78  TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 137

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 138 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 197

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 198 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 256


>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1389

 Score =  215 bits (548), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 98/169 (57%), Positives = 125/169 (73%), Gaps = 2/169 (1%)

Query: 770  NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            N+V+L   L+  E A  EL+K +Q++ RKK LRF RS IHDWGL A+E I   + VIEYV
Sbjct: 1221 NQVQLAMALSKGENAATELVKFNQLQTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYV 1280

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            GE+IR +++D RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I
Sbjct: 1281 GEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 1340

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            ++  +KKI IYAK+ I  G EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 1341 TINSEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGYLN 1389


>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1359

 Score =  215 bits (547), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 94/145 (64%), Positives = 116/145 (80%)

Query: 792  VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
            + +RKK+++F+RS IHDWGL ALEPI A D VIEY+GE+IR K++D RE RY K GIGSS
Sbjct: 1215 MTSRKKKIKFERSLIHDWGLFALEPIYARDMVIEYIGEVIRQKVADEREKRYTKQGIGSS 1274

Query: 852  YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
            YLFR+DD  ++DAT +G  ARFINH C+PNC  KVI++ GQKKI IYAKR I  GEE+TY
Sbjct: 1275 YLFRIDDDTIIDATFKGNQARFINHCCDPNCMAKVITMGGQKKIIIYAKRDINVGEELTY 1334

Query: 912  NYKFPLEEKKIPCYCGSKKCHGSLN 936
            +YKFP+E+ KIPC C S KC G+LN
Sbjct: 1335 DYKFPIEDVKIPCLCKSAKCRGTLN 1359


>gi|119572569|gb|EAW52184.1| hCG1998636, isoform CRA_e [Homo sapiens]
          Length = 1175

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 102/193 (52%), Positives = 133/193 (68%), Gaps = 2/193 (1%)

Query: 746  HTKYL--GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQR 803
            HT  L  GP  + +  A  +G    + + K + L      A  L+    + RKK+LRF R
Sbjct: 983  HTNILSDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGVDTQFRKKKLRFGR 1042

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S+IH+WGL A+EPI A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++D
Sbjct: 1043 SRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIID 1102

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            ATK G +ARFINH C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIP
Sbjct: 1103 ATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIP 1162

Query: 924  CYCGSKKCHGSLN 936
            C CG++ C GSLN
Sbjct: 1163 CLCGTESCRGSLN 1175


>gi|410922130|ref|XP_003974536.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Takifugu
            rubripes]
          Length = 1302

 Score =  214 bits (546), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 104/203 (51%), Positives = 143/203 (70%), Gaps = 13/203 (6%)

Query: 744  YVHTKYLGPEVNASQWANGKGLSARTN---------RVKLRNLLAAAE-GAELLKASQVK 793
            Y++   L  E+ ++Q   G G++ + +         R++ R LL++    ++LLK +Q+K
Sbjct: 1103 YMNNTELSAELPSTQ---GTGVAGQQSTSLRWGSDFRLEQRRLLSSFNCDSDLLKFNQLK 1159

Query: 794  ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
             RKK  RF  S IH+WGL A+EPI AE+ V+EYVGE+IR  I+D+RE RYE+ GI SSY+
Sbjct: 1160 FRKKMTRFSPSHIHEWGLFAMEPIAAEEMVMEYVGEIIRQVIADMREQRYEESGIRSSYM 1219

Query: 854  FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
            FR+D   ++DATK G +ARFINHSCNPNCY K+I+VE QKKI IY+++ I+  EEITY+Y
Sbjct: 1220 FRIDQETIIDATKCGNVARFINHSCNPNCYAKIITVESQKKIVIYSRQPISINEEITYDY 1279

Query: 914  KFPLEEKKIPCYCGSKKCHGSLN 936
            KFP E+ KIPC C +  C GSLN
Sbjct: 1280 KFPSEDTKIPCLCRATGCRGSLN 1302


>gi|281201147|gb|EFA75361.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1458

 Score =  214 bits (546), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 99/171 (57%), Positives = 126/171 (73%), Gaps = 10/171 (5%)

Query: 766  SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            SAR++R + R       G +++    + +RKK+++F RS IHDWGL ALEPI A+D VIE
Sbjct: 1298 SARSSRYESRM------GGDIV----LTSRKKKIKFSRSAIHDWGLFALEPITAKDMVIE 1347

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            Y+GE+IR K++D RE RY K GIGSSYLFR+DD  ++DAT +G +ARFINH C+PNC  K
Sbjct: 1348 YIGEVIRQKVADEREKRYTKKGIGSSYLFRIDDDTIIDATFKGNLARFINHCCDPNCIAK 1407

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +I    QKKI IYAKR I  GEEITY+YKFP+E+ KIPC C S KC G+LN
Sbjct: 1408 IIQTGNQKKIVIYAKRDIMIGEEITYDYKFPIEDVKIPCLCKSAKCRGTLN 1458


>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
 gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
          Length = 1502

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 94/153 (61%), Positives = 119/153 (77%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            A++LK +Q++ RKK+L+F RS IHDWGL A+E I A + VIEYVGE+IR  ++D RE  Y
Sbjct: 1350 ADVLKFNQLRTRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLY 1409

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E+MGIGSSYLFR+DD  VVDATK+G + R INH C+PNC  K+I++ G+KKI IYAK  I
Sbjct: 1410 ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTI 1469

Query: 904  AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
              G+E+TY+Y FP EE KIPC CGS KC G+LN
Sbjct: 1470 ELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1502


>gi|348554403|ref|XP_003463015.1| PREDICTED: hypothetical protein LOC100714908 [Cavia porcellus]
          Length = 1931

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 133/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1763 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1822

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 1823 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1882

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI+  EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1883 TVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1931


>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
          Length = 296

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 96/155 (61%), Positives = 127/155 (81%), Gaps = 2/155 (1%)

Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           A+L+KA+Q K R+K LRF RS IH+WGL A EPI+ ++ VIEYVGE++R  +++ RE RY
Sbjct: 142 ADLVKANQFKRRRKLLRFARSMIHEWGLFAQEPIDKDELVIEYVGEIVRQTVAEDRERRY 201

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            ++GIGSSYLFR+D+ YV+DAT+ G IARFINHSC+ NCY +V+SV+G+K+I IY+KR I
Sbjct: 202 ARIGIGSSYLFRIDEDYVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPI 261

Query: 904 AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           AA EEITY+YKFP EE   KIPC+CG++ C G+LN
Sbjct: 262 AANEEITYDYKFPREEGPNKIPCFCGARTCRGTLN 296


>gi|355718717|gb|AES06362.1| SET domain containing 1A [Mustela putorius furo]
          Length = 296

 Score =  213 bits (542), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 103/178 (57%), Positives = 132/178 (74%), Gaps = 11/178 (6%)

Query: 769 TNRV------KLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           TNRV      + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI
Sbjct: 119 TNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPI 178

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ VIEYVG+ IR  ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH 
Sbjct: 179 AADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHC 238

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSL 935
           C PNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSL
Sbjct: 239 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSL 296


>gi|334327124|ref|XP_003340832.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like, partial [Monodelphis domestica]
          Length = 1723

 Score =  213 bits (542), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 140/186 (75%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1540 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1597

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1598 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1657

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI+  EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1658 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKIPCLCGSEN 1717

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1718 CRGTLN 1723


>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
          Length = 1014

 Score =  213 bits (542), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 94/153 (61%), Positives = 119/153 (77%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            A++LK +Q++ RKK+L+F RS IHDWGL A+E I A + VIEYVGE+IR  ++D RE  Y
Sbjct: 862  ADVLKFNQLRTRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRREKLY 921

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E+MGIGSSYLFR+DD  VVDATK+G + R INH C+PNC  K+I++ G+KKI IYAK  I
Sbjct: 922  ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTI 981

Query: 904  AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
              G+E+TY+Y FP EE KIPC CGS KC G+LN
Sbjct: 982  ELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1014


>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1095

 Score =  213 bits (541), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 97/169 (57%), Positives = 125/169 (73%), Gaps = 2/169 (1%)

Query: 770  NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            N+V+    L+  E A  + +K +Q++ RKK LRF RS IHDWGL A+E I   D VIEYV
Sbjct: 927  NQVQRAMALSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKINRGDLVIEYV 986

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            GE+IR +++D RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I
Sbjct: 987  GEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 1046

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            ++ G+KKI IYAK+ I  G EITY+Y FP+E+ KIPC CGS KC G+LN
Sbjct: 1047 TISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1095


>gi|71897211|ref|NP_001025832.1| histone-lysine N-methyltransferase SETD1B [Gallus gallus]
 gi|82231199|sp|Q5F3P8.1|SET1B_CHICK RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|60098811|emb|CAH65236.1| hypothetical protein RCJMB04_10j6 [Gallus gallus]
          Length = 2008

 Score =  213 bits (541), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)

Query: 751  GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
            G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IH
Sbjct: 1822 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 1879

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 1880 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1939

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CG
Sbjct: 1940 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 1999

Query: 928  SKKCHGSLN 936
            S+ C G+LN
Sbjct: 2000 SENCRGTLN 2008


>gi|224071200|ref|XP_002193972.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Taeniopygia
            guttata]
          Length = 2004

 Score =  212 bits (540), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1821 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1878

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1879 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1938

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1939 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1998

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1999 CRGTLN 2004


>gi|198414837|ref|XP_002125245.1| PREDICTED: similar to AGAP002246-PA, partial [Ciona intestinalis]
          Length = 754

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 109/247 (44%), Positives = 159/247 (64%), Gaps = 9/247 (3%)

Query: 695 KKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV 754
           K+ +A K   S++ P   GCARS      ++K + N    +R  +        +  G   
Sbjct: 512 KRRQARKDDISDIKPHKTGCARSE----GFYKVNDNTKLIQRKLMYRQADNTPQPKGRSK 567

Query: 755 NASQWANGKGLS---ARTNRVKLRNLLA--AAEGAELLKASQVKARKKRLRFQRSKIHDW 809
            A++ A+  G++   +R  R  +R + +   ++ ++LLK +Q+  RKK ++F+RS IH W
Sbjct: 568 YANESASTSGVATEKSRETRHMMRRIASEFGSDASDLLKYNQLMYRKKSVKFKRSHIHGW 627

Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
           GL A E I A++ VIEYVG+L+R  I+D RE  Y + GIGSSYLFR+D  +++DATK G 
Sbjct: 628 GLFAEETIGADEMVIEYVGQLVRSLIADRREVDYTRRGIGSSYLFRIDSDHIIDATKCGN 687

Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
            ARF+NHSCNP+CY KVI+V+G KKI IY+K  I   +EITY+YKFP+E+ KIPC+CG+ 
Sbjct: 688 FARFMNHSCNPSCYAKVIAVDGAKKIVIYSKDTIKPTDEITYDYKFPIEDVKIPCFCGAP 747

Query: 930 KCHGSLN 936
            C G+LN
Sbjct: 748 NCRGTLN 754


>gi|86129848|gb|ABC86576.1| KIAA0339 protein [Danio rerio]
          Length = 406

 Score =  212 bits (539), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 102/177 (57%), Positives = 131/177 (74%), Gaps = 7/177 (3%)

Query: 765 LSARTNRVKLRNLLAA-----AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
           LS R  R + R LL+A        ++LLK +Q+K RKK+LRF RS+IH+WGL A+EPI A
Sbjct: 232 LSER--RSEQRRLLSAIGTPAVMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAA 289

Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
           ++ VIEYVG+ IR  ++D  E RY + GIGSSYLFR+D   ++DATK G +ARFINH C 
Sbjct: 290 DEMVIEYVGQSIRQMVADNWEKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCT 349

Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           PNCY KVI++E QKKI IY+K+ I   EEITY+YKFP+EE KIPC CG++ C G+LN
Sbjct: 350 PNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIEENKIPCLCGTESCRGTLN 406


>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
          Length = 292

 Score =  212 bits (539), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 115/251 (45%), Positives = 151/251 (60%), Gaps = 19/251 (7%)

Query: 692 VAMKKTKASKSRTSNLCPRSDGCARSS----ISGWEWHKWSLNASPAERARVRGAQYVHT 747
           V  KK K    +   +     GCAR+     IS  E H +  +         R A     
Sbjct: 55  VPPKKRKKYSQQDEVMVAHKTGCARTEGYYKISHQEKHGYLKDFRGVNERDARDAA---- 110

Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLL--AAAEGAELLKASQVKARKKRLRFQRSK 805
             L  +V  S+        AR+   + +N+L     +  +LLK +Q+K RKK L+F +S 
Sbjct: 111 --LKKKVQISR-------EARSESRRWQNVLQLMGEDIGDLLKFNQLKFRKKNLKFAKSG 161

Query: 806 IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDAT 865
           IHDWGL ALEPI   + VIEYVG ++R   +D+RE +YE MGIGSSYLFR+D   ++DAT
Sbjct: 162 IHDWGLFALEPIAEGEMVIEYVGAVVRQSTADLREKKYEAMGIGSSYLFRIDHDLIIDAT 221

Query: 866 KRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
           K G +ARFINHSCNPNC  K+I+VE  KKI IY++R I   EEITY+YKFPLE++KIPC 
Sbjct: 222 KCGNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEEITYDYKFPLEDEKIPCL 281

Query: 926 CGSKKCHGSLN 936
           CG+  C G+LN
Sbjct: 282 CGTSACRGTLN 292


>gi|441630858|ref|XP_003280765.2| PREDICTED: uncharacterized protein LOC100584028 [Nomascus leucogenys]
          Length = 1863

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1680 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1737

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1738 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1797

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1798 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1857

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1858 CRGTLN 1863


>gi|119618696|gb|EAW98290.1| hCG1812756 [Homo sapiens]
          Length = 1048

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 865  IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 922

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 923  LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 982

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 983  ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1042

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1043 CRGTLN 1048


>gi|210032580|ref|NP_055863.1| histone-lysine N-methyltransferase SETD1B [Homo sapiens]
 gi|166977692|sp|Q9UPS6.2|SET1B_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=Lysine N-methyltransferase 2G; AltName: Full=SET
            domain-containing protein 1B; Short=hSET1B
          Length = 1923

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1740 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1797

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1798 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1857

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1858 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1917

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1918 CRGTLN 1923


>gi|392352531|ref|XP_003751234.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Rattus
            norvegicus]
          Length = 1900

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 109/190 (57%), Positives = 141/190 (74%), Gaps = 5/190 (2%)

Query: 750  LGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKI 806
            +G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S I
Sbjct: 1713 MGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHI 1770

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            HDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK
Sbjct: 1771 HDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATK 1830

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
             G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC C
Sbjct: 1831 CGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLC 1890

Query: 927  GSKKCHGSLN 936
            GS+ C G+LN
Sbjct: 1891 GSENCRGTLN 1900


>gi|334192482|gb|AEG67286.1| histone-lysine N-methyltransferase [Homo sapiens]
          Length = 1966

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1783 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1840

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1841 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1900

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1901 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1960

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1961 CRGTLN 1966


>gi|402887949|ref|XP_003907341.1| PREDICTED: uncharacterized protein LOC101023789 [Papio anubis]
          Length = 1927

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1744 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1801

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1802 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1861

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1862 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1921

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1922 CRGTLN 1927


>gi|410047437|ref|XP_003314036.2| PREDICTED: uncharacterized protein LOC473295, partial [Pan
            troglodytes]
          Length = 1955

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1772 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1829

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1830 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1889

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1890 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1949

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1950 CRGTLN 1955


>gi|403281795|ref|XP_003932362.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Saimiri
            boliviensis boliviensis]
          Length = 1823

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1640 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1697

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1698 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1757

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1758 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1817

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1818 CRGTLN 1823


>gi|344297409|ref|XP_003420391.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Loxodonta africana]
          Length = 1750

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1582 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1641

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 1642 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1701

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1702 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1750


>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1312

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/172 (56%), Positives = 121/172 (70%), Gaps = 3/172 (1%)

Query: 768  RTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            R NR++LR   A  E A   +      +K+RKK L+FQRS IH +GL + E I A D VI
Sbjct: 1141 RANRLQLRQFAAVIEQATNSDTFSLDMLKSRKKLLKFQRSGIHAFGLFSQENISANDLVI 1200

Query: 825  EYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            EYVGE+IR  ISDIRE  YE+ GIGSSY FR+D+ +VVDAT +G +ARF+NH C PNCY 
Sbjct: 1201 EYVGEVIRQSISDIREHHYERRGIGSSYFFRIDEDHVVDATYKGNLARFMNHCCEPNCYA 1260

Query: 885  KVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            K+I V+G ++I IY+KR I  GEEITY+YKFP EE KIPC CG+  C   LN
Sbjct: 1261 KIIMVDGHQRIVIYSKRDIKKGEEITYDYKFPYEENKIPCLCGAVNCKKFLN 1312


>gi|405967140|gb|EKC32340.1| Histone-lysine N-methyltransferase SETD1B-A [Crassostrea gigas]
          Length = 1401

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 91/142 (64%), Positives = 113/142 (79%)

Query: 795  RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
            RKK+L+F RS IHDWGL ALEPI A++ VIEYVGE +R  ++D+RE +YE  G GSSYLF
Sbjct: 1260 RKKQLKFARSGIHDWGLFALEPIAADEMVIEYVGETLRQSLADLREKKYEAEGCGSSYLF 1319

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            R+D   ++DATK G +ARFINH CNPNCY K+I+VE QKKI IY+KR I   EEITY+YK
Sbjct: 1320 RVDTETIIDATKCGNLARFINHCCNPNCYAKIITVESQKKIVIYSKRDIDVNEEITYDYK 1379

Query: 915  FPLEEKKIPCYCGSKKCHGSLN 936
            FP+E++KIPC CG+  C G+LN
Sbjct: 1380 FPIEDEKIPCLCGAPNCRGTLN 1401


>gi|301754587|ref|XP_002913168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Ailuropoda melanoleuca]
          Length = 1805

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 110/193 (56%), Positives = 141/193 (73%), Gaps = 5/193 (2%)

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
            T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 1615 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1672

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 1673 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1732

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            ATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIP
Sbjct: 1733 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1792

Query: 924  CYCGSKKCHGSLN 936
            C CGS+ C G+LN
Sbjct: 1793 CLCGSENCRGTLN 1805


>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
            SS1]
          Length = 1014

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 96/169 (56%), Positives = 125/169 (73%), Gaps = 2/169 (1%)

Query: 770  NRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            N+V+    L+  E A  + +K +Q++ RKK LRF RS IHDWGL A+E I   + VIEYV
Sbjct: 846  NQVQRAMALSKGEAAAPDSVKYNQLQTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYV 905

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            GE+IR +++D RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I
Sbjct: 906  GEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKII 965

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            ++ G+KKI IYAK+ I  G EITY+Y FP+E+ KIPC CGS KC G+LN
Sbjct: 966  TISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1014


>gi|5689489|dbj|BAA83028.1| KIAA1076 protein [Homo sapiens]
          Length = 804

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 621 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 678

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 679 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 738

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 739 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 798

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 799 CRGTLN 804


>gi|149063329|gb|EDM13652.1| rCG21620 [Rattus norvegicus]
          Length = 1091

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 908  IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 965

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 966  LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1025

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1026 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1085

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1086 CRGTLN 1091


>gi|396578140|ref|NP_001035488.2| histone-lysine N-methyltransferase SETD1B [Mus musculus]
 gi|166977693|sp|Q8CFT2.2|SET1B_MOUSE RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
          Length = 1985

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1802 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1859

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1860 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1919

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1920 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1979

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1980 CRGTLN 1985


>gi|426374487|ref|XP_004054104.1| PREDICTED: uncharacterized protein LOC101124677 [Gorilla gorilla
            gorilla]
          Length = 1922

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1754 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1813

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 1814 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1873

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1874 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1922


>gi|410976579|ref|XP_003994695.1| PREDICTED: uncharacterized protein LOC101096419 [Felis catus]
          Length = 1919

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 110/193 (56%), Positives = 141/193 (73%), Gaps = 5/193 (2%)

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
            T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 1729 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1786

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 1787 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1846

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            ATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIP
Sbjct: 1847 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1906

Query: 924  CYCGSKKCHGSLN 936
            C CGS+ C G+LN
Sbjct: 1907 CLCGSENCRGTLN 1919


>gi|50546869|ref|XP_500904.1| YALI0B14883p [Yarrowia lipolytica]
 gi|74689791|sp|Q6CEK8.1|SET1_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|49646770|emb|CAG83155.1| YALI0B14883p [Yarrowia lipolytica CLIB122]
          Length = 1170

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 99/181 (54%), Positives = 130/181 (71%), Gaps = 10/181 (5%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R L+A         +   ++L  +Q++ RKK ++F RS IH+WGL A+EPI
Sbjct: 990  NSRVNRANNRRLVADINMQKQLLSTETDVLNFNQLRKRKKPVKFARSAIHNWGLYAIEPI 1049

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A + +IEYVGE++R +I+D+RE RY + GIGSSYLFR+D+  VVDATKRGGIARFINH 
Sbjct: 1050 AANEMIIEYVGEVVRQEIADLREARYMRSGIGSSYLFRVDESTVVDATKRGGIARFINHC 1109

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSL 935
            C P+C  K+I VEGQK+I IYA R IAA EE+TY+YKF  E  E++IPC CG+  C G L
Sbjct: 1110 CTPSCTAKIIKVEGQKRIVIYASRDIAANEELTYDYKFEKEIGEERIPCLCGAPGCKGYL 1169

Query: 936  N 936
            N
Sbjct: 1170 N 1170


>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
           [Strongylocentrotus purpuratus]
          Length = 282

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 96/175 (54%), Positives = 128/175 (73%), Gaps = 5/175 (2%)

Query: 767 ARTNRVKLRNLLAAAEGAE-----LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           +R  R   R  +A  +  E     L K +Q++ RKK ++F +S IH WGL A+EPI A++
Sbjct: 108 SRDARSDQRRFIAVLQQQEEQVSNLFKFNQLQFRKKDIKFCKSSIHGWGLYAMEPIAADE 167

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
            VIEYVGE +R  I+D RE  YE+MGIGSSYLFR+D   ++DATK G +ARFINHSCNPN
Sbjct: 168 MVIEYVGESVRQSIADSREKAYERMGIGSSYLFRIDAVTIIDATKSGNLARFINHSCNPN 227

Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           CY K+I+VE +KKI IY+K+ I  G+EITY+YKFP+E++KI C CG+ +C G+LN
Sbjct: 228 CYAKIITVESEKKIVIYSKQTINVGDEITYDYKFPIEDEKISCLCGAAQCRGTLN 282


>gi|355786615|gb|EHH66798.1| hypothetical protein EGM_03852, partial [Macaca fascicularis]
          Length = 673

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 490 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 547

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 548 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 607

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 608 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 667

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 668 CRGTLN 673


>gi|392332670|ref|XP_003752655.1| PREDICTED: uncharacterized protein LOC100359816 [Rattus norvegicus]
          Length = 2265

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 2097 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 2156

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 2157 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 2216

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 2217 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 2265


>gi|351698529|gb|EHB01448.1| Histone-lysine N-methyltransferase SETD1B [Heterocephalus glaber]
          Length = 1486

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)

Query: 751  GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
            G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IH
Sbjct: 1300 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 1357

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 1358 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 1417

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CG
Sbjct: 1418 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 1477

Query: 928  SKKCHGSLN 936
            S+ C G+LN
Sbjct: 1478 SENCRGTLN 1486


>gi|297263735|ref|XP_002808043.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Macaca mulatta]
          Length = 2216

 Score =  210 bits (534), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 2048 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 2107

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 2108 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 2167

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 2168 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 2216


>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
 gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
          Length = 1278

 Score =  210 bits (534), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 112/242 (46%), Positives = 154/242 (63%), Gaps = 15/242 (6%)

Query: 708  CPRSDGCARSS----ISGWEWHKWSLNASPAERARV-RGAQYVHTKYLGPEVNASQWANG 762
             P + GCAR+     I   E  K+  +    ++AR  R AQ       G  + A + A+ 
Sbjct: 1039 VPNATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAP--GKSLTAVKVADD 1096

Query: 763  KGLS---ARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
              +S   +R NR   R  +A    + ++ L+ +Q+K RKK ++F+RS IH+WGL A+E I
Sbjct: 1097 NSISRGSSRANRANHRRAMAEMNDQKSDALRFNQLKKRKKPVKFERSAIHNWGLYAMEHI 1156

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R  ++ +RE RY++ GIGSSYLFR+D+  V+DATK+GGIARFINHS
Sbjct: 1157 PKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYLFRIDEDAVIDATKKGGIARFINHS 1216

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF LEEK   ++PC CG+  C G 
Sbjct: 1217 CMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEEDRVPCLCGTTNCKGF 1276

Query: 935  LN 936
            LN
Sbjct: 1277 LN 1278


>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
 gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
          Length = 1278

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 112/242 (46%), Positives = 154/242 (63%), Gaps = 15/242 (6%)

Query: 708  CPRSDGCARSS----ISGWEWHKWSLNASPAERARV-RGAQYVHTKYLGPEVNASQWANG 762
             P + GCAR+     I   E  K+  +    ++AR  R AQ       G  + A + A+ 
Sbjct: 1039 VPNATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAP--GKSLTAVKVADD 1096

Query: 763  KGLS---ARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
              +S   +R NR   R  +A    + ++ L+ +Q+K RKK ++F+RS IH+WGL A+E I
Sbjct: 1097 NSISRGSSRANRANHRRAMAEMNDQKSDALRFNQLKKRKKPVKFERSAIHNWGLYAMEHI 1156

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R  ++ +RE RY++ GIGSSYLFR+D+  V+DATK+GGIARFINHS
Sbjct: 1157 PKDDMIIEYVGEEVRPSVAQVREARYDRSGIGSSYLFRIDEDAVIDATKKGGIARFINHS 1216

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF LEEK   ++PC CG+  C G 
Sbjct: 1217 CMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEEDRVPCLCGTTNCKGF 1276

Query: 935  LN 936
            LN
Sbjct: 1277 LN 1278


>gi|345321023|ref|XP_001506028.2| PREDICTED: hypothetical protein LOC100074411 [Ornithorhynchus
            anatinus]
          Length = 1258

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1090 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1149

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 1150 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1209

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1210 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1258


>gi|355564772|gb|EHH21272.1| hypothetical protein EGK_04290, partial [Macaca mulatta]
          Length = 663

 Score =  209 bits (533), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 480 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 537

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 538 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 597

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 598 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 657

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 658 CRGTLN 663


>gi|211830050|gb|AAH38367.2| Setd1b protein [Mus musculus]
          Length = 1103

 Score =  209 bits (533), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 920  IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 977

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 978  LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1037

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1038 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1097

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1098 CRGTLN 1103


>gi|28972602|dbj|BAC65717.1| mKIAA1076 protein [Mus musculus]
          Length = 855

 Score =  209 bits (533), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 672 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 729

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 730 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 789

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 790 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 849

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 850 CRGTLN 855


>gi|27371314|gb|AAH41681.1| Setd1b protein, partial [Mus musculus]
          Length = 917

 Score =  209 bits (533), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 734 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 791

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 792 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 851

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 852 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 911

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 912 CRGTLN 917


>gi|26251880|gb|AAH40775.1| Setd1b protein, partial [Mus musculus]
          Length = 911

 Score =  209 bits (533), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 108/186 (58%), Positives = 139/186 (74%), Gaps = 5/186 (2%)

Query: 754 VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
           + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 728 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 785

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 786 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 845

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 846 ARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSEN 905

Query: 931 CHGSLN 936
           C G+LN
Sbjct: 906 CRGTLN 911


>gi|431912177|gb|ELK14315.1| Histone-lysine N-methyltransferase SETD1B [Pteropus alecto]
          Length = 245

 Score =  209 bits (532), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 110/203 (54%), Positives = 146/203 (71%), Gaps = 3/203 (1%)

Query: 737 ARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVK 793
           AR  G   +  K     +N+S+ +  +  +    R + R LL++  G+   +LLK +Q+K
Sbjct: 43  ARSEGFYTIDKKDKLRYLNSSRASTDEPPADTQRRSEQRRLLSSFTGSCDSDLLKFNQLK 102

Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
            RKK+L+F +S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+
Sbjct: 103 FRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYM 162

Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
           FR+D   ++DATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI+  EEITY+Y
Sbjct: 163 FRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDY 222

Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
           KFP+E+ KIPC CGS+ C G+LN
Sbjct: 223 KFPIEDVKIPCLCGSENCRGTLN 245


>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
          Length = 1200

 Score =  209 bits (532), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 104/199 (52%), Positives = 134/199 (67%), Gaps = 14/199 (7%)

Query: 752  PEVNASQWAN---GKGLSARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLR 800
            PEV A+   N       ++R NRV  R L+          +  A++++ +Q+K RKK ++
Sbjct: 1002 PEVTAAAVTNPTKNPSSTSRMNRVNNRRLVVGLNNQKQMLSSDADVMRFNQLKKRKKPVK 1061

Query: 801  FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
            F RS IH+WGL A+E I A D +IEYVGE+IR +++D+RE +Y K GIGSSYLFR+DD  
Sbjct: 1062 FARSAIHNWGLYAMENISAGDMIIEYVGEIIRQQVADMREKKYLKSGIGSSYLFRIDDTT 1121

Query: 861  VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE-- 918
            V+DATK GGIARFINHSC PNC  K+I VEG K+I IYA R I   EE+TY+YKF  E  
Sbjct: 1122 VIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFERELE 1181

Query: 919  -EKKIPCYCGSKKCHGSLN 936
             E++IPC CGS  C G LN
Sbjct: 1182 SEERIPCLCGSSGCKGFLN 1200


>gi|395513793|ref|XP_003761107.1| PREDICTED: uncharacterized protein LOC100928096 [Sarcophilus
            harrisii]
          Length = 1224

 Score =  209 bits (531), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 132/169 (78%), Gaps = 3/169 (1%)

Query: 771  RVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYV
Sbjct: 1056 RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYV 1115

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI
Sbjct: 1116 GQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVI 1175

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1176 TVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 1224


>gi|345791349|ref|XP_543382.3| PREDICTED: histone-lysine N-methyltransferase SETD1B [Canis lupus
            familiaris]
          Length = 1920

 Score =  209 bits (531), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 107/186 (57%), Positives = 138/186 (74%), Gaps = 5/186 (2%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIHDWG 810
            + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IHDWG
Sbjct: 1737 IPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIHDWG 1794

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK G  
Sbjct: 1795 LFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNF 1854

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
            ARFINHSCNPNCY KVI+VE QKKI IY+ +HI   EEITY+YKFP+E+ KIPC CGS+ 
Sbjct: 1855 ARFINHSCNPNCYAKVITVESQKKIVIYSNQHINVNEEITYDYKFPIEDVKIPCLCGSEN 1914

Query: 931  CHGSLN 936
            C G+LN
Sbjct: 1915 CRGTLN 1920


>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
 gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
          Length = 213

 Score =  209 bits (531), Expect = 7e-51,   Method: Composition-based stats.
 Identities = 99/173 (57%), Positives = 129/173 (74%), Gaps = 3/173 (1%)

Query: 767 ARTNRVKLRNLLAAA---EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           +R NR   R L  A    +  ++LK ++++ RKK+L+F +S IHDWGL ALEPI A++ V
Sbjct: 41  SRENRAMQRRLQLAVCNEDFGDILKFNKLRVRKKQLKFAKSSIHDWGLFALEPIAADEMV 100

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
           IEYVGE+IR  I+D RE  YE+ GIGSSY+FRLD+  ++DAT  G  ARFINH C+PNCY
Sbjct: 101 IEYVGEVIRQAIADYRERCYEERGIGSSYMFRLDETTIIDATTMGNFARFINHCCDPNCY 160

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            KVI+VE  KKI IY+KR I   EEITY+YKFP+E++KIPC CG+ +C G+LN
Sbjct: 161 AKVIAVENMKKIVIYSKRDIQVDEEITYDYKFPIEDEKIPCLCGAPQCRGTLN 213


>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 260

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 96/155 (61%), Positives = 119/155 (76%), Gaps = 1/155 (0%)

Query: 783 GAEL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIREC 841
            AEL LK +Q++ RKK LRF RS IHDWGL A+E I   + VIEYVGE+IR +++D RE 
Sbjct: 106 AAELNLKFNQLQTRKKHLRFARSPIHDWGLYAMERISRGEMVIEYVGEVIRAQVADKREK 165

Query: 842 RYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKR 901
            YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I++ G+KKI IYAK+
Sbjct: 166 VYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQ 225

Query: 902 HIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I  GEEITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 226 DIELGEEITYDYHFPIEQDKIPCLCGSAKCRGYLN 260


>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
            magnipapillata]
          Length = 1403

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 98/175 (56%), Positives = 127/175 (72%), Gaps = 1/175 (0%)

Query: 763  KGLSARTNRVKLRNLLAAAEGAELLK-ASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            K    R  + +L + +A  E + L+   +++  RKK LRF +S IH+WGL A EPI A++
Sbjct: 1229 KSRQKRAVQRRLASSIAQEEFSSLINYNNKLMRRKKALRFSKSSIHNWGLFACEPINADE 1288

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             V EYVG+++R  +++IRE RYEK GIGSSYLFRLD   V+DATK G  ARFINH C+PN
Sbjct: 1289 MVCEYVGQMVRSIVAEIRERRYEKQGIGSSYLFRLDSDSVIDATKDGCNARFINHCCDPN 1348

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY KVI VEG KKI IY++R I  GEEITY+YKFP+E++KIPC CG+  C G+LN
Sbjct: 1349 CYAKVILVEGAKKIVIYSRRAIKLGEEITYDYKFPIEDEKIPCLCGAALCRGTLN 1403


>gi|291001085|ref|XP_002683109.1| predicted protein [Naegleria gruberi]
 gi|284096738|gb|EFC50365.1| predicted protein [Naegleria gruberi]
          Length = 147

 Score =  208 bits (529), Expect = 1e-50,   Method: Composition-based stats.
 Identities = 96/147 (65%), Positives = 117/147 (79%)

Query: 790 SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           +Q+  RKK L+F +S IHDWGL ALE IE  + VIEYVGE+IR  +SDIRE RYE++GIG
Sbjct: 1   NQLDTRKKNLKFAKSPIHDWGLFALEDIEQGEMVIEYVGEVIREALSDIREKRYEQIGIG 60

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
           SSYLFRLD+  ++DATKRG +ARFINHSC+PNC  ++I V+ QKK+ IYA R I  GEEI
Sbjct: 61  SSYLFRLDNDLIIDATKRGNLARFINHSCDPNCCARIIEVDKQKKVCIYALRKILVGEEI 120

Query: 910 TYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           TY+YKFP+EE KIPC CGS+KC   LN
Sbjct: 121 TYDYKFPIEESKIPCKCGSQKCKKWLN 147


>gi|432094921|gb|ELK26329.1| Histone-lysine N-methyltransferase SETD1B [Myotis davidii]
          Length = 1462

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 109/193 (56%), Positives = 140/193 (72%), Gaps = 5/193 (2%)

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
            T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 1272 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1329

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 1330 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1389

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            ATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIP
Sbjct: 1390 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 1449

Query: 924  CYCGSKKCHGSLN 936
            C C S+ C G+LN
Sbjct: 1450 CLCNSENCRGTLN 1462


>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
 gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
          Length = 977

 Score =  207 bits (528), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 142/393 (36%), Positives = 201/393 (51%), Gaps = 62/393 (15%)

Query: 563 TAESSASAGSSKVKSKLPSGYSSAKSTISQKVMKVTSAVQRDKVPVPKPSGEMLSTLSAD 622
           T  S ++  + +V++K+ +   + KSTIS+ ++          +PV +      +T S D
Sbjct: 628 TEVSESALKAERVETKIEATTVTIKSTISESIVPA--------IPVKQ------NTWSED 673

Query: 623 GNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLELHATKALKVAKG 682
           G     V  G   N   E + +LD    +           +   D  +L   + + VA  
Sbjct: 674 GF----VFHGTLPNTYPEDEVLLDLDGFQ-----------QIVKDDEDLQYLREV-VAAE 717

Query: 683 TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
            +K+         + K SK R  +  P +        +  E+HK +   S    AR  G 
Sbjct: 718 ESKELGDTNYWAYRRKESKIRNGDAEPSNLVLP---FNKNEYHKPNSTGS----ARTEGY 770

Query: 743 QYVHTKYLGPEVNASQW-----------ANGKG--LSARTNRVKLRNLLAAAEGA----- 784
                 Y+ P    SQ+           AN     +++R NRV  R L A  E +     
Sbjct: 771 ------YMVPAAEKSQYLPLRNRVSLEKANTSSSRITSRMNRVNNRRLAAGVEKSNLNSE 824

Query: 785 -ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            +LL+ + +KARKK+LRF  S+IH  GL A+E I+  D VIEYVGE++R +++D RE +Y
Sbjct: 825 TDLLRFNALKARKKQLRFGPSRIHTLGLFAMENIDKNDMVIEYVGEIVRQRVADTRERKY 884

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            + GIG SYLFR+D   +VDATK+G IARFINHSC PNC  K+I VEG +KI IYA R I
Sbjct: 885 VREGIGDSYLFRIDKDAIVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDI 944

Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             GEE+TY+YKFP E  KIPC CG+  C G LN
Sbjct: 945 EEGEELTYDYKFPEEVDKIPCLCGAPTCRGYLN 977


>gi|402221447|gb|EJU01516.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 164

 Score =  207 bits (528), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 96/158 (60%), Positives = 119/158 (75%)

Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
            AA  +EL   +Q++ARKK L+F RS IHDWGL ALE I A D VIEYVGE++R +++D 
Sbjct: 7   GAAPSSELFAFNQLRARKKELKFARSPIHDWGLYALEYIPAGDMVIEYVGEVVRQQVADK 66

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           RE  YE+ GIGSSYLFR+DD  VVDAT +G I R INHSC+PNC  ++I++   KKI IY
Sbjct: 67  REKVYERQGIGSSYLFRIDDDLVVDATMKGNIGRLINHSCSPNCTARIITINSSKKIVIY 126

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           AK  I  GEEITY+Y FP+E++KIPC CGS+KC G LN
Sbjct: 127 AKTPIEPGEEITYDYHFPIEQEKIPCLCGSEKCRGFLN 164


>gi|444724926|gb|ELW65512.1| Histone-lysine N-methyltransferase SETD1B [Tupaia chinensis]
          Length = 1554

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 99/153 (64%), Positives = 125/153 (81%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++LLK +Q+K RKK+L+F +S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RY
Sbjct: 1402 SDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRY 1461

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E  GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI
Sbjct: 1462 EDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHI 1521

Query: 904  AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +  EEITY+YKFP+E+ KIPC CGS+ C G+LN
Sbjct: 1522 SVNEEITYDYKFPIEDIKIPCLCGSENCRGTLN 1554


>gi|449477606|ref|XP_002188016.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
           [Taeniopygia guttata]
          Length = 228

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 109/189 (57%), Positives = 140/189 (74%), Gaps = 5/189 (2%)

Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
           G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IH
Sbjct: 42  GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 99

Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
           DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 100 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 159

Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
           G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CG
Sbjct: 160 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 219

Query: 928 SKKCHGSLN 936
           S+ C G+LN
Sbjct: 220 SENCRGTLN 228


>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
 gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
          Length = 1614

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 99/171 (57%), Positives = 125/171 (73%), Gaps = 4/171 (2%)

Query: 770  NRVKLRNLLAAAEGA--EL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            N+V+    L+  E A  EL  K +Q++ RKK LRF RS IHDWGL A+E I   + VIEY
Sbjct: 1444 NQVQRAVALSKGETAASELTFKFNQLQTRKKHLRFSRSPIHDWGLYAMEKISKGEMVIEY 1503

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            VGE+IR +++D RE  YEK GIGSSYLFR+D+ +VVDATK+G + R INHSC+PNC  K+
Sbjct: 1504 VGEVIRAQVADKREKTYEKQGIGSSYLFRIDEEFVVDATKKGNLGRLINHSCDPNCTAKI 1563

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE-KKIPCYCGSKKCHGSLN 936
            I++ G KKI IYAK+ I  GEEITY+Y FP+E+  KIPC CGS +C G LN
Sbjct: 1564 ITISGVKKIVIYAKQDIELGEEITYDYHFPIEQDNKIPCLCGSARCRGYLN 1614


>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
           972h-]
 gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
           specific; AltName: Full=COMPASS component set1; AltName:
           Full=Lysine N-methyltransferase 2; AltName: Full=SET
           domain-containing protein 1; AltName: Full=Set1 complex
           component set1; Short=Set1C component set1; AltName:
           Full=Spset1
 gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
          Length = 920

 Score =  206 bits (523), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 97/178 (54%), Positives = 125/178 (70%), Gaps = 6/178 (3%)

Query: 765 LSARTNRVKLRNLLAAAEG------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
           +++R NRV  R L A  E       A+LL+ + +KARKK+L F  S+IH  GL A+E I+
Sbjct: 743 ITSRMNRVNNRRLAAGVEKSQLPAEADLLRFNALKARKKQLHFGPSRIHTLGLFAMENID 802

Query: 819 AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
             D VIEY+GE+IR +++D RE  Y + GIG SYLFR+D+  +VDATK+G IARFINHSC
Sbjct: 803 KNDMVIEYIGEIIRQRVADNREKNYVREGIGDSYLFRIDEDVIVDATKKGNIARFINHSC 862

Query: 879 NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            PNC  ++I VEG++KI IYA R I  GEE+TY+YKFP E  KIPC CG+  C G LN
Sbjct: 863 APNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYKFPEEADKIPCLCGAPTCRGYLN 920


>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
           98AG31]
          Length = 191

 Score =  206 bits (523), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 92/153 (60%), Positives = 117/153 (76%)

Query: 784 AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           A++LK +Q++ RKK+L+F RS IHDWGL A+E I   + VIEYVGE+IR  ++D RE  Y
Sbjct: 39  ADVLKFNQLRTRKKQLKFARSPIHDWGLYAMEMIPVGEMVIEYVGEVIRQAVADRREKAY 98

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E+MGIGSSYLFR+DD  VVDATK+G + R INH C PNC  K+I++ G+KKI IYAK  I
Sbjct: 99  ERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKATI 158

Query: 904 AAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             G+E+TY+Y FP E+ KIPC CGS KC G+LN
Sbjct: 159 ELGDEVTYDYHFPKEDVKIPCLCGSSKCKGTLN 191


>gi|355718741|gb|AES06369.1| SET domain containing 1B [Mustela putorius furo]
          Length = 359

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 109/192 (56%), Positives = 140/192 (72%), Gaps = 5/192 (2%)

Query: 747 TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
           T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 170 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 227

Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
           S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 228 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 287

Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
           ATK G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIP
Sbjct: 288 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIP 347

Query: 924 CYCGSKKCHGSL 935
           C CGS+ C G+L
Sbjct: 348 CLCGSENCRGTL 359


>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
          Length = 1450

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 99/188 (52%), Positives = 127/188 (67%), Gaps = 12/188 (6%)

Query: 760  ANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
            A G+G S R NRV  R  +A          + A+ L+ +Q+K RKK ++F RS IH+WGL
Sbjct: 1264 ATGRGTS-RANRVNQRRYVADLNDQKRTLGQDADALRFNQLKKRKKTVKFARSAIHNWGL 1322

Query: 812  VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
             A+E I  +D +IEYVG+ +R  ++ IRE  Y + GIGSSYLFR+DDG V+DATK+GGIA
Sbjct: 1323 YAMESIPKDDMIIEYVGQEVRQSVATIREKAYIRAGIGSSYLFRIDDGTVIDATKKGGIA 1382

Query: 872  RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGS 928
            RFINHSC PNC  K+I VEG K+I IYA R I   EE+TY+YKF  E   E ++PC CG+
Sbjct: 1383 RFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEDNPEDRVPCLCGT 1442

Query: 929  KKCHGSLN 936
              C G LN
Sbjct: 1443 TACKGFLN 1450


>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 1344

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 102/189 (53%), Positives = 129/189 (68%), Gaps = 16/189 (8%)

Query: 763  KGLSARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
            K  S+R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WG
Sbjct: 1157 KSTSSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWG 1215

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGI
Sbjct: 1216 LYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGI 1275

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
            ARFINHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CG
Sbjct: 1276 ARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCG 1335

Query: 928  SKKCHGSLN 936
            S  C G LN
Sbjct: 1336 STGCKGFLN 1344


>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
 gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
          Length = 171

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 93/156 (59%), Positives = 116/156 (74%)

Query: 781 AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE 840
           A G    K +Q++ RKK LRF RS IHDWGL A+E I   + VIEYVGE+IR +++D RE
Sbjct: 16  AAGELSFKFNQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKRE 75

Query: 841 CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
             YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I++ G+KKI IYAK
Sbjct: 76  ATYERQGIGSSYLFRIDEEIVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAK 135

Query: 901 RHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           R I  G+EITY+Y FP E+ KIPC CG+ KC G LN
Sbjct: 136 RDIELGDEITYDYHFPFEQDKIPCLCGTAKCRGFLN 171


>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
            tetrasperma FGSC 2509]
          Length = 1313

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NRV  R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1132 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1191

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1192 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1251

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1252 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1311

Query: 935  LN 936
            LN
Sbjct: 1312 LN 1313


>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
            2508]
          Length = 1282

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NRV  R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1101 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1160

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1161 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1220

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1221 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1280

Query: 935  LN 936
            LN
Sbjct: 1281 LN 1282


>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
          Length = 1313

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NRV  R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1132 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1191

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1192 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1251

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1252 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1311

Query: 935  LN 936
            LN
Sbjct: 1312 LN 1313


>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
 gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1314

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NRV  R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1133 NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1192

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1193 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1252

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1253 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1312

Query: 935  LN 936
            LN
Sbjct: 1313 LN 1314


>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
          Length = 1224

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 94/161 (58%), Positives = 121/161 (75%), Gaps = 1/161 (0%)

Query: 776  NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
            + LAAA     +K +Q+++RKK+L+F RS IHDWGL ALE I   + VIEYVGE+IR ++
Sbjct: 1065 DTLAAAPEV-TIKFNQLQSRKKQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQV 1123

Query: 836  SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
            ++ RE  YE+ GIGSSYLFR+DD  VVDATK G + R INHSC+PNC  K+I++ GQKKI
Sbjct: 1124 AEKRERAYERSGIGSSYLFRIDDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKI 1183

Query: 896  FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             IYAK  I  G+E+TY+Y FP+E +KIPC CG+ KC G LN
Sbjct: 1184 VIYAKVDIHPGDEVTYDYHFPIENEKIPCLCGAAKCRGFLN 1224


>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 191

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 102/185 (55%), Positives = 131/185 (70%), Gaps = 6/185 (3%)

Query: 755 NASQWANGKGLSARTNRVKLRNLLAAAE--GAEL-LKASQVKARKKRLRFQRSKIHDWGL 811
           NA + A G       N+V+    L+  E   AEL +K +Q++ RKK LRF RS IHDWGL
Sbjct: 10  NARRQAQG---LEEINQVQRAMALSKGETAAAELTIKFNQLQTRKKHLRFARSPIHDWGL 66

Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
            A+E I   + VIEYVGE+IR +I+D RE  YE+ GIGSSYLFR+D+  VVDATK+G + 
Sbjct: 67  YAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLG 126

Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKC 931
           R INHSC+PNC  K+I++ G+KKI IYAK+ I  G+EITY+Y FP+E+ KIPC CGS +C
Sbjct: 127 RLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLCGSARC 186

Query: 932 HGSLN 936
            G LN
Sbjct: 187 RGYLN 191


>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1338

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 98/187 (52%), Positives = 130/187 (69%), Gaps = 5/187 (2%)

Query: 753  EVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
            + + S  AN + L+A  N  K    LAA +  +++  +Q+K RKK +RF RS IH+WGL 
Sbjct: 1154 DTSRSNRANNRRLAAEVNLQK--QTLAAVDVHDVINFNQLKKRKKPVRFARSAIHNWGLY 1211

Query: 813  ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
            A+E I   + +IEYVGE++R +++D+RE  Y + GIGSSYLFR+D+  V+DATK+GGIAR
Sbjct: 1212 AMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIGSSYLFRIDETTVIDATKKGGIAR 1271

Query: 873  FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
            FINHSC PNC  K+I VEG K+I IYA R I   EE+TY+YKF  E   E++IPC CGS 
Sbjct: 1272 FINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFEREIDSEERIPCLCGSS 1331

Query: 930  KCHGSLN 936
             C G LN
Sbjct: 1332 GCKGFLN 1338


>gi|408391029|gb|EKJ70413.1| hypothetical protein FPSE_09407 [Fusarium pseudograminearum CS3096]
          Length = 1263

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1201

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1202 CMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1261

Query: 935  LN 936
            LN
Sbjct: 1262 LN 1263


>gi|410516926|sp|Q4I5R3.2|SET1_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
          Length = 1263

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1201

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1202 CMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1261

Query: 935  LN 936
            LN
Sbjct: 1262 LN 1263


>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
 gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1150

 Score =  204 bits (518), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NRV  R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 969  NSRANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1028

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1029 NKDDMIIEYVGEEVRQQIAELREARYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1088

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1089 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1148

Query: 935  LN 936
            LN
Sbjct: 1149 LN 1150


>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
 gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
          Length = 1056

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)

Query: 707  LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
            + P S GCAR+      ++K S      +R  +R      T     E+N       + L 
Sbjct: 835  VVPHSSGCARTE----GYYKLSYKE---KRGVLRRPDVFLT-----EINERDDEKARYLV 882

Query: 766  -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
             S R  R   R LL +     +++ K +Q+K RKK ++F RS+IH WGL ALE I  ++ 
Sbjct: 883  QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEM 942

Query: 823  VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            ++EY+G+ IR  ++D RE RYE+ G+GSSYLFR+D   V+DAT+ G +ARFINHSC PNC
Sbjct: 943  IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 1002

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y K++ V+G+K+I IY+K  I  G+EITY+YKFP+EE KI C CG+  C GSLN
Sbjct: 1003 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 1056


>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 164

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 94/155 (60%), Positives = 119/155 (76%), Gaps = 1/155 (0%)

Query: 783 GAEL-LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIREC 841
            AEL +K +Q++ RKK LRF RS IHDWGL A+E I   + VIEYVGE+IR +++D RE 
Sbjct: 10  AAELTIKFNQLQTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREK 69

Query: 842 RYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKR 901
            YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I++ G+KKI IYAK+
Sbjct: 70  AYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQ 129

Query: 902 HIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I  G+EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 130 DIELGDEITYDYHFPIEQDKIPCLCGSAKCRGYLN 164


>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
 gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
          Length = 1510

 Score =  203 bits (517), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)

Query: 765  LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
            L+++  R+  R LL +   A  +  K +Q+K RKK ++F RS+IH WGL A+E I  ++ 
Sbjct: 1337 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1396

Query: 823  VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            ++EY+G+ IR  +++ RE  YE+ GIGSSYLFR+D  +V+DATKRG  ARFINHSC PNC
Sbjct: 1397 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1456

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y KV+++EG+K+I IY++  I  GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1457 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1510


>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
 gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
            AltName: Full=SET domain-containing protein 2
 gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
          Length = 1507

 Score =  203 bits (517), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)

Query: 765  LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
            L+++  R+  R LL +   A  +  K +Q+K RKK ++F RS+IH WGL A+E I  ++ 
Sbjct: 1334 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1393

Query: 823  VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            ++EY+G+ IR  +++ RE  YE+ GIGSSYLFR+D  +V+DATKRG  ARFINHSC PNC
Sbjct: 1394 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1453

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y KV+++EG+K+I IY++  I  GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1454 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1507


>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
 gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
          Length = 1337

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            S+R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1153 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1211

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARF
Sbjct: 1212 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1271

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  
Sbjct: 1272 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1331

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1332 CKGFLN 1337


>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
 gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
          Length = 1376

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            S+R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1192 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1250

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARF
Sbjct: 1251 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1310

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  
Sbjct: 1311 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1370

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1371 CKGFLN 1376


>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
 gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
          Length = 1334

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 101/186 (54%), Positives = 128/186 (68%), Gaps = 16/186 (8%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            S+R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1150 SSRSNRVNNRRLVADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1208

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARF
Sbjct: 1209 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1268

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  
Sbjct: 1269 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1328

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1329 CKGFLN 1334


>gi|400596097|gb|EJP63881.1| histone H3 methyltransferase complex protein [Beauveria bassiana
            ARSEF 2860]
          Length = 1220

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 117/254 (46%), Positives = 151/254 (59%), Gaps = 30/254 (11%)

Query: 708  CPRSDGCAR---------SSISGWEWHKWSLNASPAER-ARVR-GAQYVHTKYLGPEVNA 756
             P   GCAR         S  S +  H   +     ER ARV+ G   V T      V+A
Sbjct: 972  VPNPTGCARTEGVKKILNSEKSKYLPHHIKVQKVREEREARVKNGKDSVAT-----SVDA 1026

Query: 757  SQWANGKGLS---ARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSK 805
            S+ A  K ++   +R NR   R  +A          + +++ K +Q+K RKK ++F RS 
Sbjct: 1027 SKIAADKLVAKGNSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSA 1086

Query: 806  IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDAT 865
            IH+WGL  +E I  +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+D+  V+DAT
Sbjct: 1087 IHNWGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKSGIGSSYLFRIDENTVIDAT 1146

Query: 866  KRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKI 922
            K+GGIARFINHSC PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +I
Sbjct: 1147 KKGGIARFINHSCLPNCTAKIIKVEGSKRIVIYALREIAMNEELTYDYKFEREIGSLDRI 1206

Query: 923  PCYCGSKKCHGSLN 936
            PC CG+  C G LN
Sbjct: 1207 PCLCGTAACKGFLN 1220


>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
            NRRL 8126]
 gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
            NRRL 8126]
          Length = 1286

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1105 NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFDRSAIHNWGLYAMENI 1164

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1165 PKDDMIIEYVGEEVRQQIAEIREHRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1224

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1225 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1284

Query: 935  LN 936
            LN
Sbjct: 1285 LN 1286


>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
          Length = 1167

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 130/185 (70%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLL---------AAAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+NR   R L+          AA+G++   ++ +Q+K RKK ++F RS IH+WGL A 
Sbjct: 983  TSRSNRANNRRLVNEISTQKQALAADGSDNIAIRFNQLKKRKKLVKFDRSAIHNWGLYAQ 1042

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R K++DIRE +Y+K G+GSSYLFR+D+  VVDATK+GGIARFI
Sbjct: 1043 EDISANDMIIEYVGEKVRQKVADIREIKYDKQGVGSSYLFRIDEDSVVDATKKGGIARFI 1102

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC+PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1103 NHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEELTYDYKFEREIGSDDRIPCLCGSVNC 1162

Query: 932  HGSLN 936
             G LN
Sbjct: 1163 KGFLN 1167


>gi|444318419|ref|XP_004179867.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
 gi|387512908|emb|CCH60348.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
          Length = 1216

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 95/161 (59%), Positives = 117/161 (72%), Gaps = 3/161 (1%)

Query: 779  AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
            A    +ELL  +Q+  RKK + F RS IH+WGL ALEPI A++ VIEYVGE IR  +S++
Sbjct: 1056 AIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPITAKEMVIEYVGERIRQPVSEM 1115

Query: 839  RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
            RE RY K GIGSSYLFR+D+  V+DATKRGGIARFINHSC P+C  K+I V G K+I IY
Sbjct: 1116 REKRYIKGGIGSSYLFRIDENTVIDATKRGGIARFINHSCEPSCTAKIIKVGGMKRIVIY 1175

Query: 899  AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            A R I   EE+TY+YKF  E   E+++PCYCG+  C G LN
Sbjct: 1176 ALRDIGLNEELTYDYKFEREIDAEERLPCYCGAPSCKGFLN 1216


>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
          Length = 1802

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)

Query: 765  LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
            L+++  R+  R LL +   A  +  K +Q+K RKK ++F RS+IH WGL A+E I  ++ 
Sbjct: 1629 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 1688

Query: 823  VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            ++EY+G+ IR  +++ RE  YE+ GIGSSYLFR+D  +V+DATKRG  ARFINHSC PNC
Sbjct: 1689 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 1748

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y KV+++EG+K+I IY++  I  GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 1749 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1802


>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            niger CBS 513.88]
          Length = 1239

 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 100/186 (53%), Positives = 128/186 (68%), Gaps = 16/186 (8%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            ++R+ RV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1055 TSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1113

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARF
Sbjct: 1114 EENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARF 1173

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  
Sbjct: 1174 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1233

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1234 CKGFLN 1239


>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 1259

 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A           +G E  +L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1075 TSRSTRVNNRRLIADINAQKQALPMQGGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1134

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1135 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1194

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1195 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1254

Query: 932  HGSLN 936
             G LN
Sbjct: 1255 KGFLN 1259


>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
          Length = 1101

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 100/186 (53%), Positives = 128/186 (68%), Gaps = 16/186 (8%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            ++R+ RV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 917  TSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 975

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARF
Sbjct: 976  EENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARF 1035

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  
Sbjct: 1036 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTG 1095

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1096 CKGFLN 1101


>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
          Length = 1079

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 111/261 (42%), Positives = 148/261 (56%), Gaps = 43/261 (16%)

Query: 718  SISGWEWHKWSLNA--SPAERARVRGAQYVHTKYLGPEVNASQWANGKGL---------- 765
            ++S W W +  + A   P E   VR    +   Y+     A++    K +          
Sbjct: 820  NLSAWAWRQKEIKALNRPEESGPVREETIIPGYYVSNTTGAARTEGRKRILESEKSKYLP 879

Query: 766  ---------------SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKR 798
                           ++R+ RV  R L+A            + EG ++L+ +Q+K RKK 
Sbjct: 880  HRIKVQKAPKTISKSTSRSTRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKP 938

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
            +RF RS IH+WGL A E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+
Sbjct: 939  VRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDE 998

Query: 859  GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
              V+DATKRGGIARFINHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E
Sbjct: 999  NTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFERE 1058

Query: 919  ---EKKIPCYCGSKKCHGSLN 936
               + +IPC CGS  C G LN
Sbjct: 1059 WDSDDRIPCLCGSTGCKGFLN 1079


>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
            atroviride IMI 206040]
          Length = 1241

 Score =  202 bits (513), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1060 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1119

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1120 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1179

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1180 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1239

Query: 935  LN 936
            LN
Sbjct: 1240 LN 1241


>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
 gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
          Length = 1260

 Score =  202 bits (513), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1079 NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1138

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1139 PKDDMIIEYVGEEVRQQIAELREHRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1198

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1199 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1258

Query: 935  LN 936
            LN
Sbjct: 1259 LN 1260


>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1202

 Score =  202 bits (513), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA----------AAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A           A G  ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1018 TSRSTRVNNRRLIADINAQKQALPTANGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1077

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1078 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1137

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1138 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1197

Query: 932  HGSLN 936
             G LN
Sbjct: 1198 KGFLN 1202


>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 1188

 Score =  202 bits (513), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV+ R L A           +G +   L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1004 TSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKRKKPVRFARSAIHNWGLYAE 1063

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1064 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGGIARFI 1123

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I+  EE+TY+YKF  E   E +IPC CGS  C
Sbjct: 1124 NHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDRIPCLCGSAGC 1183

Query: 932  HGSLN 936
             G LN
Sbjct: 1184 KGFLN 1188


>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
          Length = 1221

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1040 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1099

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1100 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1159

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1160 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1219

Query: 935  LN 936
            LN
Sbjct: 1220 LN 1221


>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
 gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
          Length = 739

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 91/174 (52%), Positives = 127/174 (72%), Gaps = 2/174 (1%)

Query: 765 LSARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
           L+++  R+  R LL +   A  +  K +Q+K RKK ++F RS+IH WGL A+E I  ++ 
Sbjct: 566 LASKDMRLLQRRLLTSLGDANNDFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEM 625

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           ++EY+G+ IR  +++ RE  YE+ GIGSSYLFR+D  +V+DATKRG  ARFINHSC PNC
Sbjct: 626 IVEYIGQTIRSLVAEEREKAYERRGIGSSYLFRIDLHHVIDATKRGNFARFINHSCQPNC 685

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y KV+++EG+K+I IY++  I  GEEITY+YKFP+E+ KI C CG+K C G LN
Sbjct: 686 YAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 739


>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
          Length = 1151

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/251 (45%), Positives = 149/251 (59%), Gaps = 24/251 (9%)

Query: 708  CPRSDGCAR---------SSISGWEWHKWSLNASPAERA-RVR-GAQYVHTKYLGPEVNA 756
             P   GCAR         S  S +  H   +     ERA RV+ G   V+T     ++ A
Sbjct: 903  VPNPTGCARTEGVKKILNSEKSKYLPHHIKVQKVREERAARVKNGKDGVNTTMDASKIAA 962

Query: 757  SQWANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHD 808
             +    KG ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+
Sbjct: 963  EKMI-AKG-NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHN 1020

Query: 809  WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
            WGL  +E I  +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+D+  V+DATK+G
Sbjct: 1021 WGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKSGIGSSYLFRIDENTVIDATKKG 1080

Query: 869  GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCY 925
            GIARFINHSC PNC  K+I VEG K+I IYA R I   EE+TY+YKF  E     +IPC 
Sbjct: 1081 GIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITTNEELTYDYKFEREIGSLDRIPCL 1140

Query: 926  CGSKKCHGSLN 936
            CG+  C G LN
Sbjct: 1141 CGTAACKGFLN 1151


>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1236

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1055 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1114

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1115 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1174

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1175 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1234

Query: 935  LN 936
            LN
Sbjct: 1235 LN 1236


>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 1239

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1055 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1114

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1115 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1174

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1175 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1234

Query: 932  HGSLN 936
             G LN
Sbjct: 1235 KGFLN 1239


>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
            112818]
          Length = 1330

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 100/184 (54%), Positives = 126/184 (68%), Gaps = 16/184 (8%)

Query: 768  RTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALE 815
            R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A E
Sbjct: 1148 RSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYAEE 1206

Query: 816  PIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFIN 875
             I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARFIN
Sbjct: 1207 NITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARFIN 1266

Query: 876  HSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCH 932
            HSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C 
Sbjct: 1267 HSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1326

Query: 933  GSLN 936
            G LN
Sbjct: 1327 GFLN 1330


>gi|342887802|gb|EGU87231.1| hypothetical protein FOXB_02213 [Fusarium oxysporum Fo5176]
          Length = 1258

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/187 (52%), Positives = 128/187 (68%), Gaps = 8/187 (4%)

Query: 758  QWANGKGLSAR-TNRVKLRNL----LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLV 812
            Q AN      R TNR  +++L        + +++ K +Q+K RKK ++F RS IH+WGL 
Sbjct: 1072 QAANDSERQKRATNRRYVQDLNDQKKTLGQDSDIFKFNQLKKRKKPVKFARSAIHNWGLY 1131

Query: 813  ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
            A+E I  +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIAR
Sbjct: 1132 AMENIAKDDMIIEYVGEQVRQQIAEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIAR 1191

Query: 873  FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
            FINHSC PNC  K+I VEG K+I IYA + IA  EE+TY+YKF  E     +IPC CG+ 
Sbjct: 1192 FINHSCEPNCTAKIIKVEGSKRIVIYALQDIAMSEELTYDYKFEREIGSLDRIPCLCGTA 1251

Query: 930  KCHGSLN 936
             C G LN
Sbjct: 1252 ACKGFLN 1258


>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
          Length = 1331

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 100/184 (54%), Positives = 126/184 (68%), Gaps = 16/184 (8%)

Query: 768  RTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALE 815
            R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A E
Sbjct: 1149 RSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYAEE 1207

Query: 816  PIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFIN 875
             I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARFIN
Sbjct: 1208 NITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARFIN 1267

Query: 876  HSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCH 932
            HSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C 
Sbjct: 1268 HSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1327

Query: 933  GSLN 936
            G LN
Sbjct: 1328 GFLN 1331


>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
          Length = 1670

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 91/168 (54%), Positives = 123/168 (73%), Gaps = 2/168 (1%)

Query: 771  RVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
            R+  R LL +   A  +  K + +K+RKK ++F RS+IH +GL A+E I  ++ +IEY+G
Sbjct: 1503 RLLQRRLLTSLGDANSDFFKINMLKSRKKMIKFARSRIHGYGLYAMETIAQDEMIIEYIG 1562

Query: 829  ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
            + IR  ++D RE  YE+ GIGSSYLFR+D+  V+DATKRG  ARFINHSC PNCY KV++
Sbjct: 1563 QKIRSLVADEREKAYERRGIGSSYLFRIDEHTVIDATKRGNFARFINHSCQPNCYAKVLT 1622

Query: 889  VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +EG+K+I IY++  I  GEEITY+YKFP+EE KI C CG+K C G LN
Sbjct: 1623 IEGEKRIVIYSRSTINKGEEITYDYKFPIEEDKIDCLCGAKTCRGYLN 1670


>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1267

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1083 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1142

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1143 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1202

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1203 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1262

Query: 932  HGSLN 936
             G LN
Sbjct: 1263 KGFLN 1267


>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
 gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
          Length = 1266

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1082 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1141

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1142 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1201

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1202 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1261

Query: 932  HGSLN 936
             G LN
Sbjct: 1262 KGFLN 1266


>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
          Length = 1267

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1083 TSRSTRVNNRRLIADINAQKQALPMQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1142

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1143 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1202

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1203 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1262

Query: 932  HGSLN 936
             G LN
Sbjct: 1263 KGFLN 1267


>gi|358058803|dbj|GAA95766.1| hypothetical protein E5Q_02423 [Mixia osmundae IAM 14324]
          Length = 2083

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 100/183 (54%), Positives = 124/183 (67%), Gaps = 7/183 (3%)

Query: 753  EVNASQWANGKGLSARTNRVKL-----RNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
            +  AS  A  +   A T R+ L     R   A   G++LLK +Q+K+RKK+L+F RS IH
Sbjct: 1339 QAPASSTAISRLARANTRRLVLDMEQQRKSAATEAGSDLLKFNQLKSRKKQLKFARSPIH 1398

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            DWGL A+E I A D +IEYVGELIR +++D RE  YEKMGIGSSYLFR+DD  VVDATK+
Sbjct: 1399 DWGLYAMEHIPARDMIIEYVGELIRQQVADKREKAYEKMGIGSSYLFRVDDDLVVDATKK 1458

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCY 925
            G  AR INH C PNC  ++I++ G KKI IYA   I  G+EITY+Y F  E  + KIPC 
Sbjct: 1459 GTYARLINHCCAPNCTARIITIGGHKKIVIYALTDIEPGDEITYDYHFATESDDLKIPCL 1518

Query: 926  CGS 928
            CGS
Sbjct: 1519 CGS 1521


>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 1155

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 100/185 (54%), Positives = 127/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV+ R L A           +G +   L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 971  TSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKRKKPVRFARSAIHNWGLYAE 1030

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1031 ENIAANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGGIARFI 1090

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I+  EE+TY+YKF  E   E +IPC CGS  C
Sbjct: 1091 NHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDRIPCLCGSAGC 1150

Query: 932  HGSLN 936
             G LN
Sbjct: 1151 KGFLN 1155


>gi|366990781|ref|XP_003675158.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
 gi|342301022|emb|CCC68787.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
          Length = 1082

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/182 (53%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 901  SSRDNRASNRRFQQNIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 960

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A + +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 961  AAREMIIEYVGESIRQPVAEMRERRYLKSGIGSSYLFRVDENNVIDATKKGGIARFINHC 1020

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+P+C  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PCYCGS  C G 
Sbjct: 1021 CDPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERELDAEERLPCYCGSVNCKGF 1080

Query: 935  LN 936
            LN
Sbjct: 1081 LN 1082


>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
            tritici R3-111a-1]
          Length = 1319

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 92/182 (50%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R ++A          + +++L+ +Q++ RKK ++F RS IH+WGL A+E I
Sbjct: 1138 TSRANRANHRRVVADLNDQKKTLGQDSDVLRFNQLRKRKKPVKFARSAIHNWGLYAMENI 1197

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R  ++ +RE RY K GIGS+YLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1198 PKDDMIIEYVGEEVRPSVAKVREARYLKSGIGSTYLFRIDDEAVIDATKKGGIARFINHS 1257

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R I   EE+TY+YKF  EE    ++PC CG+  C G 
Sbjct: 1258 CMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEEDQKDRVPCLCGTTACKGF 1317

Query: 935  LN 936
            LN
Sbjct: 1318 LN 1319


>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1271

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A               +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266

Query: 932  HGSLN 936
             G LN
Sbjct: 1267 KGFLN 1271


>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
            Silveira]
          Length = 1271

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A               +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266

Query: 932  HGSLN 936
             G LN
Sbjct: 1267 KGFLN 1271


>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 160

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 88/150 (58%), Positives = 116/150 (77%)

Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
           +K +Q++ RKK LRF RS IHDWGL A+E +   + VIEYVGE++R +++D RE  YE+ 
Sbjct: 11  VKFNQLQTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYERQ 70

Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
           GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  ++I++ G+KKI IYAK+ I  G
Sbjct: 71  GIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELG 130

Query: 907 EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           +EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 131 DEITYDYHFPIEQDKIPCLCGSAKCRGYLN 160


>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
 gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
            [Coccidioides immitis RS]
          Length = 1271

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 98/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A               +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1087 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1146

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1147 ENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1206

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1207 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGC 1266

Query: 932  HGSLN 936
             G LN
Sbjct: 1267 KGFLN 1271


>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 995

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 127/182 (69%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R      E         +ELL  +Q+  RKK++ F RS IH+WGL ALEPI
Sbjct: 814 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKQVTFARSPIHNWGLYALEPI 873

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE+IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 874 AAKEMIIEYVGEMIRQPVAEMRENRYLKSGIGSSYLFRIDEYTVIDATKKGGIARFINHC 933

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C+P+C  K+I V G+K+I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 934 CDPSCTAKIIKVGGRKRIVIYALRDIAANEELTYDYKFEREVDDEERLPCLCGAATCKGF 993

Query: 935 LN 936
           LN
Sbjct: 994 LN 995


>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
 gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
          Length = 1076

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 895  NSRANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 954

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 955  PKDDMIIEYVGEEVRQQIAELRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1014

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1015 CMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTAACKGF 1074

Query: 935  LN 936
            LN
Sbjct: 1075 LN 1076


>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
            18188]
          Length = 1280

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 99/185 (53%), Positives = 127/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA---------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A           +G E  +L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1096 TSRSTRVNNRRLIADINAQKQALPMQGGEGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1155

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++++RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1156 ENISANDMIIEYVGEKVRQQVANMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1215

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1216 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1275

Query: 932  HGSLN 936
             G LN
Sbjct: 1276 KGFLN 1280


>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
          Length = 602

 Score =  200 bits (508), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)

Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
           + P S GCAR+      ++K S      +R  +R      T     E+N       + L 
Sbjct: 381 VVPHSSGCARTE----GYYKLSYKE---KRGVLRRPDVFLT-----EINERDDEKARYLV 428

Query: 766 -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
            S R  R   R LL +     +++ K +Q+K RKK ++F RS+IH WGL ALE I  ++ 
Sbjct: 429 QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEVIAPDEM 488

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           ++EY+G+ IR  ++D RE RYE+ G+GSSYLFR+D   V+DAT+ G +ARFINHSC PNC
Sbjct: 489 IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 548

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y K++ V+G+K+I IY+K  I  G+EITY+YKFP+EE KI C CG+  C GSLN
Sbjct: 549 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 602


>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
          Length = 153

 Score =  199 bits (507), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 93/151 (61%), Positives = 115/151 (76%)

Query: 786 LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK 845
           L+K +Q++ RKK+LRF RS IHDWGL A+E I   + VIEYVGE++R  I+D RE  YE+
Sbjct: 3   LIKFNQLQTRKKQLRFARSPIHDWGLYAMERIARGEMVIEYVGEIVRAPIADRREKLYER 62

Query: 846 MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAA 905
            GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+ISV G KKI IYAK+ I  
Sbjct: 63  QGIGSSYLFRIDEELVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIEL 122

Query: 906 GEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           G+E+TY+Y FP EE KIPC CG+ KC G LN
Sbjct: 123 GDELTYDYHFPREEAKIPCLCGAAKCRGFLN 153


>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
 gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
          Length = 1230

 Score =  199 bits (506), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 126/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A               +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1046 TSRSTRVNNRRLIADINAQKQALPTQSGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1105

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1106 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1165

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1166 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1225

Query: 932  HGSLN 936
             G LN
Sbjct: 1226 KGFLN 1230


>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1295

 Score =  199 bits (506), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1114 NSRANRAHNRRFVADLNDQRKTLGQDSDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1173

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              ++ +IEYVGE +R +I+++RE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1174 PKDEMIIEYVGEEVRQQIAELRESRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINHS 1233

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1234 CMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFERELGSADRIPCLCGTAACKGF 1293

Query: 935  LN 936
            LN
Sbjct: 1294 LN 1295


>gi|302910631|ref|XP_003050330.1| histone H3 methyltransferase complex protein [Nectria haematococca
            mpVI 77-13-4]
 gi|256731267|gb|EEU44617.1| histone H3 methyltransferase complex protein [Nectria haematococca
            mpVI 77-13-4]
          Length = 1281

 Score =  199 bits (506), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 125/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1100 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1159

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+D+  V+DATK+GGIARFINHS
Sbjct: 1160 AKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 1219

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1220 CMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLDRIPCLCGTAACKGF 1279

Query: 935  LN 936
            LN
Sbjct: 1280 LN 1281


>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
          Length = 163

 Score =  199 bits (506), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 92/158 (58%), Positives = 115/158 (72%)

Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
           A A     +K +Q++ RKK LRF +S IHDWGL A+E I   + VIEYVGE+IR  ++D 
Sbjct: 6   AIANELTFIKFNQLQTRKKHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADK 65

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC  K+I++ G KKI IY
Sbjct: 66  REKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIY 125

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           AK+ I  G+EITY+Y FP E+ KIPC CGS KC G LN
Sbjct: 126 AKQDIELGDEITYDYHFPFEQDKIPCLCGSAKCRGFLN 163


>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1083

 Score =  199 bits (506), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/191 (51%), Positives = 126/191 (65%), Gaps = 14/191 (7%)

Query: 757  SQWANGKGLSARTNRVKLRNLLAA--------AEGAELLKASQVKARKKRLRFQRSKIHD 808
            SQ A G   ++R NR   R  +A          + ++  + +Q+K RKK ++F RS IH+
Sbjct: 896  SQVARG---TSRANRANNRRFVAGLHDQIRGLGQDSDAFRFNQLKKRKKPVKFARSAIHN 952

Query: 809  WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
            WGL  +E I  +D +IEYVGE +R  I+++RE RY K GIGSSYLFR+DD  V+DATK+G
Sbjct: 953  WGLYTMENIPKDDMIIEYVGEEVRQVIAELREARYLKSGIGSSYLFRIDDNTVIDATKKG 1012

Query: 869  GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCY 925
            GIARFINHSC PNC  K+I VEG K+I IYA R IA  EE+TY+YKF  E     +IPC 
Sbjct: 1013 GIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGATDRIPCL 1072

Query: 926  CGSKKCHGSLN 936
            CG+  C G LN
Sbjct: 1073 CGTAACKGFLN 1083


>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            oryzae RIB40]
 gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1229

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1045 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1104

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1105 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1164

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1165 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1224

Query: 932  HGSLN 936
             G LN
Sbjct: 1225 KGFLN 1229


>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
 gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
          Length = 1019

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 99/193 (51%), Positives = 130/193 (67%), Gaps = 11/193 (5%)

Query: 755  NASQWANGKGLSARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKI 806
            + S+ A  +  S+R NR   R      E         +ELL  +Q+  RKK + F RS I
Sbjct: 827  DPSEPAPQELSSSRDNRASNRRFQQDIEAQRAAIGTESELLSLNQLNKRKKPVTFARSAI 886

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            H+WGL ALEPI A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK
Sbjct: 887  HNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREIRYIKKGIGSSYLFRVDENTVIDATK 946

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
            +GGIARFINH C+P+C  K+I V G+K+I IYA R IAA EE+TY+YKF  E   E+++P
Sbjct: 947  KGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIAANEELTYDYKFERETDDEERLP 1006

Query: 924  CYCGSKKCHGSLN 936
            C CG+  C G LN
Sbjct: 1007 CLCGAPTCKGFLN 1019


>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
            [Aspergillus oryzae 3.042]
          Length = 1223

 Score =  199 bits (505), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1039 TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1098

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1099 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1158

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1159 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGC 1218

Query: 932  HGSLN 936
             G LN
Sbjct: 1219 KGFLN 1223


>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
            NIH/UT8656]
          Length = 1277

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 94/170 (55%), Positives = 124/170 (72%), Gaps = 6/170 (3%)

Query: 770  NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
             R  LR+L    + A++L+ +Q++ RKK ++F RS IH+WGL +LE I+  + +IEYVGE
Sbjct: 1111 QREMLRDL---GDDADVLRFNQLQKRKKPVKFARSAIHNWGLYSLERIQTSEMIIEYVGE 1167

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             IR +I+D+RE +Y + GIGSSYLFR+D+G VVDATK+GGIARFINHSC+PNC  K+I V
Sbjct: 1168 KIRQEIADLREIKYTESGIGSSYLFRIDEGTVVDATKKGGIARFINHSCSPNCTAKIIRV 1227

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
             G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C G LN
Sbjct: 1228 GGTKRIVIYALRDIEKDEELTYDYKFEREIDSDDRIPCLCGSAVCKGFLN 1277


>gi|365983760|ref|XP_003668713.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
 gi|343767480|emb|CCD23470.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
          Length = 1105

 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 102/182 (56%), Positives = 131/182 (71%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLR----NLLA--AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R    N+ A  AA GAE  LL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 924  SSRDNRASNRRFQQNIEAQRAAIGAESDLLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 983

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 984  AAKEMIIEYVGESIRQAVAEMRERRYLKNGIGSSYLFRVDENNVIDATKKGGIARFINHC 1043

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+P+C  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC+CGS  C G 
Sbjct: 1044 CDPSCTAKIIKVGGRRRIVIYALRDIAANEELTYDYKFEREIDAEERLPCHCGSVNCKGF 1103

Query: 935  LN 936
            LN
Sbjct: 1104 LN 1105


>gi|156061849|ref|XP_001596847.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980]
 gi|154700471|gb|EDO00210.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1264

 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S R NRV  R  +A             ++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1083 SGRANRVNNRRYVADLNDQKKTLGGDTDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1142

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
               D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINHS
Sbjct: 1143 AMNDMIIEYVGEKVRQQVADLRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 1202

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I+VE  K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1203 CMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACKGF 1262

Query: 935  LN 936
            LN
Sbjct: 1263 LN 1264


>gi|119467882|ref|XP_001257747.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119405899|gb|EAW15850.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 1241

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236

Query: 932  HGSLN 936
             G LN
Sbjct: 1237 KGFLN 1241


>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
 gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
          Length = 278

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 105/234 (44%), Positives = 146/234 (62%), Gaps = 16/234 (6%)

Query: 707 LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGL- 765
           + P S GCAR+      ++K S      +R  +R      T     E+N       + L 
Sbjct: 57  VIPHSSGCARTE----GYYKLS---HKEKRGILRRPDVFLT-----EINEKDDEKARYLV 104

Query: 766 -SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDF 822
            S R  R   R LL +     +++ K +Q+K RKK ++F RS+IH WGL ALE I  ++ 
Sbjct: 105 QSTREARSMNRRLLTSMGDTSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEM 164

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           ++EY+G+ IR  ++D RE RYE+ G+GSSYLFR+D   V+DAT+ G +ARFINHSC PNC
Sbjct: 165 IVEYIGQKIRPTVADEREKRYERRGMGSSYLFRIDSDNVIDATQMGNLARFINHSCQPNC 224

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y K++ V+G+K+I IY+K  I  G+EITY+YKFP+EE KI C CG+  C GSLN
Sbjct: 225 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 278


>gi|159124080|gb|EDP49198.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 1241

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236

Query: 932  HGSLN 936
             G LN
Sbjct: 1237 KGFLN 1241


>gi|70991351|ref|XP_750524.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|74671075|sp|Q4WNH8.1|SET1_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component set1; AltName:
            Full=SET domain-containing protein 1
 gi|66848157|gb|EAL88486.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 1241

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 125/185 (67%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1057 TSRSTRVNNRRLVADINAQKQALPMQGGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1116

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1117 ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1176

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1177 NHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREWDSDDRIPCLCGSTGC 1236

Query: 932  HGSLN 936
             G LN
Sbjct: 1237 KGFLN 1241


>gi|254569422|ref|XP_002491821.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238031618|emb|CAY69541.1| hypothetical protein PAS_chr2-2_0494 [Komagataella pastoris GS115]
 gi|328351679|emb|CCA38078.1| histone-lysine N-methyltransferase SETD1 [Komagataella pastoris CBS
            7435]
          Length = 1020

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/187 (51%), Positives = 126/187 (67%), Gaps = 11/187 (5%)

Query: 761  NGKGLSARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLV 812
            N K  S+R NR   R   A             ++L  + +  RKK + F RS IH+WGL 
Sbjct: 834  NPKLQSSRVNRANNRRFAADINAQKQMLNSETDILDLNHLTKRKKPVSFARSAIHNWGLY 893

Query: 813  ALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIAR 872
            ALEPI A++ +IEYVGE++R K+S++RE +Y K GIGSSYLFR+D+  V+DATK+GGIAR
Sbjct: 894  ALEPIAAKEMIIEYVGEILRQKVSEVREKKYLKSGIGSSYLFRVDEDTVIDATKKGGIAR 953

Query: 873  FINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSK 929
            FINH C P+C  K+I VEG+K+I IYA + IAA EE+TY+YKF  E   E++IPC CG  
Sbjct: 954  FINHCCQPSCTAKIIKVEGKKRIVIYALKDIAANEELTYDYKFEREDNNEERIPCLCGVP 1013

Query: 930  KCHGSLN 936
             C G LN
Sbjct: 1014 GCKGYLN 1020


>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 1232

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 98/183 (53%), Positives = 125/183 (68%), Gaps = 4/183 (2%)

Query: 757  SQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
            S   N + L A  N  K    + + +G + L+ +Q+K RKK +RF RS IH+WGL A E 
Sbjct: 1051 STRVNNRRLVADINAQKQALPMQSGDG-DALRFNQLKKRKKPVRFARSAIHNWGLYAEEN 1109

Query: 817  IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
            I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFINH
Sbjct: 1110 ISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFINH 1169

Query: 877  SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
            SC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C G
Sbjct: 1170 SCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCKG 1229

Query: 934  SLN 936
             LN
Sbjct: 1230 FLN 1232


>gi|451848788|gb|EMD62093.1| hypothetical protein COCSADRAFT_162605 [Cochliobolus sativus ND90Pr]
          Length = 1180

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 95/190 (50%), Positives = 126/190 (66%), Gaps = 8/190 (4%)

Query: 754  VNASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDW 809
             N +  AN +   A   R    + +   L+ AEG + L+ +Q+K RKK ++F RS IH+W
Sbjct: 992  ANKASTANSRANRANNRRMVNDINISKNLSGAEG-DALRFNQLKKRKKLVKFDRSAIHNW 1050

Query: 810  GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
            GL A EPI   D +IEYVGE +R +++D+RE +Y+  G+GSSYLFR+D+  V+DATK GG
Sbjct: 1051 GLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGG 1110

Query: 870  IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYC 926
            IARFINHSC PNC  K+I V+  K+I IYA R I + EE+TY+YKF  E     +IPC C
Sbjct: 1111 IARFINHSCTPNCTAKIIRVDNTKRIVIYALRDIQSDEELTYDYKFEREIDATDRIPCLC 1170

Query: 927  GSKKCHGSLN 936
            GS  C G LN
Sbjct: 1171 GSIGCKGFLN 1180


>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
          Length = 1111

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 101/182 (55%), Positives = 129/182 (70%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLR----NLLA--AAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R    N+ A  AA G  +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 930  SSRDNRASNRRFQQNIEAQKAATGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPI 989

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ VIEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 990  NAKEMVIEYVGERIRQPVAEMRERRYIKNGIGSSYLFRIDEHTVIDATKKGGIARFINHC 1049

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C P+C  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1050 CEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERETDAEERLPCLCGAPSCKGF 1109

Query: 935  LN 936
            LN
Sbjct: 1110 LN 1111


>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
 gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
          Length = 986

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 805 SSRVNRAINRRFQQDIEAQKAMIGSESELLTLNQLTKRKKPVTFARSAIHNWGLYALEPI 864

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE++R  ++++RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 865 AAKEMIIEYVGEILRQPVAEMRERTYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 924

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C+P+C  K+I V G+K+I IYA R IAA EE+TY+YKF  E   E+++PC+CG+  C G 
Sbjct: 925 CDPSCTAKIIRVGGRKRIVIYALRDIAANEELTYDYKFERETDDEERLPCFCGAPTCKGF 984

Query: 935 LN 936
           LN
Sbjct: 985 LN 986


>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
          Length = 539

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 91/174 (52%), Positives = 122/174 (70%), Gaps = 3/174 (1%)

Query: 766 SARTNRVKLRNLLAAA--EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           + R  R   R LL +     ++  K +Q+K RKK ++F RS+IH WGL ALEPI  ++ +
Sbjct: 366 ATREARSDNRRLLTSMGETSSDFFKVNQLKYRKKMIKFARSRIHGWGLYALEPITPDEMI 425

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
           +EYVG+ IR  ++D RE RY + G+GSSYLFR+D   V+DAT  G  ARFINHSC PNCY
Sbjct: 426 VEYVGQKIRPTVADEREKRYIRKGMGSSYLFRIDSDNVIDATNMGNFARFINHSCQPNCY 485

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-KIPCYCGSKKCHGSLN 936
            KV+ V+G+K+I IY+K  I  G+EITY+YKFP+EE+ KI C CG+  C G+LN
Sbjct: 486 AKVVVVDGEKRIVIYSKTPINKGDEITYDYKFPIEEEDKIDCLCGAPSCRGTLN 539


>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
            bruxellensis AWRI1499]
          Length = 1104

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 97/183 (53%), Positives = 128/183 (69%), Gaps = 11/183 (6%)

Query: 765  LSARTNRVKLRNLLAAAE------GAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
             S+R NR   R   A         G+E  LL+ +Q+  R+K ++F RS IH+WGL A EP
Sbjct: 922  FSSRVNRAASRRFAADVSMQKQMLGSESDLLELNQLLKRQKPVQFARSAIHNWGLYAXEP 981

Query: 817  IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
            I A++ + EYVGE IR K++++RE RY K GIGSSYLFR+DD  V+DA+K+GGIARFINH
Sbjct: 982  IAAKEMIXEYVGERIRQKVAEVREKRYLKSGIGSSYLFRVDDNTVIDASKKGGIARFINH 1041

Query: 877  SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
             C+P+C  K+I V+G+K+I IYA R IAA EE+TY+YKF  E   E++IPC CG+  C G
Sbjct: 1042 CCDPSCTAKIIKVDGKKRIVIYALRDIAANEELTYDYKFEKETNPEERIPCLCGAPNCKG 1101

Query: 934  SLN 936
             LN
Sbjct: 1102 YLN 1104


>gi|440632035|gb|ELR01954.1| hypothetical protein GMDG_05127 [Geomyces destructans 20631-21]
          Length = 1301

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NRV  R  +A            A+ L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1120 SSRANRVNNRRFVADLNDQKKTLGGEADALRFNQLKKRKKPVKFARSAIHNWGLYAMENI 1179

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
               D +IEYVGE +R  ++D+RE  Y K GIGSSYLFR+D+  VVDATKRGGIARFINHS
Sbjct: 1180 AMNDMIIEYVGEKLRQSVADLRERIYLKSGIGSSYLFRIDENTVVDATKRGGIARFINHS 1239

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C PNC  K+I VEG ++I IYA R I   EE+TY+YKF  E   + +IPC CG+  C G 
Sbjct: 1240 CMPNCTAKIIKVEGTRRIVIYALRDIKLNEELTYDYKFEREIGSDDRIPCLCGTVACKGF 1299

Query: 935  LN 936
            LN
Sbjct: 1300 LN 1301


>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
 gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
 gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
            2.1.1.43)(COMPASS component SET1)(SET domain-containing
            protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
            [Aspergillus nidulans FGSC A4]
          Length = 1220

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 97/185 (52%), Positives = 128/185 (69%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNLLA---------AAEG--AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A          ++G  +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1036 TSRSTRVNNRRLIADINAQKQALPSQGGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1095

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
              I A + +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1096 VNISANEMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1155

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E   + +IPC CGS  C
Sbjct: 1156 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSAGC 1215

Query: 932  HGSLN 936
             G LN
Sbjct: 1216 KGFLN 1220


>gi|451998610|gb|EMD91074.1| hypothetical protein COCHEDRAFT_1156416 [Cochliobolus heterostrophus
            C5]
          Length = 1251

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 91/170 (53%), Positives = 120/170 (70%), Gaps = 4/170 (2%)

Query: 770  NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            N + +   L+ AEG + L+ +Q+K RKK ++F RS IH+WGL A EPI   D +IEYVGE
Sbjct: 1083 NDINISKNLSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGE 1141

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             +R +++D+RE +Y+  G+GSSYLFR+D+  V+DATK GGIARFINHSC PNC  K+I V
Sbjct: 1142 KVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRV 1201

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +  K+I IYA R I + EE+TY+YKF  E     +IPC CGS  C G LN
Sbjct: 1202 DNTKRIVIYALRDIQSDEELTYDYKFEREIDATDRIPCLCGSIGCKGFLN 1251


>gi|189205485|ref|XP_001939077.1| SET domain containing protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
 gi|187975170|gb|EDU41796.1| SET domain containing protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
          Length = 1274

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 94/189 (49%), Positives = 126/189 (66%), Gaps = 8/189 (4%)

Query: 755  NASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
            N +  AN +   A   R    + +   ++ AEG + L+ +Q+K RKK ++F RS IH+WG
Sbjct: 1087 NKTSTANSRANRANNRRMVNDINISKNMSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWG 1145

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A EPI   D +IEYVGE +R +++D+RE +Y+  G+GSSYLFR+D+  V+DATK GGI
Sbjct: 1146 LYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGI 1205

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
            ARFINHSC PNC  K+I V+  K+I IYA R I + EE+TY+YKF  E     +IPC CG
Sbjct: 1206 ARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDYKFEREMDATDRIPCLCG 1265

Query: 928  SKKCHGSLN 936
            S  C G LN
Sbjct: 1266 SIGCKGFLN 1274


>gi|330906701|ref|XP_003295568.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
 gi|311333048|gb|EFQ96340.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
          Length = 1272

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 94/189 (49%), Positives = 126/189 (66%), Gaps = 8/189 (4%)

Query: 755  NASQWANGKGLSARTNR----VKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
            N +  AN +   A   R    + +   ++ AEG + L+ +Q+K RKK ++F RS IH+WG
Sbjct: 1085 NKTSTANSRANRANNRRMVNDINISKNMSGAEG-DALRFNQLKKRKKLVKFDRSAIHNWG 1143

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A EPI   D +IEYVGE +R +++D+RE +Y+  G+GSSYLFR+D+  V+DATK GGI
Sbjct: 1144 LYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGSSYLFRIDEDTVIDATKMGGI 1203

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCG 927
            ARFINHSC PNC  K+I V+  K+I IYA R I + EE+TY+YKF  E     +IPC CG
Sbjct: 1204 ARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDYKFEREMDATDRIPCLCG 1263

Query: 928  SKKCHGSLN 936
            S  C G LN
Sbjct: 1264 SIGCKGFLN 1272


>gi|320580861|gb|EFW95083.1| histone-lysine n-methyltransferase, h3 lysine-4 specific, putative
           [Ogataea parapolymorpha DL-1]
          Length = 658

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 94/182 (51%), Positives = 126/182 (69%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R   A             ++L  +Q+  RKK ++F RS IH+WGL ALEPI
Sbjct: 477 SSRVNRANNRRFAADISAQKQMLGSETDILDLNQLTKRKKPVQFARSAIHNWGLYALEPI 536

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE IR +++++RE +Y + GIGSSYLFR+D+  V+DA+K+GGIARFINH 
Sbjct: 537 AAKEMIIEYVGERIRQQVAEVREKKYLRSGIGSSYLFRIDENTVIDASKKGGIARFINHC 596

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C P+C  K+I VEG+K+I IYA R IAA EE+TY+YKF  E   E++IPC CG+  C G 
Sbjct: 597 CVPSCTAKIIKVEGKKRIVIYALRDIAANEELTYDYKFERETNDEERIPCLCGAPGCKGY 656

Query: 935 LN 936
           LN
Sbjct: 657 LN 658


>gi|302306708|ref|NP_983083.2| ABR136Wp [Ashbya gossypii ATCC 10895]
 gi|442570023|sp|Q75D88.2|SET1_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
           specific; AltName: Full=COMPASS component SET1; AltName:
           Full=SET domain-containing protein 1
 gi|299788647|gb|AAS50907.2| ABR136Wp [Ashbya gossypii ATCC 10895]
          Length = 975

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 794 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 853

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 854 SAKEMIIEYVGERIRQPVAEMREKRYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 913

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C+P+C  K+I V G K+I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 914 CDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERETDDEERLPCLCGAPNCKGF 973

Query: 935 LN 936
           LN
Sbjct: 974 LN 975


>gi|374106286|gb|AEY95196.1| FABR136Wp [Ashbya gossypii FDAG1]
          Length = 975

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 794 SSRVNRALQRRFQQDIEAQKAAIGTESELLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 853

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 854 SAKEMIIEYVGERIRQPVAEMREKRYLKSGIGSSYLFRVDESTVIDATKKGGIARFINHC 913

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C+P+C  K+I V G K+I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 914 CDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERETDDEERLPCLCGAPNCKGF 973

Query: 935 LN 936
           LN
Sbjct: 974 LN 975


>gi|254585415|ref|XP_002498275.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
 gi|238941169|emb|CAR29342.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
          Length = 1015

 Score =  196 bits (498), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 834  SSRDNRASNRRFQQDIEAQRAAIGTESELLSLNQLNKRKKPVTFARSTIHNWGLYALEPI 893

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 894  AAKEMIIEYVGERIRQPVAEMREIRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHC 953

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C P+C  K+I V G K+I IYA R I A EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 954  CEPSCTAKIIKVGGMKRIVIYALRDIGANEELTYDYKFEREIDAEERLPCLCGAPSCKGF 1013

Query: 935  LN 936
            LN
Sbjct: 1014 LN 1015


>gi|347827280|emb|CCD42977.1| similar to histone-lysine N-methyltransferase [Botryotinia
            fuckeliana]
          Length = 1265

 Score =  196 bits (498), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 89/155 (57%), Positives = 115/155 (74%), Gaps = 3/155 (1%)

Query: 785  ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
            ++L+ +Q+K RKK ++F RS IH+WGL A+E I   D +IEYVGE +R +++D+RE RY 
Sbjct: 1111 DVLRFNQLKKRKKPVKFARSAIHNWGLYAMENIAMNDMIIEYVGEKVRQQVADLRENRYL 1170

Query: 845  KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
            K GIGSSYLFR+D+  V+DATK+GGIARFINHSC PNC  K+I+VE  K+I IYA R IA
Sbjct: 1171 KSGIGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKIITVEKSKRIVIYALRDIA 1230

Query: 905  AGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
              EE+TY+YKF  E     +IPC CG+  C G LN
Sbjct: 1231 QNEELTYDYKFEREIGSTDRIPCLCGTPACKGFLN 1265


>gi|367004711|ref|XP_003687088.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
 gi|357525391|emb|CCE64654.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
          Length = 1030

 Score =  196 bits (497), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+PI
Sbjct: 849  SSRVNRAMNRRFQQDIEAQKAVIGSESELLTLNQLNKRKKPVTFARSAIHNWGLYALQPI 908

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  +S++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 909  AAKEMIIEYVGERIRQPVSEMREIRYIKNGIGSSYLFRIDENTVIDATKKGGIARFINHC 968

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
            C+P+C  K+I V G+K+I IYA R I   EE+TY+YKF  EE   +++PC CG+  C G 
Sbjct: 969  CDPSCTAKIIKVGGKKRIVIYALRDIDVNEELTYDYKFEREEDDQERLPCLCGAPNCKGF 1028

Query: 935  LN 936
            LN
Sbjct: 1029 LN 1030


>gi|259146872|emb|CAY80128.1| Set1p [Saccharomyces cerevisiae EC1118]
          Length = 1080

 Score =  196 bits (497), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            CNPNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|323348281|gb|EGA82530.1| Set1p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 980

 Score =  196 bits (497), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 96/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 799 SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 858

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 859 AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 918

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           CNPNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 919 CNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 978

Query: 935 LN 936
           LN
Sbjct: 979 LN 980


>gi|380806421|gb|AFE75086.1| histone-lysine N-methyltransferase SETD1B, partial [Macaca mulatta]
          Length = 367

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 104/180 (57%), Positives = 133/180 (73%), Gaps = 5/180 (2%)

Query: 751 GPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQRSKIH 807
           G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +S IH
Sbjct: 190 GMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCKSHIH 247

Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
           DWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++DATK 
Sbjct: 248 DWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKC 307

Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
           G  ARFINHSCNPNCY KVI+VE QKKI IY+K+HI   EEITY+YKFP+E+ KIPC CG
Sbjct: 308 GNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCG 367


>gi|169601854|ref|XP_001794349.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
 gi|111067888|gb|EAT89008.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
          Length = 1168

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 94/185 (50%), Positives = 123/185 (66%), Gaps = 4/185 (2%)

Query: 755  NASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
             A+  AN        N + +   +  AEG + L+ +Q+K RKK ++F RS IH+WGL A 
Sbjct: 985  TANSRANRANNRRMVNDINISKNMTGAEG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAQ 1043

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY++ G+GSSYLFR+D+  V+DATK GGIARFI
Sbjct: 1044 ENIVANDMIIEYVGEKVRQRVADLREVRYDQQGVGSSYLFRIDEDTVIDATKMGGIARFI 1103

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC PNC  K+I V+  K+I IYA R I   EE+TY+YKF  E     +IPC CGS  C
Sbjct: 1104 NHSCTPNCTAKIIRVDNTKRIVIYALRDIGQDEELTYDYKFEREMDATDRIPCLCGSVGC 1163

Query: 932  HGSLN 936
             G LN
Sbjct: 1164 KGFLN 1168


>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
          Length = 1148

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 91/172 (52%), Positives = 121/172 (70%), Gaps = 3/172 (1%)

Query: 768  RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R +   L++ + ++   ++ K +Q+K RKK ++F RS IH+WGL   E I  +D +IEYV
Sbjct: 977  RRHVTALQDEMKSSGDPDVFKFNQLKKRKKPVKFARSAIHNWGLYTEENINKDDMIIEYV 1036

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            GE +R +IS+IRE RY K G+GSSYLFR+D+  V+DATK+GGIARFINHSC PNC  K+I
Sbjct: 1037 GEQVRQQISEIREVRYLKQGMGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKII 1096

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
             V+G K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G LN
Sbjct: 1097 KVDGSKRIVIYALRDIARTEELTYDYKFEREIGSLDRIPCLCGTALCKGFLN 1148


>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
            Nara gc5]
          Length = 1270

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 95/181 (52%), Positives = 122/181 (67%), Gaps = 10/181 (5%)

Query: 766  SARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIE 818
            ++R NR   R  ++A +        ++  K +Q+K RKK ++F RS IH+WGL   E I 
Sbjct: 1090 NSRANRANNRRHVSALQDEKKMSGDSDAFKFNQLKKRKKPVKFARSAIHNWGLYTEENIN 1149

Query: 819  AEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
             +D +IEYVGE +R  IS+IRE RY K G+GSSYLFR+DD  V+DATK+GGIARFINHSC
Sbjct: 1150 KDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATKKGGIARFINHSC 1209

Query: 879  NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSL 935
             PNC  K+I V+G K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G L
Sbjct: 1210 MPNCTAKIIKVDGSKRIVIYALRDIAQHEELTYDYKFEREIGSLDRIPCLCGTAACKGFL 1269

Query: 936  N 936
            N
Sbjct: 1270 N 1270


>gi|398394325|ref|XP_003850621.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
 gi|339470500|gb|EGP85597.1| histone methyltransferase [Zymoseptoria tritici IPO323]
          Length = 1163

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 95/186 (51%), Positives = 123/186 (66%), Gaps = 15/186 (8%)

Query: 766  SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            ++R+NRV  R L+            A ++    ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 978  TSRSNRVNNRRLVNDINLQKQTLATANSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1037

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I   D +IEYVGE +R KI+D+RE RYEK G+GSSYLFR+ D  +VDATK+GGIARF
Sbjct: 1038 EENIAVNDLIIEYVGEKVRQKIADLREIRYEKQGVGSSYLFRMIDDEIVDATKKGGIARF 1097

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC+PNC  K+I VEG  +I IYA + I   +E+TY+YKF  E     +IPC CGS  
Sbjct: 1098 INHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFEREMDSTDRIPCLCGSAN 1157

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1158 CKGFLN 1163


>gi|403217274|emb|CCK71769.1| hypothetical protein KNAG_0H03550 [Kazachstania naganishii CBS 8797]
          Length = 1111

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 96/180 (53%), Positives = 125/180 (69%), Gaps = 11/180 (6%)

Query: 765  LSARTNRVKLRNLLA------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEP 816
            LS+R NR + R          AA G E  LL  +Q+  RKK + F RS IH+WGL ALEP
Sbjct: 931  LSSRDNRAQNRRFQQDIEAQKAAIGTESDLLSLNQLNKRKKPVTFARSAIHNWGLYALEP 990

Query: 817  IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
            I A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATKRGGIARFINH
Sbjct: 991  IAAKEMIIEYVGERIRQPVAEMREMRYLKSGIGSSYLFRVDENNVIDATKRGGIARFINH 1050

Query: 877  SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHG 933
             C+P+C  K+I V G+++I IYA R I   EE+TY+YKF  E   E+++PC+CG+  C G
Sbjct: 1051 CCDPSCTAKIIKVGGKRRIVIYALRDIGKNEELTYDYKFEREQDDEERLPCHCGAPTCKG 1110


>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
 gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
          Length = 1135

 Score =  194 bits (494), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 91/172 (52%), Positives = 121/172 (70%), Gaps = 3/172 (1%)

Query: 768  RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            R +   L++ + ++   ++ K +Q+K RKK ++F RS IH+WGL   E I  +D +IEYV
Sbjct: 964  RRHVTALQDEMKSSGDPDVFKFNQLKKRKKPVKFARSAIHNWGLYTEENINKDDMIIEYV 1023

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            GE +R +IS+IRE RY K G+GSSYLFR+D+  V+DATK+GGIARFINHSC PNC  K+I
Sbjct: 1024 GEQVRQQISEIREVRYLKQGMGSSYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKII 1083

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
             V+G K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G LN
Sbjct: 1084 KVDGSKRIVIYALRDIARTEELTYDYKFEREIGSLDRIPCLCGTALCKGFLN 1135


>gi|50312247|ref|XP_456155.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74636430|sp|Q6CIT4.1|SET1_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|49645291|emb|CAG98863.1| KLLA0F24134p [Kluyveromyces lactis]
          Length = 1000

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 123/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NRV  R      E         ++LL  +Q+  RKK + F RS IH+WGL ALEPI
Sbjct: 819  SSRLNRVFQRRFQQDIEAQRAAIGFESDLLSLNQLTKRKKPVTFARSAIHNWGLYALEPI 878

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATKRGGIARFINH 
Sbjct: 879  AAKEMIIEYVGESIRQPVAEMREKRYIKSGIGSSYLFRIDENTVIDATKRGGIARFINHC 938

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
            C P+C  K+I V+G+K+I IYA R I   EE+TY+YKF  E    +++PC CG+  C G 
Sbjct: 939  CEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERETDEGERLPCLCGAPSCKGF 998

Query: 935  LN 936
            LN
Sbjct: 999  LN 1000


>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
 gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
          Length = 1505

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 102/247 (41%), Positives = 145/247 (58%), Gaps = 23/247 (9%)

Query: 707  LCPRSDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLS 766
            + P ++GCAR+      + K S+    +   R      V T         +     + L+
Sbjct: 1265 VIPVAEGCARAR----PYKKMSMKQKRSLVRRPENESSVSTAIFSERDETA--MRHQHLA 1318

Query: 767  ARTNRVKLRNLLAAAEGA--ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
             +  R+  R LL +   A  +  K +Q+K RKK ++F RS+IH WGL A+E I  ++ ++
Sbjct: 1319 NKDMRLLQRRLLTSLGDASNDFFKINQLKFRKKMIKFARSRIHGWGLYAMETIAQDEMIV 1378

Query: 825  EYVGEL---------------IRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
            EY+G+                IR  ++D RE  YE+ GIGSSYLFR+D+  V+DATKRG 
Sbjct: 1379 EYIGQTVIVFQNFSSILFHLQIRSLVADEREKAYERRGIGSSYLFRIDENSVIDATKRGN 1438

Query: 870  IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSK 929
             ARFINHSC PNCY KV+++EG+K+I IY++  I  GEEITY+YKFP+E+ KI C CG+K
Sbjct: 1439 FARFINHSCQPNCYAKVLTIEGEKRIVIYSRSVINKGEEITYDYKFPIEDDKIDCLCGAK 1498

Query: 930  KCHGSLN 936
             C G LN
Sbjct: 1499 ACRGYLN 1505


>gi|154311696|ref|XP_001555177.1| hypothetical protein BC1G_06307 [Botryotinia fuckeliana B05.10]
          Length = 451

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766 SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
           S R  RV  R  +A             ++L+ +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 270 SGRATRVNNRRYVADLNDQKKTLGGDTDVLRFNQLKKRKKPVKFARSAIHNWGLYAMENI 329

Query: 818 EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINHS
Sbjct: 330 AMNDMIIEYVGEKVRQQVADLRENRYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHS 389

Query: 878 CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
           C PNC  K+I+VE  K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 390 CMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACKGF 449

Query: 935 LN 936
           LN
Sbjct: 450 LN 451


>gi|256271664|gb|EEU06704.1| Set1p [Saccharomyces cerevisiae JAY291]
          Length = 1080

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|151944065|gb|EDN62358.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 1080

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|6321911|ref|NP_011987.1| Set1p [Saccharomyces cerevisiae S288c]
 gi|731707|sp|P38827.1|SET1_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=Lysine N-methyltransferase 2; AltName: Full=SET
            domain-containing protein 1
 gi|529135|gb|AAB68867.1| Set1p [Saccharomyces cerevisiae]
 gi|190405898|gb|EDV09165.1| histone-lysine N-methyltransferase [Saccharomyces cerevisiae RM11-1a]
 gi|285810026|tpg|DAA06813.1| TPA: Set1p [Saccharomyces cerevisiae S288c]
 gi|392298926|gb|EIW10021.1| Set1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1080

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|349578671|dbj|GAA23836.1| K7_Set1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1080

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|156847122|ref|XP_001646446.1| hypothetical protein Kpol_1048p18 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117123|gb|EDO18588.1| hypothetical protein Kpol_1048p18 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1074

 Score =  194 bits (492), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 91/161 (56%), Positives = 118/161 (73%), Gaps = 3/161 (1%)

Query: 779  AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
            A    +ELL  +Q+  RKK + F RS IH+WGL ALEPI A++ +IEYVGE IR  ++++
Sbjct: 914  AIGTESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEM 973

Query: 839  RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
            RE RY K GIGSSYLFR+D+  V+DATKRGGIARFINH C+P+C  K+I V G K+I IY
Sbjct: 974  RERRYIKNGIGSSYLFRVDENTVIDATKRGGIARFINHCCDPSCTAKIIKVGGMKRIVIY 1033

Query: 899  AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            A R IA+ EE+TY+YKF  E   ++++PC CG+  C G LN
Sbjct: 1034 ALRDIASNEELTYDYKFEREMDDKERLPCLCGAATCKGFLN 1074


>gi|410080444|ref|XP_003957802.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
 gi|372464389|emb|CCF58667.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
          Length = 1133

 Score =  194 bits (492), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 90/161 (55%), Positives = 118/161 (73%), Gaps = 3/161 (1%)

Query: 779  AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
            A    +ELL  +Q+  RKK + F RS IH+WGL ALEPI A++ +IEYVGE IR  ++++
Sbjct: 973  AIGSESELLSLNQLNKRKKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEM 1032

Query: 839  RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
            RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH C+P+C  K+I V G+++I IY
Sbjct: 1033 REKRYLKNGIGSSYLFRVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIY 1092

Query: 899  AKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            A R IA  EE+TY+YKF  E   E+++PC CG+  C G LN
Sbjct: 1093 ALRDIAKNEELTYDYKFEREQDDEERLPCLCGAPNCKGFLN 1133


>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 1257

 Score =  193 bits (491), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 127/193 (65%), Gaps = 11/193 (5%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKI 806
            +  ++ A  KG ++R NR   R  ++A +        ++  K +Q+K RKK ++F RS I
Sbjct: 1066 LTVAEKAVAKG-NSRANRANNRRHVSALQDEKKISGDSDAFKFNQLKKRKKPVKFARSAI 1124

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            H+WGL   E I  +D +IEYVGE +R  IS+IRE RY K G+GSSYLFR+DD  V+DATK
Sbjct: 1125 HNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATK 1184

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
            +GGIARFINHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E     +IP
Sbjct: 1185 KGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIP 1244

Query: 924  CYCGSKKCHGSLN 936
            C CG+  C G LN
Sbjct: 1245 CLCGTAACKGFLN 1257


>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 1262

 Score =  193 bits (491), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 127/193 (65%), Gaps = 11/193 (5%)

Query: 754  VNASQWANGKGLSARTNRVKLRNLLAAAEG-------AELLKASQVKARKKRLRFQRSKI 806
            + A++    KG ++R NR   R  ++A +        ++  K +Q+K RKK ++F RS I
Sbjct: 1071 LTAAEKVVAKG-NSRANRANNRRHVSALQDEKKISGDSDAFKFNQLKKRKKPVKFARSAI 1129

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            H+WGL   E I  +D +IEYVGE +R  IS+IRE RY K G+GSSYLFR+DD  V+DATK
Sbjct: 1130 HNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATK 1189

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIP 923
            +GGIARFINHSC PNC  K+I V+G K+I IYA R I   EE+TY+YKF  E     +IP
Sbjct: 1190 KGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIP 1249

Query: 924  CYCGSKKCHGSLN 936
            C CG+  C G LN
Sbjct: 1250 CLCGTAACKGFLN 1262


>gi|302501458|ref|XP_003012721.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
 gi|291176281|gb|EFE32081.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
          Length = 1363

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 102/207 (49%), Positives = 129/207 (62%), Gaps = 37/207 (17%)

Query: 766  SARTNRVKLRNLLA------------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            S+R+NRV  R L+A            + EG ++L+ +Q+K RKK +RF RS IH+WGL A
Sbjct: 1158 SSRSNRVNNRRLIADINAQKQALPTQSGEG-DVLRFNQLKKRKKPVRFARSAIHNWGLYA 1216

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATK GGIARF
Sbjct: 1217 EENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGGIARF 1276

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG---------------------EEITYN 912
            INHSC PNC  K+I V+G K+I IYA R I  G                     EE+TY+
Sbjct: 1277 INHSCTPNCTAKIIKVDGSKRIVIYALRDIERGGLIHSKLLLSIYDRLLTFCVDEELTYD 1336

Query: 913  YKFPLE---EKKIPCYCGSKKCHGSLN 936
            YKF  E   + +IPC CGS  C G LN
Sbjct: 1337 YKFEREWDSDDRIPCLCGSTGCKGFLN 1363


>gi|207344594|gb|EDZ71692.1| YHR119Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1080

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 95/182 (52%), Positives = 124/182 (68%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 899  SSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDFI 958

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 959  AAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1018

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R IAA EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1019 CDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGF 1078

Query: 935  LN 936
            LN
Sbjct: 1079 LN 1080


>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
 gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
          Length = 1372

 Score =  192 bits (489), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 89/170 (52%), Positives = 121/170 (71%), Gaps = 4/170 (2%)

Query: 770  NRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
            N + +   ++ A+G + L+ +Q+K RKK ++F RS IH+WGL A E I A D +IEYVGE
Sbjct: 1204 NDINISKNMSGADG-DALRFNQLKKRKKLVKFDRSAIHNWGLYAEENIAANDMIIEYVGE 1262

Query: 830  LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
             +R +++D+RE +Y++ G+GSSYLFR+D+  V+DATK GGIARFINHSC PNC  K+I V
Sbjct: 1263 KVRQRVADLREAKYDQQGVGSSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRV 1322

Query: 890  EGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +  K+I IYA R I+  EE+TY+YKF  E     +IPC CGS  C G LN
Sbjct: 1323 DTSKRIVIYALRDISQDEELTYDYKFEREMDATDRIPCLCGSIGCKGFLN 1372


>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
          Length = 2067

 Score =  192 bits (488), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 86/156 (55%), Positives = 117/156 (75%), Gaps = 3/156 (1%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            A LL    +++R+K+LRF RS +H WG+ A EPI A D VIEY GE+I   ++D RE +Y
Sbjct: 1912 AGLLNLDVLQSRRKKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQY 1971

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E M IGS Y+FR+D+  VVDAT +G +ARFINHSC+P+C T++I+VEG KKI IYA+R +
Sbjct: 1972 EDMQIGSDYMFRVDEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDV 2031

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
            A GEE++Y+YKFP E     ++PC+CGS+KC G +N
Sbjct: 2032 AMGEELSYDYKFPPEPDEAARVPCHCGSEKCRGFIN 2067


>gi|406863054|gb|EKD16102.1| SET domain-containing protein [Marssonina brunnea f. sp.
            'multigermtubi' MB_m1]
          Length = 1280

 Score =  192 bits (487), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 87/155 (56%), Positives = 113/155 (72%), Gaps = 3/155 (1%)

Query: 785  ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
            ++L+ +Q++ R+K + F+RS IH WGL A+E I   D +IEYVGE +R  ++D+RE RY 
Sbjct: 1126 DVLRFNQLQKRQKLVNFERSAIHKWGLFAMENINQNDMIIEYVGEKVRQTVADLREHRYL 1185

Query: 845  KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
            K GIGSSYLFR+D+  V+DATK+GGIARFINHSC PNC  K+I+VE  K+I IYA R IA
Sbjct: 1186 KSGIGSSYLFRIDESTVIDATKKGGIARFINHSCMPNCTAKIITVEKGKRIVIYALRDIA 1245

Query: 905  AGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
              EE+TY+YKF  E     +IPC CG+  C G LN
Sbjct: 1246 QNEELTYDYKFEREIGSTDRIPCLCGTAACKGFLN 1280


>gi|453082196|gb|EMF10244.1| hypothetical protein SEPMUDRAFT_151237 [Mycosphaerella populorum
            SO2202]
          Length = 1254

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 94/185 (50%), Positives = 126/185 (68%), Gaps = 14/185 (7%)

Query: 766  SARTNRVKLRNL----------LAAAEGAEL-LKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+NRV  R L          LA+   A++ ++ +Q+K RKK ++F RS IH WGL A 
Sbjct: 1070 TSRSNRVNNRRLVNDINLQKQSLASTGDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYAE 1129

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I   D +IEYVGE +R K++D+RE +Y+K G+GSSYLFR+ D  +VDATK+GGIARFI
Sbjct: 1130 ENIAINDLIIEYVGEKVRQKVADMREIKYDKQGVGSSYLFRMLDDEIVDATKKGGIARFI 1189

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKC 931
            NHSC+PNC  K+I VEG  +I IYA + I+  +E+TY+YKF  E     +IPC CGS  C
Sbjct: 1190 NHSCSPNCTAKIIKVEGTPRIVIYALKDISKNDELTYDYKFEREIGATDRIPCLCGSANC 1249

Query: 932  HGSLN 936
             G LN
Sbjct: 1250 KGFLN 1254


>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
          Length = 1040

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A   A  G+E  +L  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 862  SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 921

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y K GIGSSYLFR+DD  V+DATK+GGIARFINH C+P
Sbjct: 922  EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDDNTVIDATKKGGIARFINHCCSP 981

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 982  SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1040


>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 1071

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 93/178 (52%), Positives = 118/178 (66%), Gaps = 7/178 (3%)

Query: 766  SARTNRVKLRNLLAAA----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
            S+R NR   R             E+L  +Q+  RKK + F RS IH+WGL ALEPI  ++
Sbjct: 894  SSRVNRANNRRFAYEVSTFNNQNEILTLNQLNKRKKPVSFARSAIHNWGLYALEPIAQKE 953

Query: 822  FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
             +IEYVGE IR +++D RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH C P+
Sbjct: 954  MIIEYVGERIRQQVADFREKAYLKSGIGSSYLFRIDENTVIDATKKGGIARFINHCCQPS 1013

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            C  K+I VEGQK+I IYA R I A EE+TY+YKF  E    +++ C CG+  C G LN
Sbjct: 1014 CTAKIIKVEGQKRIVIYALRDIGANEELTYDYKFERETNDNERVRCLCGAPGCKGYLN 1071


>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1040

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A   A  G+E  +L  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 862  SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 921

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y K GIGSSYLFR+DD  V+DATK+GGIARFINH C+P
Sbjct: 922  EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDDNTVIDATKKGGIARFINHCCSP 981

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 982  SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1040


>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
            10762]
          Length = 1279

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 87/153 (56%), Positives = 110/153 (71%), Gaps = 3/153 (1%)

Query: 787  LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
            ++ +Q+K RKK ++F RS IH WGL A E I   D +IEYVGE +R K++D+RE RYEK 
Sbjct: 1127 VRFNQLKKRKKLVKFDRSAIHGWGLYAEENITVNDLIIEYVGEKVRQKVADLRELRYEKQ 1186

Query: 847  GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
            G+GSSYLFR+ D  +VDATK+GGIARFINHSC+PNC  K+I VEG  +I IYA + I   
Sbjct: 1187 GVGSSYLFRMMDDEIVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKN 1246

Query: 907  EEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            EE+TY+YKF  E     +IPC CG+  C G LN
Sbjct: 1247 EELTYDYKFEREYGSTDRIPCLCGTANCKGFLN 1279


>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
 gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
          Length = 144

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 87/144 (60%), Positives = 109/144 (75%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           + RKK LRF RS IHDWGL A+E I   + VIEYVGE+IR ++++ RE  YE+ GIGSSY
Sbjct: 1   QTRKKHLRFARSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGSSY 60

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           LFR+D+  VVDATK+G + R INHSC+PNC  K+I++ G+KKI IYAK+ I  G+EITY+
Sbjct: 61  LFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYD 120

Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
           Y FP E+ KI C CGS KC G LN
Sbjct: 121 YHFPFEQDKILCLCGSVKCRGFLN 144


>gi|401625463|gb|EJS43472.1| set1p [Saccharomyces arboricola H-6]
          Length = 1089

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 908  SSRDNRASNRRFQQNIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 967

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              ++ +IEYVGE IR  ++++RE RY K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 968  ATKEMIIEYVGERIRQPVAEMRERRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHC 1027

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R I A EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1028 CDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQDDEERLPCLCGAPNCKGF 1087

Query: 935  LN 936
            LN
Sbjct: 1088 LN 1089


>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
            90-125]
 gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
          Length = 1038

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 92/179 (51%), Positives = 122/179 (68%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A         +E++  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 860  SSRVNRANNRRFAADITAQIGSESEVMSLNALTKRKKPVTFARSTIHNWGLYAMEPIAAK 919

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y + GIGSSYLFR+D+  V+DATK+GGIARFINH CNP
Sbjct: 920  EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCNP 979

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 980  SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1038


>gi|401842102|gb|EJT44375.1| SET1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1087

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R      E         +ELL  +Q+  RKK + F RS IH+WGL AL+ I
Sbjct: 906  SSRDNRASNRRFQQNIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSI 965

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR  ++++RE RY   GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 966  AAKEMIIEYVGERIRQPVAEMRERRYLMNGIGSSYLFRVDESTVIDATKKGGIARFINHC 1025

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+PNC  K+I V G+++I IYA R I A EE+TY+YKF  E   E+++PC CG+  C G 
Sbjct: 1026 CDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQDDEERLPCLCGASNCKGF 1085

Query: 935  LN 936
            LN
Sbjct: 1086 LN 1087


>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
          Length = 1042

 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 92/179 (51%), Positives = 122/179 (68%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A         +E++  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 864  SSRVNRANNRRFAADITAQIGSESEVMSLNALTKRKKPVTFARSTIHNWGLYAMEPIAAK 923

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y + GIGSSYLFR+D+  V+DATK+GGIARFINH CNP
Sbjct: 924  EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCNP 983

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 984  SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1042


>gi|452980621|gb|EME80382.1| hypothetical protein MYCFIDRAFT_204567 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1200

 Score =  189 bits (481), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 91/186 (48%), Positives = 123/186 (66%), Gaps = 15/186 (8%)

Query: 766  SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            ++R+NRV  R L+            + ++    ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 1015 TSRSNRVNNRRLVNDINLQKQAFATSNSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1074

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I   D +IEYVGE +R K++D+RE +Y+K G+GSSYLFR+ D  +VDATK+GGIARF
Sbjct: 1075 EENIAVNDLIIEYVGEKVRQKVADMREIKYDKQGVGSSYLFRMIDDEIVDATKKGGIARF 1134

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC+PNC  K+I VEG  +I IYA + I   +E+TY+YKF  E     +IPC CGS  
Sbjct: 1135 INHSCDPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFEREYGSTDRIPCLCGSAN 1194

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1195 CKGFLN 1200


>gi|452837203|gb|EME39145.1| hypothetical protein DOTSEDRAFT_75034 [Dothistroma septosporum NZE10]
          Length = 1275

 Score =  189 bits (481), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 92/186 (49%), Positives = 123/186 (66%), Gaps = 15/186 (8%)

Query: 766  SARTNRVKLRNLL------------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVA 813
            ++R+NRV  R L+            A ++    ++ +Q+K RKK ++F RS IH WGL A
Sbjct: 1090 TSRSNRVNNRRLVNDINLQKQSFATANSDADVAIRFNQLKKRKKLVKFDRSAIHGWGLYA 1149

Query: 814  LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARF 873
             E I   + +IEYVGE +R K++D+RE +YEK G+GSSYLFR+ D  +VDATK+GGIARF
Sbjct: 1150 EENIGINELIIEYVGEKVRQKVADMREIKYEKQGVGSSYLFRMMDDEIVDATKKGGIARF 1209

Query: 874  INHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKK 930
            INHSC+PNC  K+I VEG  +I IYA + I   +E+TY+YKF  E     +IPC CGS  
Sbjct: 1210 INHSCDPNCTAKIIKVEGTPRIVIYALKDIYKNDELTYDYKFEREIGSTDRIPCLCGSAN 1269

Query: 931  CHGSLN 936
            C G LN
Sbjct: 1270 CKGFLN 1275


>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
            histone-lysine n-methyltransferase, h3 lysine-4 specific,
            putative [Candida dubliniensis CD36]
 gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
            dubliniensis CD36]
          Length = 1032

 Score =  189 bits (479), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 95/179 (53%), Positives = 124/179 (69%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A   A  G+E  +L  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 854  SSRVNRANNRRFAADITAQIGSESDVLSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 913

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH C+P
Sbjct: 914  EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 973

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 974  SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1032


>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
 gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
          Length = 1088

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 94/182 (51%), Positives = 120/182 (65%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R   A             ++L  + +  RKK + F RS IH+WGL ALEPI
Sbjct: 907  SSRVNRANNRRFAADISAQKQMLGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPI 966

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  VVDATK+GGIARFINH 
Sbjct: 967  AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVVDATKKGGIARFINHC 1026

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGS 934
            CNP+C  K+I VEG+K+I IYA R I A EE+TY+YKF  E    ++I C CG+  C G 
Sbjct: 1027 CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKETNDAERIRCLCGAPGCKGY 1086

Query: 935  LN 936
            LN
Sbjct: 1087 LN 1088


>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
          Length = 1489

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 88/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 766  SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            SAR NR   R L+          A   ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1309 SARNNRADSRRLVLGIEQHKRETATDTDIFKFNQLRTRKKQLKFAKSPIHDWGLYAMEYI 1368

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A D VIEYVGE++R +++D RE +YE+ G  S+YLFR+DD  VVDAT +G IAR +NH 
Sbjct: 1369 PAGDMVIEYVGEMVRQQVADNREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1428

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
            C PNC  K+++V G+K+I ++AK  I AGEE+TY+YKF     +E  IPC CGS  C
Sbjct: 1429 CTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQSSADDEDAIPCLCGSDGC 1485


>gi|452820772|gb|EME27810.1| histone-lysine N-methyltransferase isoform 2 [Galdieria
           sulphuraria]
          Length = 797

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 92/178 (51%), Positives = 122/178 (68%), Gaps = 7/178 (3%)

Query: 766 SARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
           SAR +R + R L           ++ L  SQ+K+RKKR+RF RS IH WGL A++ IE  
Sbjct: 620 SARESRQEYRKLRQGVGELGIRHSDSLIISQLKSRKKRVRFGRSIIHGWGLYAMQDIEPN 679

Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
           +F+IEYVGE+IR KISD RE RY + GIG SY+FRLD+  ++DAT++G +ARF+NHSC  
Sbjct: 680 EFIIEYVGEIIRQKISDEREKRYFRQGIGDSYMFRLDEDQIIDATRKGSVARFVNHSCES 739

Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
           N   K+I+++  KKI  Y+KR I AGEEITY+YKF  E++  KI C CG+  C   LN
Sbjct: 740 NAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTEDENNKILCLCGAPTCRKFLN 797


>gi|452820773|gb|EME27811.1| histone-lysine N-methyltransferase isoform 1 [Galdieria
           sulphuraria]
          Length = 769

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 92/178 (51%), Positives = 122/178 (68%), Gaps = 7/178 (3%)

Query: 766 SARTNRVKLRNLLAAA-----EGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
           SAR +R + R L           ++ L  SQ+K+RKKR+RF RS IH WGL A++ IE  
Sbjct: 592 SARESRQEYRKLRQGVGELGIRHSDSLIISQLKSRKKRVRFGRSIIHGWGLYAMQDIEPN 651

Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
           +F+IEYVGE+IR KISD RE RY + GIG SY+FRLD+  ++DAT++G +ARF+NHSC  
Sbjct: 652 EFIIEYVGEIIRQKISDEREKRYFRQGIGDSYMFRLDEDQIIDATRKGSVARFVNHSCES 711

Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
           N   K+I+++  KKI  Y+KR I AGEEITY+YKF  E++  KI C CG+  C   LN
Sbjct: 712 NAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTEDENNKILCLCGAPTCRKFLN 769


>gi|313228693|emb|CBY17844.1| unnamed protein product [Oikopleura dioica]
          Length = 473

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 140/232 (60%), Gaps = 12/232 (5%)

Query: 711 SDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV-----NASQWANGKGL 765
           S GCAR+   G+  + W        + R++  +      L PE      +A+    G   
Sbjct: 248 SSGCARTE--GYFKYTWEEKL----QMRLKLHRLGQLTDLKPEATSQGNDAAGRKEGVQA 301

Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
           + R  R   R +      + +++ +Q+  RKK+++F +S IH WGL ALE I A+DFVIE
Sbjct: 302 TDRETRNYHRRVADEFSDSSIMRYNQLTMRKKKVKFMKSAIHGWGLFALEDIPADDFVIE 361

Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
           YVG+ IR+ I+D RE  Y K GIGSSYLFRLD  +V+DATK G  +RFINH C PNC  K
Sbjct: 362 YVGQKIRIGIADNREEEYTKCGIGSSYLFRLDTTHVIDATKHGSSSRFINHCCVPNCIAK 421

Query: 886 VIS-VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           VI+  +G+KKI IY+K  IA   E+TY+YKFP+E++KI C C  + C G LN
Sbjct: 422 VINGADGEKKIIIYSKAPIAKNAEVTYDYKFPIEDEKIRCLCFHENCRGYLN 473


>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 1060

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 92/179 (51%), Positives = 121/179 (67%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA-----AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A         +++L  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 882  SSRVNRANNRRFAADITAQIGNESDILSLNALTKRKKPVTFARSAIHNWGLYAMEPIAAK 941

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR +++  RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH C+P
Sbjct: 942  EMIIEYVGERIRQQVAQHREESYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 1001

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 1002 SCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1060


>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
 gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
          Length = 1056

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 96/179 (53%), Positives = 122/179 (68%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A   A  G E  +L  + +  RKK + F RS IH+WGL ALEPI A+
Sbjct: 878  SSRVNRANNRRFAADITAQIGTESDVLSLNALTKRKKPVTFARSAIHNWGLYALEPIAAK 937

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH C+P
Sbjct: 938  EMIIEYVGERIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 997

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I VEG K+I IYA R I A EE+TY+YKF  E   E++I C CG+  C G LN
Sbjct: 998  SCTAKIIKVEGIKRIVIYALRDIEANEELTYDYKFERETNDEERIRCLCGAPGCKGYLN 1056


>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
            antarctica T-34]
          Length = 1366

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 89/184 (48%), Positives = 122/184 (66%), Gaps = 12/184 (6%)

Query: 760  ANGKGL-SARTNRVKLRNLLAAAE--------GAELLKASQVKARKKRLRFQRSKIHDWG 810
            AN + L SAR NR   R L+   E          ++ K +Q++ RKK+L+F +S IHDWG
Sbjct: 1179 ANAQVLQSARNNRADSRRLVLGIEQHKRETATDTDMFKFNQLRTRKKQLKFAKSPIHDWG 1238

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
            L A+E I A D VIEYVGE++R +++D RE +YE+ G  S+YLFR+DD  VVDAT +G I
Sbjct: 1239 LYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNI 1298

Query: 871  ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCG 927
            AR +NH C PNC  K++++ G+K+I ++AK  I  GEE+TY+YKF     +E  IPC CG
Sbjct: 1299 ARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYKFQSSADDEDAIPCLCG 1358

Query: 928  SKKC 931
            S  C
Sbjct: 1359 SPGC 1362


>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
          Length = 1453

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 87/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 766  SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            SAR NR   R L+          A   ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1273 SARNNRADSRRLVLGIEQHKRETATDTDMFKFNQLRTRKKQLKFAKSPIHDWGLYAMEFI 1332

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A D VIEYVGE++R +++D RE +YE+ G  S+YLFR+DD  VVDAT +G IAR +NH 
Sbjct: 1333 PAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1392

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
            C PNC  K++++ G+K+I ++AK  I AGEE+TY+YKF     +E  IPC CGS  C
Sbjct: 1393 CTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYKFQSSADDEDAIPCLCGSPGC 1449


>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1055

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R L+A             ++L  + +  RKK + F RS IH+WGL ALE I
Sbjct: 874  SSRVNRANNRRLVADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESI 933

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 934  AAKEMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHC 993

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            CNP+C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   +++I C CG+  C G 
Sbjct: 994  CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGY 1053

Query: 935  LN 936
            LN
Sbjct: 1054 LN 1055


>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1055

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 93/182 (51%), Positives = 122/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R L+A             ++L  + +  RKK + F RS IH+WGL ALE I
Sbjct: 874  SSRVNRANNRRLVADISAQKQMLGSETDILDLNALTKRKKPVTFARSAIHNWGLYALESI 933

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 934  AAKEMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENSVIDATKKGGIARFINHC 993

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            CNP+C  K+I VEG+K+I IYA R I A EE+TY+YKF  E   +++I C CG+  C G 
Sbjct: 994  CNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGY 1053

Query: 935  LN 936
            LN
Sbjct: 1054 LN 1055


>gi|313228694|emb|CBY17845.1| unnamed protein product [Oikopleura dioica]
          Length = 1034

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 102/232 (43%), Positives = 140/232 (60%), Gaps = 12/232 (5%)

Query: 711  SDGCARSSISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEV-----NASQWANGKGL 765
            S GCAR+   G+  + W        + R++  +      L PE      +A+    G   
Sbjct: 809  SSGCARTE--GYFKYTWEEKL----QMRLKLHRLGQLTDLKPEATSQGNDAAGRKEGVQA 862

Query: 766  SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
            + R  R   R +      + +++ +Q+  RKK+++F +S IH WGL ALE I A+DFVIE
Sbjct: 863  TDRETRNYHRRVADEFSDSSIMRYNQLTMRKKKVKFMKSAIHGWGLFALEDIPADDFVIE 922

Query: 826  YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
            YVG+ IR+ I+D RE  Y K GIGSSYLFRLD  +V+DATK G  +RFINH C PNC  K
Sbjct: 923  YVGQKIRIGIADNREEEYTKCGIGSSYLFRLDTTHVIDATKHGSSSRFINHCCVPNCIAK 982

Query: 886  VIS-VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            VI+  +G+KKI IY+K  IA   E+TY+YKFP+E++KI C C  + C G LN
Sbjct: 983  VINGADGEKKIIIYSKAPIAKNAEVTYDYKFPIEDEKIRCLCFHENCRGYLN 1034


>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
 gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
          Length = 1468

 Score =  185 bits (470), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 87/177 (49%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 766  SARTNRVKLRNLL--------AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            SAR NR   R L+          A   ++ K +Q++ RKK+L+F +S IHDWGL A+E I
Sbjct: 1288 SARNNRADSRRLVLGIEQHKRETATDTDIFKFNQLRTRKKQLKFAKSPIHDWGLYAMELI 1347

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A D VIEYVGE++R +++D RE +YE+ G  S+YLFR+DD  VVDAT +G IAR +NH 
Sbjct: 1348 PAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRVDDDLVVDATHKGNIARLMNHC 1407

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYCGSKKC 931
            C PNC  K++++ G+K+I ++AK  I AGEE+TY+YKF     +E  IPC CGS  C
Sbjct: 1408 CTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSADDEDAIPCLCGSPGC 1464


>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
            [Scheffersomyces stipitis CBS 6054]
 gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
            [Scheffersomyces stipitis CBS 6054]
          Length = 1055

 Score =  185 bits (470), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 92/179 (51%), Positives = 119/179 (66%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLAAAEG-----AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A          E+L  + +  RKK + F RS IH+WGL ALEPI A+
Sbjct: 877  SSRVNRANNRRFAADITAQLGNETEVLSLNTLTKRKKPVSFARSAIHNWGLYALEPIAAK 936

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH C+P
Sbjct: 937  EMIIEYVGESIRQQVAEHREKSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 996

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
            +C  K+I V+ QK+I IYA R I A EE+TY+YKF  E    ++I C CG+  C G LN
Sbjct: 997  SCTAKIIKVDNQKRIVIYALRDIDANEELTYDYKFERETNDAERIRCLCGAPGCKGYLN 1055


>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
          Length = 1037

 Score =  185 bits (470), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 94/182 (51%), Positives = 123/182 (67%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLA------AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R   A       A G+E  +L  + +  RKK + F RS IH+WGL ALEPI
Sbjct: 856  SSRVNRANNRRFAADISAQKQAIGSETDILSLNTLAKRKKPVSFARSAIHNWGLYALEPI 915

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 916  AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHC 975

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+P+C  K+I V+G+K+I IYA R I A EE+TY+YKF  E    ++I C CG+  C G 
Sbjct: 976  CSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDSERIRCLCGAPGCKGY 1035

Query: 935  LN 936
            LN
Sbjct: 1036 LN 1037


>gi|388580816|gb|EIM21128.1| histone methyltransferase, partial [Wallemia sebi CBS 633.66]
          Length = 144

 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 82/144 (56%), Positives = 108/144 (75%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           + RKK LRF +S IH WGL + + I   + VIEYVGE++R +++D RE  YEK GIGSSY
Sbjct: 1   RTRKKELRFAKSSIHSWGLYSCQVIPKGEMVIEYVGEVVRQQVADRREKAYEKQGIGSSY 60

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           LF++DD  +VDAT +G +AR INH C PNC  K+I++ G+KKI IYAK  I+ G+EITY+
Sbjct: 61  LFKIDDDNIVDATMKGSVARLINHCCQPNCTAKIITILGEKKIIIYAKTEISPGDEITYD 120

Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
           Y FP+E++KIPC CG + C GSLN
Sbjct: 121 YHFPIEDEKIPCLCGVEGCRGSLN 144


>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
 gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
          Length = 469

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 93/189 (49%), Positives = 124/189 (65%), Gaps = 12/189 (6%)

Query: 760 ANGKGL-SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWG 810
           A G  L S+R NR   R   A            +++L  + +  RKK + + RS IH+WG
Sbjct: 281 AGGNSLQSSRVNRANNRRFAADISAQKQILSTESDILNLNALTKRKKPVSYARSAIHNWG 340

Query: 811 LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGI 870
           L ALEPI A++ +IEYVGE IR ++++ RE  Y + GIGSSYLFR+D+  V+DATK+GGI
Sbjct: 341 LYALEPIAAKEMIIEYVGESIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGI 400

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCG 927
           ARFINH CNP+C  K+I V+G+K+I IYA R I A EE+TY+YKF  E    ++I C CG
Sbjct: 401 ARFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDAERIRCLCG 460

Query: 928 SKKCHGSLN 936
           +  C G LN
Sbjct: 461 APGCKGYLN 469


>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 1156

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 92/179 (51%), Positives = 123/179 (68%), Gaps = 8/179 (4%)

Query: 766  SARTNRVKLRNLLA---AAEGAE--LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            S+R NR   R   A   A  G E  +L  + +  RKK + F RS IH+WGL A+EPI A+
Sbjct: 978  SSRVNRANNRRFAADITAQIGTESDVLSLNALTKRKKPVTFARSSIHNWGLYAMEPIAAK 1037

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
            + +IEYVGE IR ++++ RE  Y + GIGSSYLFR+D+  V+DATK+GGIARFINH C+P
Sbjct: 1038 EMIIEYVGERIRQQVAEHREKSYLRTGIGSSYLFRIDENTVIDATKKGGIARFINHCCSP 1097

Query: 881  NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            +C  K+I V+G+K+I IYA R I A EE+TY+YKF  E   +++I C CG+  C G LN
Sbjct: 1098 SCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDDERIRCLCGAPGCKGFLN 1156


>gi|224086152|ref|XP_002307834.1| SET domain protein [Populus trichocarpa]
 gi|222857283|gb|EEE94830.1| SET domain protein [Populus trichocarpa]
          Length = 90

 Score =  184 bits (466), Expect = 2e-43,   Method: Composition-based stats.
 Identities = 85/90 (94%), Positives = 86/90 (95%)

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           RE  YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC PNCYTKVISVEGQKKIFIY
Sbjct: 1   RERLYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIY 60

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGS 928
           AKRHIAAGEEITYNYKFPLE+KKIPC CGS
Sbjct: 61  AKRHIAAGEEITYNYKFPLEDKKIPCNCGS 90


>gi|448106516|ref|XP_004200765.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|448109616|ref|XP_004201396.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|359382187|emb|CCE81024.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|359382952|emb|CCE80259.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
          Length = 1062

 Score =  183 bits (465), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 91/182 (50%), Positives = 120/182 (65%), Gaps = 11/182 (6%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            S+R NR   R   A             ++L  + +  RKK + F RS IH+WGL ALEPI
Sbjct: 881  SSRVNRANNRRFAADISAQKQMIGSETDILNLNALTKRKKPVSFARSAIHNWGLYALEPI 940

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
             A++ +IEYVGE IR ++++ RE  Y K GIGSSYLFR+D+  V+DATK+GGIARFINH 
Sbjct: 941  AAKEMIIEYVGESIRQQVAEHRERSYLKTGIGSSYLFRIDENTVIDATKKGGIARFINHC 1000

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
            C+P+C  K+I V+G+K+I IYA R I   EE+TY+YKF  E   E++I C CG+  C G 
Sbjct: 1001 CSPSCTAKIIKVDGKKRIVIYALRDIDKNEELTYDYKFERETNDEERIRCLCGAPGCKGY 1060

Query: 935  LN 936
            LN
Sbjct: 1061 LN 1062


>gi|321257952|ref|XP_003193759.1| hypothetical protein CGB_D6310C [Cryptococcus gattii WM276]
 gi|317460229|gb|ADV21972.1| conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1477

 Score =  180 bits (456), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++L   +Q++ RKK+LRF RS I  +GL A+E I A + V EYVG+L+R  ++D+RE RY
Sbjct: 1322 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1381

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
             K GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+I V GQ KI IYA+R +
Sbjct: 1382 MKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1441

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
              GEEI Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 1442 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1477


>gi|357139465|ref|XP_003571302.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-4
           specific-like [Brachypodium distachyon]
          Length = 319

 Score =  179 bits (455), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 80/145 (55%), Positives = 108/145 (74%), Gaps = 1/145 (0%)

Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
           + AR+K+LR +RSKIH+ G++A+  I   + ++EY+GE +   ++D+R  RYEK G G  
Sbjct: 176 LSARRKKLRVERSKIHELGVMAVTDIMKGELIVEYIGERMPKWVADLRGLRYEKAGKGD- 234

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y F++D G V+DAT RGGIAR+INHSC PNC T+VI   GQ++IFIYA + I AG E+TY
Sbjct: 235 YFFKIDAGLVIDATLRGGIARYINHSCEPNCETRVILSNGQRRIFIYANQKIKAGTELTY 294

Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
           +YKFP EE KIPC CGSK+C  S+N
Sbjct: 295 DYKFPFEENKIPCSCGSKRCRKSMN 319


>gi|338819402|sp|P0CO27.1|SET1_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
          Length = 1469

 Score =  179 bits (455), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++L   +Q++ RKK+LRF RS I  +GL A+E I A + V EYVG+L+R  ++D+RE RY
Sbjct: 1314 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1373

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
             K GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+I V GQ KI IYA+R +
Sbjct: 1374 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1433

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
              GEEI Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 1434 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1469


>gi|58266804|ref|XP_570558.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|338819403|sp|P0CO26.1|SET1_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|57226791|gb|AAW43251.1| conserved hypothetical protein [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1469

 Score =  179 bits (455), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++L   +Q++ RKK+LRF RS I  +GL A+E I A + V EYVG+L+R  ++D+RE RY
Sbjct: 1314 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1373

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
             K GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+I V GQ KI IYA+R +
Sbjct: 1374 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1433

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
              GEEI Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 1434 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1469


>gi|134110798|ref|XP_775863.1| hypothetical protein CNBD2720 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50258529|gb|EAL21216.1| hypothetical protein CNBD2720 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1483

 Score =  179 bits (454), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 82/156 (52%), Positives = 110/156 (70%), Gaps = 3/156 (1%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++L   +Q++ RKK+LRF RS I  +GL A+E I A + V EYVG+L+R  ++D+RE RY
Sbjct: 1328 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1387

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
             K GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+I V GQ KI IYA+R +
Sbjct: 1388 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIVIYAERTL 1447

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
              GEEI Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 1448 YPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 1483


>gi|345490501|ref|XP_001604610.2| PREDICTED: hypothetical protein LOC100121025 [Nasonia vitripennis]
          Length = 1677

 Score =  179 bits (453), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 92/190 (48%), Positives = 120/190 (63%), Gaps = 31/190 (16%)

Query: 751  GPEVNASQWANGK--GLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
            GP+VN S+   GK   LS  AR+N+ +L         ++LLK +Q+K             
Sbjct: 1515 GPKVN-SKALTGKMQALSREARSNQRRLLTAFGIDTDSDLLKFNQLKXXX---------- 1563

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
                            VIEYVG++IR  ++D+RE +YE  GIGSSYLFR+D   ++DATK
Sbjct: 1564 ----------------VIEYVGQMIRPIVADLRETQYEATGIGSSYLFRIDLDTIIDATK 1607

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
             G +ARFINHSCNPNCY KVI++E QKKI IY+K+ I   EEITY+YKFPLE+ KIPC C
Sbjct: 1608 CGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLC 1667

Query: 927  GSKKCHGSLN 936
            G+ +C G+LN
Sbjct: 1668 GAPQCRGTLN 1677


>gi|46126135|ref|XP_387621.1| hypothetical protein FG07445.1 [Gibberella zeae PH-1]
          Length = 1252

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 91/182 (50%), Positives = 117/182 (64%), Gaps = 22/182 (12%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1082 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1141

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +IS+IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINH 
Sbjct: 1142 AKDDMIIEYVGEQVRQQISEIRENRYLKSGIGSSYLFRIDDNTVIDATKKGGIARFINH- 1200

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGS 934
                      S EG K+I IYA R IA  EE+TY+YKF  E     +IPC CG+  C G 
Sbjct: 1201 ----------SFEGSKRIVIYALRDIALNEELTYDYKFEREIGSTDRIPCLCGTAACKGF 1250

Query: 935  LN 936
            LN
Sbjct: 1251 LN 1252


>gi|224003381|ref|XP_002291362.1| set domain-containing protein [Thalassiosira pseudonana CCMP1335]
 gi|220973138|gb|EED91469.1| set domain-containing protein, partial [Thalassiosira pseudonana
           CCMP1335]
          Length = 189

 Score =  176 bits (445), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 85/178 (47%), Positives = 121/178 (67%), Gaps = 4/178 (2%)

Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           G+ + A   R+ L++L +  + ++ +     + R++RLRF +S IH WG+ A E I A D
Sbjct: 7   GRAVRASQRRM-LKSLASLGDASKTVDRLVSRDREERLRFDKSLIHGWGVFAEEAINAGD 65

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
            +IEY GELI   ++D RE  YE+  IGS Y+FR+D+ +V DATK G +ARFIN SC PN
Sbjct: 66  MIIEYRGELIGNAVADKRELEYERAKIGSDYMFRIDEFWVCDATKLGNVARFINASCCPN 125

Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
           CYTK+I+    K+I I+AK++I  GEE+ Y+YKF +E   EK+IPCYCGS +C G +N
Sbjct: 126 CYTKIITANETKRIVIFAKKNIQRGEELCYDYKFQMEHEKEKRIPCYCGSSECRGFMN 183


>gi|443918070|gb|ELU38649.1| Setd1a protein [Rhizoctonia solani AG-1 IA]
          Length = 425

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 81/132 (61%), Positives = 101/132 (76%)

Query: 780 AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIR 839
           AA+  ++LK +Q++ RKK+L F RS IHDWGL A EPI A D VIEYVGE+IR +++D R
Sbjct: 198 AADAGDVLKFNQLRTRKKQLTFARSPIHDWGLYAAEPIPAGDMVIEYVGEVIRQQVADKR 257

Query: 840 ECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYA 899
           E  YEK GIGSSYLFR+DD  VVDATK+G + R INH C PNC  K+I++ G+KKI IYA
Sbjct: 258 EKYYEKTGIGSSYLFRVDDDSVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYA 317

Query: 900 KRHIAAGEEITY 911
           K +I  G+EITY
Sbjct: 318 KTNIDVGDEITY 329


>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
          Length = 4387

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 78/138 (56%), Positives = 95/138 (68%), Gaps = 4/138 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM----GIGSSYLFRLDD 858
            RS IH  GL     IE  D VIEY GE+IR  ++D RE +YE M    G+G  Y+FR+DD
Sbjct: 4250 RSHIHGRGLFCKRDIEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVGGCYMFRIDD 4309

Query: 859  GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE 918
              VVDAT +G  ARFINHSC+PNCY++V+ + G K I I+A R I  GEE+TY+YKFP E
Sbjct: 4310 NLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHILIFALRRITIGEELTYDYKFPFE 4369

Query: 919  EKKIPCYCGSKKCHGSLN 936
            E KIPC CG+KKC   LN
Sbjct: 4370 EVKIPCTCGAKKCRKYLN 4387


>gi|406694364|gb|EKC97692.1| hypothetical protein A1Q2_08004 [Trichosporon asahii var. asahii CBS
            8904]
          Length = 1218

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 85/171 (49%), Positives = 112/171 (65%), Gaps = 4/171 (2%)

Query: 769  TNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVG 828
            TNR        AAE ++L   +Q++ RKK+LRF RS I  +GL A+E I+  + V EYVG
Sbjct: 1049 TNRKAGGGTATAAE-SDLFMFNQLRIRKKQLRFARSAIEGYGLYAMETIQPGEMVCEYVG 1107

Query: 829  ELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
            EL R  ++D+RE RY K GIGSSYLFR+D   V DAT +G ++R INHSCNP    K+I+
Sbjct: 1108 ELCRSSVADVREQRYTKQGIGSSYLFRIDGDIVCDATFKGSVSRLINHSCNPTANAKIIN 1167

Query: 889  VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
            + G  KI IYAKR +  G+E+TY+Y FPLE+    ++ C CG   C G LN
Sbjct: 1168 INGHNKIVIYAKRTLYPGDEVTYSYNFPLEQDESLRVRCLCGEPTCLGFLN 1218


>gi|392580378|gb|EIW73505.1| hypothetical protein TREMEDRAFT_24920 [Tremella mesenterica DSM
           1558]
          Length = 180

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 82/161 (50%), Positives = 112/161 (69%), Gaps = 3/161 (1%)

Query: 779 AAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
           +A   ++L   +Q++ RKK+LRF RS I  +GL A+E I+  + V EYVGE+ R  +++I
Sbjct: 20  SATAESDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIQPGEMVCEYVGEICRSAVAEI 79

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           RE RY + GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+IS+ GQ KI IY
Sbjct: 80  REQRYLRQGIGSSYLFRIDNDLVCDATFKGSVSRLINHSCDPSASAKIISINGQSKIVIY 139

Query: 899 AKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
           AKR +  GEEI Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 140 AKRTLHPGEEILYDYKFPLESDPALRVPCLCGAATCRGWLN 180


>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
 gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
          Length = 3708

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 79/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3576 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3634

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP EE+KI
Sbjct: 3635 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 3694

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3695 PCSCGSKRCRKYLN 3708


>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
 gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
          Length = 3837

 Score =  172 bits (437), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3705 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3763

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3764 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3823

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3824 PCSCGSKRCRKYLN 3837


>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
 gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
          Length = 3756

 Score =  172 bits (437), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3624 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3682

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3683 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3742

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3743 PCSCGSKRCRKYLN 3756


>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
 gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
          Length = 3822

 Score =  172 bits (436), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3690 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3748

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3749 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3808

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3809 PCSCGSKRCRKYLN 3822


>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
 gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
          Length = 3828

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3696 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3754

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3755 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3814

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3815 PCSCGSKRCRKYLN 3828


>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
 gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
          Length = 3741

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3609 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3667

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP EE+KI
Sbjct: 3668 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 3727

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3728 PCSCGSKRCRKYLN 3741


>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 3779

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3647 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3705

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3706 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3765

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3766 PCSCGSKRCRKYLN 3779


>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
 gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
          Length = 4925

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 79/144 (54%), Positives = 99/144 (68%), Gaps = 1/144 (0%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            ++ K+ +   RS IH  GL     IEA + VIEY GELIR  ++D RE  Y+  GIG  Y
Sbjct: 4783 ESSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-Y 4841

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +F++D+ +VVDAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+
Sbjct: 4842 MFKIDENFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYD 4901

Query: 913  YKFPLEEKKIPCYCGSKKCHGSLN 936
            YKFP E+ KIPC CGSKKC   LN
Sbjct: 4902 YKFPFEDVKIPCSCGSKKCRKYLN 4925


>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 3474

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3342 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3400

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3401 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3460

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3461 PCSCGSKRCRKYLN 3474


>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
          Length = 3759

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3627 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3685

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3686 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3745

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3746 PCSCGSKRCRKYLN 3759


>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
 gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
 gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
            Full=Lysine N-methyltransferase 2A
 gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
 gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
          Length = 3726

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3594 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3652

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3653 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3712

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3713 PCSCGSKRCRKYLN 3726


>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
 gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
            spliced, exon II-containing isoform} [Drosophila,
            embryos, Peptide, 3726 aa]
          Length = 3726

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3594 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3652

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3653 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3712

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3713 PCSCGSKRCRKYLN 3726


>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
 gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
 gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
 gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
 gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
 gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
          Length = 3358

 Score =  170 bits (430), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3226 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3284

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3285 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 3344

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3345 PCSCGSKRCRKYLN 3358


>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
          Length = 3358

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 3226 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 3284

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 3285 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKI 3344

Query: 923  PCYCGSKKCHGSLN 936
            PC CGSK+C   LN
Sbjct: 3345 PCSCGSKRCRKYLN 3358


>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
          Length = 3489

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 79/141 (56%), Positives = 97/141 (68%), Gaps = 1/141 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K+ +   RS IH  GL     IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F+
Sbjct: 3350 KESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFK 3408

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +VVDAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKF
Sbjct: 3409 IDEHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKF 3468

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
            P E+ KIPC CGSKKC   LN
Sbjct: 3469 PFEDVKIPCSCGSKKCRKYLN 3489


>gi|449016155|dbj|BAM79557.1| unknown RNA binding protein [Cyanidioschyzon merolae strain 10D]
          Length = 1151

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 83/188 (44%), Positives = 117/188 (62%), Gaps = 6/188 (3%)

Query: 752  PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
            P   A +W+    L  R  RV+ R LL+A    E      ++ R++ LRF RS IH +GL
Sbjct: 967  PADEARRWSATGSL--RGTRVEQRQLLSAVSDREQ-TVLYLQQRERFLRFYRSGIHGYGL 1023

Query: 812  VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
             ALE I A D++IEY GEL+R  ++D+RE  Y + G+G S++FR+D   VVDAT  G +A
Sbjct: 1024 WALEDIPAGDYIIEYRGELVRSAVADLRERAYRQQGMGDSFMFRIDADTVVDATHIGSVA 1083

Query: 872  RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGS 928
            RF+NHSC+PN   +++ + G   I  Y+KR I  GEEITY+Y F +E+   +K+PC CG+
Sbjct: 1084 RFVNHSCDPNAIARIVQLGGASHILFYSKRSICVGEEITYDYNFDIEDDASEKVPCLCGA 1143

Query: 929  KKCHGSLN 936
              C   LN
Sbjct: 1144 PNCRQYLN 1151


>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
          Length = 3069

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 79/141 (56%), Positives = 97/141 (68%), Gaps = 1/141 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K+ +   RS IH  GL     IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F+
Sbjct: 2930 KESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFK 2988

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +VVDAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKF
Sbjct: 2989 IDEHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKF 3048

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
            P E+ KIPC CGSKKC   LN
Sbjct: 3049 PFEDVKIPCSCGSKKCRKYLN 3069


>gi|322699770|gb|EFY91529.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Metarhizium
            acridum CQMa 102]
          Length = 1231

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 81/157 (51%), Positives = 109/157 (69%), Gaps = 8/157 (5%)

Query: 766  SARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH+WGL A+E I
Sbjct: 1073 NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIHNWGLYAMENI 1132

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
              +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+GGIARFINHS
Sbjct: 1133 NKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDDTVIDATKKGGIARFINHS 1192

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            C PNC  K+I VEG K+I IYA R IA  ++++   K
Sbjct: 1193 CMPNCTAKIIKVEGSKRIVIYALRDIAMSKQLSLESK 1229


>gi|322706019|gb|EFY97601.1| Histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF 23]
          Length = 1222

 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 130/220 (59%), Gaps = 23/220 (10%)

Query: 708  CPRSDGCAR---------SSISGWEWHKWSLNASPAER---ARVRGAQYVHTKYLGPEVN 755
             P + GCAR         S  S +  H   +  +  ER   ++  G   VH      ++ 
Sbjct: 1006 VPNATGCARTEGVKKILNSEKSKYLPHHIKVQKAREERQARSKRNGKDSVHAASEAAKIA 1065

Query: 756  ASQWANGKGLSARTNRVKLRNLLA--------AAEGAELLKASQVKARKKRLRFQRSKIH 807
            A + A G   ++R NR   R  +A          + +++ K +Q+K RKK ++F RS IH
Sbjct: 1066 AEKIAKG---NSRANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSAIH 1122

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            +WGL A+E I  +D +IEYVGE +R +I++IRE RY K GIGSSYLFR+DD  V+DATK+
Sbjct: 1123 NWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGSSYLFRIDDDTVIDATKK 1182

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
            GGIARFINHSC PNC  K+I VEG K+I IYA R IA  E
Sbjct: 1183 GGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMSE 1222


>gi|425770041|gb|EKV08516.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Penicillium
            digitatum Pd1]
 gi|425771732|gb|EKV10169.1| Histone-lysine N-methyltransferase, H3 lysine-4 specific [Penicillium
            digitatum PHI26]
          Length = 1178

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 83/152 (54%), Positives = 106/152 (69%), Gaps = 11/152 (7%)

Query: 766  SARTNRVKLRNLLA----------AAEG-AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A           A G  ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1026 TSRSTRVNNRRLIADINAQKQALPTANGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1085

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1086 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1145

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
            NHSC PNC  K+I V+G K+I IYA R I  G
Sbjct: 1146 NHSCTPNCTAKIIKVDGSKRIVIYALRDIERG 1177


>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
          Length = 2343

 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RSKIH  GL  L   EA + VIEY GE+IR  ++D RE  Y   GIG  Y+FR+DD  VV
Sbjct: 2211 RSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVLTDKREKYYNSKGIGC-YMFRIDDNLVV 2269

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT  G  ARFINHSC+PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP+EE KI
Sbjct: 2270 DATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRRIICGEELTYDYKFPIEEDKI 2329

Query: 923  PCYCGSKKCHGSLN 936
            PC CG+++C   LN
Sbjct: 2330 PCTCGTRRCRKFLN 2343


>gi|91076142|ref|XP_970289.1| PREDICTED: similar to mixed-lineage leukemia protein, mll [Tribolium
            castaneum]
          Length = 1824

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RSKIH  GL  L   EA + VIEY GE+IR  ++D RE  Y   GIG  Y+FR+DD  VV
Sbjct: 1692 RSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVLTDKREKYYNSKGIGC-YMFRIDDNLVV 1750

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT  G  ARFINHSC+PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP+EE KI
Sbjct: 1751 DATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRRIICGEELTYDYKFPIEEDKI 1810

Query: 923  PCYCGSKKCHGSLN 936
            PC CG+++C   LN
Sbjct: 1811 PCTCGTRRCRKFLN 1824


>gi|258564606|ref|XP_002583048.1| hypothetical protein UREG_07821 [Uncinocarpus reesii 1704]
 gi|237908555|gb|EEP82956.1| hypothetical protein UREG_07821 [Uncinocarpus reesii 1704]
          Length = 1311

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 82/159 (51%), Positives = 106/159 (66%), Gaps = 11/159 (6%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A               +++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 1092 TSRSTRVNNRRLIADINAQKQALPMQNGDSDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 1151

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE RY K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 1152 ENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 1211

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
            NHSC PNC  K+I V+G K+I IYA R I  G      Y
Sbjct: 1212 NHSCTPNCTAKIIKVDGSKRIVIYALRDIDRGNSARQKY 1250


>gi|340381930|ref|XP_003389474.1| PREDICTED: histone-lysine N-methyltransferase trithorax-like
           [Amphimedon queenslandica]
          Length = 192

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 78/133 (58%), Positives = 97/133 (72%), Gaps = 1/133 (0%)

Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
           S IH  GL  L+ I++ D VIEY G +IR  ++D RE  YE  GIG  Y+FR+D   VVD
Sbjct: 61  SHIHGLGLFCLQEIDSGDMVIEYAGTVIRSTLTDYRERFYESRGIGC-YMFRIDSDEVVD 119

Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
           AT  G +ARFINHSC PNCY+KV++V+GQKKI I+A R I  GEE+TY+YKFP+EE KIP
Sbjct: 120 ATMSGNMARFINHSCEPNCYSKVVAVDGQKKIMIFALRRIVPGEELTYDYKFPIEEAKIP 179

Query: 924 CYCGSKKCHGSLN 936
           C CGS +C  +LN
Sbjct: 180 CKCGSARCRKTLN 192


>gi|195446231|ref|XP_002070688.1| GK10891 [Drosophila willistoni]
 gi|194166773|gb|EDW81674.1| GK10891 [Drosophila willistoni]
          Length = 447

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 79/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 315 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 373

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP EE+KI
Sbjct: 374 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 433

Query: 923 PCYCGSKKCHGSLN 936
           PC CGSK+C   LN
Sbjct: 434 PCSCGSKRCRKYLN 447


>gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster]
          Length = 751

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 619 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 677

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 678 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 737

Query: 923 PCYCGSKKCHGSLN 936
           PC CGSK+C   LN
Sbjct: 738 PCSCGSKRCRKYLN 751


>gi|238488134|ref|XP_002375305.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|220700184|gb|EED56523.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 1024

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 81/153 (52%), Positives = 105/153 (68%), Gaps = 11/153 (7%)

Query: 766  SARTNRVKLRNLLAAAEG-----------AELLKASQVKARKKRLRFQRSKIHDWGLVAL 814
            ++R+ RV  R L+A                ++L+ +Q+K RKK +RF RS IH+WGL A 
Sbjct: 867  TSRSTRVNNRRLIADINAQKQALPTQSGDGDVLRFNQLKKRKKPVRFARSAIHNWGLYAE 926

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
            E I A D +IEYVGE +R +++D+RE +Y K GIGSSYLFR+D+  V+DATKRGGIARFI
Sbjct: 927  ENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGGIARFI 986

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
            NHSC PNC  K+I V+G K+I IYA R I  GE
Sbjct: 987  NHSCTPNCTAKIIKVDGSKRIVIYALRDIERGE 1019


>gi|336385606|gb|EGO26753.1| hypothetical protein SERLADRAFT_385814 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 115

 Score =  166 bits (421), Expect = 5e-38,   Method: Composition-based stats.
 Identities = 73/114 (64%), Positives = 91/114 (79%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEYVGE+IR +++D RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC
Sbjct: 2   VIEYVGEVIRAQVADKREKVYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNC 61

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             K+I++ G+KKI IYAK+ I  GEEITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 62  TAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQDKIPCLCGSAKCRGYLN 115


>gi|291240989|ref|XP_002740398.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like,
           partial [Saccoglossus kowalevskii]
          Length = 357

 Score =  166 bits (419), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 80/144 (55%), Positives = 99/144 (68%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL    PI+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 215 KTSKEAVGVYRSGIHGRGLFCKRPIDAAEMVIEYAGMVIRSSLTDKREKYYDSKGIGC-Y 273

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI V+G+K I I+A R I  GEE+TY+
Sbjct: 274 MFRIDDYDVVDATMHGNAARFINHSCEPNCFSRVIQVDGKKHIVIFASRKIMPGEELTYD 333

Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
           YKFP EE+KIPC CGSKKC   LN
Sbjct: 334 YKFPFEEEKIPCTCGSKKCRKHLN 357


>gi|195570949|ref|XP_002103466.1| GD20433 [Drosophila simulans]
 gi|194199393|gb|EDX12969.1| GD20433 [Drosophila simulans]
          Length = 152

 Score =  165 bits (417), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 77/134 (57%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 20  RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 78

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 79  DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 138

Query: 923 PCYCGSKKCHGSLN 936
           PC CGSK+C   LN
Sbjct: 139 PCSCGSKRCRKYLN 152


>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
          Length = 4493

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/185 (48%), Positives = 115/185 (62%), Gaps = 6/185 (3%)

Query: 757  SQWANGKGLSARTNRVKLRNLLAA-AEGAELLKASQVKARKKRLR----FQRSKIHDWGL 811
            SQ+    G   R N V++ +  A  A   +L  A + +  K+  R      RS IH  GL
Sbjct: 4310 SQYREMPGSKERKNDVEMVHKSARRATSMDLPMAMRFRKLKEHAREAVGVYRSHIHGRGL 4369

Query: 812  VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
                 I+  + VIEY GE+IR  ++D RE  YE  GIG  Y+FR+DD  V+DAT  G  A
Sbjct: 4370 YCKRNIDEGEMVIEYSGEVIRGSLTDKREKYYEGKGIGC-YMFRIDDYDVIDATLHGNAA 4428

Query: 872  RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKC 931
            RFINHSC PNCY+KVI+V+G+K I I+A + I  GEE+TY+YKFP+EE KIPC CG+KKC
Sbjct: 4429 RFINHSCEPNCYSKVINVDGKKHIVIFAMKSIKRGEELTYDYKFPIEEVKIPCTCGAKKC 4488

Query: 932  HGSLN 936
               LN
Sbjct: 4489 RRYLN 4493


>gi|242221772|ref|XP_002476627.1| predicted protein [Postia placenta Mad-698-R]
 gi|220724099|gb|EED78169.1| predicted protein [Postia placenta Mad-698-R]
          Length = 115

 Score =  164 bits (416), Expect = 2e-37,   Method: Composition-based stats.
 Identities = 72/114 (63%), Positives = 90/114 (78%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEYVGE+IR +++D RE  YE+ GIGSSYLFR+D+  VVDATK+G + R INHSC+PNC
Sbjct: 2   VIEYVGEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNC 61

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             K+I++ G+KKI IYAK+ I  G EITY+Y FP+E+ KIPC CGS KC G LN
Sbjct: 62  TAKIITINGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLCGSAKCRGFLN 115


>gi|195145308|ref|XP_002013638.1| GL23289 [Drosophila persimilis]
 gi|194102581|gb|EDW24624.1| GL23289 [Drosophila persimilis]
          Length = 293

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 161 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 219

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP E++KI
Sbjct: 220 DATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKI 279

Query: 923 PCYCGSKKCHGSLN 936
           PC CGSK+C   LN
Sbjct: 280 PCSCGSKRCRKYLN 293


>gi|195501654|ref|XP_002097885.1| GE26460 [Drosophila yakuba]
 gi|194183986|gb|EDW97597.1| GE26460 [Drosophila yakuba]
          Length = 343

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY GELIR  ++D RE  Y+  GIG  Y+F++DD  VV
Sbjct: 211 RSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERYYDSRGIGC-YMFKIDDNLVV 269

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT RG  ARFINH C PNCY+KV+ + G K I I+A R I  GEE+TY+YKFP EE+KI
Sbjct: 270 DATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKI 329

Query: 923 PCYCGSKKCHGSLN 936
           PC CGSK+C   LN
Sbjct: 330 PCSCGSKRCRKYLN 343


>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
          Length = 3530

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            A + RV   NL  A     L + S     K+ +    S IH  GL  L  IEA + VIEY
Sbjct: 3367 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3421

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
             GE+IR  ++D RE  Y+   IG  Y+F++DD  VVDAT +G  ARFINHSC PNCY++V
Sbjct: 3422 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3480

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            + + G+K I I+A R I  GEE+TY+YKFP E+ KIPC CGS++C   LN
Sbjct: 3481 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3530


>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
          Length = 3434

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            A + RV   NL  A     L + S     K+ +    S IH  GL  L  IEA + VIEY
Sbjct: 3271 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3325

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
             GE+IR  ++D RE  Y+   IG  Y+F++DD  VVDAT +G  ARFINHSC PNCY++V
Sbjct: 3326 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3384

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            + + G+K I I+A R I  GEE+TY+YKFP E+ KIPC CGS++C   LN
Sbjct: 3385 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3434


>gi|157126650|ref|XP_001654691.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108873214|gb|EAT37439.1| AAEL010578-PA [Aedes aegypti]
          Length = 172

 Score =  163 bits (413), Expect = 4e-37,   Method: Composition-based stats.
 Identities = 84/167 (50%), Positives = 106/167 (63%), Gaps = 5/167 (2%)

Query: 774 LRNLLAAAEGAELLKASQVKA----RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
           LR +     G+ L  A + +      K+ +   RS IH  GL     IEA + VIEY GE
Sbjct: 7   LRKIKRRGSGSNLPMAMRYRTLKETSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGE 66

Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
           LIR  ++D RE  Y+  GIG  Y+F++D+ +VVDAT RG  ARFINHSC PNCY+KV+ +
Sbjct: 67  LIRSTLTDKRERYYDSRGIGC-YMFKIDEHFVVDATMRGNAARFINHSCEPNCYSKVVDI 125

Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            G K I I+A R I  GEE+TY+YKFP E+ KIPC CGSKKC   LN
Sbjct: 126 LGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYLN 172


>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
          Length = 3522

 Score =  163 bits (413), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            A + RV   NL  A     L + S     K+ +    S IH  GL  L  IEA + VIEY
Sbjct: 3359 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3413

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
             GE+IR  ++D RE  Y+   IG  Y+F++DD  VVDAT +G  ARFINHSC PNCY++V
Sbjct: 3414 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3472

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            + + G+K I I+A R I  GEE+TY+YKFP E+ KIPC CGS++C   LN
Sbjct: 3473 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3522


>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus floridanus]
          Length = 3218

 Score =  163 bits (412), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 83/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            A + RV   NL  A     L + S     KK +    S IH  GL  L  IEA + VIEY
Sbjct: 3055 AESRRVASTNLPMAMRFRILKETS-----KKSVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3109

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
             GE+IR  ++D RE  Y+   IG  Y+F++DD  VVDAT +G  ARFINHSC PNCY++V
Sbjct: 3110 AGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3168

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            + + G+K I I+A R I  GEE+TY+YKFP E+ KIPC CGS++C   LN
Sbjct: 3169 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3218


>gi|241687915|ref|XP_002401626.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
 gi|215504523|gb|EEC14017.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
          Length = 1036

 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 77/141 (54%), Positives = 97/141 (68%), Gaps = 1/141 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K+ +   RS IH  GL     I+  + +IEY GE+IR  ++D RE  YE  GIG  Y+FR
Sbjct: 897  KEAVGVYRSSIHGRGLYCKRNIDGGEMIIEYAGEVIRAALTDKREKYYESKGIGC-YMFR 955

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +DD  VVDAT  G  ARFINHSC PNCY+KVI+V+ +K I I+A R I  GEE+TY+YKF
Sbjct: 956  IDDHEVVDATMHGNAARFINHSCEPNCYSKVITVDNKKHIVIFALRSILKGEELTYDYKF 1015

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
            P+EE KIPC CGS++C   LN
Sbjct: 1016 PIEEVKIPCSCGSRRCRKFLN 1036


>gi|403360488|gb|EJY79922.1| Histone-lysine N-methyltransferase [Oxytricha trifallax]
          Length = 2438

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 85/199 (42%), Positives = 119/199 (59%), Gaps = 5/199 (2%)

Query: 741  GAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAEL---LKASQVKARKK 797
            G +  ++K    E+N ++  N +  +    +++L + +   +  +L   +K    KA+ K
Sbjct: 2242 GKKMFYSKSHQDEINENK--NQQKSTVLAKKIRLTDTIDTNKEIDLPIAMKYRLSKAQPK 2299

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+    SKIH  GL  LE +   D VIEYVGE IR K++D RE  YE+ GIG  YLFRLD
Sbjct: 2300 RVEVGPSKIHRNGLFTLEDLMPGDIVIEYVGEKIRNKVADKREIYYEQKGIGDCYLFRLD 2359

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
              Y++DAT  G  AR++NHSC+ NC  K+I+V+ QK I I   R I +GEE+TYNY F  
Sbjct: 2360 KEYIIDATFFGNKARYLNHSCDANCSAKIINVQQQKHIIISTNRQIKSGEELTYNYNFDY 2419

Query: 918  EEKKIPCYCGSKKCHGSLN 936
            E  KI C+CG+  C G LN
Sbjct: 2420 ETDKIACFCGAPTCSGRLN 2438


>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
            mellifera]
          Length = 3195

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 106/170 (62%), Gaps = 6/170 (3%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            A + RV   NL  A     L + S     K+ +    S IH  GL  L  IEA + VIEY
Sbjct: 3032 AESRRVASTNLPMAMRFRILKETS-----KESVGVYHSHIHGRGLFCLRDIEAGEMVIEY 3086

Query: 827  VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
             GE+IR  ++D RE  Y+   IG  Y+F++DD  VVDAT +G  ARFINHSC PNCY++V
Sbjct: 3087 AGEVIRASLTDKREKYYDSKNIGC-YMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRV 3145

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            + + G+K I I+A R I  GEE+TY+YKFP E+ KIPC CGS++C   LN
Sbjct: 3146 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3195


>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
 gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
          Length = 3311

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 76/134 (56%), Positives = 96/134 (71%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS+IH  GL  L  IEA + VIEY GE+IR  +SD RE  Y + GIG  Y+FR+DD +VV
Sbjct: 3179 RSQIHGRGLFCLREIEAGEMVIEYAGEVIRANLSDKREKYYTEKGIGC-YMFRIDDHFVV 3237

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT +G  ARFINHSC PNCY++V+ + G+K I I+A R I   EE+TY+YKFP E++KI
Sbjct: 3238 DATMKGNAARFINHSCEPNCYSRVVDILGKKHIVIFALRKINIMEELTYDYKFPFEDEKI 3297

Query: 923  PCYCGSKKCHGSLN 936
             C+C SKKC   LN
Sbjct: 3298 SCHCLSKKCKKYLN 3311


>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
            latipes]
          Length = 4536

 Score =  162 bits (411), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 80/149 (53%), Positives = 103/149 (69%), Gaps = 4/149 (2%)

Query: 791  QVKA-RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
            Q+KA  K+ +   RS IH  GL   + IEA + VIEY G +IR  ++D RE  Y+  GIG
Sbjct: 4389 QLKAMSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDAKGIG 4448

Query: 850  SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
              Y+FR+DD  VVDAT  G  ARFINHSC PNCY++V++V+GQK I I+A R I  GEE+
Sbjct: 4449 C-YMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVLTVDGQKHIVIFASRRICCGEEL 4507

Query: 910  TYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            TY+YKFP+E+   K+PC CG+KKC   LN
Sbjct: 4508 TYDYKFPIEDASNKLPCNCGTKKCRKFLN 4536


>gi|4127850|emb|CAA09454.1| MLL protein [Gallus gallus]
          Length = 945

 Score =  162 bits (410), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 801 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 859

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 860 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 919

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 920 YKFPIEDASNKLPCNCGAKKCRKFLN 945


>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
          Length = 3272

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 93/133 (69%), Gaps = 1/133 (0%)

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S+IH  GL  L  IE  + VIEY GE+IR  ++D RE  Y+   IG  Y+F++DD  VVD
Sbjct: 3141 SRIHGRGLFCLRDIEPGEMVIEYAGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVD 3199

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            AT +G  ARFINHSC PNCY++V+ + G+K I I+A R I  GEE+TY+YKFP E+ KIP
Sbjct: 3200 ATMKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIP 3259

Query: 924  CYCGSKKCHGSLN 936
            C CGS+KC   LN
Sbjct: 3260 CTCGSRKCRKYLN 3272


>gi|15488422|gb|AAL01112.1|AF408061_1 trithorax-related protein 7 [Arabidopsis thaliana]
          Length = 259

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/211 (48%), Positives = 126/211 (59%), Gaps = 6/211 (2%)

Query: 611 PSGEMLSTLSADGNDVGKVVRGKAHNVGIEKDSILDSSKSKPNATKESKQKRKRTMDGLE 670
           PS   LS  S D  D   V++        EK     S K K    K  ++++       E
Sbjct: 55  PSRRRLSLSSTDSEDA--VIKEDYDVRNEEKLPCHTSDKLKKGPNKLIRRRKPLAKHTTE 112

Query: 671 LHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLN 730
               K L V  G  K  A + +    +K SK +     P+SDGCAR+SI+GW WH WSL 
Sbjct: 113 RSPIKDLSVDDGRPKPIALKPLEKLSSKPSKKKLFLSIPKSDGCARTSINGWHWHAWSLK 172

Query: 731 ASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKAS 790
           AS  ERARVRG+  VH ++ G + + +Q      LSARTNR KLRNLLAAA+GA++LK S
Sbjct: 173 ASAEERARVRGSSCVHMQHFGSKSSLTQ----NVLSARTNRAKLRNLLAAADGADVLKMS 228

Query: 791 QVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           Q+KARKK LRFQ+SKIHDWGLVALEPIEAED
Sbjct: 229 QLKARKKHLRFQQSKIHDWGLVALEPIEAED 259


>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
          Length = 3871

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3727 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3785

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3786 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3845

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3846 YKFPIEDASNKLPCNCGAKKCRKFLN 3871


>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
            domestica]
          Length = 3960

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  YE  GIG  Y
Sbjct: 3816 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYESKGIGC-Y 3874

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3875 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3934

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3935 YKFPIEDASNKLPCNCGAKKCRKFLN 3960


>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
          Length = 3786

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3642 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3700

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3701 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3760

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3761 YKFPIEDASNKLPCNCGAKKCRKFLN 3786


>gi|32394672|gb|AAN39003.1| SET1 protein [Griffithsia japonica]
          Length = 201

 Score =  161 bits (407), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 90/191 (47%), Positives = 122/191 (63%), Gaps = 8/191 (4%)

Query: 754 VNASQWANGKG--LSARTNRVKLRNLLAAAEGA----ELLKASQVKARKKRLRFQRSKIH 807
           VNA++    +G   S+R +R + R L    +      EL+  S ++ R+K +  +RS IH
Sbjct: 11  VNATEVVKPEGGVESSRESRRQQRWLRKGMQQVKARHELVTGSMLQERRKAVFCRRSGIH 70

Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            +GL A E IEA +FVIEYVG +IR  ++D+RE  YE+ G+G SYLFRL+   VVDAT+R
Sbjct: 71  GFGLYAQEEIEAREFVIEYVGVVIRQSVADVREREYEEGGVGDSYLFRLNGEMVVDATRR 130

Query: 868 GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCY 925
           GGIARFINHSC+PN       V G ++I  Y++RHI   +E+TY+YKF LE  +KKI C 
Sbjct: 131 GGIARFINHSCDPNLTATTQRVGGTERIVFYSRRHIGKYDELTYDYKFALEGDDKKIRCL 190

Query: 926 CGSKKCHGSLN 936
           C S  C   LN
Sbjct: 191 CKSLNCRKFLN 201


>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
            harrisii]
          Length = 3995

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  YE  GIG  Y
Sbjct: 3851 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYESKGIGC-Y 3909

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3910 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3969

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3970 YKFPIEDASNKLPCNCGAKKCRKFLN 3995


>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Meleagris gallopavo]
          Length = 3851

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3707 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3765

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3766 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3825

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3826 YKFPIEDASNKLPCNCGAKKCRKFLN 3851


>gi|241554585|ref|XP_002399516.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
 gi|215501703|gb|EEC11197.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
          Length = 544

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 77/134 (57%), Positives = 96/134 (71%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL +   I+A + VIEY G+++R  ++D RE  YE  GIG  Y+FR+D+  VV
Sbjct: 412 RSLIHGRGLYSKRNIDAGEMVIEYAGQVVRSVLTDKRERLYESRGIGC-YMFRMDEDQVV 470

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC+PNCY+KVI+V GQK I IYA R I  GEE+TY+YKFP EE KI
Sbjct: 471 DATTHGNAARFINHSCDPNCYSKVIAVFGQKHIIIYALRKIYKGEELTYDYKFPKEEVKI 530

Query: 923 PCYCGSKKCHGSLN 936
           PC CG+++C   LN
Sbjct: 531 PCSCGARRCRKFLN 544


>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
            guttata]
          Length = 3849

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3705 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-Y 3763

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3764 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3823

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3824 YKFPIEDASNKLPCNCGAKKCRKFLN 3849


>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1858

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 105/161 (65%), Gaps = 5/161 (3%)

Query: 780  AAEGAELLKASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
            A +  ++  A Q +  +   R     +RS IH  GL A   +E +  V+EY+GE IR  +
Sbjct: 1699 ATQSIQMPMAMQYRVLRDNYRTYSVVRRSPIHGCGLYAARRLEKDSMVVEYMGERIRDIL 1758

Query: 836  SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
            +D RE  Y+  GIG  Y+FR+DD Y++DAT +   ARF+NHSC PNCYT++++ +G K+I
Sbjct: 1759 TDYRERMYDARGIGC-YMFRIDDDYIIDATMKANQARFMNHSCEPNCYTRIVNPDGVKRI 1817

Query: 896  FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
              +A R +  GEE+T +YK P+E+ KIPCYCG++ C GS+N
Sbjct: 1818 IYFASRVVLEGEELTVDYKMPIEDVKIPCYCGTRSCRGSMN 1858


>gi|156353190|ref|XP_001622957.1| predicted protein [Nematostella vectensis]
 gi|156209595|gb|EDO30857.1| predicted protein [Nematostella vectensis]
          Length = 288

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL     I A + VIEY G LIR  ++D RE  YE  GIG  Y+FR+D  YVV
Sbjct: 156 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 214

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 215 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 274

Query: 923 PCYCGSKKCHGSLN 936
           PC+C SK+C   LN
Sbjct: 275 PCHCKSKRCRKYLN 288


>gi|156354350|ref|XP_001623359.1| predicted protein [Nematostella vectensis]
 gi|156210050|gb|EDO31259.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  160 bits (406), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL     I A + VIEY G LIR  ++D RE  YE  GIG  Y+FR+D  YVV
Sbjct: 179 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 237

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 238 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 297

Query: 923 PCYCGSKKCHGSLN 936
           PC+C SK+C   LN
Sbjct: 298 PCHCKSKRCRKYLN 311


>gi|156353192|ref|XP_001622958.1| predicted protein [Nematostella vectensis]
 gi|156209596|gb|EDO30858.1| predicted protein [Nematostella vectensis]
          Length = 314

 Score =  160 bits (406), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 75/134 (55%), Positives = 96/134 (71%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL     I A + VIEY G LIR  ++D RE  YE  GIG  Y+FR+D  YVV
Sbjct: 182 RSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKREAYYESKGIGC-YMFRIDGTYVV 240

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC PNCY++V++++G KKI I+A + I+ GEE+TY+YKFPLE++K+
Sbjct: 241 DATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKL 300

Query: 923 PCYCGSKKCHGSLN 936
           PC+C SK+C   LN
Sbjct: 301 PCHCKSKRCRKYLN 314


>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
          Length = 3452

 Score =  160 bits (406), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 1/133 (0%)

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IH  GL  L  IE  + VIEY GE+IR  ++D RE  Y+   IG  Y+F++DD  VVD
Sbjct: 3321 SHIHGRGLFCLRDIEPGEMVIEYAGEVIRSSLTDKREKYYDSKNIGC-YMFKIDDHLVVD 3379

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            AT +G  ARFINHSC PNCY++V+ + G+K I I+A R I  GEE+TY+YKFP E+ KIP
Sbjct: 3380 ATMKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIP 3439

Query: 924  CYCGSKKCHGSLN 936
            C CGS+KC   LN
Sbjct: 3440 CTCGSRKCRKYLN 3452


>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
            niloticus]
          Length = 4517

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/171 (49%), Positives = 112/171 (65%), Gaps = 5/171 (2%)

Query: 770  NRVKLRNLLAAAEGAEL-LKASQVKA-RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            + V+LR    A+    L ++  Q+KA  K+ +   RS IH  GL   + IEA + VIEY 
Sbjct: 4348 DEVQLRTARRASMELPLAVRFKQLKATSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYS 4407

Query: 828  GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
            G +IR  ++D RE  Y+  GIG  Y+FR+DD  VVDAT  G  ARFINHSC PNCY++VI
Sbjct: 4408 GNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVI 4466

Query: 888  SVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            +V+GQK I I+A R I  GEE+TY+YKFP+E+   K+PC C +KKC   LN
Sbjct: 4467 TVDGQKHIVIFASRRIYCGEELTYDYKFPIEDASNKLPCNCSAKKCRKFLN 4517


>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4219

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            KA +  +   RS IH  GL   + IE  + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 4075 KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4133

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++V++V+GQK I I+A R I  GEE+TY+
Sbjct: 4134 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFATRKIYKGEELTYD 4193

Query: 913  YKFPLEEK--KIPCYCGSKKCHGSLN 936
            YKFP+EE   K+PC CG+KKC   LN
Sbjct: 4194 YKFPIEEPGNKLPCNCGAKKCRKFLN 4219


>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
 gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4218

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            KA +  +   RS IH  GL   + IE  + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 4074 KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4132

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++V++V+GQK I I+A R I  GEE+TY+
Sbjct: 4133 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFATRKIYKGEELTYD 4192

Query: 913  YKFPLEEK--KIPCYCGSKKCHGSLN 936
            YKFP+EE   K+PC CG+KKC   LN
Sbjct: 4193 YKFPIEEPGNKLPCNCGAKKCRKFLN 4218


>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
          Length = 3806

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3662 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3720

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3721 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3780

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3781 YKFPIEDASNKLPCNCGAKKCRKFLN 3806


>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
            gorilla]
          Length = 3837

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3693 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3751

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3752 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3811

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3812 YKFPIEDASNKLPCNCGAKKCRKFLN 3837


>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
          Length = 3710

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3566 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3624

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3625 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3684

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3685 YKFPIEDASNKLPCNCGAKKCRKFLN 3710


>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
          Length = 3826

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3682 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3740

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3741 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3800

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3801 YKFPIEDASNKLPCNCGAKKCRKFLN 3826


>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
            taurus]
 gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
          Length = 3821

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3677 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3735

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3736 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3795

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3796 YKFPIEDASNKLPCNCGAKKCRKFLN 3821


>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
          Length = 3846

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3702 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3760

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3761 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3820

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3821 YKFPIEDASNKLPCNCGAKKCRKFLN 3846


>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
          Length = 3899

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3755 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3813

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3814 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3873

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3874 YKFPIEDASNKLPCNCGAKKCRKFLN 3899


>gi|76161881|gb|AAX30110.2| KIAA1076 protein [Schistosoma japonicum]
          Length = 123

 Score =  160 bits (404), Expect = 4e-36,   Method: Composition-based stats.
 Identities = 72/114 (63%), Positives = 88/114 (77%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEYVG ++R  ++++RE +YE  GIG SYLFR+DD +V+DAT  G  ARFINHSC PNC
Sbjct: 10  VIEYVGHVVRKGVAELREHQYEAKGIGGSYLFRIDDDFVIDATMCGNNARFINHSCQPNC 69

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y K+I VE +KKI IY+KR I   EEITY+YKFP EE+KIPC CGS  C G+LN
Sbjct: 70  YAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEEKIPCQCGSSSCRGTLN 123


>gi|325192337|emb|CCA26782.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 2128

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 72/140 (51%), Positives = 99/140 (70%), Gaps = 1/140 (0%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL+  +S IH +GL   E +     ++EY GELI  +++D+RE  YE+MG+GS YLFRLD
Sbjct: 1989 RLKVCKSAIHGYGLFTKEALSEGQAIVEYQGELISQQVADVREALYEEMGVGSCYLFRLD 2048

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEG-QKKIFIYAKRHIAAGEEITYNYKFP 916
               ++DAT RG +ARFINHSC+P  + + + VE  +KKI I+AKR I AGEEITY+YKFP
Sbjct: 2049 ATTIIDATTRGNLARFINHSCDPKAFARSVIVENDKKKILIFAKRAIMAGEEITYDYKFP 2108

Query: 917  LEEKKIPCYCGSKKCHGSLN 936
            +EE+ + C CG+  C G +N
Sbjct: 2109 IEEEALRCDCGAPNCVGRMN 2128


>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Loxodonta africana]
          Length = 3962

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3818 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3876

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3877 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3936

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3937 YKFPIEDASNKLPCNCGAKKCRKFLN 3962


>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
          Length = 3554

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3410 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3468

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3469 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3528

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3529 YKFPIEDASNKLPCNCGAKKCRKFLN 3554


>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
          Length = 3459

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3315 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3373

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3374 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3433

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3434 YKFPIEDASNKLPCNCGAKKCRKFLN 3459


>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
          Length = 3844

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3700 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3758

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3759 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3818

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3819 YKFPIEDASNKLPCNCGAKKCRKFLN 3844


>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
          Length = 3824

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3680 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3738

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3739 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3798

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3799 YKFPIEDASNKLPCNCGAKKCRKFLN 3824


>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
            garnettii]
          Length = 4062

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3918 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3976

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3977 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 4036

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 4037 YKFPIEDASNKLPCNCGAKKCRKFLN 4062


>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
            boliviensis boliviensis]
          Length = 3985

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3841 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3899

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3900 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3959

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3960 YKFPIEDASNKLPCNCGAKKCRKFLN 3985


>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
          Length = 2525

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 2381 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2439

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 2440 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2499

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 2500 YKFPIEDASNKLPCNCGAKKCRKFLN 2525


>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
            rotundus]
          Length = 3966

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3822 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3880

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3881 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3940

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3941 YKFPIEDASNKLPCNCGAKKCRKFLN 3966


>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
            melanoleuca]
          Length = 3981

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3837 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3895

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3896 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3955

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3956 YKFPIEDASNKLPCNCGAKKCRKFLN 3981


>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
            jacchus]
          Length = 3994

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3850 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3908

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3909 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3968

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3969 YKFPIEDASNKLPCNCGAKKCRKFLN 3994


>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
            griseus]
          Length = 3907

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3763 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3821

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3822 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3881

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3882 YKFPIEDASNKLPCNCGAKKCRKFLN 3907


>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
            familiaris]
          Length = 3829

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3685 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3743

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3744 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3803

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3804 YKFPIEDASNKLPCNCGAKKCRKFLN 3829


>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
          Length = 3968

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3824 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3882

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3883 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3942

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3943 YKFPIEDASNKLPCNCGAKKCRKFLN 3968


>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
          Length = 3986

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3842 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3900

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3901 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3960

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3961 YKFPIEDASNKLPCNCGAKKCRKFLN 3986


>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
          Length = 3969

 Score =  159 bits (403), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969


>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
            leucogenys]
          Length = 3968

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3824 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3882

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3883 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3942

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3943 YKFPIEDASNKLPCNCGAKKCRKFLN 3968


>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
            sapiens]
          Length = 3972

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3828 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3886

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3887 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3946

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3947 YKFPIEDASNKLPCNCGAKKCRKFLN 3972


>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_e [Homo sapiens]
          Length = 3972

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3828 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3886

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3887 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3946

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3947 YKFPIEDASNKLPCNCGAKKCRKFLN 3972


>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_d [Homo sapiens]
          Length = 4002

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3858 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3916

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3917 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3976

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3977 YKFPIEDASNKLPCNCGAKKCRKFLN 4002


>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
            sapiens]
 gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
            7; AltName: Full=Lysine N-methyltransferase 2A;
            Short=KMT2A; AltName: Full=Trithorax-like protein;
            AltName: Full=Zinc finger protein HRX; Contains: RecName:
            Full=MLL cleavage product N320; AltName: Full=N-terminal
            cleavage product of 320 kDa; Short=p320; Contains:
            RecName: Full=MLL cleavage product C180; AltName:
            Full=C-terminal cleavage product of 180 kDa; Short=p180
 gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila) [Homo sapiens]
          Length = 3969

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969


>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_a [Homo sapiens]
          Length = 3969

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969


>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
          Length = 4005

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3861 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3919

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3920 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3979

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3980 YKFPIEDASNKLPCNCGAKKCRKFLN 4005


>gi|299115811|emb|CBN74374.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 3157

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 71/140 (50%), Positives = 104/140 (74%), Gaps = 2/140 (1%)

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
            L+ +RS IH WGL ALE    +  ++EY+GE++R  ++D+RE +YE+MG+GS YLFR D 
Sbjct: 3018 LKVRRSHIHGWGLFALEEFPKDAMLVEYMGEVVRQCVADLREVKYEEMGVGSCYLFRADA 3077

Query: 859  GYVVDATKRGGIARFINHSCNPNCYTKVISVEG--QKKIFIYAKRHIAAGEEITYNYKFP 916
              +VDAT++G +ARFINH C+PN   +++++EG  +KKI I AKR I  GEEITY+YKF 
Sbjct: 3078 DAIVDATRKGNLARFINHCCDPNAIARIVNLEGTKKKKIIIVAKRQIKPGEEITYDYKFE 3137

Query: 917  LEEKKIPCYCGSKKCHGSLN 936
             E+ ++ C+CG+  C+GS+N
Sbjct: 3138 REDGQLSCHCGADVCYGSMN 3157


>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
          Length = 3866

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3722 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3780

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3781 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3840

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3841 YKFPIEDASNKLPCNCGAKKCRKFLN 3866


>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL [Pongo abelii]
          Length = 4012

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3868 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3926

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3927 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3986

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3987 YKFPIEDASNKLPCNCGAKKCRKFLN 4012


>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
          Length = 3869

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3725 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3783

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3784 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3843

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3844 YKFPIEDASNKLPCNCGAKKCRKFLN 3869


>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
          Length = 4202

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 4058 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 4116

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 4117 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 4176

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 4177 YKFPIEDASNKLPCNCGAKKCRKFLN 4202


>gi|361127849|gb|EHK99806.1| putative Histone-lysine N-methyltransferase, H3 lysine-4 specific
            [Glarea lozoyensis 74030]
          Length = 1072

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 78/153 (50%), Positives = 105/153 (68%), Gaps = 8/153 (5%)

Query: 766  SARTNRVKLRNLLAAAEG--------AELLKASQVKARKKRLRFQRSKIHDWGLVALEPI 817
            ++R +RV  R  +A  E         ++ L+ +Q++ RKK ++F RS IH+WGL A+E I
Sbjct: 909  NSRASRVNNRRFVAGLEDQKKALGGESDALRFNQLQKRKKPVKFARSAIHNWGLYAMENI 968

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
               D +IEYVGE IR +++D+RE  Y + GIGSSYLFR+D+  VVDATK+GGIARFINHS
Sbjct: 969  AINDMIIEYVGEKIRQQVADLRENNYLRSGIGSSYLFRIDENTVVDATKKGGIARFINHS 1028

Query: 878  CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEIT 910
            C PNC  K+I+VE  K+I IYA R IA  +  T
Sbjct: 1029 CMPNCTAKIITVEKSKRIVIYALRDIAMSKFFT 1061


>gi|357150782|ref|XP_003575574.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Brachypodium
            distachyon]
          Length = 1055

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 72/142 (50%), Positives = 101/142 (71%), Gaps = 1/142 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLF 854
            ++RL F +S+IH +G+ A    +A D +IEY+GEL+R  +SDIRE R Y  +    +Y+F
Sbjct: 892  RRRLAFGKSRIHGFGVFAKVAHKAGDMMIEYIGELVRPPVSDIREGRIYNSLVGAGTYMF 951

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            R+DD  V+DAT+ G IA  INHSC PNCY++VISV G + I I+AKR IA  EE+TY+Y+
Sbjct: 952  RIDDERVIDATRSGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDIAPWEELTYDYR 1011

Query: 915  FPLEEKKIPCYCGSKKCHGSLN 936
            F   ++++PCYCG  KC G +N
Sbjct: 1012 FVSNDQRLPCYCGFPKCRGVVN 1033


>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
          Length = 3463

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3319 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3377

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3378 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3437

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3438 YKFPIEDASNKLPCNCGAKKCRKFLN 3463


>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_c [Homo sapiens]
          Length = 3130

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 2986 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3044

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3045 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3104

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3105 YKFPIEDASNKLPCNCGAKKCRKFLN 3130


>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
            norvegicus]
          Length = 3894

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3750 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3808

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3809 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3868

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3869 YKFPIEDASNKLPCNCGAKKCRKFLN 3894


>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila) variant [Homo sapiens]
          Length = 2880

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 2736 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2794

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 2795 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2854

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 2855 YKFPIEDASNKLPCNCGAKKCRKFLN 2880


>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
            [Cavia porcellus]
          Length = 2799

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 2655 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 2713

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 2714 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 2773

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 2774 YKFPIEDASNKLPCNCGAKKCRKFLN 2799


>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
          Length = 3963

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3819 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3877

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3878 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3937

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3938 YKFPIEDASNKLPCNCGAKKCRKFLN 3963


>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
          Length = 3706

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3562 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3620

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3621 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3680

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3681 YKFPIEDASNKLPCNCGAKKCRKFLN 3706


>gi|326489957|dbj|BAJ94052.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1081

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 137/238 (57%), Gaps = 19/238 (7%)

Query: 711  SDGCARSSISGWEWHKWSLNASP-------AERARVRGAQYVHTKY----LGPEVNASQW 759
            S GCAR+    + +H+      P        +R  V    Y+ + Y    +G + +    
Sbjct: 828  SSGCARTE--PYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEPI 885

Query: 760  ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
             +   L A +    + N+ + AE  + +KA+     +KRL F +S+IH +G+ A    +A
Sbjct: 886  QSVALLDAASQEASV-NVSSMAEKYKSMKAT----FRKRLAFGKSRIHGFGVFAKVGHKA 940

Query: 820  EDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
             D +IEY+GEL+R  ISD+RE R Y  +    +Y+FR+DD  V+DAT+ G IA  INHSC
Sbjct: 941  GDMMIEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 1000

Query: 879  NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             PNCY++VISV G + I I+AKR I   EE+TY+Y+F   E+++PCYCG  KC G +N
Sbjct: 1001 EPNCYSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVN 1058


>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=Zinc finger protein HRX;
            Contains: RecName: Full=MLL cleavage product N320;
            AltName: Full=N-terminal cleavage product of 320 kDa;
            Short=p320; Contains: RecName: Full=MLL cleavage product
            C180; AltName: Full=C-terminal cleavage product of 180
            kDa; Short=p180
          Length = 3966

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3822 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3880

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3881 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3940

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3941 YKFPIEDASNKLPCNCGAKKCRKFLN 3966


>gi|332077887|gb|AED99887.1| Trx1 [Hordeum vulgare]
          Length = 1029

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 92/238 (38%), Positives = 137/238 (57%), Gaps = 19/238 (7%)

Query: 711  SDGCARSSISGWEWHKWSLNASP-------AERARVRGAQYVHTKY----LGPEVNASQW 759
            S GCAR+    + +H+      P        +R  V    Y+ + Y    +G + +    
Sbjct: 776  SSGCARTE--PYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEPI 833

Query: 760  ANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEA 819
             +   L A +    + N+ + AE  + +KA+     +KRL F +S+IH +G+ A    +A
Sbjct: 834  QSVALLDAASQEASV-NVSSMAEKYKSMKAT----FRKRLAFGKSRIHGFGVFAKVGHKA 888

Query: 820  EDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSC 878
             D +IEY+GEL+R  ISD+RE R Y  +    +Y+FR+DD  V+DAT+ G IA  INHSC
Sbjct: 889  GDMMIEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 948

Query: 879  NPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             PNCY++VISV G + I I+AKR I   EE+TY+Y+F   E+++PCYCG  KC G +N
Sbjct: 949  EPNCYSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVN 1006


>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
            norvegicus]
          Length = 3725

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3581 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3639

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3640 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3699

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3700 YKFPIEDASNKLPCNCGAKKCRKFLN 3725


>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
          Length = 3969

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3825 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3883

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3884 MFRIDDSEVVDATMHGNRARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3943

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3944 YKFPIEDASNKLPCNCGAKKCRKFLN 3969


>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
          Length = 3987

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 3843 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 3901

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3902 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 3961

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3962 YKFPIEDASNKLPCNCGAKKCRKFLN 3987


>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
          Length = 4498

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 76/136 (55%), Positives = 95/136 (69%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  VV
Sbjct: 4364 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 4422

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
            DAT  G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+YKFP+E+   
Sbjct: 4423 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYRGEELTYDYKFPIEDASS 4482

Query: 921  KIPCYCGSKKCHGSLN 936
            K+PC C SKKC   LN
Sbjct: 4483 KLPCNCNSKKCRKFLN 4498


>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
 gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
          Length = 2943

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 67/141 (47%), Positives = 102/141 (72%), Gaps = 1/141 (0%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
            +R+  ++S IH +GL   EP+     ++EY G++I   I+D RE RYE+ G+GS Y+FRL
Sbjct: 2803 ERMLVRKSSIHGYGLFLKEPVSDGQMIVEYQGQMINQTIADERERRYEEQGVGSCYMFRL 2862

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKF 915
            D+  ++DAT+ G +ARFINHSC+P  + ++++VE G+KKI I+AKR IA G+E+TY+YKF
Sbjct: 2863 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDYKF 2922

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
            P+E++ I C C +  C G +N
Sbjct: 2923 PIEDEAIRCDCNAPNCIGRMN 2943


>gi|449679772|ref|XP_002161520.2| PREDICTED: histone-lysine N-methyltransferase MLL-like [Hydra
           magnipapillata]
          Length = 281

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 74/134 (55%), Positives = 98/134 (73%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RSKI   GL   + IEA + +IEY GE IR  ++D RE  YEK G G  Y+FR+DD  VV
Sbjct: 149 RSKIQGRGLFCKKTIEAGEMIIEYSGEKIRSSLTDRREKYYEKKGYGC-YMFRIDDTDVV 207

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT +G  ARFINHSC PNC++++IS++G KKI IYA++ +  GEE+TY+YKF +E+ K+
Sbjct: 208 DATTKGNAARFINHSCEPNCFSRIISIDGCKKIIIYAQKRVTVGEELTYDYKFAIEDDKL 267

Query: 923 PCYCGSKKCHGSLN 936
           PC+CG+KKC   LN
Sbjct: 268 PCFCGAKKCRKYLN 281


>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Takifugu rubripes]
          Length = 4478

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 76/136 (55%), Positives = 95/136 (69%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + IEA + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  VV
Sbjct: 4344 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 4402

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
            DAT  G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+YKFP+E+   
Sbjct: 4403 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYRGEELTYDYKFPIEDASS 4462

Query: 921  KIPCYCGSKKCHGSLN 936
            K+PC C SKKC   LN
Sbjct: 4463 KLPCNCNSKKCRKFLN 4478


>gi|356526623|ref|XP_003531916.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
          Length = 1088

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 140/256 (54%), Gaps = 26/256 (10%)

Query: 703  RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
            R S LC      P   GCARS    +   +      +L A+  +R  V    Y+   Y  
Sbjct: 813  RVSGLCSDYEPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 872

Query: 752  PEVNASQWANGKGLSA---------RTNRVKLRN-LLAAAEGAELLKASQVKARKKRLRF 801
                 +   +G+G+ +         RT+R+   N +L+ +E    ++    +  +KRL F
Sbjct: 873  HGSLNNLEPSGRGVCSKFFCSQQRLRTSRIDTSNSILSISEKYTYMR----ETFRKRLAF 928

Query: 802  QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
             +S+IH +G+ A  P +  D VIEY GEL+R  I+D RE   Y  +    +Y+FR+DD  
Sbjct: 929  GKSRIHGFGIFAKHPYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDER 988

Query: 861  VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
            V+DAT+ G IA  INHSC PNCY++VISV G + I I+AKR I   EE+TY+Y+F   ++
Sbjct: 989  VIDATRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE 1048

Query: 921  KIPCYCGSKKCHGSLN 936
            ++PCYCG  KC G +N
Sbjct: 1049 RLPCYCGFPKCRGIVN 1064


>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
 gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
          Length = 3002

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 76/157 (48%), Positives = 99/157 (63%), Gaps = 3/157 (1%)

Query: 780  AAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIR 839
            A + + LL  S   AR  R+    S+IH  GL AL     ++ VIEY GELIR  I D R
Sbjct: 2849 ATQFSRLLSGSL--ARFYRVSVHPSRIHGRGLFALREFREDEMVIEYTGELIRSIICDAR 2906

Query: 840  ECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYA 899
            E +Y   G+   Y+FR+D  +V+DAT  G  ARFINH+C+PNCY KV+S++ +K I I A
Sbjct: 2907 ELKYRATGVDC-YMFRIDPDWVIDATYAGNAARFINHACDPNCYAKVVSIDDKKHIVILA 2965

Query: 900  KRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            +R I  GEE+TY+Y+FP E  K+PC CGS  C   LN
Sbjct: 2966 QRKIYPGEELTYDYRFPKESDKLPCNCGSYSCRKYLN 3002


>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
          Length = 2720

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 102/146 (69%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS+IH  GL     IEA + VIEY G +IR  ++D R+  Y+  GIG  Y
Sbjct: 2576 KISKEAVGVYRSEIHGRGLFCKRNIEAGEMVIEYAGTVIRAVLTDKRQKYYDGKGIGC-Y 2634

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT +G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+
Sbjct: 2635 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 2694

Query: 913  YKFPLE--EKKIPCYCGSKKCHGSLN 936
            YKFP+E  E K+ C CG+++C GSLN
Sbjct: 2695 YKFPIEDDESKLHCNCGTRRCRGSLN 2720


>gi|219127798|ref|XP_002184115.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404346|gb|EEC44293.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 154

 Score =  156 bits (395), Expect = 4e-35,   Method: Composition-based stats.
 Identities = 75/145 (51%), Positives = 100/145 (68%), Gaps = 4/145 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           ++RL  +RS IH WGL     I+ +D +IEY+GE+IR  I+D RE  YE  G GS Y+FR
Sbjct: 10  EQRLVARRSHIHGWGLFTKSDIQKDDPIIEYMGEVIRQPIADQRERAYEISGEGSCYMFR 69

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITY 911
           LD   +VDATK G +ARF+NH C PN Y K+ISV+ +    +KI ++A R I+AGEEITY
Sbjct: 70  LDMQRIVDATKIGCMARFMNHCCGPNAYAKIISVDAELGQDRKIVVFANRDISAGEEITY 129

Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
           +YKFP+E+  + C CG+  C G LN
Sbjct: 130 DYKFPVEDGSLRCTCGAPNCIGRLN 154


>gi|391333600|ref|XP_003741200.1| PREDICTED: uncharacterized protein LOC100900769 [Metaseiulus
           occidentalis]
          Length = 748

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 72/134 (53%), Positives = 95/134 (70%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RSKIH  G+ A   I+A + VIEY GELIR  ++D RE  Y+  GI   Y+F++D+  VV
Sbjct: 616 RSKIHRRGVFAKRAIDAGEMVIEYSGELIRAVLTDKREQLYKARGI-DCYMFKIDEDEVV 674

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC+PNCY+K I + G+K I IY+++ I  GEE+TY+YKFP E+ K+
Sbjct: 675 DATMHGNAARFINHSCDPNCYSKCIEIFGKKHIVIYSQKRIKVGEELTYDYKFPKEDVKV 734

Query: 923 PCYCGSKKCHGSLN 936
           PC CG++KC   LN
Sbjct: 735 PCTCGARKCRRYLN 748


>gi|428181744|gb|EKX50607.1| hypothetical protein GUITHDRAFT_66907 [Guillardia theta CCMP2712]
          Length = 182

 Score =  156 bits (395), Expect = 5e-35,   Method: Composition-based stats.
 Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 2/158 (1%)

Query: 780 AAEGAELLKASQVKARKK-RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDI 838
           A E    +KA  ++ R + R   +RS IH+ GL A   I+  D VIEYVGELIR  + DI
Sbjct: 26  AKEEERSIKAKYLQLRSEPRTVVRRSPIHNLGLFATRRIDRNDMVIEYVGELIRPIVGDI 85

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           R+    + G  S+Y+FRLDD Y+VDA   G  +RFINH C+PNCY +V++V+G K I ++
Sbjct: 86  RDDLALEKG-KSTYMFRLDDNYIVDAMFAGNASRFINHCCDPNCYCQVVTVDGVKHIVLF 144

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           A R I A EEITY+YK P+EE K+ C+CGS KC G +N
Sbjct: 145 AMRDIEADEEITYDYKLPIEEVKVICHCGSAKCRGYMN 182


>gi|323450935|gb|EGB06814.1| hypothetical protein AURANDRAFT_15476, partial [Aureococcus
           anophagefferens]
          Length = 136

 Score =  156 bits (395), Expect = 5e-35,   Method: Composition-based stats.
 Identities = 66/136 (48%), Positives = 95/136 (69%), Gaps = 1/136 (0%)

Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV 861
           +RS IH WGL     +  + F+IEYVG+++R  + D RE  Y+  G+GS YLFRLD+  +
Sbjct: 1   RRSHIHGWGLYCKTDVPKDAFIIEYVGQVVRQAVGDKREKYYDDAGVGSCYLFRLDEDAI 60

Query: 862 VDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFPLEEK 920
           VDAT+RG I RFINH C PN Y K+++++   KKI I A + + AG+E+ Y+YKFP+E+ 
Sbjct: 61  VDATRRGNIGRFINHCCRPNAYAKIVALDSNTKKIVIIALQDLKAGDEVMYDYKFPIEDD 120

Query: 921 KIPCYCGSKKCHGSLN 936
           K+ CYCG+  C G++N
Sbjct: 121 KVKCYCGAPNCRGTMN 136


>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
            (Silurana) tropicalis]
          Length = 3855

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3711 KTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREKYYDGKGIGC-Y 3769

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI ++GQK I I+A R I  GEE+TY+
Sbjct: 3770 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYD 3829

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+ C CG+KKC   LN
Sbjct: 3830 YKFPIEDANNKLACNCGTKKCRKFLN 3855


>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
            (Silurana) tropicalis]
          Length = 3840

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3696 KTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREKYYDGKGIGC-Y 3754

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI ++GQK I I+A R I  GEE+TY+
Sbjct: 3755 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYD 3814

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+ C CG+KKC   LN
Sbjct: 3815 YKFPIEDANNKLACNCGTKKCRKFLN 3840


>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
            tropicalis]
          Length = 2771

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 79/146 (54%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2627 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDSKGIGC-Y 2685

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2686 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYD 2745

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 2746 YKFPIEDASNKLPCNCGAKKCRRFLN 2771


>gi|326676474|ref|XP_003200588.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Danio
           rerio]
          Length = 820

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 676 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGNVIRAVLTDKREKYYDSKGIGC-Y 734

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC+PNCY++VI+VEGQK I I+A R I  GEE+TY+
Sbjct: 735 MFRIDDFDVVDATMHGNAARFINHSCDPNCYSRVINVEGQKHIVIFALRKIYRGEELTYD 794

Query: 913 YKFPLE--EKKIPCYCGSKKCHGSLN 936
           YKFP+E  + K+ C CG+++C   LN
Sbjct: 795 YKFPIEDADNKLHCNCGARRCRRFLN 820


>gi|6850313|gb|AAF29390.1|AC009999_10 Contains similarity to MLL protein from Fugu rubripes gb|AF036382,
            and contains a PWWP PF|00855 and a SET PF|00856 domain
            [Arabidopsis thaliana]
          Length = 1193

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 13/234 (5%)

Query: 709  PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
            P   GCAR+    +   +      +L  + ++R  V    Y+   Y   E +  +   G 
Sbjct: 810  PNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFSTYERIYGS 869

Query: 764  GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
             +S  T      N+L+ AE    +K +     +KRL F +S IH +G+ A  P  A D V
Sbjct: 870  KMSQITTP---SNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 922

Query: 824  IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            IEY GEL+R  I+D RE   Y  M    +Y+FR+D+  V+DAT+ G IA  INHSC PNC
Sbjct: 923  IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 982

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y++VISV G + I I+AKR +A  EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 983  YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1036


>gi|240254018|ref|NP_172074.6| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
 gi|240254020|ref|NP_001077464.4| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
 gi|257096236|sp|P0CB22.1|ATX2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX2; AltName:
            Full=Protein SET DOMAIN GROUP 30; AltName:
            Full=Trithorax-homolog protein 2; Short=TRX-homolog
            protein 2
 gi|332189779|gb|AEE27900.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
 gi|332189780|gb|AEE27901.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
          Length = 1083

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 130/234 (55%), Gaps = 13/234 (5%)

Query: 709  PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
            P   GCAR+    +   +      +L  + ++R  V    Y+   Y   E +  +   G 
Sbjct: 833  PNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFSTYERIYGS 892

Query: 764  GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
             +S  T      N+L+ AE    +K +     +KRL F +S IH +G+ A  P  A D V
Sbjct: 893  KMSQITTP---SNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 945

Query: 824  IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            IEY GEL+R  I+D RE   Y  M    +Y+FR+D+  V+DAT+ G IA  INHSC PNC
Sbjct: 946  IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 1005

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y++VISV G + I I+AKR +A  EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 1006 YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1059


>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Anolis carolinensis]
          Length = 3817

 Score =  155 bits (393), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 75/146 (51%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+   IG  Y
Sbjct: 3673 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSILTDKREKYYDSKDIGC-Y 3731

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 3732 MFRIDDAEVVDATMHGNAARFINHSCEPNCFSRVINIDGQKHIVIFAMRKIFRGEELTYD 3791

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 3792 YKFPIEDASNKLPCNCGAKKCRRFLN 3817


>gi|115478080|ref|NP_001062635.1| Os09g0134500 [Oryza sativa Japonica Group]
 gi|47848462|dbj|BAD22318.1| trithorax-like [Oryza sativa Japonica Group]
 gi|113630868|dbj|BAF24549.1| Os09g0134500 [Oryza sativa Japonica Group]
 gi|215704333|dbj|BAG93767.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 1022

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 72/142 (50%), Positives = 100/142 (70%), Gaps = 1/142 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLF 854
            ++RL F +S+IH +G+ A    +A D +IEY+GEL+R  ISDIRE R Y  +    +Y+F
Sbjct: 860  RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVGAGTYMF 919

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            R+DD  V+DAT+ G IA  INHSC PNCY++VISV G + I I+AKR I   EE+TY+Y+
Sbjct: 920  RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDINPWEELTYDYR 979

Query: 915  FPLEEKKIPCYCGSKKCHGSLN 936
            F   ++++PCYCG  KC G +N
Sbjct: 980  FVSSDQRLPCYCGFPKCRGVVN 1001


>gi|256077052|ref|XP_002574822.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
          Length = 1720

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/180 (47%), Positives = 111/180 (61%), Gaps = 15/180 (8%)

Query: 768  RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            R  R   R LLA     E  +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1545 REARSVQRRLLAEFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1604

Query: 825  EYVGELIRLKISDIRECRYE-----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHS-- 877
            EYVG ++R  ++++RE +YE     K+   S +L  +   + +   +      F  HS  
Sbjct: 1605 EYVGHVVRKGVAELREHQYEAXXXXKLFTESIFLVLMSLRFYLYPRQNT----FYYHSEF 1660

Query: 878  -CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
              +PNCY K+I VE +KKI IY+KR I   EEITY+YKFP EE KIPC CGS  C G+LN
Sbjct: 1661 YLHPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEDKIPCQCGSSSCRGTLN 1720


>gi|154417121|ref|XP_001581581.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121915810|gb|EAY20595.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 762

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 21/187 (11%)

Query: 771 RVKLRNLLAAAEGAELLKASQVKARK-------------------KRLRFQRSKIHDWGL 811
           R +L+ L+ A +    ++ +Q  A K                   KR+ F++S I  +GL
Sbjct: 545 RARLQPLVVARDSGLKIRQTQFPAPKLMMQRRGAANNQMRRGINGKRVYFEKSTIQGYGL 604

Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
            ALEPI ++  + EY GELIR +I+D+RE +YE++G    +LFR+D+  VVDAT RGG +
Sbjct: 605 FALEPISSDSLICEYNGELIRSRIADLREKQYEQLGFPHMFLFRIDNDTVVDATMRGGKS 664

Query: 872 RFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSK 929
           RF+NHSC+PNC +K+I+V   + I  YA R+I   +EIT+NY+   E+  K+  CYCG+K
Sbjct: 665 RFLNHSCHPNCRSKIINVGKTQTISFYAIRNIKPHDEITFNYQMEFEDRSKRERCYCGAK 724

Query: 930 KCHGSLN 936
           +C G LN
Sbjct: 725 QCLGYLN 731


>gi|397614973|gb|EJK63132.1| hypothetical protein THAOC_16229, partial [Thalassiosira oceanica]
          Length = 962

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/179 (43%), Positives = 109/179 (60%), Gaps = 5/179 (2%)

Query: 761 NGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
           NG+   A   R+ L++L      +  +     + R++ LRF +S IH WG+ A E I + 
Sbjct: 786 NGRAARASQRRM-LKSLATLGSASSSVDRLAGRDREQELRFDKSSIHGWGVFAEEHINSG 844

Query: 821 DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNP 880
           D +IEY GELIR  ++D RE  YE   +   Y+FR+D   V DAT RG +AR+IN SC+P
Sbjct: 845 DLIIEYRGELIRNAVADKREKEYEAARM-DDYMFRIDANTVCDATMRGNVARYINASCDP 903

Query: 881 NCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
           NCYT++I+    K+I IYAKR I  G E+ Y+YKF LE    K++ C CG+K C G +N
Sbjct: 904 NCYTQIITAGDNKRIVIYAKRDIPKGGELVYDYKFSLESDPGKRLICNCGAKMCKGFMN 962


>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
          Length = 2812

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/146 (53%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2668 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKYYDSKGIGC-Y 2726

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2727 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRRIFRGEELTYD 2786

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2787 YKFPIEDAGSKLPCNCGAKRCRRFLN 2812


>gi|256077050|ref|XP_002574821.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
          Length = 1822

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/180 (47%), Positives = 111/180 (61%), Gaps = 15/180 (8%)

Query: 768  RTNRVKLRNLLAA---AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            R  R   R LLA     E  +LLK +Q+K RKK+L F +S IH WGL+ALEPI AE+ VI
Sbjct: 1647 REARSVQRRLLAEFQDIETGDLLKFNQLKFRKKQLIFAKSPIHAWGLIALEPIAAEEMVI 1706

Query: 825  EYVGELIRLKISDIRECRYE-----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHS-- 877
            EYVG ++R  ++++RE +YE     K+   S +L  +   + +   +      F  HS  
Sbjct: 1707 EYVGHVVRKGVAELREHQYEAXXXXKLFTESIFLVLMSLRFYLYPRQNT----FYYHSEF 1762

Query: 878  -CNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
              +PNCY K+I VE +KKI IY+KR I   EEITY+YKFP EE KIPC CGS  C G+LN
Sbjct: 1763 YLHPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPYEEDKIPCQCGSSSCRGTLN 1822


>gi|414884804|tpg|DAA60818.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
 gi|414884805|tpg|DAA60819.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
          Length = 976

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 113/174 (64%), Gaps = 5/174 (2%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           GLS    +  + N+ +  E   ++KA+     +KRL F +SKIH +G+ A    +A D +
Sbjct: 783 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 838

Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           +EY+GE++R  ISD RE R Y  +    +Y+FR+DD  V+DAT+ G IAR INHSC PNC
Sbjct: 839 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 898

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y++ I++ G + I I+AKR I   EE+TY+Y+F   ++++PC+CG  KC G +N
Sbjct: 899 YSRAITILGDEHIIIFAKRDIDPWEELTYDYRFFSSDQRLPCFCGFPKCRGVVN 952


>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
          Length = 1622

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 79/146 (54%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 1478 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDSKGIGC-Y 1536

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 1537 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYD 1596

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+KKC   LN
Sbjct: 1597 YKFPIEDASNKLPCNCGAKKCRRFLN 1622


>gi|195059316|ref|XP_001995609.1| GH17848 [Drosophila grimshawi]
 gi|193896395|gb|EDV95261.1| GH17848 [Drosophila grimshawi]
          Length = 2535

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2382 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2441

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2442 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2500

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2501 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2535


>gi|348667410|gb|EGZ07235.1| hypothetical protein PHYSODRAFT_306469 [Phytophthora sojae]
          Length = 3106

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 101/141 (71%), Gaps = 1/141 (0%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
            +R+  ++S IH +GL   E +     ++EY G++I   ++D RE RYE+ G+GS Y+FRL
Sbjct: 2966 ERMLVRKSSIHGYGLFMKEAVSEGQMIVEYQGQMINQAVADERERRYEEQGVGSCYMFRL 3025

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKF 915
            D+  ++DAT+ G +ARFINHSC+P  + ++++VE G+KKI I+AKR IA G+E+TY+YKF
Sbjct: 3026 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDYKF 3085

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
            P+E++ I C C +  C G +N
Sbjct: 3086 PIEDEAIRCDCNAPNCIGRMN 3106


>gi|24639197|ref|NP_525040.2| trithorax-related, isoform C [Drosophila melanogaster]
 gi|22831528|gb|AAF45684.2| trithorax-related, isoform C [Drosophila melanogaster]
          Length = 2410

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2257 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2316

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2317 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2375

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2376 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2410


>gi|28571451|ref|NP_726773.2| trithorax-related, isoform D [Drosophila melanogaster]
 gi|74865454|sp|Q8IRW8.2|TRR_DROME RecName: Full=Histone-lysine N-methyltransferase trr; AltName:
            Full=Lysine N-methyltransferase 2C; AltName:
            Full=Trithorax-related protein
 gi|28381556|gb|AAN09063.2| trithorax-related, isoform D [Drosophila melanogaster]
          Length = 2431

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2278 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2337

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2338 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2396

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2397 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2431


>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
          Length = 1990

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 71/140 (50%), Positives = 91/140 (65%), Gaps = 1/140 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            KKRL F +S IH WGL++  PI+A   VI Y GE +R  I+D+RE RYE+ G    YL R
Sbjct: 1851 KKRLTFGKSAIHGWGLISKVPIKAGAMVIIYRGEAVRTPIADLREARYERDGT-DCYLLR 1909

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
             DD  VVD T  G IARF NHSC+PN YTK+I   G+  +  + +  + AG E+TYNY+F
Sbjct: 1910 ADDHTVVDCTNMGNIARFTNHSCDPNMYTKIIKSGGEHHVCFFTRVDVPAGTEMTYNYRF 1969

Query: 916  PLEEKKIPCYCGSKKCHGSL 935
             +E+ K+PCYC S  C G L
Sbjct: 1970 EIEDGKVPCYCASHNCRGYL 1989


>gi|3256105|emb|CAA15944.1| EG:63B12.3 [Drosophila melanogaster]
          Length = 2422

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2269 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2328

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2329 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2387

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2388 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2422


>gi|198467813|ref|XP_001354516.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
 gi|198146119|gb|EAL31569.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
          Length = 2566

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2413 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2472

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2473 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2531

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 2532 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2566


>gi|426381958|ref|XP_004057596.1| PREDICTED: uncharacterized protein LOC101126745 [Gorilla gorilla
            gorilla]
          Length = 1635

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 65/102 (63%), Positives = 81/102 (79%)

Query: 835  ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
            ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY KVI++E QKK
Sbjct: 1534 VADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKK 1593

Query: 895  IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1594 IVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1635


>gi|405120132|gb|AFR94903.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
            grubii H99]
          Length = 1471

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 100/156 (64%), Gaps = 17/156 (10%)

Query: 784  AELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            ++L   +Q++ RKK+LRF RS I  +GL A+E I A + V EYVG+L+R  ++D+RE RY
Sbjct: 1330 SDLFAFNQLRIRKKQLRFARSAIEGYGLYAMETIHAGEMVCEYVGDLVRATVADVREQRY 1389

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
             K GIGSSYLFR+D+  V DAT +G ++R INHSC+P+   K+I V GQ KI        
Sbjct: 1390 LKQGIGSSYLFRIDNDIVCDATFKGSVSRLINHSCDPSANAKIIKVNGQSKIL------- 1442

Query: 904  AAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
                   Y+YKFPLE     ++PC CG+  C G LN
Sbjct: 1443 -------YDYKFPLESDPSLRVPCLCGAATCRGWLN 1471


>gi|297698601|ref|XP_002826411.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Pongo abelii]
          Length = 1576

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 65/102 (63%), Positives = 81/102 (79%)

Query: 835  ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
            ++D+RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY KVI++E QKK
Sbjct: 1475 VADMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKK 1534

Query: 895  IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I IY+K+ I   EEITY+YKFPLE+ KIPC CG++ C GSLN
Sbjct: 1535 IVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1576


>gi|194767856|ref|XP_001966030.1| GF19475 [Drosophila ananassae]
 gi|190622915|gb|EDV38439.1| GF19475 [Drosophila ananassae]
          Length = 2490

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2337 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2396

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2397 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2455

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 2456 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2490


>gi|297843378|ref|XP_002889570.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297335412|gb|EFH65829.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1206

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/234 (37%), Positives = 128/234 (54%), Gaps = 13/234 (5%)

Query: 709  PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
            P   GCAR+    +   +      +L    ++R  V    Y+   Y   E    +   G 
Sbjct: 823  PNPSGCARTEPYNYLGRRGRKEPEALAGPSSKRLFVENQPYIVGGYSRHEFATYERIYGS 882

Query: 764  GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
             +S     +   N+L+ AE    +K +     +KRL F +S IH +G+ A  P  A D V
Sbjct: 883  KMS---QIITPSNILSMAEKYTFMKETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMV 935

Query: 824  IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            IEY GEL+R  I+D RE   Y  M    +Y+FR+D+  V+DAT+ G IA  INHSC PNC
Sbjct: 936  IEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPNC 995

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y++VISV G + I I+AKR +A  EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 996  YSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVN 1049


>gi|449496792|ref|XP_004160228.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ATX2-like [Cucumis sativus]
          Length = 1095

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 75/170 (44%), Positives = 112/170 (65%), Gaps = 5/170 (2%)

Query: 768  RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            +T ++  RN+L+ AE  + ++    +  +KRL F +S IH +G+ A  P  A D VIEY 
Sbjct: 906  KTCQLDPRNILSVAEKYKFMR----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 961

Query: 828  GELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            GE++R  I+D RE     + +G+ +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++V
Sbjct: 962  GEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 1021

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            ISV G + I I+AKR I   EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 1022 ISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVN 1071


>gi|194912727|ref|XP_001982563.1| GG12672 [Drosophila erecta]
 gi|190648239|gb|EDV45532.1| GG12672 [Drosophila erecta]
          Length = 2406

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2253 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2312

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2313 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2371

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2372 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2406


>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
 gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
          Length = 2510

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2357 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2416

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2417 EAKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2475

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 2476 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2510


>gi|384246087|gb|EIE19578.1| Mll1 protein, partial [Coccomyxa subellipsoidea C-169]
          Length = 132

 Score =  154 bits (388), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 75/136 (55%), Positives = 94/136 (69%), Gaps = 6/136 (4%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS +H WGL A +PI A +FVIEYVGELIR  ++D+RE    ++   S Y FR+D  +VV
Sbjct: 1   RSGVHAWGLFAHKPIPANEFVIEYVGELIRRTVNDVRE----RVDADSDYRFRVDADWVV 56

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EK 920
           DAT+RGG ARFINH C+PNC TK +SV GQ  I I +KR IA GEE++Y+YKF     E 
Sbjct: 57  DATRRGGKARFINHCCDPNCVTKTVSVGGQLHIMICSKRAIAEGEELSYDYKFQPTPGEA 116

Query: 921 KIPCYCGSKKCHGSLN 936
            IPC CG++ C   LN
Sbjct: 117 PIPCTCGARNCRKRLN 132


>gi|302786940|ref|XP_002975241.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
 gi|300157400|gb|EFJ24026.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
          Length = 1184

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 68/141 (48%), Positives = 98/141 (69%), Gaps = 1/141 (0%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
            +RL F +S IH WGL A EP  A D VIEY GE+IR  ++D+RE R Y  +    +Y+F 
Sbjct: 1022 RRLTFGKSAIHGWGLFAKEPHGAGDMVIEYAGEIIRPTVADVREKRCYNSLVGAGTYMFC 1081

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+  VVDAT+ G IA  INHSC PNCY++V++  G+++I I+AK+ IA G+E+TY+Y+F
Sbjct: 1082 IDNERVVDATRAGSIAHLINHSCEPNCYSRVVTTNGKERIVIFAKQDIAGGDEVTYDYRF 1141

Query: 916  PLEEKKIPCYCGSKKCHGSLN 936
                 ++PC+CG+  C G +N
Sbjct: 1142 TSIGDQLPCHCGTAGCRGIVN 1162


>gi|167533965|ref|XP_001748661.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772902|gb|EDQ86548.1| predicted protein [Monosiga brevicollis MX1]
          Length = 706

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 73/137 (53%), Positives = 98/137 (71%), Gaps = 10/137 (7%)

Query: 810 GLVA-LEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           GL+  L+ I A + VIEYVGE+IR  ++D RE RY ++ IGSSYLFR+D+  V+DAT+RG
Sbjct: 570 GLIQQLKDIPANEVVIEYVGEVIRQSVADERERRYTQVKIGSSYLFRIDELNVIDATRRG 629

Query: 869 GIARFINHSCN-------PNC--YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE 919
            IARF+NHSC+       PN   + KV     +K I IY+KR I  G+EITY+Y+FP+E+
Sbjct: 630 HIARFMNHSCDVGLPRSIPNTPFHPKVFESHDEKHIVIYSKRDIREGQEITYDYQFPMED 689

Query: 920 KKIPCYCGSKKCHGSLN 936
           +KIPCYCG++ C G LN
Sbjct: 690 EKIPCYCGAENCKGFLN 706


>gi|168012124|ref|XP_001758752.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
            patens subsp. patens]
 gi|162689889|gb|EDQ76258.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
            patens subsp. patens]
          Length = 2373

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 116/192 (60%), Gaps = 9/192 (4%)

Query: 752  PEVNASQWANGKGLSARTNRV-KLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWG 810
            PE  A+ W    G    T R+ K  +L    +  E L+  Q K  KK L   +S IH  G
Sbjct: 2184 PEQVAA-WLRISGRKLSTRRLHKSASLAMKFDHKEYLRFKQKKGWKK-LAVYKSIIHALG 2241

Query: 811  LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSYLFRLDDGYVVDATK 866
            L   + I   + V+EYVGE++  +++D RE  Y         G+ YLFR+D   ++DAT+
Sbjct: 2242 LYTTDFIAEREVVVEYVGEIVGHRVADKREVEYHSRKRLQYQGACYLFRIDTEQIIDATR 2301

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPC 924
             GGIARF+NHSC+PNC  KVI VE  KK+  +AKR+I AGEE+TY+YKF  +E   KIPC
Sbjct: 2302 NGGIARFVNHSCSPNCVAKVICVENLKKVIFFAKRNIDAGEEVTYDYKFNYDEVGDKIPC 2361

Query: 925  YCGSKKCHGSLN 936
            +CG+ +C G+LN
Sbjct: 2362 FCGTPECRGTLN 2373


>gi|124111218|gb|ABM91999.1| MLL [Pan troglodytes]
          Length = 338

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 194 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 252

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 253 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 312

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 313 YKFPIEDASNKLPCNCGAKKCRKFLN 338


>gi|449441169|ref|XP_004138356.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Cucumis
            sativus]
          Length = 1036

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 75/170 (44%), Positives = 112/170 (65%), Gaps = 5/170 (2%)

Query: 768  RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
            +T ++  RN+L+ AE  + ++    +  +KRL F +S IH +G+ A  P  A D VIEY 
Sbjct: 847  KTCQLDPRNILSVAEKYKFMR----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 902

Query: 828  GELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
            GE++R  I+D RE     + +G+ +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++V
Sbjct: 903  GEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 962

Query: 887  ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            ISV G + I I+AKR I   EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 963  ISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVN 1012


>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
          Length = 2027

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 107/155 (69%), Gaps = 3/155 (1%)

Query: 784  AELLKASQVKAR-KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR 842
            ++ ++  Q K R ++ +R   SKIH +GL A   I   + ++EYVGE+IR +++D RE  
Sbjct: 1874 SDAMQYRQCKERMRETVRVGYSKIHGFGLYAQRDISGGEMIVEYVGEVIRPELTDKREQF 1933

Query: 843  YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRH 902
            Y+   +GS Y+FR+DD  VVDAT  GG ARF+NHSC+PNC +++IS +  KKI I AK+H
Sbjct: 1934 YDARNMGS-YMFRIDDKQVVDATLTGGQARFVNHSCDPNCISRIISTDRGKKIVIVAKQH 1992

Query: 903  IAAGEEITYNYKFPL-EEKKIPCYCGSKKCHGSLN 936
            I  G+E+TY+Y+FPL ++ K+ C CG+  C G +N
Sbjct: 1993 ICKGDELTYDYQFPLDDDNKVRCSCGAANCRGFMN 2027


>gi|40225368|gb|AAH09337.2| MLL4 protein [Homo sapiens]
          Length = 795

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 651 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 709

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 710 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 769

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C   LN
Sbjct: 770 YKFPIEDASNKLPCNCGAKRCRRFLN 795


>gi|120974668|gb|ABM46716.1| MLL [Gorilla gorilla]
          Length = 338

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 194 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 252

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 253 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 312

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 313 YKFPIEDASNKLPCNCGAKKCRKFLN 338


>gi|224128834|ref|XP_002320433.1| SET domain protein [Populus trichocarpa]
 gi|222861206|gb|EEE98748.1| SET domain protein [Populus trichocarpa]
          Length = 1050

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 100/283 (35%), Positives = 146/283 (51%), Gaps = 22/283 (7%)

Query: 672  HATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSD--GCARSSISGWEWHKWS- 728
               + L   K   + +  R V  ++      R S+  P  +  GCAR+    +   +   
Sbjct: 751  QCIRLLSFCKKHRQPSNERVVTDERVGQIPRRCSDYIPPCNLSGCARTEPYNYFGRRGRK 810

Query: 729  ----LNASPAERARVRGAQYVHTKYLGPEVNASQWA-NG---KGLSARTNRVKLR----- 775
                L A+  +R  V    Y+   Y   E +    A NG    G S+   R++       
Sbjct: 811  EPEVLAAASLKRLFVENQPYLVGGYSQHESSGCTLASNGLINSGFSSSLQRLRASQLDAP 870

Query: 776  -NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLK 834
             N+L+ AE  + ++ +     +KRL F +S IH +G+ A  P  A D VIEY GEL+R  
Sbjct: 871  SNILSMAEKYQHMRHTF----RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPP 926

Query: 835  ISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
            I+D RE   Y  +    +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++VISV G +
Sbjct: 927  IADRREHFIYNSLVGAGTYMFRIDDKRVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE 986

Query: 894  KIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
             I I+AKR I   EE+TY+Y+F   E+K+ CYCG  +C G +N
Sbjct: 987  HIIIFAKRDIKRWEELTYDYRFFSIEEKLACYCGFSRCRGVVN 1029


>gi|195392728|ref|XP_002055009.1| GJ19139 [Drosophila virilis]
 gi|194149519|gb|EDW65210.1| GJ19139 [Drosophila virilis]
          Length = 2531

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2378 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2437

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2438 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2496

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 2497 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2531


>gi|308044237|ref|NP_001182990.1| uncharacterized protein LOC100501309 [Zea mays]
 gi|238008654|gb|ACR35362.1| unknown [Zea mays]
          Length = 531

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 113/174 (64%), Gaps = 5/174 (2%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           GLS    +  + N+ +  E   ++KA+     +KRL F +SKIH +G+ A    +A D +
Sbjct: 338 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 393

Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           +EY+GE++R  ISD RE R Y  +    +Y+FR+DD  V+DAT+ G IAR INHSC PNC
Sbjct: 394 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 453

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y++ I++ G + I I+AKR I   EE+TY+Y+F   ++++PC+CG  KC G +N
Sbjct: 454 YSRAITILGDEHIIIFAKRDIDPWEELTYDYRFFSSDQRLPCFCGFPKCRGVVN 507


>gi|195132807|ref|XP_002010834.1| GI21761 [Drosophila mojavensis]
 gi|193907622|gb|EDW06489.1| GI21761 [Drosophila mojavensis]
          Length = 2748

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 2595 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 2654

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 2655 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 2713

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 2714 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2748


>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
 gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
          Length = 2166

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 67/147 (45%), Positives = 98/147 (66%), Gaps = 1/147 (0%)

Query: 789  ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
            A  ++ R++RL F +S IH WGL+A + ++A   V+EYVGE +R  ++D+RE  YE+ G 
Sbjct: 2020 AECLRLRRERLTFGKSNIHGWGLIAKQFLKAGSMVVEYVGERLRPSVADLREKVYERTGH 2079

Query: 849  GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE 908
               YL   DD  V+D T +G IARF NHSC PN YTK+++++G  +IF + +  +  G+E
Sbjct: 2080 -DVYLLAADDKTVIDTTVKGSIARFTNHSCTPNMYTKLVALDGDSRIFFFTRIDVHPGQE 2138

Query: 909  ITYNYKFPLEEKKIPCYCGSKKCHGSL 935
            +TYNY+F  E  K+PCYCG+  C G L
Sbjct: 2139 LTYNYRFDAESGKVPCYCGANNCRGFL 2165


>gi|194380712|dbj|BAG58509.1| unnamed protein product [Homo sapiens]
          Length = 323

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 179 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 237

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 238 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 297

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 298 YKFPIEDASNKLPCNCGAKKCRKFLN 323


>gi|349604316|gb|AEP99904.1| Histone-lysine N-methyltransferase HRX-like protein, partial [Equus
           caballus]
          Length = 297

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 153 KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 211

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 212 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 271

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 272 YKFPIEDASNKLPCNCGAKKCRKFLN 297


>gi|357502739|ref|XP_003621658.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
           truncatula]
 gi|355496673|gb|AES77876.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
           truncatula]
          Length = 791

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 90/240 (37%), Positives = 133/240 (55%), Gaps = 12/240 (5%)

Query: 709 PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
           P   GCARS    +   +      +L AS  +R  V    Y+   Y    +      +G+
Sbjct: 528 PNPSGCARSEPYDYFGRRGRKEPEALAASSLKRLFVENQPYLVGGYCQHGLLNDSEPSGR 587

Query: 764 GLSAR--TNRVKLRNLLAAAEGAELLKASQVK----ARKKRLRFQRSKIHDWGLVALEPI 817
           G+ ++   +  +LR  +  A  + L  A + K      +K+L F +S+IH +G+ A  P 
Sbjct: 588 GVCSKFFCSEQRLRTSMVDAADSILTVAEKYKYMSETFRKQLAFGKSRIHGFGIFAKHPY 647

Query: 818 EAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
           +  D VIEY GEL+R  I+D RE   Y  +    +Y+FR+DD  V+DAT+ G IA  INH
Sbjct: 648 KGGDMVIEYTGELVRPPIADRRERFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 707

Query: 877 SCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           SC PNCY++VISV G + I I+AKR I   EE+TY+Y+F   ++++ CYCG  KC G +N
Sbjct: 708 SCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLSCYCGFPKCRGVVN 767


>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
 gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
            Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
            homolog; AltName: Full=Trithorax homolog 2; AltName:
            Full=WW domain-binding protein 7; Short=WBP-7
          Length = 2713

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713


>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
          Length = 2713

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713


>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
           sulphuraria]
          Length = 969

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 73/138 (52%), Positives = 100/138 (72%), Gaps = 2/138 (1%)

Query: 800 RFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDG 859
           + ++S I   GL  LE +  E+FVIEY GELIR  I+DIRE  Y++  IG  Y+FRL+D 
Sbjct: 833 KVKQSGIQGLGLYTLENLPDEEFVIEYAGELIRPVIADIREKFYDRRKIGC-YMFRLNDD 891

Query: 860 YVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIF-IYAKRHIAAGEEITYNYKFPLE 918
           ++VDAT +G  ARFINHSC PNC +K+I+V+G K++  I+AKR+IAAGEE+TY+Y+F   
Sbjct: 892 FIVDATMKGNYARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQFEEF 951

Query: 919 EKKIPCYCGSKKCHGSLN 936
            + IPC CG+  C G +N
Sbjct: 952 GETIPCNCGAPNCRGKMN 969


>gi|225380776|gb|ACN88689.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 148

 Score =  152 bits (385), Expect = 6e-34,   Method: Composition-based stats.
 Identities = 78/146 (53%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           KA +  +   RS IH  GL   + IE  + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 4   KASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 62

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+
Sbjct: 63  MFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVINVDGRKHIVIFATRKIYKGEELTYD 122

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+EE   K+PC CG+KKC   LN
Sbjct: 123 YKFPIEEPGNKLPCNCGAKKCRKFLN 148


>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
          Length = 3750

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 76/136 (55%), Positives = 94/136 (69%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  VV
Sbjct: 3616 RSAIHGRGLFCKRNIEAGEMVIEYSGIVIRSVLTDKREKYYDGKGIGC-YMFRIDDFDVV 3674

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
            DAT  G  ARFINHSC PNCY++VI+VEGQK I I+A R I  GEE+TY+YKFP+E+   
Sbjct: 3675 DATMHGNAARFINHSCEPNCYSRVINVEGQKHIVIFALRKIYRGEELTYDYKFPIEDASN 3734

Query: 921  KIPCYCGSKKCHGSLN 936
            K+ C CG+K+C   LN
Sbjct: 3735 KLGCNCGAKRCRRFLN 3750


>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
            garnettii]
          Length = 2714

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2570 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKYYDGKGIGC-Y 2628

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2629 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2688

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2689 YKFPIEDASNKLPCNCGAKRCRRFLN 2714


>gi|342209882|gb|AEL16988.1| trithorax-like protein [Phaseolus vulgaris]
          Length = 1089

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 92/256 (35%), Positives = 140/256 (54%), Gaps = 26/256 (10%)

Query: 703  RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
            R + LC      P   GCARS    +   +      +L A+  +R  V    Y+   Y  
Sbjct: 814  RVAGLCSDYEPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYIVGGYCQ 873

Query: 752  PEVNASQWANGKGLSAR--TNRVKLR--------NLLAAAEGAELLKASQVKARKKRLRF 801
                     +G+G+ ++   ++ +LR        N+L+ +E  + ++    +  +KRL F
Sbjct: 874  HGALNDPEPSGRGVCSKFFCSQQRLRTSLIDTSNNILSISEKYKYMR----ETFRKRLAF 929

Query: 802  QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
             +S+IH +G+ A  P +  D VIEY GEL+R  I+D RE   Y  +    +Y+FR+DD  
Sbjct: 930  GKSRIHGFGIFAKHPHKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDER 989

Query: 861  VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
            V+DAT+ G IA  INHSC PNCY++VISV G + I I+AKR I   EE+TY+Y+F   ++
Sbjct: 990  VIDATRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE 1049

Query: 921  KIPCYCGSKKCHGSLN 936
            ++ CYCG  KC G +N
Sbjct: 1050 RLSCYCGFPKCRGIVN 1065


>gi|170058057|ref|XP_001864756.1| Mll1 protein [Culex quinquefasciatus]
 gi|167877297|gb|EDS40680.1| Mll1 protein [Culex quinquefasciatus]
          Length = 114

 Score =  152 bits (385), Expect = 7e-34,   Method: Composition-based stats.
 Identities = 69/114 (60%), Positives = 84/114 (73%), Gaps = 1/114 (0%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEY GELIR  ++D RE  Y+  GIG  Y+F++D+ +VVDAT RG  ARFINHSC PNC
Sbjct: 2   VIEYAGELIRSTLTDKRERYYDGRGIGC-YMFKIDENFVVDATMRGNAARFINHSCEPNC 60

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y+KV+ + G K I I+A R I  GEE+TY+YKFP E+ KIPC CGSKKC   LN
Sbjct: 61  YSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYLN 114


>gi|121483959|gb|ABM54292.1| MLL [Pan paniscus]
          Length = 162

 Score =  152 bits (385), Expect = 7e-34,   Method: Composition-based stats.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 18  KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 76

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 77  MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 136

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 137 YKFPIEDASNKLPCNCGAKKCRKFLN 162


>gi|410904194|ref|XP_003965577.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like,
           partial [Takifugu rubripes]
          Length = 109

 Score =  152 bits (385), Expect = 7e-34,   Method: Composition-based stats.
 Identities = 66/102 (64%), Positives = 83/102 (81%)

Query: 835 ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
           I+D+RE RYE+ GIGSSY+FR+D   ++DATK G  ARFINHSCNPNCY KVI+VE QKK
Sbjct: 8   IADMREKRYEEEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKK 67

Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           I IY+++ I   EEITY+YKFP+E+ KIPC CG++ C G+LN
Sbjct: 68  IVIYSRQPINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 109


>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
            occidentalis]
          Length = 2686

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 70/134 (52%), Positives = 94/134 (70%), Gaps = 1/134 (0%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL   + I   + +IEY GE+IR  + D RE  YE  G+G  Y+FR+D+  VV
Sbjct: 2554 RSGIHGRGLYCKKDIAKGEMIIEYAGEVIRASLCDRREKYYEGRGLGC-YMFRMDNDEVV 2612

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            DAT +G  ARFINHSC+PNCY+K+I+V+ +K I IYA R I  GEE+TY+YKFP+E+ K+
Sbjct: 2613 DATVKGNAARFINHSCDPNCYSKMITVDNKKHIVIYALREIRTGEELTYDYKFPIEDDKL 2672

Query: 923  PCYCGSKKCHGSLN 936
             C CGS++C   +N
Sbjct: 2673 HCTCGSRRCRKFMN 2686


>gi|302785193|ref|XP_002974368.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
 gi|300157966|gb|EFJ24590.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
          Length = 508

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 68/141 (48%), Positives = 98/141 (69%), Gaps = 1/141 (0%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
           +RL F +S IH WGL A EP  A D VIEY GE+IR  ++D+RE R Y  +    +Y+F 
Sbjct: 346 RRLTFGKSAIHGWGLFAKEPHGAGDMVIEYAGEIIRPTVADVREKRCYNSLVGAGTYMFC 405

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+  VVDAT+ G IA  INHSC PNCY++V++  G+++I I+AK+ IA G+E+TY+Y+F
Sbjct: 406 IDNERVVDATRAGSIAHLINHSCEPNCYSRVVTTNGKERIVIFAKQDIAGGDEVTYDYRF 465

Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
                ++PC+CG+  C G +N
Sbjct: 466 TSIGDQLPCHCGTAGCRGIVN 486


>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
          Length = 2270

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2126 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2184

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2185 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2244

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2245 YKFPIEDASNKLPCNCGAKRCRRFLN 2270


>gi|47225089|emb|CAF97504.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 352

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 77/136 (56%), Positives = 95/136 (69%), Gaps = 3/136 (2%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL   + IEA + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  VV
Sbjct: 218 RSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREKYYDGKGIGC-YMFRIDDYEVV 276

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
           DAT  G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+YKFP+EE   
Sbjct: 277 DATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYQGEELTYDYKFPIEEASS 336

Query: 921 KIPCYCGSKKCHGSLN 936
           K+PC C SKKC   LN
Sbjct: 337 KLPCNCNSKKCRKFLN 352


>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
            porcellus]
          Length = 2692

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2548 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2606

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2607 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2666

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2667 YKFPIEDASNKLPCNCGAKRCRRFLN 2692


>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
          Length = 3017

 Score =  152 bits (385), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2873 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2931

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2932 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2991

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2992 YKFPIEDASNKLPCNCGAKRCRRFLN 3017


>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4 [Callithrix jacchus]
          Length = 2660

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2516 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2574

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2575 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2634

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2635 YKFPIEDASNKLPCNCGAKRCRRFLN 2660


>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
          Length = 2566

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2422 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2480

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2481 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2540

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2541 YKFPIEDASNKLPCNCGAKRCRRFLN 2566


>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
          Length = 2605

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2461 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2519

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2520 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2579

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2580 YKFPIEDASNKLPCNCGAKRCRRFLN 2605


>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
 gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
            Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein
            4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
            domain-binding protein 7; Short=WBP-7
 gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
          Length = 2715

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2571 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2629

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2630 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2689

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2690 YKFPIEDASNKLPCNCGAKRCRRFLN 2715


>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
            boliviensis boliviensis]
          Length = 2665

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2521 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2579

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2580 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2639

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2640 YKFPIEDASNKLPCNCGAKRCRRFLN 2665


>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
            anubis]
          Length = 2716

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2572 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2630

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2631 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2690

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2691 YKFPIEDASNKLPCNCGAKRCRRFLN 2716


>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
            troglodytes]
          Length = 2526

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2382 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2440

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2441 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2500

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2501 YKFPIEDASNKLPCNCGAKRCRRFLN 2526


>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
            mulatta]
          Length = 2789

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2645 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2703

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2704 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2763

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2764 YKFPIEDASNKLPCNCGAKRCRRFLN 2789


>gi|255573673|ref|XP_002527758.1| phd finger protein, putative [Ricinus communis]
 gi|223532845|gb|EEF34619.1| phd finger protein, putative [Ricinus communis]
          Length = 1103

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 147/288 (51%), Gaps = 22/288 (7%)

Query: 667  DGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSD--GCARSSISGWEW 724
            D +E    + L   K   + +  R V  ++      R S+  P  +  GCARS    +  
Sbjct: 796  DDVEDQCIRLLSFCKRHKQPSNERPVTEERIGRITHRYSDYIPPCNPSGCARSEPYNYFG 855

Query: 725  HKW-----SLNASPAERARVRGAQYVHTKYLGPEVNA----SQWANGKGLSARTNRVKL- 774
             +      +L A+  +R  V    Y+   Y   E +     S    G   S+    +K  
Sbjct: 856  RRGRKEPEALAAASLKRLFVENQPYLVGGYCQHESSGITLPSNGVEGSRFSSNLQWLKTS 915

Query: 775  -----RNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGE 829
                  N+++ AE  E ++    +  +KRL F +S IH +G+ A  P  A D VIEY GE
Sbjct: 916  QLDAPNNIISMAEKYEYMR----QTFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 971

Query: 830  LIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
            L+R  I+D RE   Y  +    +Y+FR++D  V+DAT+ G IA  INHSC PNCY++VIS
Sbjct: 972  LVRPPIADRREHFIYNSLVGAGTYMFRINDERVIDATRAGSIAHLINHSCEPNCYSRVIS 1031

Query: 889  VEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            V G + I I+AKR I   EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 1032 VNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 1079


>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Ailuropoda melanoleuca]
          Length = 2611

 Score =  152 bits (384), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2467 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2525

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2526 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2585

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2586 YKFPIEDASNKLPCNCGAKRCRRFLN 2611


>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
            gorilla gorilla]
          Length = 2536

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2392 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2450

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2451 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2510

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2511 YKFPIEDASNKLPCNCGAKRCRRFLN 2536


>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
            glaber]
          Length = 2592

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2448 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2506

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2507 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2566

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2567 YKFPIEDASNKLPCNCGAKRCRRFLN 2592


>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
          Length = 2721

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2577 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2635

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2636 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2695

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2696 YKFPIEDASNKLPCNCGAKRCRRFLN 2721


>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
          Length = 2415

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2271 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2329

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2330 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2389

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2390 YKFPIEDASNKLPCNCGAKRCRRFLN 2415


>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Pan paniscus]
          Length = 2776

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2632 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2690

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2691 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2750

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2751 YKFPIEDASNKLPCNCGAKRCRRFLN 2776


>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
          Length = 2013

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 1869 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 1927

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 1928 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 1987

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 1988 YKFPIEDASNKLPCNCGAKRCRRFLN 2013


>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
            lupus familiaris]
          Length = 2713

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2569 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2627

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2628 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2687

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2688 YKFPIEDASNKLPCNCGAKRCRRFLN 2713


>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
          Length = 2688

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2544 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2602

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2603 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2662

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2663 YKFPIEDASNKLPCNCGAKRCRRFLN 2688


>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Loxodonta africana]
          Length = 2200

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2056 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2114

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2115 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2174

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2175 YKFPIEDASNKLPCNCGAKRCRRFLN 2200


>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
          Length = 2711

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2567 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2625

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2626 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2685

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2686 YKFPIEDASNKLPCNCGAKRCRRFLN 2711


>gi|224065044|ref|XP_002301643.1| SET domain protein [Populus trichocarpa]
 gi|222843369|gb|EEE80916.1| SET domain protein [Populus trichocarpa]
          Length = 1014

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 76/162 (46%), Positives = 105/162 (64%), Gaps = 5/162 (3%)

Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
           N+L+ AE  + ++    +  +KRL F +S IH +G+ A  P  A D VIEY GEL+R  I
Sbjct: 833 NILSMAEKYQHMR----QTFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPPI 888

Query: 836 SDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
           +D RE   Y  +    +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++VISV G + 
Sbjct: 889 ADRRERFIYNSLVGAGTYMFRIDDKRVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 948

Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           I I+AKR I   EE+TY+Y+F   E+K+ CYCG  +C G +N
Sbjct: 949 IIIFAKRDIKRWEELTYDYRFFSIEEKLACYCGFPRCRGVVN 990


>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Nomascus leucogenys]
          Length = 2433

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2289 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2347

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2348 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2407

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2408 YKFPIEDASNKLPCNCGAKRCRRFLN 2433


>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
          Length = 4717

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 90/131 (68%), Gaps = 1/131 (0%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            ++ K+ +   RS IH  GL     IEA + VIEY GELIR  ++D RE  Y+  GIG  Y
Sbjct: 4568 ESSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTLTDKRERYYDGRGIGC-Y 4626

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +F++D+ +VVDAT RG  ARFINHSC PNCY+KV+ + G K I I+A R I  GEE+TY+
Sbjct: 4627 MFKIDENFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYD 4686

Query: 913  YKFPLEEKKIP 923
            YKFP E+ KIP
Sbjct: 4687 YKFPFEDVKIP 4697


>gi|223365716|pdb|2W5Y|A Chain A, Binary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
           Domain With The Cofactor Product S-Adenosylhomocysteine.
 gi|223365717|pdb|2W5Z|A Chain A, Ternary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
           Domain With The Cofactor Product S-Adenosylhomocysteine
           And Histone Peptide
          Length = 192

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y
Sbjct: 48  KTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-Y 106

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+
Sbjct: 107 MFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYD 166

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+KKC   LN
Sbjct: 167 YKFPIEDASNKLPCNCGAKKCRKFLN 192


>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
          Length = 3463

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 3319 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDSKGIGC-Y 3377

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+VEG+K I I+A R I  GEE+TY+
Sbjct: 3378 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 3437

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+ C CG+++C   LN
Sbjct: 3438 YKFPIEDASNKLNCNCGARRCRRFLN 3463


>gi|28277052|gb|AAH44818.1| Mll1 protein, partial [Mus musculus]
          Length = 142

 Score =  151 bits (382), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 76/143 (53%), Positives = 97/143 (67%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K+ +   RS IH  GL     I+A + VIEY G +IR   +D RE  Y+  GIG  Y+FR
Sbjct: 1   KEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGC-YMFR 59

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  GEE+TY+YKF
Sbjct: 60  IDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKF 119

Query: 916 PLEE--KKIPCYCGSKKCHGSLN 936
           P+E+   K+PC CG+KKC   LN
Sbjct: 120 PIEDASNKLPCNCGAKKCRKFLN 142


>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
          Length = 2924

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2780 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGC-Y 2838

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+VEG+K I I+A R I  GEE+TY+
Sbjct: 2839 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 2898

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+ C CG+++C   LN
Sbjct: 2899 YKFPIEDASNKLNCNCGARRCRRFLN 2924


>gi|443723097|gb|ELU11678.1| hypothetical protein CAPTEDRAFT_179499 [Capitella teleta]
          Length = 323

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 69/134 (51%), Positives = 93/134 (69%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL ++   +  + +IEY G +IR  ++D RE  YE  GIG  Y+FR+DD +V+
Sbjct: 191 RSLIHGRGLFSIRDFDVGEMIIEYAGMVIRSSLTDSREKYYETKGIGC-YMFRIDDHFVI 249

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC+PNC+++VI V G K I I+A + + AGEE+TY+YKFP E+ KI
Sbjct: 250 DATVNGNAARFINHSCDPNCFSRVIDVSGVKHIIIFALKAVKAGEELTYDYKFPYEDVKI 309

Query: 923 PCYCGSKKCHGSLN 936
           PC C +K+C   LN
Sbjct: 310 PCTCQAKRCRKYLN 323


>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
            mutus]
          Length = 2524

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2380 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2438

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2439 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2498

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2499 YKFPIEDASNKLPCNCGAKRCRRFLN 2524


>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
          Length = 2209

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2065 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 2123

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 2124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 2183

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 2184 YKFPIEDASNKLPCNCGAKRCRRFLN 2209


>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
          Length = 1744

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 1600 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 1658

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 1659 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 1718

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            YKFP+E+   K+PC CG+K+C   LN
Sbjct: 1719 YKFPIEDASNKLPCNCGAKRCRRFLN 1744


>gi|195404288|ref|XP_002060445.1| GJ19853 [Drosophila virilis]
 gi|194156330|gb|EDW71514.1| GJ19853 [Drosophila virilis]
          Length = 1720

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 1567 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 1626

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 1627 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1685

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 1686 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1720


>gi|168007105|ref|XP_001756249.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
            patens]
 gi|162692759|gb|EDQ79115.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
            patens]
          Length = 1900

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 70/145 (48%), Positives = 97/145 (66%), Gaps = 6/145 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSYL 853
            +L   +S IH  GL   + I   + V+EYVGE++  +++D RE  Y         G+ YL
Sbjct: 1756 KLGVYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACYL 1815

Query: 854  FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
            FR+D   ++DAT++GGIARF+NHSC+PNC  KVI VE  KK+  +AKR I AGEE+TY+Y
Sbjct: 1816 FRIDTEQIIDATRKGGIARFVNHSCSPNCVAKVICVENLKKVVFFAKRDIYAGEEVTYDY 1875

Query: 914  KFPLEE--KKIPCYCGSKKCHGSLN 936
            KF  +E   KIPC+CG+ +C G+LN
Sbjct: 1876 KFNCDEVGDKIPCFCGTPECRGTLN 1900


>gi|170050212|ref|XP_001859676.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
 gi|167871728|gb|EDS35111.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
          Length = 2977

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K   R      RSKI   GL A   +E    VIEY+GE+IR+++S++RE +Y
Sbjct: 2824 KSSQYKKMKLEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRVEVSELREKQY 2883

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ VE + +I I+AKR I
Sbjct: 2884 EARNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVEREVRIIIFAKRRI 2942

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KI C CG+  C   +N
Sbjct: 2943 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2977


>gi|154422490|ref|XP_001584257.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121918503|gb|EAY23271.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 259

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 71/147 (48%), Positives = 98/147 (66%), Gaps = 2/147 (1%)

Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
           ++++ K LRF++S IH WG+ +       + ++EY GEL+RL +++ R+  YE  G   S
Sbjct: 92  IRSQLKYLRFEKSGIHLWGVFSACYFAPGEPIVEYTGELVRLSVTEARQKYYETEGNHGS 151

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y+FRLDD   +DAT +GGIARF+NHSC+PNC T V+   GQ+ I I+AK+ I   EE+TY
Sbjct: 152 YIFRLDDDLYIDATHKGGIARFLNHSCDPNCKTCVVEAGGQRHIVIFAKKKIEPFEELTY 211

Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
           +Y  P E  EK I C CGS KC G LN
Sbjct: 212 DYNLPYESKEKAIVCLCGSPKCRGYLN 238


>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
          Length = 3692

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 74/136 (54%), Positives = 94/136 (69%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  VV
Sbjct: 3558 RSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGC-YMFRIDDFDVV 3616

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
            DAT  G  ARFINHSC PNCY++VI+VEG+K I I+A R I  GEE+TY+YKFP+E+   
Sbjct: 3617 DATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASN 3676

Query: 921  KIPCYCGSKKCHGSLN 936
            K+ C CG+++C   LN
Sbjct: 3677 KLNCNCGARRCRRFLN 3692


>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
          Length = 2812

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/146 (51%), Positives = 100/146 (68%), Gaps = 3/146 (2%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 2668 KTSKEAVGVYRSLIHGRGLFCKRNIEAGEMVIEYAGTVIRSVLTDKREKFYDGKGIGC-Y 2726

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT +G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+
Sbjct: 2727 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 2786

Query: 913  YKFPLEEK--KIPCYCGSKKCHGSLN 936
            YKFP+E++  K+ C CG+++C   LN
Sbjct: 2787 YKFPIEDEDNKLHCNCGTRRCRRFLN 2812


>gi|38565948|gb|AAH62210.1| Wbp7 protein, partial [Mus musculus]
          Length = 442

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 298 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 356

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 357 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 416

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C   LN
Sbjct: 417 YKFPIEDASNKLPCNCGAKRCRRFLN 442


>gi|195477920|ref|XP_002100342.1| GE16999 [Drosophila yakuba]
 gi|194187866|gb|EDX01450.1| GE16999 [Drosophila yakuba]
          Length = 1421

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 1268 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 1327

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 1328 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1386

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 1387 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1421


>gi|221508321|gb|EEE33908.1| SET domain-containing protein / bromodomain-containing protein,
            putative [Toxoplasma gondii VEG]
          Length = 7555

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
            RL  + S IH  GL A  P+   + VIEYVG+++R  +SD+RE  YEK G    G+ Y+F
Sbjct: 7411 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7470

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
            RLDD +VVDAT+ G ++RFINHSC PNC  +++  E GQK I I AK  I AGEEITY+Y
Sbjct: 7471 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7530

Query: 914  KFPL--EEKKIPCYCGSKKCHGSLN 936
            +F +  E  K+ C CG++ C G +N
Sbjct: 7531 QFGIGNETDKLACLCGARSCLGRMN 7555


>gi|237834073|ref|XP_002366334.1| SET domain-containing protein / Bromodomain-containing protein
            [Toxoplasma gondii ME49]
 gi|211963998|gb|EEA99193.1| SET domain-containing protein / Bromodomain-containing protein
            [Toxoplasma gondii ME49]
          Length = 7551

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
            RL  + S IH  GL A  P+   + VIEYVG+++R  +SD+RE  YEK G    G+ Y+F
Sbjct: 7407 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7466

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
            RLDD +VVDAT+ G ++RFINHSC PNC  +++  E GQK I I AK  I AGEEITY+Y
Sbjct: 7467 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7526

Query: 914  KFPL--EEKKIPCYCGSKKCHGSLN 936
            +F +  E  K+ C CG++ C G +N
Sbjct: 7527 QFGIGNETDKLACLCGARSCLGRMN 7551


>gi|221486558|gb|EEE24819.1| SET domain-containing protein / bromodomain-containing protein,
            putative [Toxoplasma gondii GT1]
          Length = 7565

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
            RL  + S IH  GL A  P+   + VIEYVG+++R  +SD+RE  YEK G    G+ Y+F
Sbjct: 7421 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 7480

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
            RLDD +VVDAT+ G ++RFINHSC PNC  +++  E GQK I I AK  I AGEEITY+Y
Sbjct: 7481 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 7540

Query: 914  KFPL--EEKKIPCYCGSKKCHGSLN 936
            +F +  E  K+ C CG++ C G +N
Sbjct: 7541 QFGIGNETDKLACLCGARSCLGRMN 7565


>gi|148692059|gb|EDL24006.1| WW domain binding protein 7 [Mus musculus]
          Length = 209

 Score =  150 bits (379), Expect = 3e-33,   Method: Composition-based stats.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 65  KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 123

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 183

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C   LN
Sbjct: 184 YKFPIEDASNKLPCNCGAKRCRRFLN 209


>gi|211828317|gb|AAH07353.3| MLL4 protein [Homo sapiens]
          Length = 172

 Score =  150 bits (379), Expect = 4e-33,   Method: Composition-based stats.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 28  KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 86

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 87  MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 146

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C   LN
Sbjct: 147 YKFPIEDASNKLPCNCGAKRCRRFLN 172


>gi|380796799|gb|AFE70275.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
          Length = 396

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 77/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 252 KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 310

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 311 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 370

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C   LN
Sbjct: 371 YKFPIEDASNKLPCNCGAKRCRRFLN 396


>gi|260836403|ref|XP_002613195.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
 gi|229298580|gb|EEN69204.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
          Length = 313

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 75/136 (55%), Positives = 94/136 (69%), Gaps = 3/136 (2%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL     I++ + VIEY G +IR  ++D RE  Y   GIG  Y+FR+DD  VV
Sbjct: 179 RSPIHGRGLFCKRNIDSGEMVIEYAGMVIRSVLTDKRENYYNSKGIGC-YMFRIDDYEVV 237

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
           DAT  G  ARFINHSC+PNCY++VI VEG+K I I+A R I  GEE+TY+YKFP+E++  
Sbjct: 238 DATMHGNAARFINHSCDPNCYSRVIQVEGKKHIVIFAMRKIYKGEELTYDYKFPIEDQNS 297

Query: 921 KIPCYCGSKKCHGSLN 936
           KI C CGSK+C   LN
Sbjct: 298 KIDCTCGSKRCRKYLN 313


>gi|158287424|ref|XP_309452.4| AGAP011192-PA [Anopheles gambiae str. PEST]
 gi|157019644|gb|EAA05242.4| AGAP011192-PA [Anopheles gambiae str. PEST]
          Length = 2808

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++S++RE +Y
Sbjct: 2655 KSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSEMREKQY 2714

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+ + +I I+AKR I
Sbjct: 2715 EARNRGI-YMFRLDEERVVDATLSGGLARYINHSCNPNCVTEIVEVDRELRIIIFAKRRI 2773

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KI C CG+  C   +N
Sbjct: 2774 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2808


>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
          Length = 2204

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++ D RE  Y
Sbjct: 2051 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSQLCDYREKLY 2110

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLDD  VVDAT  GG+AR+INHSCNPNC T+ + V+ + +I I+AKR I
Sbjct: 2111 EAKNRGI-YMFRLDDDRVVDATISGGLARYINHSCNPNCVTEKVEVDRELRIIIFAKRRI 2169

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            A GEE+ Y+Y+F +E  + KIPC CG+  C   +N
Sbjct: 2170 ARGEELAYDYQFDIEDDQHKIPCNCGAPNCRKWMN 2204


>gi|170055483|ref|XP_001863602.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167875425|gb|EDS38808.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1357

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 67/119 (56%), Positives = 93/119 (78%), Gaps = 2/119 (1%)

Query: 763  KGLS--ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAE 820
            +GLS  AR+N+ +L     A+  +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A+
Sbjct: 1216 QGLSREARSNQRRLLTAFGASTESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAAD 1275

Query: 821  DFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
            + VIEYVG+++R  ++D+RE +YE +GIGSSYLFR+D   ++DATK G +ARFINHSCN
Sbjct: 1276 EMVIEYVGQMVRPSVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCN 1334


>gi|195170125|ref|XP_002025864.1| GL18348 [Drosophila persimilis]
 gi|194110717|gb|EDW32760.1| GL18348 [Drosophila persimilis]
          Length = 1046

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 102/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 893  KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 952

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 953  ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1011

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KIPC CG+  C   +N
Sbjct: 1012 YRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1046


>gi|21064853|gb|AAM29656.1| SD13650p [Drosophila melanogaster]
          Length = 1019

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 866  KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 925

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 926  ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 984

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 985  YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1019


>gi|195564658|ref|XP_002105931.1| GD16408 [Drosophila simulans]
 gi|194203296|gb|EDX16872.1| GD16408 [Drosophila simulans]
          Length = 1076

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 923  KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 982

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 983  ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 1041

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 1042 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1076


>gi|195347745|ref|XP_002040412.1| GM18944 [Drosophila sechellia]
 gi|194121840|gb|EDW43883.1| GM18944 [Drosophila sechellia]
          Length = 898

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 745 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 804

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 805 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 863

Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
             GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 864 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 898


>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
           sinensis]
          Length = 328

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 71/143 (49%), Positives = 91/143 (63%), Gaps = 1/143 (0%)

Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
           AR  R+    S IH  GL AL     ++ VIEY+GELIR  + + RE RY   G+   Y+
Sbjct: 187 ARYLRVAVHPSHIHGRGLFALRGFREDEMVIEYMGELIRNFVCETREIRYRSAGV-DCYM 245

Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
           FR+D   V+DAT  G  ARFINHSC+PNCY KV++V+ +K I I A+R I  GEE+TY+Y
Sbjct: 246 FRIDSDLVIDATYAGNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDY 305

Query: 914 KFPLEEKKIPCYCGSKKCHGSLN 936
           +FP E  K+ C CGS  C   LN
Sbjct: 306 RFPKESDKLLCNCGSYNCRKYLN 328


>gi|12847486|dbj|BAB27589.1| unnamed protein product [Mus musculus]
          Length = 209

 Score =  149 bits (375), Expect = 9e-33,   Method: Composition-based stats.
 Identities = 76/146 (52%), Positives = 97/146 (66%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 65  KTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-Y 123

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+
Sbjct: 124 MFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYD 183

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+PC CG+K+C    N
Sbjct: 184 YKFPIEDASNKLPCNCGAKRCRRFFN 209


>gi|399218649|emb|CCF75536.1| unnamed protein product [Babesia microti strain RI]
          Length = 2180

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/156 (51%), Positives = 107/156 (68%), Gaps = 5/156 (3%)

Query: 786  LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY-E 844
            L +  Q  ++  RLR + S IH  GL A + I++ + VIEY+GELIR  ISD+RE  Y +
Sbjct: 2025 LYRYIQSLSQPSRLRVRGSPIHGIGLFANQLIQSGEPVIEYIGELIRDVISDVREKFYND 2084

Query: 845  KMGI-GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRH 902
            K GI GS Y+F+L++  VVDAT+ G +ARFINHSC PNC  ++I+ E G K I I+AKR 
Sbjct: 2085 KWGIDGSCYMFKLNEQLVVDATRMGNMARFINHSCEPNCICRIITCESGLKHIVIFAKRD 2144

Query: 903  IAAGEEITYNYKFPL--EEKKIPCYCGSKKCHGSLN 936
            I  GEEITYNY+F +  E KK+ C+CG+  C G +N
Sbjct: 2145 INPGEEITYNYQFGIEAEAKKLSCHCGAPSCLGRMN 2180


>gi|303286289|ref|XP_003062434.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226455951|gb|EEH53253.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 491

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 93/139 (66%), Gaps = 1/139 (0%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
           +R  F +S IH WGL+A +P++A   VIE+ GE+++  ++D+RE  Y+   I   YL + 
Sbjct: 353 ERFTFGKSNIHGWGLIAKKPVKAGSMVIEFRGEIVKPHVADLREKAYDDANI-DCYLLKA 411

Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
           D+  V+D T RG IARF NHSCNPN YTK++SV+G   I  +A+  +  GEE+TY+Y+F 
Sbjct: 412 DEKTVIDTTMRGNIARFTNHSCNPNMYTKIVSVDGSNHIIFFARVDVQPGEEMTYDYRFD 471

Query: 917 LEEKKIPCYCGSKKCHGSL 935
            E  K+PCYCG+  C G L
Sbjct: 472 AESGKVPCYCGAHNCRGFL 490


>gi|30685011|ref|NP_850170.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
 gi|95147534|sp|Q9C5X4.2|ATX1_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX1; AltName:
            Full=Protein SET DOMAIN GROUP 27; AltName:
            Full=Trithorax-homolog protein 1; Short=TRX-homolog
            protein 1
 gi|330253475|gb|AEC08569.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
          Length = 1062

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/249 (37%), Positives = 137/249 (55%), Gaps = 15/249 (6%)

Query: 696  KTKASKSRTSNLCPRSD--GCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTK 748
            + K++  +TS   P  +  GCAR+   +  G    K   +L A+ ++R  V    YV   
Sbjct: 797  RIKSATHKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGG 856

Query: 749  YLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHD 808
            Y   E +  +  +G  +S         N+L+ AE    ++ +     +KRL F +S IH 
Sbjct: 857  YSRLEFSTYKSIHGSKVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHG 909

Query: 809  WGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKR 867
            +G+ A  P  A D +IEY GEL+R  I+D RE   Y  M    +Y+FR+DD  V+DAT+ 
Sbjct: 910  FGIFAKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRT 969

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G IA  INHSC PNCY++VI+V G + I I+AKRHI   EE+TY+Y+F    +++ C CG
Sbjct: 970  GSIAHLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCG 1029

Query: 928  SKKCHGSLN 936
               C G +N
Sbjct: 1030 FPGCRGVVN 1038


>gi|12659210|gb|AAK01237.1|AF329273_1 trithorax-like protein 1 [Arabidopsis thaliana]
          Length = 1062

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/249 (37%), Positives = 137/249 (55%), Gaps = 15/249 (6%)

Query: 696  KTKASKSRTSNLCPRSD--GCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTK 748
            + K++  +TS   P  +  GCAR+   +  G    K   +L A+ ++R  V    YV   
Sbjct: 797  RIKSATHKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGG 856

Query: 749  YLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHD 808
            Y   E +  +  +G  +S         N+L+ AE    ++ +     +KRL F +S IH 
Sbjct: 857  YSRLEFSTYKSIHGSKVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHG 909

Query: 809  WGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKR 867
            +G+ A  P  A D +IEY GEL+R  I+D RE   Y  M    +Y+FR+DD  V+DAT+ 
Sbjct: 910  FGIFAKLPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRT 969

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCG 927
            G IA  INHSC PNCY++VI+V G + I I+AKRHI   EE+TY+Y+F    +++ C CG
Sbjct: 970  GSIAHLINHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCG 1029

Query: 928  SKKCHGSLN 936
               C G +N
Sbjct: 1030 FPGCRGVVN 1038


>gi|225433774|ref|XP_002268621.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Vitis
            vinifera]
          Length = 1084

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/196 (40%), Positives = 115/196 (58%), Gaps = 4/196 (2%)

Query: 744  YVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQV--KARKKRLRF 801
            Y   + LG  +++S  + G   S R  ++K   L A      +++      +  +KRL F
Sbjct: 866  YCQHESLGNPLSSSALS-GSKFSFRNQKIKASQLDAPKSILSMVEKYNYMRETFRKRLAF 924

Query: 802  QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGY 860
             +S IH +G+ A +P  A D VIEY GEL+R  I+D RE   Y  +    +Y+FR+DD  
Sbjct: 925  GKSGIHGFGIFAKQPHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMFRIDDER 984

Query: 861  VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
            V+DAT+ G IA  INHSC PNCY++VIS  G   I I+AKR I   EE+TY+Y+F   ++
Sbjct: 985  VIDATRAGSIAHLINHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYRFFSIDE 1044

Query: 921  KIPCYCGSKKCHGSLN 936
            ++ CYCG  +C G +N
Sbjct: 1045 QLACYCGFPRCRGVVN 1060


>gi|157109807|ref|XP_001650833.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108878935|gb|EAT43160.1| AAEL005380-PA [Aedes aegypti]
          Length = 2874

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K   R      RSKI   GL A   +E    VIEY+GE+IR ++S++RE +Y
Sbjct: 2721 KSSQYKKMKLEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQY 2780

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+ + VE   +I I+AKR I
Sbjct: 2781 EARNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTETVEVERDLRIIIFAKRRI 2839

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KI C CG+  C   +N
Sbjct: 2840 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2874


>gi|47225575|emb|CAG12058.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 480

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/146 (52%), Positives = 98/146 (67%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 336 KTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDSKGIGC-Y 394

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT  G  ARFINHSC PNCY++VI+VEG+K I I+A R I  GEE+TY+
Sbjct: 395 MFRIDDFDVVDATMHGNAARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYD 454

Query: 913 YKFPLEE--KKIPCYCGSKKCHGSLN 936
           YKFP+E+   K+ C CG+++C   LN
Sbjct: 455 YKFPIEDASNKLNCNCGARRCRRFLN 480


>gi|4582456|gb|AAD24840.1| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
          Length = 186

 Score =  148 bits (374), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 75/162 (46%), Positives = 104/162 (64%), Gaps = 6/162 (3%)

Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
           N+L+ AE    ++ +     +KRL F +S IH +G+ A  P  A D +IEY GEL+R  I
Sbjct: 6   NILSMAEKYRYMRETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMMIEYTGELVRPSI 61

Query: 836 SDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK 894
           +D RE   Y  MG G+ Y+FR+DD  V+DAT+ G IA  INHSC PNCY++VI+V G + 
Sbjct: 62  ADKREQLIYNSMGAGT-YMFRIDDERVIDATRTGSIAHLINHSCVPNCYSRVITVNGDEH 120

Query: 895 IFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           I I+AKRHI   EE+TY+Y+F    +++ C CG   C G +N
Sbjct: 121 IIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVN 162


>gi|297826607|ref|XP_002881186.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297327025|gb|EFH57445.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1066

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 122/210 (58%), Gaps = 8/210 (3%)

Query: 728  SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELL 787
            +L A+ ++R  V    YV   Y   E +  +  +G  +S         N+L+ AE    +
Sbjct: 841  ALAAASSKRLFVENQPYVIGGYSKIEFSTYECIHGSKVSQMNTP---SNILSMAEKYRYM 897

Query: 788  KASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKM 846
            K +     +KRL F +S IH +G+ A  P +A D +IEY GEL+R  I+D RE   Y  M
Sbjct: 898  KETY----RKRLAFGKSGIHGFGIFAKLPHKAGDMMIEYTGELVRPSIADKRERLIYNSM 953

Query: 847  GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
                +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++VI+V G++ I I+AKR I   
Sbjct: 954  VGAGTYMFRIDDKRVIDATRAGSIAHLINHSCVPNCYSRVITVNGEEHIIIFAKRDIPIW 1013

Query: 907  EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            EE+TY+Y+F    +++ C CG + C G +N
Sbjct: 1014 EELTYDYRFFSIGERLSCSCGFQGCRGVVN 1043


>gi|17861882|gb|AAL39418.1| GM10003p [Drosophila melanogaster]
          Length = 421

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 103/155 (66%), Gaps = 7/155 (4%)

Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           K+SQ K  K+  R      RSKI   GL A   IE    +IEY+GE+IR ++S+IRE +Y
Sbjct: 268 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQY 327

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+++ V+   +I I+AKR I
Sbjct: 328 ESKNRGI-YMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKI 386

Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
             GEE++Y+YKF +E++  KIPC CG+  C   +N
Sbjct: 387 YRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 421


>gi|302755933|ref|XP_002961390.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
 gi|300170049|gb|EFJ36650.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
          Length = 749

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 68/145 (46%), Positives = 100/145 (68%), Gaps = 2/145 (1%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K R+ F +S IH WGL A   IE  + V+EY GE +R  ++D+RE RY   G    Y
Sbjct: 606 KTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQVRRSVADLREKRYRDQG-KDCY 664

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  V+DAT++G I R INHSC+P+CY +++ V+G++ +I + AKR++AAGEE+TY
Sbjct: 665 LFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCVDGEESRIVLIAKRNVAAGEELTY 724

Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
           +Y+F  E+KK+PC CGS  C   +N
Sbjct: 725 DYQFEEEDKKVPCLCGSDSCRQYMN 749


>gi|302798240|ref|XP_002980880.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
 gi|300151419|gb|EFJ18065.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
          Length = 749

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 68/145 (46%), Positives = 100/145 (68%), Gaps = 2/145 (1%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K R+ F +S IH WGL A   IE  + V+EY GE +R  ++D+RE RY   G    Y
Sbjct: 606 KTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQVRRSVADLREKRYRDQG-KDCY 664

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  V+DAT++G I R INHSC+P+CY +++ V+G++ +I + AKR++AAGEE+TY
Sbjct: 665 LFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCVDGEESRIVLIAKRNVAAGEELTY 724

Query: 912 NYKFPLEEKKIPCYCGSKKCHGSLN 936
           +Y+F  E+KK+PC CGS  C   +N
Sbjct: 725 DYQFEEEDKKVPCLCGSDSCRQYMN 749


>gi|321466655|gb|EFX77649.1| hypothetical protein DAPPUDRAFT_105888 [Daphnia pulex]
          Length = 208

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 71/134 (52%), Positives = 91/134 (67%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS IH  GL     IE  + V+EY G++IR  + D RE  YE  G+G  Y+FR+D+  VV
Sbjct: 76  RSHIHGRGLFCKRDIECGEMVVEYAGQVIRSVLCDKREKEYEAKGMGC-YMFRIDEQTVV 134

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
           DAT  G  ARFINHSC PNCY++++ + G+K I I+A R I  GEE+TY+YKFPLE+ KI
Sbjct: 135 DATVHGNAARFINHSCEPNCYSRIVDIFGKKHIIIFALRRILRGEELTYDYKFPLEDVKI 194

Query: 923 PCYCGSKKCHGSLN 936
            C CGS+KC   LN
Sbjct: 195 RCTCGSRKCRKYLN 208


>gi|322792928|gb|EFZ16758.1| hypothetical protein SINV_09282 [Solenopsis invicta]
          Length = 3429

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 3276 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 3335

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 3336 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 3394

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 3395 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3429


>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
            finger protein HRX) (ALL-1) (Trithorax-like protein)
            (Lysine N-methyltransferase 2A) (CXXC-type zinc finger
            protein 7) [Ciona intestinalis]
          Length = 3406

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 67/136 (49%), Positives = 94/136 (69%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH  GL      ++ + ++EY G++IR +++D RE  YE   IG  Y+FR+DD YVV
Sbjct: 3272 RSTIHGRGLYCKRDFDSGEMIMEYTGQIIRQELTDKREKYYESKSIGC-YMFRMDDFYVV 3330

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
            DAT  G  ARFINHSC+PNCY++++  EG+K I I+A R I  GEE+TY+YKFP+E++  
Sbjct: 3331 DATVLGSGARFINHSCDPNCYSRIVQFEGKKHIVIFALREIYKGEELTYDYKFPIEDENH 3390

Query: 921  KIPCYCGSKKCHGSLN 936
            KI C CG++ C   +N
Sbjct: 3391 KIACTCGARLCRKFMN 3406


>gi|356568903|ref|XP_003552647.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
          Length = 1088

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 90/252 (35%), Positives = 136/252 (53%), Gaps = 18/252 (7%)

Query: 703  RTSNLC------PRSDGCARSSISGWEWHKW-----SLNASPAERARVRGAQYVHTKYLG 751
            R + LC      P   GCARS    +   +      +L A+  +R  V    Y+   Y  
Sbjct: 813  RVAGLCSDYEPPPNLSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 872

Query: 752  PEVNASQWANGKGLSAR--TNRVKLRNLLAAAEGAELLKASQV----KARKKRLRFQRSK 805
                 +   +G+G+ ++   ++ +LR  L     + L  + +     +  +KRL F +S+
Sbjct: 873  HGSLNNLEPSGRGVCSKFFCSQQRLRTSLVDTSNSILSISEKYTYMRETFRKRLAFGKSR 932

Query: 806  IHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDA 864
            IH +G+ A    +  D VIEY GEL+R  I+D RE   Y  +    +Y+FR+DD  V+DA
Sbjct: 933  IHGFGIFAKHAYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDERVIDA 992

Query: 865  TKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPC 924
            T+ G IA  INHSC PNCY++VISV G + I I+AKR I   EE+TY+Y+F   ++++ C
Sbjct: 993  TRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLAC 1052

Query: 925  YCGSKKCHGSLN 936
            YCG  KC G +N
Sbjct: 1053 YCGFPKCRGIVN 1064


>gi|348530102|ref|XP_003452550.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
           [Oreochromis niloticus]
          Length = 399

 Score =  146 bits (369), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 74/146 (50%), Positives = 100/146 (68%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K+ +   RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 255 KISKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYAGTVIRAVLTDKREKYYDSKGIGC-Y 313

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +FR+DD  VVDAT +G  ARFINHSC PNCY++VI+V+G+K I I+A R I  GEE+TY+
Sbjct: 314 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYD 373

Query: 913 YKFPLEEK--KIPCYCGSKKCHGSLN 936
           YKFP+E++  K+ C CG+++C   LN
Sbjct: 374 YKFPIEDESNKLLCNCGARRCRRYLN 399


>gi|297745169|emb|CBI39161.3| unnamed protein product [Vitis vinifera]
          Length = 1068

 Score =  146 bits (368), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 69/142 (48%), Positives = 94/142 (66%), Gaps = 1/142 (0%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE-CRYEKMGIGSSYLF 854
            +KRL F +S IH +G+ A +P  A D VIEY GEL+R  I+D RE   Y  +    +Y+F
Sbjct: 903  RKRLAFGKSGIHGFGIFAKQPHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMF 962

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            R+DD  V+DAT+ G IA  INHSC PNCY++VIS  G   I I+AKR I   EE+TY+Y+
Sbjct: 963  RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYR 1022

Query: 915  FPLEEKKIPCYCGSKKCHGSLN 936
            F   ++++ CYCG  +C G +N
Sbjct: 1023 FFSIDEQLACYCGFPRCRGVVN 1044


>gi|168029907|ref|XP_001767466.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
 gi|162681362|gb|EDQ67790.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
          Length = 924

 Score =  145 bits (367), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 97/141 (68%), Gaps = 1/141 (0%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFR 855
           +RL F +S IH  G+   +   A D +IEY GE++R  I+DIRE R+    +G+ +Y+FR
Sbjct: 754 QRLVFGKSAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERRFYDSLVGAGTYMFR 813

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +DD  VVDAT+ G IA  INHSC PNCY++ ++  G+ +I I+AKR+I  GEE+TY+Y+F
Sbjct: 814 IDDERVVDATRAGSIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEIGEELTYDYRF 873

Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
             +++ + CYCG   C GS+N
Sbjct: 874 MSKDEVLTCYCGCAGCRGSVN 894


>gi|27754617|gb|AAO22754.1| putative trithorax protein 1 [Arabidopsis thaliana]
          Length = 479

 Score =  145 bits (367), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 90/234 (38%), Positives = 129/234 (55%), Gaps = 13/234 (5%)

Query: 709 PRSDGCARS---SISGWEWHKW--SLNASPAERARVRGAQYVHTKYLGPEVNASQWANGK 763
           P   GCAR+   +  G    K   +L A+ ++R  V    YV   Y   E +  +  +G 
Sbjct: 229 PNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSRLEFSTYKSIHGS 288

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
            +S         N+L+ AE    ++ +     +KRL F +S IH +G+ A  P  A D +
Sbjct: 289 KVSQMNTP---SNILSMAEKYRYMRETY----RKRLAFGKSGIHGFGIFAKLPHRAGDMM 341

Query: 824 IEYVGELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           IEY GEL+R  I+D RE   Y  M    +Y+FR+DD  V+DAT+ G IA  INHSC PNC
Sbjct: 342 IEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIAHLINHSCVPNC 401

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y++VI+V G + I I+AKRHI   EE+TY+Y+F    +++ C CG   C G +N
Sbjct: 402 YSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVN 455


>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
          Length = 3609

 Score =  145 bits (367), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 109/175 (62%), Gaps = 5/175 (2%)

Query: 767  ARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEY 826
            +RTNR + R     +  A   +       +KRL  + S IH  GL   E I A + V+EY
Sbjct: 3435 SRTNRTRSRKQYDMSNNAMQYRYLISLPPEKRLDVKPSAIHGLGLFTTEDITAGEPVVEY 3494

Query: 827  VGELIRLKISDIRECRYEKM--GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            VGELIR  ISD RE  Y +   G GS Y+FRLDD  +VDAT++G ++RFINHSC+PNC  
Sbjct: 3495 VGELIRDIISDKREEIYSESQGGDGSCYMFRLDDELIVDATRKGNMSRFINHSCDPNCLC 3554

Query: 885  KVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            ++I+ E G K I ++AK  + AG+E+TY+Y+F +E   +K+ C CG+  C G +N
Sbjct: 3555 RIITCEYGLKHIVVFAKSDLKAGDEVTYDYQFGVESETRKLQCLCGAPNCLGRMN 3609


>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile rotundata]
          Length = 4805

 Score =  145 bits (366), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 4652 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 4711

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 4712 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 4770

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 4771 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4805


>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
          Length = 4678

 Score =  145 bits (366), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 4525 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 4584

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 4585 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 4643

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 4644 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4678


>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
          Length = 5713

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 5560 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5619

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 5620 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5678

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 5679 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5713


>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
          Length = 5622

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 5469 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5528

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 5529 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5587

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 5588 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5622


>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
          Length = 5619

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 5466 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQY 5525

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 5526 EARNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRI 5584

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 5585 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5619


>gi|168050289|ref|XP_001777592.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671077|gb|EDQ57635.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 907

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/158 (46%), Positives = 103/158 (65%), Gaps = 8/158 (5%)

Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
           E LK  Q K  KK + F +S IH WGL +   I+  + VIEY GE +R  ++D+RE RY 
Sbjct: 752 ERLKFLQ-KTEKKTVCFGKSAIHGWGLFSRRAIQEGEMVIEYRGERVRGSVADLREIRYH 810

Query: 845 KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-----EGQKKIFIYA 899
           K G    YLF++++  V+DAT +G IAR INHSC+P+CY K++       EG  +I + A
Sbjct: 811 KEG-KDCYLFKINEEIVIDATDKGNIARLINHSCDPSCYAKILDFQRDDGEGDSRIVLIA 869

Query: 900 KRHIAAGEEITYNYKFPLEE-KKIPCYCGSKKCHGSLN 936
           +++IAAGEE+TYNY+F  E+ +K+PC CGS  C   +N
Sbjct: 870 RKYIAAGEELTYNYRFDEEDTQKVPCLCGSSNCRKYMN 907


>gi|168057166|ref|XP_001780587.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
 gi|162667953|gb|EDQ54570.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
          Length = 902

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 95/141 (67%), Gaps = 1/141 (0%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
           +RL F +S IH  G+   +   A D +IEY GE++R  I+DIRE R Y+ +    +Y+FR
Sbjct: 732 QRLAFGKSAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERRCYDSLVGAGTYMFR 791

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +DD  VVDAT  G IA  INHSC PNCY++ ++  G+ +I I+AKR+I  GEE+TY+Y+F
Sbjct: 792 IDDERVVDATHAGTIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEVGEELTYDYRF 851

Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
             +++ + CYCG   C GS+N
Sbjct: 852 MSKDEVLTCYCGCAGCRGSVN 872


>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
            [Nasonia vitripennis]
          Length = 5138

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE++R +++DIRE +Y
Sbjct: 4985 KSSQYKKMKQDWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEIVRNELADIREKQY 5044

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  + + VE + ++ I+AKR I
Sbjct: 5045 EAKNRGI-YMFRLDENRVVDATLCGGLARYINHSCNPNCVVENVEVERKLRLIIFAKRRI 5103

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
              GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 5104 LRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5138


>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
 gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
          Length = 6073

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 98/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K   R      RSKI   GL A   +E    VIEY+GE+IR ++S+ RE +Y
Sbjct: 5920 KSSQYKKMKLEWRNNVYLARSKIQGLGLYAARDLERHTMVIEYIGEIIRHELSESREKQY 5979

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  V+DAT  GG+AR+INHSCNPNC  +++ V+   +I I+AKR I
Sbjct: 5980 EARNRGI-YMFRLDEERVIDATICGGLARYINHSCNPNCVAEIVEVDRDLRIIIFAKRRI 6038

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
              GEE+ Y+YKF +E  + KIPC CG+  C   +N
Sbjct: 6039 TRGEELAYDYKFDIEDDQHKIPCLCGAPNCRKWMN 6073


>gi|355702679|gb|AES02012.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
          Length = 374

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/130 (54%), Positives = 90/130 (69%), Gaps = 3/130 (2%)

Query: 801 FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
           + RS IH  GL     I+A + VIEY G +IR  ++D RE  Y+  GIG  Y+FR+DD  
Sbjct: 246 YSRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGC-YMFRMDDFD 304

Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE- 919
           VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  GEE+TY+YKFP+E+ 
Sbjct: 305 VVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDA 364

Query: 920 -KKIPCYCGS 928
             K+PC CG+
Sbjct: 365 SNKLPCNCGA 374


>gi|198413251|ref|XP_002126315.1| PREDICTED: transcription factor protein, partial [Ciona intestinalis]
          Length = 1130

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K+  R      RS+I   GL A   IE    VIEY+G++IR ++++ RE  Y
Sbjct: 977  KSSQYRRMKQEWRSNVYLARSRIQGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNY 1036

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD  Y+VDAT  GG AR+INHSCNPNC  +V++ E +KKI I + RHI
Sbjct: 1037 EAANRGV-YMFRLDSDYIVDATVTGGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHI 1095

Query: 904  AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
             +GEE+ Y+YKF  E++  KIPC CG+  C   +N
Sbjct: 1096 LSGEELNYDYKFDFEDEGNKIPCLCGAINCRKWMN 1130


>gi|302800676|ref|XP_002982095.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
 gi|300150111|gb|EFJ16763.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
          Length = 1045

 Score =  143 bits (360), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 95/139 (68%), Gaps = 1/139 (0%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRL 856
            RL F +S IH WG+ A +  +A D + EY GE++R  I+DIRE R+    +G+ +Y+FR+
Sbjct: 882  RLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIREQRHYDSLVGAGTYMFRI 941

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
            DD  VVDAT  G +A  INHSC PNCY+++I+V+ +  I I+AKR I   EE+TY+Y+F 
Sbjct: 942  DDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFA 1001

Query: 917  LEEKKIPCYCGSKKCHGSL 935
             +  ++ C CG+ KC GS+
Sbjct: 1002 SKGAELVCNCGALKCRGSV 1020


>gi|219121873|ref|XP_002181282.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407268|gb|EEC47205.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 141

 Score =  143 bits (360), Expect = 6e-31,   Method: Composition-based stats.
 Identities = 65/117 (55%), Positives = 88/117 (75%), Gaps = 3/117 (2%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEY GELI   +++ R+ +Y+   IGS Y+FR+D   V DATK+G +ARF+N SCNPNC
Sbjct: 2   VIEYRGELIGNAVAEKRDKQYDASKIGSDYMFRIDGSGVCDATKQGNVARFVNASCNPNC 61

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCHGSLN 936
           YTK+I+++G K+I IYAKR I  GEE++Y+YKFPLE    K++ CYCG+K C G +N
Sbjct: 62  YTKIITLDGIKRIVIYAKRDILPGEELSYDYKFPLERIEAKRVSCYCGAKDCRGYMN 118


>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
          Length = 1603

 Score =  142 bits (359), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 87/250 (34%), Positives = 130/250 (52%), Gaps = 30/250 (12%)

Query: 712  DGCARSSISGWEWHKWSLNASPA----------ERARVRG---------------AQYVH 746
            DGCAR+++   +    + + +PA          ER R++                 + + 
Sbjct: 1359 DGCARTTVFNRKTPIQTTSGNPADTERERKIMIERERLKSEIISFKTRERTIALLTKKIQ 1418

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKI 806
            + Y   ++N       K +   +    + N L +     LL    +K    R+  + S I
Sbjct: 1419 SVYSMEKINQIGIPGLKDVLGFSTNSGIHNHLLSDNAQYLL----LKGEDSRVVVKPSSI 1474

Query: 807  HDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATK 866
            H  G  A + I A +F+IEYVGE+IR KI+D RE +Y+  G+   ++F ++   +VDAT 
Sbjct: 1475 HSLGAFAKKSIVAGEFIIEYVGEIIRQKIADERERKYQNDGV-DCFMFAIEKDVIVDATF 1533

Query: 867  RGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYC 926
            +G  ARF NHSC PN  TK+IS++G KKI I A + IA  EEITY+Y+FP E+ KI C C
Sbjct: 1534 KGNRARFANHSCEPNAKTKIISIDGVKKIIIVASKDIAKNEEITYDYQFPREKMKIKCLC 1593

Query: 927  GSKKCHGSLN 936
             S +C G LN
Sbjct: 1594 QSTRCKGYLN 1603


>gi|321469515|gb|EFX80495.1| hypothetical protein DAPPUDRAFT_51465 [Daphnia pulex]
          Length = 927

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 74/154 (48%), Positives = 98/154 (63%), Gaps = 6/154 (3%)

Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           K+SQ K  K+  R      RSKI   GL A   IE    VIEY+GE+IR ++++ RE RY
Sbjct: 775 KSSQYKKMKQEWRHNIYLARSKIQGLGLYAARDIEKHTMVIEYIGEMIRAELAECREKRY 834

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E    G  Y+FRLD+  V+DAT  GG+AR+INHSC PNC  + + VE   +I I+A R I
Sbjct: 835 EAANRGI-YMFRLDEQRVIDATLCGGLARYINHSCGPNCVAEAVEVERDLRIIIFASRRI 893

Query: 904 AAGEEITYNYKFPLE-EKKIPCYCGSKKCHGSLN 936
           A GEE++Y+YKF +E + KIPC CG+  C   +N
Sbjct: 894 ARGEELSYDYKFDIEDDHKIPCLCGAASCRKWMN 927


>gi|302821061|ref|XP_002992195.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
 gi|300139962|gb|EFJ06692.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
          Length = 1052

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 95/139 (68%), Gaps = 1/139 (0%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRL 856
            RL F +S IH WG+ A +  +A D + EY GE++R  I+DIRE R+    +G+ +Y+FR+
Sbjct: 889  RLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIRERRHYDSLVGAGTYMFRI 948

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
            DD  VVDAT  G +A  INHSC PNCY+++I+V+ +  I I+AKR I   EE+TY+Y+F 
Sbjct: 949  DDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFA 1008

Query: 917  LEEKKIPCYCGSKKCHGSL 935
             +  ++ C CG+ KC GS+
Sbjct: 1009 SKGAELVCNCGALKCRGSV 1027


>gi|70568862|dbj|BAE06306.1| transcription factor protein [Ciona intestinalis]
          Length = 589

 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           K+SQ +  K+  R      RS+I   GL A   IE    VIEY+G++IR ++++ RE  Y
Sbjct: 436 KSSQYRRMKQEWRSNVYLARSRIQGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNY 495

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E    G  Y+FRLD  Y+VDAT  GG AR+INHSCNPNC  +V++ E +KKI I + RHI
Sbjct: 496 EAANRGV-YMFRLDSDYIVDATVTGGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHI 554

Query: 904 AAGEEITYNYKFPLEEK--KIPCYCGSKKCHGSLN 936
            +GEE+ Y+YKF  E++  KIPC CG+  C   +N
Sbjct: 555 LSGEELNYDYKFDFEDEGNKIPCLCGAINCRKWMN 589


>gi|428171302|gb|EKX40220.1| hypothetical protein GUITHDRAFT_75734 [Guillardia theta CCMP2712]
          Length = 156

 Score =  142 bits (358), Expect = 8e-31,   Method: Composition-based stats.
 Identities = 67/161 (41%), Positives = 100/161 (62%), Gaps = 16/161 (9%)

Query: 787 LKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
           +KA+Q     +RL   RS I  +G+ A +PI   + V+EY GE IR  ++D+RE +YE  
Sbjct: 1   MKANQ----NRRLVLGRSGIEGYGIFARQPIFKGEMVVEYTGEAIRPVVADVREEQYEAA 56

Query: 847 GIGSSYLFRLDD-----------GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKI 895
           G+G+ Y +RL++             +VDAT R  I  +INH C+PNC  K++ + GQ++I
Sbjct: 57  GLGT-YFWRLEEYLGAGEPPEGRAAIVDATIRHNIGHYINHCCDPNCEAKILKINGQRRI 115

Query: 896 FIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            I A   +  GEE+TY+YK P E+KKIPC+CG+  C G++N
Sbjct: 116 IISAIHDVQFGEELTYDYKLPFEDKKIPCHCGAPTCRGTMN 156


>gi|218201707|gb|EEC84134.1| hypothetical protein OsI_30479 [Oryza sativa Indica Group]
          Length = 1057

 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 73/175 (41%), Positives = 101/175 (57%), Gaps = 34/175 (19%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK---------- 845
            ++RL F +S+IH +G+ A    +A D +IEY+GEL+R  ISDIRE R             
Sbjct: 862  RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWK 921

Query: 846  -----------------MGI-------GSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
                             MG+         +Y+FR+DD  V+DAT+ G IA  INHSC PN
Sbjct: 922  TLVKLIGLFMEFNLKALMGVIITTFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 981

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY++VISV G + I I+AKR I   EE+TY+Y+F   ++++PCYCG  KC G +N
Sbjct: 982  CYSRVISVLGDEHIIIFAKRDINPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVN 1036


>gi|222641096|gb|EEE69228.1| hypothetical protein OsJ_28463 [Oryza sativa Japonica Group]
          Length = 1057

 Score =  142 bits (358), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 73/175 (41%), Positives = 101/175 (57%), Gaps = 34/175 (19%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK---------- 845
            ++RL F +S+IH +G+ A    +A D +IEY+GEL+R  ISDIRE R             
Sbjct: 862  RRRLAFGKSRIHGFGVFAKVSHKAGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWK 921

Query: 846  -----------------MGI-------GSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
                             MG+         +Y+FR+DD  V+DAT+ G IA  INHSC PN
Sbjct: 922  TLVKLIGLFMEFNLKALMGVIITTFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 981

Query: 882  CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            CY++VISV G + I I+AKR I   EE+TY+Y+F   ++++PCYCG  KC G +N
Sbjct: 982  CYSRVISVLGDEHIIIFAKRDINPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVN 1036


>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
 gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
          Length = 2678

 Score =  142 bits (358), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 74/145 (51%), Positives = 98/145 (67%), Gaps = 5/145 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI--GSSYLF 854
            KRL  ++SKIH +GL A E I   + ++EYVGE+IR  ++D RE  YE+ G   GS Y+F
Sbjct: 2534 KRLDVKKSKIHGYGLFAKECIHPNEPIVEYVGEVIRNSVADKREKIYEQEGQRDGSCYMF 2593

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVI-SVEGQKKIFIYAKRHIAAGEEITYNY 913
            RLD+  V+DAT  G +ARF+NH C PNC  KVI S +G K I I++K  I   EE+TY+Y
Sbjct: 2594 RLDEHRVIDATNCGNLARFMNHCCQPNCVCKVITSDDGNKHIVIFSKTEIKPDEEVTYDY 2653

Query: 914  KFPLEE--KKIPCYCGSKKCHGSLN 936
            +F LEE  +KI CYCG+  C G +N
Sbjct: 2654 QFNLEEESEKIYCYCGAPNCLGRMN 2678


>gi|312384477|gb|EFR29200.1| hypothetical protein AND_02075 [Anopheles darlingi]
          Length = 5086

 Score =  142 bits (358), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 74/155 (47%), Positives = 101/155 (65%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++S++RE +Y
Sbjct: 4933 KSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQY 4992

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC T+ + V+ + +I I+AKR I
Sbjct: 4993 EAKNRGI-YMFRLDEERVVDATLSGGLARYINHSCNPNCVTETVEVDRELRIIIFAKRRI 5051

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE++Y+YKF +E+   KI C CG+  C   +N
Sbjct: 5052 NRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 5086


>gi|449512913|ref|XP_004175517.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like
           [Taeniopygia guttata]
          Length = 99

 Score =  142 bits (357), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 61/99 (61%), Positives = 77/99 (77%)

Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
           +RE RY + GIGSSYLFR+D   ++DATK G +ARFINH C PNCY KVI++E QKKI I
Sbjct: 1   MREKRYAQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIEAQKKIVI 60

Query: 898 YAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y+K+ I   EEITY+YKFP+E+ KIPC C ++ C G+LN
Sbjct: 61  YSKQPIGVNEEITYDYKFPIEDTKIPCLCRTESCRGTLN 99


>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
          Length = 5215

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 97/155 (62%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 5062 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKKY 5121

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  VVDAT  GG+AR+INHSCNPNC  + + V+   +I I+AKR I
Sbjct: 5122 EAKNRGI-YMFRLDEERVVDATLCGGLARYINHSCNPNCVAETVEVDRDYRIIIFAKRRI 5180

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
              GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 5181 QRGEELAYDYKFDIEDDQHKISCMCGAPNCRKWMN 5215


>gi|159488034|ref|XP_001702027.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158271484|gb|EDO97302.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 2487

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/132 (49%), Positives = 87/132 (65%), Gaps = 5/132 (3%)

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            D GL A+EPIEA +FVIEY G  +R  + D+R+ +Y+  G    Y+F +D  +VVDAT  
Sbjct: 2358 DPGLFAVEPIEAGEFVIEYCGVRLRKPLDDVRQRQYDAAGY-MDYMFAVDGAWVVDATLA 2416

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPC 924
            G  ARFINH C+PNC  +++ V G+KKI IY  R + AGEE+ Y+YK    P  E +IPC
Sbjct: 2417 GNAARFINHCCDPNCVARIVEVGGEKKIAIYTLRRVEAGEELAYDYKLSCVPGAE-RIPC 2475

Query: 925  YCGSKKCHGSLN 936
             CG+K C G +N
Sbjct: 2476 GCGAKNCRGFMN 2487


>gi|294948371|ref|XP_002785717.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus ATCC
            50983]
 gi|239899765|gb|EER17513.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus ATCC
            50983]
          Length = 1340

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 76/158 (48%), Positives = 97/158 (61%), Gaps = 20/158 (12%)

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK-MGIGSS-YLFRL 856
            L  +RS+IH++GL A       D V+EY GE++R  ++D RE  YE+ M +G   Y+FRL
Sbjct: 1183 LEVKRSRIHNYGLFAKVRFAKGDMVVEYAGEIVRHSVADCRERYYEEEMLMGQCCYMFRL 1242

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKV------------ISVEG------QKKIFIY 898
            D+ YVVDAT RG  ARFINHSCNPNC  KV            IS  G      +K I I 
Sbjct: 1243 DEHYVVDATLRGNTARFINHSCNPNCVCKVVEDPVPLASEDDISFRGVVRASHKKHIMIV 1302

Query: 899  AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
            AK  I+AGEEITY+Y+F +E +K+ C CG+  C G LN
Sbjct: 1303 AKNDISAGEEITYDYQFAVESEKLACKCGAPNCLGRLN 1340


>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
          Length = 3474

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/155 (46%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ K  K+  R      RSKI   GL A   +E    VIEY+GE+IR ++++ RE +Y
Sbjct: 3321 KSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSELAETREKQY 3380

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  V+DAT  GG+AR+INHSCNPNC  +++ VE   +I I+AKR I
Sbjct: 3381 EARNRGI-YMFRLDEERVIDATLCGGLARYINHSCNPNCVAEIVEVERDFRIIIFAKRRI 3439

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+ Y+YKF +E  + KI C CG+  C   +N
Sbjct: 3440 SRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3474


>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1782

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 112/203 (55%), Gaps = 13/203 (6%)

Query: 740  RGA---QYVHTKYLGPE----VNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQV 792
            RGA   + + +K   PE    V A + A G    A  ++V  ++ L   E  E + AS  
Sbjct: 1585 RGAFKRELIASKKYTPEEFAAVQAERKARGVN-GASDDKVVEQHGLTMREKLEQMTASY- 1642

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
                 RL F +S IH WGLVA    +A   V ++ GE  R  ++D+RE  YE  G+   Y
Sbjct: 1643 ---SDRLTFAKSNIHGWGLVAKVFHKAGSIVTQFKGETCRSTVADLRETFYEDNGV-DCY 1698

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            L + DD  VVD T +G  ARF NHSCNPN Y+K++ V+    I  +A+  +  GEE+TYN
Sbjct: 1699 LLKQDDDTVVDCTFQGNFARFTNHSCNPNMYSKIVKVDDANHIIFFARTDVRPGEELTYN 1758

Query: 913  YKFPLEEKKIPCYCGSKKCHGSL 935
            Y+F  E+ K+PCYCG+  C G L
Sbjct: 1759 YRFESEDGKVPCYCGADNCRGYL 1781


>gi|311788758|gb|ADQ12920.1| trithorax [Solanum lycopersicum]
          Length = 873

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 106/170 (62%), Gaps = 5/170 (2%)

Query: 768 RTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYV 827
           R +++  R++++  E    +K    +   +RL F +S IH +G+ A  P +A D VIEY 
Sbjct: 684 RCSQLTSRSIVSMVEKYNYMK----ETLGQRLAFGKSGIHGFGIFAKLPQKAGDMVIEYT 739

Query: 828 GELIRLKISDIRE-CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
           GEL+R  I+D RE   Y  +    +Y+FR+DD  V+DAT+ G IA  INHSC PNCY++V
Sbjct: 740 GELVRPPIADRREHLIYNSLVGAGTYMFRIDDQRVIDATRAGSIAHLINHSCEPNCYSRV 799

Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           ISV     I I++KR I   EE+TY+Y+F   ++++ CYCG  +C G +N
Sbjct: 800 ISVNSIDHIIIFSKRDIEQWEELTYDYRFLSIDEQLACYCGFPRCRGVVN 849


>gi|294868764|ref|XP_002765683.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
           ATCC 50983]
 gi|239865762|gb|EEQ98400.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
           ATCC 50983]
          Length = 643

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 77/158 (48%), Positives = 97/158 (61%), Gaps = 20/158 (12%)

Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK-MGIGSS-YLFRL 856
           L  +RS+IH++GL A       D V+EY GE++R  I+D RE  YE+ M +G   Y+FRL
Sbjct: 486 LEVKRSRIHNYGLFAKVRFAKGDMVVEYAGEIVRHSIADCRERYYEEEMLMGQCCYMFRL 545

Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKV------------ISVEG------QKKIFIY 898
           D+ YVVDAT RG  ARFINHSCNPNC  KV            IS  G      +K I I 
Sbjct: 546 DEHYVVDATLRGNTARFINHSCNPNCVCKVVEDPVPLASEDDISFRGVVRASHKKHIMIA 605

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           AK  I+AGEEITY+Y+F +E +K+ C CG+  C G LN
Sbjct: 606 AKNDISAGEEITYDYQFAVESEKLACKCGAPNCLGRLN 643


>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4137

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 68/127 (53%), Positives = 86/127 (67%), Gaps = 1/127 (0%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            KA +  +   RS IH  GL   + IE  + VIEY G +IR  ++D RE  Y+  GIG  Y
Sbjct: 4007 KASRDAVGAYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDKGIGC-Y 4065

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT  G  ARFINHSC PNCY+ V++V+GQK I I+A R I  GEE+TY+
Sbjct: 4066 MFRIDDYEVVDATIHGNSARFINHSCEPNCYSHVVNVDGQKHIVIFATRRIYKGEELTYD 4125

Query: 913  YKFPLEE 919
            YKFP+EE
Sbjct: 4126 YKFPIEE 4132


>gi|340501484|gb|EGR28266.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 956

 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/150 (46%), Positives = 93/150 (62%), Gaps = 2/150 (1%)

Query: 789 ASQVKARKKRLRFQR--SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
           A+ +  +KK L      SKIH +GL A +    +D VIEY+GE IR  ++D RE  Y + 
Sbjct: 807 ANYIADKKKHLNLYAGPSKIHKYGLFAKKHFLQDDIVIEYLGETIRQVLADYREKIYNQK 866

Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
           G G  Y+F+     ++DAT +G  ARF+NHSC PNC ++VI  E   KI IYAK  I  G
Sbjct: 867 GFGDCYMFKAGPNKIIDATFKGNEARFLNHSCQPNCASQVIEYEKDTKIIIYAKTEIQPG 926

Query: 907 EEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           EE+TY+Y F +EE+K+ C CG  KC G LN
Sbjct: 927 EELTYDYCFDIEEEKLVCNCGDPKCTGKLN 956


>gi|123454343|ref|XP_001314927.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121897588|gb|EAY02704.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 486

 Score =  140 bits (352), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 66/140 (47%), Positives = 94/140 (67%), Gaps = 2/140 (1%)

Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
           +RF++SKI  +G+    PI   + VIEY+GE+IR  I+D R+  YEKMG   +Y+F+ D 
Sbjct: 293 IRFEKSKIEGFGVKTTIPIRKGEKVIEYIGEVIRPIIADKRQINYEKMGNHGTYVFKADS 352

Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL- 917
            + +DAT RGGIAR+INHSC+PNC +++I + G+  + + A + I   EE+TY+YK P  
Sbjct: 353 DHYLDATFRGGIARWINHSCDPNCESRIIKLNGRFAVVLVAIKDINPCEELTYDYKLPYE 412

Query: 918 -EEKKIPCYCGSKKCHGSLN 936
            E+K I C CGS  C G LN
Sbjct: 413 PEDKAIKCLCGSPNCRGWLN 432


>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
 gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
          Length = 6442

 Score =  140 bits (352), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 71/142 (50%), Positives = 95/142 (66%), Gaps = 4/142 (2%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL  ++S IH +GL   E I   + VIEY+GE IR  ISD RE  Y+K+   S Y+FRL+
Sbjct: 6302 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6360

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
            +  ++DATK G ++RFINHSC PNC+ K++S +   K I I+AKR I A EEITY+Y+F 
Sbjct: 6361 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFG 6420

Query: 917  LEE--KKIPCYCGSKKCHGSLN 936
            +E   KK+ C CGS  C G +N
Sbjct: 6421 VESEGKKLICLCGSSTCLGRMN 6442


>gi|168027193|ref|XP_001766115.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
 gi|162682758|gb|EDQ69174.1| trithorax-like protein, histone-lysine N-methyltransferase
           [Physcomitrella patens subsp. patens]
          Length = 514

 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 71/186 (38%), Positives = 109/186 (58%), Gaps = 5/186 (2%)

Query: 752 PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGL 811
           P    + + NG  L +        ++L+ ++    +K S      +RL F +S IH  G+
Sbjct: 303 PPPKVAGYTNGGSLWSLHWEASKGSILSMSDKFRRMKGSL----SQRLVFGKSAIHGMGV 358

Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGS-SYLFRLDDGYVVDATKRGGI 870
                  A D +IEY GE++R  ++D RE R+    +G+ +Y+FR+DD  VVDAT  G I
Sbjct: 359 FTKRVHYANDMIIEYAGEVVRPVVADSRERRHYDSLVGAGTYMFRIDDERVVDATTTGSI 418

Query: 871 ARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKK 930
           A  INHSC PNCY++ ++  G+ +I I+AKR +  GEE+TY+Y+F  + + + CYCGS  
Sbjct: 419 AHLINHSCEPNCYSRTVTASGEDRIIIFAKRDLEIGEELTYDYRFMSKTEVLTCYCGSAG 478

Query: 931 CHGSLN 936
           C GS+N
Sbjct: 479 CRGSVN 484


>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 3691

 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 68/143 (47%), Positives = 94/143 (65%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3550 KSNVYLARSRIQGLGLYAARDIEKCTMVIEYIGTIIRSEVANRKERLYESQNRGV-YMFR 3608

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ YV+DAT  GG AR+INHSC+PNC T+V+SVE + KI I + R I  GEE++Y+YKF
Sbjct: 3609 IDNDYVIDATITGGPARYINHSCSPNCITEVVSVEKENKIIISSCRRIQRGEELSYDYKF 3668

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
             LE  + KIPC+CG+  C   +N
Sbjct: 3669 DLEDDQHKIPCHCGAVNCRKWMN 3691


>gi|401409540|ref|XP_003884218.1| Multidomain chromatinic protein with the following architecture: 3x
            PHD-bromo-3xPHD-SET domain and associated cysteine
            cluster at the C-terminus, related [Neospora caninum
            Liverpool]
 gi|325118636|emb|CBZ54187.1| Multidomain chromatinic protein with the following architecture: 3x
            PHD-bromo-3xPHD-SET domain and associated cysteine
            cluster at the C-terminus, related [Neospora caninum
            Liverpool]
          Length = 6755

 Score =  139 bits (350), Expect = 6e-30,   Method: Composition-based stats.
 Identities = 73/145 (50%), Positives = 97/145 (66%), Gaps = 6/145 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG---IGSSYLF 854
            RL  + S IH  GL A  P+   + VIEYVG+++R  +SD+RE  YEK G    G+ Y+F
Sbjct: 6611 RLSVRSSTIHGQGLFANVPLAEGEPVIEYVGDMVRNCVSDLREALYEKQGGGGDGACYMF 6670

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
            RLDD +VVDAT+ G ++RFINHSC PNC  +++  E GQK I I AK  I AGEEITY+Y
Sbjct: 6671 RLDDNFVVDATRAGNVSRFINHSCEPNCTCRILVCEAGQKHIVIIAKTAIRAGEEITYDY 6730

Query: 914  KFPL--EEKKIPCYCGSKKCHGSLN 936
            +F +  E  K+ C CG++ C G +N
Sbjct: 6731 QFGIGNETDKLACLCGARTCLGRMN 6755


>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
 gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
          Length = 6587

 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 71/142 (50%), Positives = 95/142 (66%), Gaps = 4/142 (2%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL  ++S IH +GL   E I   + VIEY+GE IR  ISD RE  Y+K+   S Y+FRL+
Sbjct: 6447 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6505

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
            +  ++DATK G ++RFINHSC PNC+ K++S +   K I I+AKR I A EEITY+Y+F 
Sbjct: 6506 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFG 6565

Query: 917  LEE--KKIPCYCGSKKCHGSLN 936
            +E   KK+ C CGS  C G +N
Sbjct: 6566 VESEGKKLICLCGSSTCLGRMN 6587


>gi|449015700|dbj|BAM79102.1| similar to Trithorax protein [Cyanidioschyzon merolae strain 10D]
          Length = 1910

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 6/170 (3%)

Query: 771  RVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGEL 830
            R   R + A A    L +  Q++ RK  +   RS+IH +GL A   ++A+  ++EY GE 
Sbjct: 1743 RDAWRVVSATATDRRLYRQMQLEHRKYCVPL-RSRIHGFGLFARCDLQADQMIVEYAGER 1801

Query: 831  IRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE 890
            I   ++D+RE  YE+ GIG  Y+FR+D  +V+DAT  G +AR+INHSC PNCY++++ + 
Sbjct: 1802 IASPVADVRERHYERRGIGC-YMFRIDADWVIDATMIGSVARYINHSCRPNCYSEILRIS 1860

Query: 891  GQKK---IFIYAKRHIAAGEEITYNYKFPLEEK-KIPCYCGSKKCHGSLN 936
             + K   I I + R I  GEE+TYNY F ++   KIPC CG   C G +N
Sbjct: 1861 DEPKQDVIVIRSARAIKRGEELTYNYMFDVDNAMKIPCLCGEPGCVGFMN 1910


>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
            latipes]
          Length = 4455

 Score =  139 bits (349), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 75/165 (45%), Positives = 104/165 (63%), Gaps = 7/165 (4%)

Query: 778  LAAAEGAELLKASQ---VKAR-KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
            +AAA      KA+Q   +KA  K  +   RS+I   GL A   IE    VIEY+G +IR 
Sbjct: 4292 VAAAHQGRHSKAAQYRRMKAEWKSNVYLARSRIQGLGLYAARDIEKCTMVIEYIGAIIRS 4351

Query: 834  KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
            ++++ +E  YE    G  Y+FR+D+ YV+DAT  GG AR+INHSC PNC T+V++VE + 
Sbjct: 4352 EVANRKERLYESQNRGV-YMFRIDNDYVIDATITGGPARYINHSCAPNCITEVVNVEKEN 4410

Query: 894  KIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            KI I + R I  GEE++Y+YKF LE  + KIPC+CG+  C   +N
Sbjct: 4411 KIIISSCRRIQRGEELSYDYKFDLEDDQHKIPCHCGAVNCRKWMN 4455


>gi|302846845|ref|XP_002954958.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
 gi|300259721|gb|EFJ43946.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
          Length = 128

 Score =  139 bits (349), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 64/130 (49%), Positives = 89/130 (68%), Gaps = 5/130 (3%)

Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
           GL A+EPIE  +FVIEY G+LIR  + D+R+ +Y+  G  + Y+F +DD +VVDAT  G 
Sbjct: 1   GLFAVEPIEPSEFVIEYCGQLIRKPLDDVRQRQYDAAGY-ADYMFAVDDNWVVDATLAGN 59

Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPCYC 926
            ARFINH C+PNC  +++ V G+K+I IY  R + AGEE+ Y+YK    P  E +IPC C
Sbjct: 60  AARFINHCCDPNCVARIVEVGGEKRIAIYTLRRVEAGEELHYDYKLSSVPGAE-RIPCGC 118

Query: 927 GSKKCHGSLN 936
           G++ C G +N
Sbjct: 119 GARNCKGFMN 128


>gi|67586322|ref|XP_665182.1| SET-domain protein [Cryptosporidium hominis TU502]
 gi|54655690|gb|EAL34953.1| SET-domain protein [Cryptosporidium hominis]
          Length = 209

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/176 (43%), Positives = 112/176 (63%), Gaps = 9/176 (5%)

Query: 766 SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
           S++ N+++L ++       +L +   +    KRL  ++S IH +GL A E I+  + +IE
Sbjct: 38  SSKRNKIQLEDMGPT----KLYRYLDLLPYDKRLNIKKSSIHGFGLFAKELIKTGEPIIE 93

Query: 826 YVGELIRLKISDIRECRYEKMG--IGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
           YVGELIR  ++D RE  Y+  G   GS Y+FRLD+  V+DAT  G  ARF+NH C+PN  
Sbjct: 94  YVGELIRNSVADKRESLYKSNGNRDGSCYMFRLDESSVIDATNIGNHARFMNHCCDPNSI 153

Query: 884 TKVISVEGQKK-IFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            KVIS++ Q K I I++K+ I   EEITY+Y+F +EE  +KI C+CG+  C G +N
Sbjct: 154 CKVISIDSQNKHIVIFSKKTINKDEEITYDYQFNVEEASEKIICHCGASNCLGRMN 209


>gi|356554223|ref|XP_003545448.1| PREDICTED: uncharacterized protein LOC100812602 [Glycine max]
          Length = 1985

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 68/145 (46%), Positives = 95/145 (65%), Gaps = 5/145 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
            K L   +S+IH  GL     I   + V+EY+GE++ L+++D RE  Y+   K+   ++ Y
Sbjct: 1841 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKTACY 1900

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             FR+D  +++DAT++GGIARF+NHSC PNC  KVI+V  +KK+   A+R I  GEEITY+
Sbjct: 1901 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 1960

Query: 913  YKFPLE-EKKIPCYCGSKKCHGSLN 936
            Y F  E E KIPCYC SK C   +N
Sbjct: 1961 YHFNHEDEGKIPCYCNSKNCRRYMN 1985


>gi|296082099|emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 69/146 (47%), Positives = 95/146 (65%), Gaps = 6/146 (4%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
            K L   +S IH  GL     I     V+EYVGE++ L+++D RE  Y+   K+   ++ Y
Sbjct: 966  KHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACY 1025

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             FR+D  +++DAT++GGIARF+NHSC PNC  KVISV  +KK+  +A+R I  GEEITY+
Sbjct: 1026 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYD 1085

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y F  E+  KKIPC+C S+ C   LN
Sbjct: 1086 YHFNHEDEGKKIPCFCNSRNCRRYLN 1111


>gi|196004646|ref|XP_002112190.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
 gi|190586089|gb|EDV26157.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
          Length = 152

 Score =  137 bits (345), Expect = 3e-29,   Method: Composition-based stats.
 Identities = 68/141 (48%), Positives = 90/141 (63%), Gaps = 1/141 (0%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K+ +   RS IH  GL     IEA + ++EY G  IR  ++D +E  YE  G+G  Y+FR
Sbjct: 13  KETVGVYRSDIHGRGLFCKRIIEAGEMIMEYAGTKIRSILTDKQEKYYEGKGLGC-YMFR 71

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D   V+DAT  G  ARFINHSC+PNCY K++ ++  KKI I+A R I  GEE+TY+Y F
Sbjct: 72  IDRDEVIDATLSGNAARFINHSCDPNCYCKIVPIDNIKKIIIFALRRIYPGEELTYDYNF 131

Query: 916 PLEEKKIPCYCGSKKCHGSLN 936
             E+ K+PC C SKKC   LN
Sbjct: 132 AKEDIKLPCNCSSKKCRKFLN 152


>gi|384244737|gb|EIE18235.1| Mll1 protein, partial [Coccomyxa subellipsoidea C-169]
          Length = 137

 Score =  137 bits (345), Expect = 3e-29,   Method: Composition-based stats.
 Identities = 61/136 (44%), Positives = 91/136 (66%), Gaps = 3/136 (2%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           +S IH WGL A + ++A+  VIEY G+++R   +D RE RY+  G    YLF + D +V+
Sbjct: 3   QSGIHGWGLFARKAMKADSMVIEYRGDVLRRTAADAREKRYQAAG-KDCYLFTVQDDFVI 61

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EK 920
           DAT RGGI RF NH C+P+ YTKV+ +EG  ++  +A+  +  G+E+TY+Y+F  E  + 
Sbjct: 62  DATMRGGIGRFTNHCCSPSMYTKVLEIEGTPRLVFFARFDLVPGQELTYDYRFKEEDGDN 121

Query: 921 KIPCYCGSKKCHGSLN 936
           K+PC CG+  C G+LN
Sbjct: 122 KLPCGCGAPNCRGTLN 137


>gi|403271117|ref|XP_003927487.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
           boliviensis boliviensis]
          Length = 206

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 81/179 (45%), Positives = 108/179 (60%), Gaps = 8/179 (4%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQV----KARKKRLRFQRSKIHDWGLVALEPIEA 819
           G S  + +V+LR+   A    EL  A Q     K  K+ +   RS IH   L     I+A
Sbjct: 30  GKSLPSAQVQLRSTRRATS-LELPMAMQFRHLKKTSKESVGVYRSTIHRRALFCKGDIDA 88

Query: 820 EDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
            + +I+Y G +IR  ++D RE  Y+  GI   Y+FR+DD  V+DAT  G  ARFINHSC 
Sbjct: 89  GEMIIDYSGIVIRSVLTDKREKFYDGKGI-RCYMFRMDDFDVLDATMHGHAARFINHSCE 147

Query: 880 PNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           PNC+++VI VEGQK I I+A R I  GEE+TY+YKFP+E+   K+PC  G K+C   LN
Sbjct: 148 PNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNYGGKRCRRFLN 206


>gi|302787360|ref|XP_002975450.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
 gi|300157024|gb|EFJ23651.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
          Length = 582

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 66/146 (45%), Positives = 96/146 (65%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K R+ F +S IH WGL A   I+  + V+EY GE IR  ++D+RE RY   G    Y
Sbjct: 438 KTEKTRVCFGKSGIHGWGLFARRYIKEGEMVVEYRGERIRRSVADLREKRYCLEG-KHCY 496

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK--IFIYAKRHIAAGEEIT 910
           LF++ +  V+DAT+ G I R INHSC PNCY +++SVEG+ +  I + A++ ++ GEE+T
Sbjct: 497 LFKISEEVVIDATENGNIGRLINHSCEPNCYARIVSVEGEGESHIVLIARKDVSVGEELT 556

Query: 911 YNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y+Y+F  E+KK+ C CGS +C   +N
Sbjct: 557 YDYQFDKEDKKVLCLCGSSRCRRFMN 582


>gi|313239304|emb|CBY14252.1| unnamed protein product [Oikopleura dioica]
          Length = 2763

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 69/147 (46%), Positives = 93/147 (63%), Gaps = 3/147 (2%)

Query: 792  VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
            V+  +  L+   S IH  GL A  P E  + VIEY G +IR ++ D+RE  Y+  GIG+ 
Sbjct: 2618 VETSRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDRGIGT- 2676

Query: 852  YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
            Y+FR+DD  VVDAT  GG ARFINHSC PNC +K+I+V+ +K I I A R I  GEE+TY
Sbjct: 2677 YMFRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTY 2736

Query: 912  NYKFPLE--EKKIPCYCGSKKCHGSLN 936
            +YKF  +   ++I C C +  C   +N
Sbjct: 2737 DYKFDRDFGSERIECGCQAPNCRRFMN 2763


>gi|302761298|ref|XP_002964071.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
 gi|300167800|gb|EFJ34404.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
          Length = 564

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 66/146 (45%), Positives = 96/146 (65%), Gaps = 3/146 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K R+ F +S IH WGL A   I+  + V+EY GE IR  ++D+RE RY   G    Y
Sbjct: 420 KTEKTRVCFGKSGIHGWGLFARRYIKEGEMVVEYRGERIRRSVADLREKRYCLEG-KHCY 478

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK--IFIYAKRHIAAGEEIT 910
           LF++ +  V+DAT+ G I R INHSC PNCY +++SVEG+ +  I + A++ ++ GEE+T
Sbjct: 479 LFKISEEVVIDATENGNIGRLINHSCEPNCYARIVSVEGEGESHIVLIARKDVSVGEELT 538

Query: 911 YNYKFPLEEKKIPCYCGSKKCHGSLN 936
           Y+Y+F  E+KK+ C CGS +C   +N
Sbjct: 539 YDYQFDKEDKKVLCLCGSSRCRRFMN 564


>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
            sativus]
 gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
            sativus]
          Length = 1055

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 70/167 (41%), Positives = 104/167 (62%), Gaps = 13/167 (7%)

Query: 782  EGAELLKASQVKAR--------KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
            EGA+    S  K R        K R+ F +S IH WGL A   ++  + V+EY GE +R 
Sbjct: 890  EGADPKSFSSFKERLQYLQRTEKDRVCFGKSGIHGWGLFARRNVQEGEMVVEYRGEQVRR 949

Query: 834  KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ 892
             ++D+RE RY+  G    YLF++ +  V+DAT++G IAR INHSC PNCY +++SV + +
Sbjct: 950  SVADLREARYQLEG-KDCYLFKISEEVVIDATEKGNIARLINHSCMPNCYARIMSVGDNE 1008

Query: 893  KKIFIYAKRHIAAGEEITYNYKF---PLEEKKIPCYCGSKKCHGSLN 936
             +I + AK ++AAGEE+TY+Y F    L+E K+PC+C +  C   +N
Sbjct: 1009 SRIVLIAKTNVAAGEELTYDYLFDPDELDELKVPCHCNAPNCRKFMN 1055


>gi|356562088|ref|XP_003549306.1| PREDICTED: uncharacterized protein LOC100816713 [Glycine max]
          Length = 992

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 5/145 (3%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
           K L   +S+IH  GL     I   + V+EY+GE++ L+++D RE  Y+   K+   S+ Y
Sbjct: 848 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACY 907

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            FR+D  +++DAT++GGIARF+NHSC PNC  KVI+V  +KK+   A+R I  GEEITY+
Sbjct: 908 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 967

Query: 913 YKFPLE-EKKIPCYCGSKKCHGSLN 936
           Y F  E E KIPCYC SK C   +N
Sbjct: 968 YHFNHEDEGKIPCYCYSKNCRRYMN 992


>gi|225430418|ref|XP_002283013.1| PREDICTED: histone-lysine N-methyltransferase ATX1-like [Vitis
           vinifera]
          Length = 496

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 72/158 (45%), Positives = 99/158 (62%), Gaps = 7/158 (4%)

Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
           E  +  Q K  K  L   +S IH  GL     I     V+EYVGE++ L+++D RE  Y+
Sbjct: 340 EFARYKQAKGWK-HLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQ 398

Query: 845 ---KMGIGSS-YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
              K+   ++ Y FR+D  +++DAT++GGIARF+NHSC PNC  KVISV  +KK+  +A+
Sbjct: 399 SGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAE 458

Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           R I  GEEITY+Y F  E+  KKIPC+C S+ C   LN
Sbjct: 459 RDINPGEEITYDYHFNHEDEGKKIPCFCNSRNCRRYLN 496


>gi|195445276|ref|XP_002070254.1| GK11134 [Drosophila willistoni]
 gi|194166339|gb|EDW81240.1| GK11134 [Drosophila willistoni]
          Length = 290

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 86/118 (72%)

Query: 762 GKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAED 821
           G    AR+N+ +      +   +ELLK +Q+K RKK+L+F +S IHDWGL A+EPI A++
Sbjct: 169 GISREARSNQRRQLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADE 228

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCN 879
             IEYVG++IR  ++D+RE +YE +GIGSSYLFR+D   ++DATK G + RFINHSCN
Sbjct: 229 MAIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLERFINHSCN 286


>gi|255635862|gb|ACU18278.1| unknown [Glycine max]
          Length = 245

 Score =  136 bits (343), Expect = 5e-29,   Method: Composition-based stats.
 Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 5/145 (3%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
           K L   +S+IH  GL     I   + V+EY+GE++ L+++D RE  Y+   K+   S+ Y
Sbjct: 101 KHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACY 160

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            FR+D  +++DAT++GGIARF+NHSC PNC  KVI+V  +KK+   A+R I  GEEITY+
Sbjct: 161 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYD 220

Query: 913 YKFPLE-EKKIPCYCGSKKCHGSLN 936
           Y F  E E KIPCYC SK C   +N
Sbjct: 221 YHFNHEDEGKIPCYCYSKNCRRYMN 245


>gi|357127667|ref|XP_003565500.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
           [Brachypodium distachyon]
          Length = 972

 Score =  136 bits (342), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 65/148 (43%), Positives = 95/148 (64%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   KR+   RS +H WGL A   I+    VIEY G+ +R  ++D+RE RY +      Y
Sbjct: 826 KTENKRVSCGRSGVHGWGLFAARKIQEGQMVIEYRGDQVRQSVADLREARYHREN-KDCY 884

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
           LF++ +  V+DAT+RG IAR INHSC PNCY +++SV + + +I + AKR ++AGEE+TY
Sbjct: 885 LFKISEDVVIDATERGNIARLINHSCMPNCYARIVSVGDDKSQIILIAKRDVSAGEELTY 944

Query: 912 NYKFPL---EEKKIPCYCGSKKCHGSLN 936
           +YKF     E++K+ C C +  C G +N
Sbjct: 945 DYKFDRDESEDRKVLCLCKAPNCRGYMN 972


>gi|123478084|ref|XP_001322206.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121905048|gb|EAY09983.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 351

 Score =  136 bits (342), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 93/141 (65%), Gaps = 3/141 (2%)

Query: 799 LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
           L F++S+I  WG+ +   I+    V EYVGE+IR  ++D R+   EK G   +Y+F+LD 
Sbjct: 166 LMFEKSEIEGWGVRSTCSIDKNQIVAEYVGEIIRPVVADKRQVYNEKHGNHGTYIFKLDS 225

Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFPL 917
              +DAT+RGGIARFINHSC+PNC ++++++  G+K + I + ++I    EITY+YK P 
Sbjct: 226 QNYLDATQRGGIARFINHSCDPNCRSELVTMSNGRKAVVIISNQYIPPNTEITYDYKLPY 285

Query: 918 E--EKKIPCYCGSKKCHGSLN 936
           E  +K I C CGS KC   LN
Sbjct: 286 ESPDKAIKCLCGSDKCRHYLN 306


>gi|218202612|gb|EEC85039.1| hypothetical protein OsI_32352 [Oryza sativa Indica Group]
          Length = 1741

 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 65/146 (44%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
            K L   +S IH  GL   E I     V++YVGE++   ++D RE  Y+    +    + Y
Sbjct: 1596 KHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQSGKRQQYKSACY 1655

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             F++   +++DAT++GGIARFINHSC PNC  KVISV  +KK+  +A+RHI  GEEITY+
Sbjct: 1656 FFKIGKEHIIDATRKGGIARFINHSCQPNCVAKVISVRNEKKVVFFAERHINPGEEITYD 1715

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y F  E+  ++IPC+C S+ C   LN
Sbjct: 1716 YHFNREDEGQRIPCFCRSRGCRRYLN 1741


>gi|170067214|ref|XP_001868394.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863427|gb|EDS26810.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 284

 Score =  135 bits (341), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 57/99 (57%), Positives = 79/99 (79%)

Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
           VG+++R  ++D+RE +YE +GI S+YLFR+D   ++DATK G +ARF+NHSCNPNCYTKV
Sbjct: 186 VGQVVRPSVTDLRETKYEAIGISSTYLFRIDMETIIDATKCGNLARFLNHSCNPNCYTKV 245

Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
           I++E +KKI IY+K+ I   EEITY+YK PLE++ I  Y
Sbjct: 246 ITIESEKKIVIYSKQPIGVNEEITYDYKVPLEDENISFY 284


>gi|357617692|gb|EHJ70932.1| hypothetical protein KGM_14792 [Danaus plexippus]
          Length = 147

 Score =  135 bits (341), Expect = 8e-29,   Method: Composition-based stats.
 Identities = 68/143 (47%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           +  +   RSKI   GL A   +E    VIEY+GE+IR ++S+IRE +YE    G  Y+FR
Sbjct: 6   RNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSELSEIREKKYEAKNRGI-YMFR 64

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           LD+  VVDAT  GG+AR+INHSC PNC  + + V+   +I I+AKR IA GEE+ Y+YKF
Sbjct: 65  LDERRVVDATLCGGLARYINHSCQPNCVAETVEVDRHLRIIIFAKRRIARGEELAYDYKF 124

Query: 916 PLEEK--KIPCYCGSKKCHGSLN 936
            +E+   KI C CG+  C   +N
Sbjct: 125 DIEDDAHKIMCMCGAPNCRKWMN 147


>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
          Length = 1035

 Score =  135 bits (340), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 94/258 (36%), Positives = 132/258 (51%), Gaps = 20/258 (7%)

Query: 683  TAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKWSLNASPAERARVRGA 742
            T K+  SR ++  + K   S   N        AR  I     H     A  A   RVRG 
Sbjct: 794  TKKKTGSRLISSSRKKQDDSPVDNTEHEPFSAARCRIFQRTNHTKKRAADEAVSHRVRGP 853

Query: 743  QYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ 802
             Y H       ++A +       S  T+R  +    A +   E L   Q +   +R+ F 
Sbjct: 854  -YHHP------LDAIE-------SLNTHRQVVHEPQAFSSFRERLYHLQ-RTENERVCFG 898

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS IH+WGL A   I+  D V+EY GE +R  I+D+RE RY   G    YLF++ +  VV
Sbjct: 899  RSGIHEWGLFARRNIQEGDMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKISEEVVV 957

Query: 863  DATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF---PLE 918
            DAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY+Y F     E
Sbjct: 958  DATDKGNIARLINHSCMPNCYARIMSVGDEESRIVLIAKTNVAAGDELTYDYLFDPDEPE 1017

Query: 919  EKKIPCYCGSKKCHGSLN 936
            E K+PC C +  C   +N
Sbjct: 1018 ENKVPCLCKAPNCRKFMN 1035


>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
            PHD-bromo-3xPHD-SET domain and associated cysteine
            cluster at the C-terminus [Cryptosporidium parvum Iowa
            II]
 gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
            PHD-bromo-3xPHD-SET domain and associated cysteine
            cluster at the C-terminus [Cryptosporidium parvum Iowa
            II]
          Length = 2244

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 72/145 (49%), Positives = 98/145 (67%), Gaps = 5/145 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG--IGSSYLF 854
            KRL  ++S IH +GL A E I+  + +IEYVGELIR  ++D RE  Y+  G   GS Y+F
Sbjct: 2100 KRLNIKKSSIHGFGLFAKELIKTGEPIIEYVGELIRNSVADKRESLYKSNGNRDGSCYMF 2159

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-IFIYAKRHIAAGEEITYNY 913
            RLD+  V+DAT  G  ARF+NH C+PN   KVIS++ Q K I I++K+ I   EEITY+Y
Sbjct: 2160 RLDESSVIDATNIGNHARFMNHCCDPNSICKVISIDSQNKHIVIFSKKTINKDEEITYDY 2219

Query: 914  KFPLEE--KKIPCYCGSKKCHGSLN 936
            +F +EE  +KI C+CG+  C G +N
Sbjct: 2220 QFNVEEASEKIICHCGASNCLGRMN 2244


>gi|222642073|gb|EEE70205.1| hypothetical protein OsJ_30300 [Oryza sativa Japonica Group]
          Length = 1792

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
            K L   +S IH  GL   E I     V++YVGE++   ++D RE  Y+    +    + Y
Sbjct: 1647 KHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQSGKRQQYKSACY 1706

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             F++   +++DAT++GGIARFINHSC PNC  K+ISV  +KK+  +A+RHI  GEEITY+
Sbjct: 1707 FFKIGKEHIIDATRKGGIARFINHSCQPNCVAKIISVRNEKKVVFFAERHINPGEEITYD 1766

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y F  E+  ++IPC+C S+ C   LN
Sbjct: 1767 YHFNREDEGQRIPCFCRSRGCRRYLN 1792


>gi|260794338|ref|XP_002592166.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
 gi|229277381|gb|EEN48177.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
          Length = 1891

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 99/155 (63%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K++Q +  K+  R      RS+I   GL A + I+    VIEY+G +IR ++ + RE  Y
Sbjct: 1738 KSAQYRKLKQEWRNNVVLGRSRIQGLGLFAAKDIDKHVMVIEYIGVIIRNEVCNKREHIY 1797

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E+   G  Y+FR+D   V+DAT  GG AR+INHSCNPNC  +V++ E ++KI I + R +
Sbjct: 1798 EEQNRGV-YMFRIDSDLVIDATLAGGPARYINHSCNPNCVAEVVNFEKEQKIIIISSRRL 1856

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            + GEE+TY+YKF +E  E+KIPC CG+  C   +N
Sbjct: 1857 SKGEELTYDYKFDIEDDEQKIPCCCGAPNCRKWMN 1891


>gi|427785301|gb|JAA58102.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
            pulchellus]
          Length = 2867

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K   R      RS I   GL A   IE    +IEY+G+LIR +I++  E  Y
Sbjct: 2714 KSSQYRRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEIAERNEAIY 2773

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  V+DAT  GG+AR+INHSC+PNC  +++ ++ + KI I A R I
Sbjct: 2774 EAQNRGV-YMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKILIIANRRI 2832

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE+TY+YKF  E+   KIPC C +  C   +N
Sbjct: 2833 TRGEELTYDYKFDYEDDGHKIPCLCSASNCRKWMN 2867


>gi|145518147|ref|XP_001444951.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412384|emb|CAK77554.1| unnamed protein product [Paramecium tetraurelia]
          Length = 841

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 96/153 (62%), Gaps = 6/153 (3%)

Query: 786 LLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK 845
           L + S+ + +K +L    S IH +GL   +  +  DFVIEY GE+IR  ++D RE  Y +
Sbjct: 693 LKQESKNQKQKNKLYVAPSHIHKYGLFTKQDFKKGDFVIEYTGEVIRNALADYRELTYNE 752

Query: 846 MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAA 905
            G G  Y+FR     V+DAT +G  ARF+NHSC PNC + ++     +KI IYA++ I+ 
Sbjct: 753 QGFGDCYMFRASKTKVIDATFKGSEARFLNHSCQPNCDSLLLD----EKILIYARKDISV 808

Query: 906 GEEITYNYKFPL--EEKKIPCYCGSKKCHGSLN 936
           GEE+TY+Y+F +  E +KI C CG+K C G LN
Sbjct: 809 GEELTYDYQFEIEAESQKIQCSCGAKNCIGRLN 841


>gi|242045554|ref|XP_002460648.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
 gi|241924025|gb|EER97169.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
          Length = 1658

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 63/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSYLF 854
            L   +S IH  GL     I     V+EYVGE++  +++D RE  Y+    +    + Y F
Sbjct: 1515 LVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADKREIEYQSGKRQQYKSACYFF 1574

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            ++D  ++VDAT++GGIARF+NHSC PNC  K+IS+  +KK+  +A+RHI  GEEITY+Y 
Sbjct: 1575 KIDREHIVDATRKGGIARFVNHSCQPNCVAKIISIRNEKKVMFFAERHINPGEEITYDYH 1634

Query: 915  FPLEE--KKIPCYCGSKKCHGSLN 936
            F  E+  ++IPC+C S+ C   LN
Sbjct: 1635 FNREDEGQRIPCFCRSRYCRRYLN 1658


>gi|68071179|ref|XP_677503.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497646|emb|CAH98690.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 1325

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 70/142 (49%), Positives = 94/142 (66%), Gaps = 4/142 (2%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL  ++S IH +GL A E I   + VIEY+GE IR  ISD RE  YEK+   S Y+FRL+
Sbjct: 1185 RLYVKKSSIHGYGLYAREFINEGEPVIEYIGEYIRNIISDKRETYYEKIE-SSCYMFRLN 1243

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
            +  ++DATK G  +RFINHSC PNC+ K++S +   K I I+AK+ I   EEITY+Y+F 
Sbjct: 1244 ENIIIDATKWGNASRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFG 1303

Query: 917  LEE--KKIPCYCGSKKCHGSLN 936
            +E   +K+ C CGS  C G +N
Sbjct: 1304 VESEGEKLICLCGSNTCLGRMN 1325


>gi|297803296|ref|XP_002869532.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297315368|gb|EFH45791.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1024

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY ++G    YLF++ 
Sbjct: 883  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 941

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F 
Sbjct: 942  EEVVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 1001

Query: 916  --PLEEKKIPCYCGSKKCHGSLN 936
                EE K+PC C +  C   +N
Sbjct: 1002 PDEAEELKVPCLCKAPNCRKFMN 1024


>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
 gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
          Length = 2475

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 68/141 (48%), Positives = 92/141 (65%), Gaps = 6/141 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K R+   RS+I   GL A   I   DF+IEY GE+IR ++ ++RE RY     G  Y+FR
Sbjct: 2331 KDRVYLARSRIAGLGLYAKVDISMGDFIIEYKGEIIRSEVCEVREIRYVAQNRGV-YMFR 2389

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQKKIFIYAKRHIAAGEEITYN 912
            +D+ +V+DAT  GG AR+INHSC+PNC T+++   S   +KKI I A R I+A EE+TY+
Sbjct: 2390 IDEEWVIDATMAGGPARYINHSCDPNCSTQILDAGSGAREKKIIITANRPISANEELTYD 2449

Query: 913  YKFPLE--EKKIPCYCGSKKC 931
            Y+F LE    KIPC CG+  C
Sbjct: 2450 YQFELEGTTDKIPCLCGAPNC 2470


>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
 gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 6753

 Score =  134 bits (338), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 72/142 (50%), Positives = 96/142 (67%), Gaps = 4/142 (2%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL  ++S IH +GL   E I   + VIEY+GE IR  ISD RE  Y+K+   S Y+FRL+
Sbjct: 6613 RLYVKKSSIHGYGLYTCEFINEGEPVIEYIGEYIRNIISDKREKYYDKIE-SSCYMFRLN 6671

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
            +  ++DATK G ++RFINHSC PNC+ K++S +   K I I+AKR IAA EEITY+Y+F 
Sbjct: 6672 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQFG 6731

Query: 917  LEE--KKIPCYCGSKKCHGSLN 936
            +E   KK+ C CGS  C G +N
Sbjct: 6732 VESEGKKLICLCGSSTCLGRMN 6753


>gi|42567196|ref|NP_194520.3| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
 gi|229488104|sp|Q9SUE7.3|ATX4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX4; AltName:
            Full=Protein SET DOMAIN GROUP 16; AltName:
            Full=Trithorax-homolog protein 4; Short=TRX-homolog
            protein 4; Short=Trithorax 4
 gi|332660008|gb|AEE85408.1| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
          Length = 1027

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY ++G    YLF++ 
Sbjct: 886  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 944

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F 
Sbjct: 945  EEVVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 1004

Query: 916  --PLEEKKIPCYCGSKKCHGSLN 936
                EE K+PC C +  C   +N
Sbjct: 1005 PDEAEELKVPCLCKAPNCRKFMN 1027


>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
          Length = 991

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/148 (44%), Positives = 95/148 (64%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   KR+   RS IH WGL A + I+    VIEY G+ +R  ++D+RE RY +      Y
Sbjct: 845 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 903

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  VVDAT++G IAR INHSC PNCY +++SV  +K +I + AKR ++AGEE+TY
Sbjct: 904 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 963

Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
           +Y F  +E    ++PC C +  C G +N
Sbjct: 964 DYLFDPDESEDCRVPCLCKAPNCRGYMN 991


>gi|256076153|ref|XP_002574378.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
 gi|350644878|emb|CCD60414.1| mixed-lineage leukemia protein, mll, putative [Schistosoma mansoni]
          Length = 1560

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/191 (38%), Positives = 104/191 (54%), Gaps = 18/191 (9%)

Query: 752  PEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLR----FQRSKIH 807
            P  N +  +NG           + N+  A++  +  ++ Q +  K  +       RS+I 
Sbjct: 1382 PSFNGTTCSNG-----------VYNIDTASKQHQSSRSQQYRKLKTEVNSNVILGRSRIQ 1430

Query: 808  DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
              GL A   +E +  VIEYVGELIRL+I++ RE  YE    G  Y+FRLDD  V+DAT  
Sbjct: 1431 GLGLFAARDLEQQTMVIEYVGELIRLEIANKREKNYEAHNRGI-YMFRLDDDTVIDATVC 1489

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK--KIPCY 925
            GG+AR+INHSC PNC  + ++  G   I I   R I  GEE+ Y+Y F LE++  KIPC 
Sbjct: 1490 GGLARYINHSCQPNCLAEFVNFGGHSHIVIITNRRIKKGEELCYDYNFDLEDRSDKIPCL 1549

Query: 926  CGSKKCHGSLN 936
            C +  C   +N
Sbjct: 1550 CRAPNCRKWMN 1560


>gi|357154561|ref|XP_003576824.1| PREDICTED: uncharacterized protein LOC100834435 [Brachypodium
           distachyon]
          Length = 973

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 92/147 (62%), Gaps = 6/147 (4%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSS 851
           +K L   +S IH  GL   E I     VIEYVGE++  +++D RE  Y     +      
Sbjct: 827 QKHLVVYKSGIHGLGLYTSEFIPRGSMVIEYVGEIVGQRVADKREIEYHSGKRQQYKSVC 886

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y F++D  +++DAT++GG+ARFINHSC PNC  K+ISV  +KK+  +++R I  GEEITY
Sbjct: 887 YFFKIDREHIIDATQKGGVARFINHSCQPNCVAKIISVRNEKKVVFFSERQINPGEEITY 946

Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
           +Y F  E+  ++IPC+C S+ C   LN
Sbjct: 947 DYHFTQEDEGQRIPCFCRSRSCRRYLN 973


>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
 gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 2437

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 85/133 (63%)

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            SKIH +GL A    + +D V+EY+GE IR  ++D RE  Y++ G G  Y+F+     ++D
Sbjct: 2305 SKIHKYGLFAKTYFKQDDIVVEYLGETIRQVLADYREKIYKQRGFGDCYMFKACPDKIID 2364

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP 923
            AT +G  AR++NHSCNPNC + VI  E   KI IYAKR I  GEE+TY+Y F +EE+KI 
Sbjct: 2365 ATFKGNEARYLNHSCNPNCSSLVIEYEKDSKIIIYAKRDIKPGEELTYDYCFDIEEEKIN 2424

Query: 924  CYCGSKKCHGSLN 936
            C C    C   +N
Sbjct: 2425 CNCNDPNCTRIMN 2437


>gi|70950103|ref|XP_744403.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524343|emb|CAH80265.1| SET-domain protein, putative [Plasmodium chabaudi chabaudi]
          Length = 870

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 69/142 (48%), Positives = 94/142 (66%), Gaps = 4/142 (2%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           RL  ++S IH +GL   E I   + VIEY+GE IR  ISD RE  YEK+   S Y+FRL+
Sbjct: 730 RLYVKKSSIHGYGLYTREFINEGEPVIEYIGEYIRNIISDKREKYYEKIE-SSCYMFRLN 788

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFP 916
           +  ++DATK G ++RFINHSC PNC+ K++S +   K I I+AK+ I   EEITY+Y+F 
Sbjct: 789 ENIIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFG 848

Query: 917 LEE--KKIPCYCGSKKCHGSLN 936
           +E   +K+ C CGS  C G +N
Sbjct: 849 VESEGEKLICLCGSSTCLGRMN 870


>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
          Length = 1018

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A + I+  D V+EY GE +R  I+D+RE RY   G    YLF++ 
Sbjct: 877  RVCFGRSGIHGWGLFARQAIQEGDMVLEYRGEQVRRSIADMREVRYRLEG-KDCYLFKIS 935

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY+Y F 
Sbjct: 936  EEVVVDATDKGNIARLINHSCAPNCYARIMSVGDDESRIVLIAKTNVAAGDELTYDYLFD 995

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C +  C   +N
Sbjct: 996  PDEPDECKVPCLCKAPNCRKFMN 1018


>gi|224005356|ref|XP_002296329.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586361|gb|ACI65046.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 161

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/140 (47%), Positives = 90/140 (64%), Gaps = 8/140 (5%)

Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
           S IH WGL A  P   +D V EY+GE I   ++D+RE  Y K  I   Y F++D   V+D
Sbjct: 8   SGIHGWGLFADYPFRKDDIVAEYIGEYISDAVADVREKYYRKHRI-QDYQFKVDGSLVID 66

Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITYNYKFPLE- 918
           AT +GG AR+INHSCNPNC  K++S +      K++ I A+R I A EE++Y+Y+FPLE 
Sbjct: 67  ATLKGGYARYINHSCNPNCIAKIVSGKAPNQHLKRVIIVAQRDIQATEELSYDYQFPLET 126

Query: 919 --EKKIPCYCGSKKCHGSLN 936
             + +IPC CGS+ C G +N
Sbjct: 127 DLDLRIPCNCGSRHCRGFMN 146


>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
 gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
          Length = 2509

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 92/141 (65%), Gaps = 6/141 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K R+   RS+I   GL A   I   +F+IEY GE+IR ++ ++RE RY     G  Y+FR
Sbjct: 2365 KDRVYLARSRIAGLGLYAKNDISMGEFIIEYKGEIIRAEVCEVREIRYTAQNRGV-YMFR 2423

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQKKIFIYAKRHIAAGEEITYN 912
            +D+ +V+DAT  GG AR+INHSC+PNC T+++   S   +KKI I A R I++ EE+TY+
Sbjct: 2424 IDEEWVIDATMAGGPARYINHSCDPNCSTQILDAGSSGREKKIIITANRPISSNEELTYD 2483

Query: 913  YKFPLE--EKKIPCYCGSKKC 931
            Y+F LE    KIPC CG+  C
Sbjct: 2484 YQFELEGTTDKIPCLCGAPNC 2504


>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
            vinifera]
          Length = 1094

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 66/148 (44%), Positives = 94/148 (63%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            +    R+ F RS IH WGL A + I+  D V+EY GE +R  I+D+RE RY   G    Y
Sbjct: 948  RTENDRVCFGRSGIHGWGLFARQAIQEGDMVLEYRGEQVRRSIADMREVRYRLEG-KDCY 1006

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++AAG+E+TY
Sbjct: 1007 LFKISEEVVVDATDKGNIARLINHSCAPNCYARIMSVGDDESRIVLIAKTNVAAGDELTY 1066

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E    K+PC C +  C   +N
Sbjct: 1067 DYLFDPDEPDECKVPCLCKAPNCRKFMN 1094


>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
 gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
 gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
          Length = 991

 Score =  134 bits (336), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 66/148 (44%), Positives = 95/148 (64%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   KR+   RS IH WGL A + I+    VIEY G+ +R  ++D+RE RY +      Y
Sbjct: 845 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 903

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  VVDAT++G IAR INHSC PNCY +++SV  +K +I + AKR ++AGEE+TY
Sbjct: 904 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 963

Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
           +Y F  +E    ++PC C +  C G +N
Sbjct: 964 DYLFDPDESEDCRVPCLCKALNCRGYMN 991


>gi|313247613|emb|CBY15785.1| unnamed protein product [Oikopleura dioica]
          Length = 386

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/147 (46%), Positives = 93/147 (63%), Gaps = 3/147 (2%)

Query: 792 VKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSS 851
           V+  +  L+   S IH  GL A  P E  + VIEY G +IR ++ D+RE  Y+  GIG+ 
Sbjct: 241 VETSRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDRGIGT- 299

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y+FR+DD  VVDAT  GG ARFINHSC PNC +K+I+V+ +K I I A R I  GEE+TY
Sbjct: 300 YMFRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTY 359

Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
           +YKF  +   ++I C C +  C   +N
Sbjct: 360 DYKFDRDFGSERIECGCQAPNCRRFMN 386


>gi|353232161|emb|CCD79516.1| hypothetical protein Smp_070170 [Schistosoma mansoni]
          Length = 1430

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 64/144 (44%), Positives = 91/144 (63%), Gaps = 4/144 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   I    F+IEY+GE+IR ++++ RE  YE    G  Y+FR
Sbjct: 1288 KTNVVLARSRIQGLGLYAARDISKSTFIIEYLGEVIRNEVANRRERLYESQNRGI-YMFR 1346

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +DD ++VDAT  GG+AR+INHSC+PNC  +++  +    I I A ++I  G+E+TY+YKF
Sbjct: 1347 VDDDWIVDATMSGGLARYINHSCDPNCTAEILHCDNSNHIVIIASKNIEKGDELTYDYKF 1406

Query: 916  PLEE---KKIPCYCGSKKCHGSLN 936
             LEE    +IPC CGS  C   +N
Sbjct: 1407 DLEEDRWDRIPCLCGSINCRKWMN 1430


>gi|16118405|gb|AAL12215.1| trithorax 4 [Arabidopsis thaliana]
          Length = 285

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 92/143 (64%), Gaps = 5/143 (3%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           R+ F RS IH WGL     I+  + V+EY GE +R  I+D+RE RY ++G    YLF++ 
Sbjct: 144 RVCFGRSGIHGWGLFGRRNIQEGEMVLEYRGEQVRGSIADLREARYRRVG-KDCYLFKIS 202

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
           +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++A GEE+TY+Y F 
Sbjct: 203 EELVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDYLFD 262

Query: 916 --PLEEKKIPCYCGSKKCHGSLN 936
               EE K+PC C +  C   +N
Sbjct: 263 PDEAEELKVPCLCKAPNCRKFMN 285


>gi|301116291|ref|XP_002905874.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
 gi|262109174|gb|EEY67226.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
          Length = 1659

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 57/118 (48%), Positives = 87/118 (73%), Gaps = 1/118 (0%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
            +R+  ++S IH +GL   EP+     ++EY G++I   I+D RE RYE+ G+GS Y+FRL
Sbjct: 1539 ERMLVRKSSIHGYGLFLKEPVSDGQMIVEYQGQMINQTIADERERRYEEQGVGSCYMFRL 1598

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNY 913
            D+  ++DAT+ G +ARFINHSC+P  + ++++VE G+KKI I+AKR IA G+E+TY+Y
Sbjct: 1599 DEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIFAKRAIAVGDEVTYDY 1656


>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
          Length = 1067

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 91/143 (63%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  D V+EY GE +R  I+D+RE RY   G    YLF++ 
Sbjct: 926  RVCFGRSGIHGWGLFARRNIQEGDMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKIS 984

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF- 915
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AG+E+TY+Y F 
Sbjct: 985  EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVVAGDELTYDYLFD 1044

Query: 916  --PLEEKKIPCYCGSKKCHGSLN 936
                EE K+PC C +  C   +N
Sbjct: 1045 PDEPEENKVPCLCKAPNCRKYMN 1067


>gi|405970535|gb|EKC35431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
          Length = 1927

 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 64/136 (47%), Positives = 86/136 (63%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS+I   GL A   +E    VIEY+G LIR ++++  E  YE+   G  Y+FR+D+  VV
Sbjct: 1793 RSRIQGLGLFAARDLEKHTMVIEYIGYLIRNEVANRTEVVYEEQNRGV-YMFRIDNETVV 1851

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEK-- 920
            DAT  GG AR+INHSCNPNC  +V+  + + KI I   R I  GEE+TY+YKF  E++  
Sbjct: 1852 DATMAGGPARYINHSCNPNCVAEVVPFDKESKIIIITNRRIPRGEELTYDYKFDFEDEQH 1911

Query: 921  KIPCYCGSKKCHGSLN 936
            KIPC CG+  C   +N
Sbjct: 1912 KIPCCCGAPGCRKWMN 1927


>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
          Length = 2498

 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 91/141 (64%), Gaps = 6/141 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K R+   RS+I   GL A + I   +++IEY GE+IR ++ ++RE RY     G  Y FR
Sbjct: 2354 KDRVLLARSQIAGLGLYAKKAISMGEYIIEYKGEVIRSEVCEVREKRYIAQNRGV-YQFR 2412

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG---QKKIFIYAKRHIAAGEEITYN 912
            +D+ +VVDAT  GG+AR+INHSC+PNC T+++   G    +KI I A R I+  EE+TYN
Sbjct: 2413 IDENWVVDATMSGGVARYINHSCDPNCSTEILPTGGGPSNQKIIIMANRPISEMEELTYN 2472

Query: 913  YKFPLEEK--KIPCYCGSKKC 931
            Y F LE+   KIPC CG+  C
Sbjct: 2473 YNFDLEDPTDKIPCLCGAPNC 2493


>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
          Length = 3915

 Score =  132 bits (333), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3774 KSNVYLARSRIQGLGLYAARDIEKYTMVIEYIGTIIRSEVANRKEKMYEAQNRGV-YMFR 3832

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D  +V+DAT  GG AR+INHSC PNC T+V+++E   KI I + R I  GEE+ Y+YKF
Sbjct: 3833 IDSEHVIDATITGGPARYINHSCAPNCITEVVALERGHKIIISSNRRIQRGEELCYDYKF 3892

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
             LE  + KIPC+CG+  C   +N
Sbjct: 3893 DLEDDQHKIPCHCGAVNCRKWMN 3915


>gi|414590164|tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1591

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
            K L   +S IH  GL     I     V+EYVGE++  +++D RE  Y+    +    + Y
Sbjct: 1446 KHLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACY 1505

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             F++D  +++DAT++GGIARF+NHSC PNC  K+ISV  +KK+  +A+RHI  GEEITY+
Sbjct: 1506 FFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYD 1565

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y F  E+  ++I C+C S+ C   LN
Sbjct: 1566 YHFNREDEGQRILCFCRSRYCRRYLN 1591


>gi|414590165|tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1566

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE----KMGIGSSY 852
            K L   +S IH  GL     I     V+EYVGE++  +++D RE  Y+    +    + Y
Sbjct: 1421 KHLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACY 1480

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             F++D  +++DAT++GGIARF+NHSC PNC  K+ISV  +KK+  +A+RHI  GEEITY+
Sbjct: 1481 FFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYD 1540

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y F  E+  ++I C+C S+ C   LN
Sbjct: 1541 YHFNREDEGQRILCFCRSRYCRRYLN 1566


>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
          Length = 2288

 Score =  132 bits (332), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    +IEY GE+IR ++ ++RE +YE    G  Y+FR
Sbjct: 2144 KNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2202

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEITYN 912
            +DD  ++DAT  GG AR+INHSC+PNC T+++        KKI I A R I+AGEE+TY+
Sbjct: 2203 IDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYD 2262

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y+F +E+   KIPC CG+  C   +N
Sbjct: 2263 YQFDIEDAADKIPCLCGAPNCQKWMN 2288


>gi|115480613|ref|NP_001063900.1| Os09g0556700 [Oryza sativa Japonica Group]
 gi|113632133|dbj|BAF25814.1| Os09g0556700 [Oryza sativa Japonica Group]
          Length = 424

 Score =  132 bits (332), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/158 (42%), Positives = 97/158 (61%), Gaps = 7/158 (4%)

Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
           E +   Q+K   K L   +S IH  GL   E I     V++YVGE++   ++D RE  Y+
Sbjct: 268 EYVHYKQLKG-WKHLVVYKSSIHGLGLYTSEFIPRGSMVVQYVGEIVGQCVADKREIEYQ 326

Query: 845 ----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
               +    + Y F++   +++DAT++GGIARFINHSC PNC  K+ISV  +KK+  +A+
Sbjct: 327 SGKRQQYKSACYFFKIGKEHIIDATRKGGIARFINHSCQPNCVAKIISVRNEKKVVFFAE 386

Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           RHI  GEEITY+Y F  E+  ++IPC+C S+ C   LN
Sbjct: 387 RHINPGEEITYDYHFNREDEGQRIPCFCRSRGCRRYLN 424


>gi|242051571|ref|XP_002454931.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
 gi|241926906|gb|EES00051.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
          Length = 993

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K    R+   RS IH WGL A   I+    VIEY GE +R  ++D+RE +Y +      Y
Sbjct: 847 KTENLRVSCGRSGIHGWGLFAARNIQEGQMVIEYRGEQVRRCVADLREAQYHREK-KDCY 905

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  V+DAT +G IAR INHSC PNCY ++++V G + +I + AKR ++AGEE+TY
Sbjct: 906 LFKISEDVVIDATDKGNIARLINHSCMPNCYARIMTVSGDRNQIILIAKRDVSAGEELTY 965

Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
           +Y F  +E    K+PC C +  C G +N
Sbjct: 966 DYLFDPDESEDCKVPCLCKAPNCRGYMN 993


>gi|297792715|ref|XP_002864242.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297310077|gb|EFH40501.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1049

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 93/143 (65%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY + G    YLF++ 
Sbjct: 908  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKIS 966

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT++G IAR INHSC PNCY +++SV + + +I + AK  +A+GEE+TY+Y F 
Sbjct: 967  EEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASGEELTYDYLFD 1026

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C S  C   +N
Sbjct: 1027 PDEPDEFKVPCLCKSPNCRKFMN 1049


>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
 gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
          Length = 2144

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/146 (45%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    +IEY GE+IR ++ ++RE +YE    G  Y+FR
Sbjct: 2000 KNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2058

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEITYN 912
            +DD  ++DAT  GG AR+INHSC+PNC T+++        KKI I A R I+AGEE+TY+
Sbjct: 2059 IDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYD 2118

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y+F +E+   KIPC CG+  C   +N
Sbjct: 2119 YQFDIEDVADKIPCLCGAPNCQKWMN 2144


>gi|402865476|ref|XP_003919660.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3, partial [Papio anubis]
          Length = 3229

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3088 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3146

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 3147 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 3206

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 3207 DFEDDQHKIPCHCGAVNCRKWMN 3229


>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
            porcellus]
          Length = 4878

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4737 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4795

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4796 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQTGEELCYDYKF 4855

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4856 DFEDDQHKIPCHCGAVNCRKWMN 4878


>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
          Length = 4499

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4358 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4416

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4417 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4476

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4477 DFEDDQHKIPCHCGAVNCRKWMN 4499


>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
          Length = 4906

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4765 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4823

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4824 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4883

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4884 DFEDDQHKIPCHCGAVNCRKWMN 4906


>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
            boliviensis boliviensis]
          Length = 4029

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3888 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3946

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 3947 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4006

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4007 DFEDDQHKIPCHCGAVNCRKWMN 4029


>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
            garnettii]
          Length = 4945

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4804 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4862

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4863 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4922

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4923 DFEDDQHKIPCHCGAVNCRKWMN 4945


>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
            [Oryctolagus cuniculus]
          Length = 4865

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4724 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4782

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4783 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4842

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4843 DFEDDQHKIPCHCGAVNCRKWMN 4865


>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
          Length = 4724

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4583 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4641

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4642 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4701

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4702 DFEDDQHKIPCHCGAVNCRKWMN 4724


>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
            guttata]
          Length = 4871

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4730 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4788

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4789 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4848

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4849 DFEDDQHKIPCHCGAVNCRKWMN 4871


>gi|219109809|ref|XP_002176658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411193|gb|EEC51121.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 139

 Score =  131 bits (329), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 64/140 (45%), Positives = 90/140 (64%), Gaps = 8/140 (5%)

Query: 804 SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
           SK+H WGL A +P    D V EY+GE I   ++D RE  Y++  I   Y FRLD+  V+D
Sbjct: 1   SKVHGWGLFADQPFIKGDVVAEYLGEYISYAVADNREKIYQEQRI-QDYQFRLDEKLVID 59

Query: 864 ATKRGGIARFINHSCNPNCYTKVISVEGQ----KKIFIYAKRHIAAGEEITYNYKFPLE- 918
           AT +GG AR+INH+C PNC+ K++S E      K++ I A++ I+  EEI+Y+Y+FPLE 
Sbjct: 60  ATLKGGPARYINHNCTPNCFAKIVSGEPPSPCLKRVIIIAQQEISINEEISYDYQFPLEL 119

Query: 919 --EKKIPCYCGSKKCHGSLN 936
              ++IPC C S  C G +N
Sbjct: 120 DLSERIPCNCQSDACRGFMN 139


>gi|167526642|ref|XP_001747654.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773758|gb|EDQ87394.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1547

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 59/116 (50%), Positives = 81/116 (69%), Gaps = 1/116 (0%)

Query: 799  LRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDD 858
            +R   SKIH  GL AL P++  + +IEY GE IR +++D RE  Y+  GIG  Y+FR+D 
Sbjct: 1432 VRVSHSKIHGIGLFALRPLKPGEMIIEYAGEQIRPELTDKREAYYDSRGIGC-YMFRVDA 1490

Query: 859  GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
              VVDAT  G  ARF+NHSC+PNC +++I  +  K I I+A+R+IA GEE+TY+YK
Sbjct: 1491 NLVVDATLTGNPARFVNHSCDPNCASRIIQTDVGKHIVIFAERNIAVGEELTYDYK 1546


>gi|399217787|emb|CCF74674.1| unnamed protein product [Babesia microti strain RI]
          Length = 602

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 99/183 (54%), Gaps = 20/183 (10%)

Query: 771 RVKLRNLLAAAEGAELLKASQVKARK--------------KRLRFQRSKIHDWGLVALEP 816
           R K+ NL A      LL     + R               K LR  +S +H +GL A E 
Sbjct: 422 RNKMNNLYAKHRRHNLLHNICSRIRNITPKNGFLSPFPVNKVLRLAKSGVHGFGLFAGED 481

Query: 817 IEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINH 876
           I+  +F+IEY G L+   ++D+RE  YEK    S+YLFR DD  V+DATK G +ARF+NH
Sbjct: 482 IKKGEFIIEYAGVLVTDAMTDVREDAYEKCMGSSTYLFRADDNRVIDATKTGNLARFVNH 541

Query: 877 SCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEE---KKIPCYCGSKKCH 932
           SC+PN YT  +  E     + IYA R IA G+EI YNY+  L E    K  CYCGS  C 
Sbjct: 542 SCDPNAYTNTVHDEYNGPHVGIYASRDIAPGQEIFYNYR--LSESVYNKEYCYCGSSNCS 599

Query: 933 GSL 935
           G +
Sbjct: 600 GRM 602


>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
          Length = 4532

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4391 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4449

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4450 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4509

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4510 DFEDDQHKIPCHCGAVNCRKWMN 4532


>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Equus caballus]
          Length = 4910

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4769 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4827

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4828 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4887

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4888 DFEDDQHKIPCHCGAVNCRKWMN 4910


>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan troglodytes]
          Length = 4026

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3885 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3943

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 3944 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4003

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4004 DFEDDQHKIPCHCGAVNCRKWMN 4026


>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
 gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
          Length = 1078

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/148 (45%), Positives = 94/148 (63%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY   G    Y
Sbjct: 932  KTENDRVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREARYRLEG-KDCY 990

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY
Sbjct: 991  LFKISEEVVVDATDKGNIARLINHSCMPNCYARIMSVGDNESRIVLIAKTNVSAGDELTY 1050

Query: 912  NYKF-PLE--EKKIPCYCGSKKCHGSLN 936
            +Y F P E  E K+PC C +  C   +N
Sbjct: 1051 DYLFDPNEPDEFKVPCLCKAPNCRKYMN 1078


>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
          Length = 4575

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4434 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4492

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4493 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4552

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4553 DFEDDQHKIPCHCGAVNCRKWMN 4575


>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Loxodonta africana]
          Length = 4785

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4644 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4702

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4703 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4762

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4763 DFEDDQHKIPCHCGAVNCRKWMN 4785


>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
          Length = 4025

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3884 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3942

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 3943 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4002

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4003 DFEDDQHKIPCHCGAVNCRKWMN 4025


>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
          Length = 4912

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4771 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4829

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4830 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4889

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4890 DFEDDQHKIPCHCGAVNCRKWMN 4912


>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
            jacchus]
          Length = 4909

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4768 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4826

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4827 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4886

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4887 DFEDDQHKIPCHCGAVNCRKWMN 4909


>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
            leucogenys]
          Length = 4856

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4715 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4773

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4774 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4833

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4834 DFEDDQHKIPCHCGAVNCRKWMN 4856


>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
            gorilla]
          Length = 4782

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4641 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4699

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4700 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4759

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4760 DFEDDQHKIPCHCGAVNCRKWMN 4782


>gi|358342811|dbj|GAA39455.2| histone-lysine N-methyltransferase MLL3 [Clonorchis sinensis]
          Length = 1443

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/167 (41%), Positives = 99/167 (59%), Gaps = 13/167 (7%)

Query: 781  AEGAELLKASQVKARKKRLRFQ--------RSKIHDWGLVALEPIEAEDFVIEYVGELIR 832
            +  A  L  S+ + + +RLR +        RS+I   GL A   +    F+IEY+GELIR
Sbjct: 1279 STAASFLSVSKSQ-QYRRLRLEWRYNVILARSQIQGLGLYAARNLSKHAFIIEYLGELIR 1337

Query: 833  LKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ 892
             ++ + RE  YE    G  Y+FR+D+  +VDAT  GG+AR+INHSC PNC  +V++ E  
Sbjct: 1338 NEVGNRRERLYELQNRGI-YMFRVDEDSIVDATMCGGLARYINHSCEPNCTAEVLNCENG 1396

Query: 893  KKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
              I I A R I  GEE+TY+YKF +EE+   +IPC CG+  C   +N
Sbjct: 1397 SHIIIIASRDIEKGEELTYDYKFDIEEERGDRIPCLCGAPNCRRWMN 1443


>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3, partial [Pan paniscus]
          Length = 4810

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4669 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4727

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4728 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4787

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4788 DFEDDQHKIPCHCGAVNCRKWMN 4810


>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
            harrisii]
          Length = 4951

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4810 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4868

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4869 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4928

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4929 DFEDDQHKIPCHCGAVNCRKWMN 4951


>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4930

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4789 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4847

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4848 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4907

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4908 DFEDDQHKIPCHCGAVNCRKWMN 4930


>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Ailuropoda melanoleuca]
          Length = 4927

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4786 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4844

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4845 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4904

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4905 DFEDDQHKIPCHCGAVNCRKWMN 4927


>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
          Length = 4916

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4775 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4833

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4834 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4893

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4894 DFEDDQHKIPCHCGAVNCRKWMN 4916


>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4931

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4790 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4848

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4849 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4908

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4909 DFEDDQHKIPCHCGAVNCRKWMN 4931


>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3 [Canis lupus familiaris]
          Length = 4874

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4733 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4791

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4792 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4851

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4852 DFEDDQHKIPCHCGAVNCRKWMN 4874


>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
            homolog
          Length = 4903

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4762 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4820

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4821 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4880

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4881 DFEDDQHKIPCHCGAVNCRKWMN 4903


>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
          Length = 4911

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911


>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
 gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Homologous to ALR protein; AltName: Full=Lysine
            N-methyltransferase 2C; Short=KMT2C; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
          Length = 4911

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911


>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
          Length = 4904

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4763 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4821

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4822 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4881

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4882 DFEDDQHKIPCHCGAVNCRKWMN 4904


>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
          Length = 2526

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/221 (37%), Positives = 121/221 (54%), Gaps = 19/221 (8%)

Query: 719  ISGWEWHKWSLNASPAERARVRGAQYVHTKYLGPEVNASQWANGKGLSARTNRVKLRNLL 778
            I   +  + S + + A   R RG +  +T+    E  A      + LSA ++RV+     
Sbjct: 2312 IETLQSRQASTSLTSANSGRTRGVRSYYTE----EAAARARGFPQDLSAISHRVE----- 2362

Query: 779  AAAEGAELLKASQVKAR---KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
              A+ A     +  K R   K+ +   RS+I   GL A   I   +++IEY GE+IR ++
Sbjct: 2363 -PAQSATAQYTAYQKMRREWKELVYLARSRIAGLGLYAKTDIPMGEYIIEYKGEIIRSEL 2421

Query: 836  SDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI---SVEGQ 892
             ++RE RY     G  Y+FRLD+ +V+DAT  GG AR++NHSC+PNC T +    S    
Sbjct: 2422 CEVREKRYNAQNRGV-YMFRLDEEWVIDATMSGGPARYVNHSCDPNCSTMLFDSNSGARD 2480

Query: 893  KKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKC 931
            KKI I A R I+A EE+TY+Y+F LE+   K+PC CG+  C
Sbjct: 2481 KKILITANRPISANEELTYDYQFELEDATDKVPCLCGAPNC 2521


>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
            mulatta]
          Length = 4785

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4644 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4702

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4703 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4762

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4763 DFEDDQHKIPCHCGAVNCRKWMN 4785


>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
          Length = 4911

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4770 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4828

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4829 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4888

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4889 DFEDDQHKIPCHCGAVNCRKWMN 4911


>gi|312089370|ref|XP_003146221.1| histone methyltransferase [Loa loa]
          Length = 689

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 6/149 (4%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K  K  +   RS+I   GL A   IE    +IEY GE+IR ++ ++RE +YE    G  Y
Sbjct: 542 KEWKNTVYLARSRIQGLGLYASRDIEMNSMIIEYKGEVIRSEVGEMREKKYEAQNRGV-Y 600

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS---VEGQKKIFIYAKRHIAAGEEI 909
           +FR+DD  ++DAT  GG AR+INHSC+PNC T+++        KKI I A R I+AGEE+
Sbjct: 601 MFRIDDEKLIDATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEEL 660

Query: 910 TYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           TY+Y+F +E+   KIPC CG+  C   +N
Sbjct: 661 TYDYQFDIEDAADKIPCLCGAPNCQKWMN 689


>gi|414884806|tpg|DAA60820.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
 gi|414884807|tpg|DAA60821.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
          Length = 982

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/152 (43%), Positives = 98/152 (64%), Gaps = 5/152 (3%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           GLS    +  + N+ +  E   ++KA+     +KRL F +SKIH +G+ A    +A D +
Sbjct: 832 GLSDAVQQEAIGNVSSMVEKYTIVKAT----FRKRLTFGKSKIHGYGVFAKVAHKAGDMM 887

Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           +EY+GE++R  ISD RE R Y  +    +Y+FR+DD  V+DAT+ G IAR INHSC PNC
Sbjct: 888 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNC 947

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
           Y++ I++ G + I I+AKR I   EE+TY+Y+
Sbjct: 948 YSRAITILGDEHIIIFAKRDIDPWEELTYDYR 979


>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Ornithorhynchus anatinus]
          Length = 4910

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4769 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4827

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4828 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4887

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4888 DFEDDQHKIPCHCGAVNCRKWMN 4910


>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
            (Silurana) tropicalis]
          Length = 5215

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 89/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 5074 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 5132

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   +I I + R I  GEE++Y+YKF
Sbjct: 5133 IDNEHVIDATLTGGPARYINHSCAPNCVAEVVTFEKGHRIIISSNRRIQKGEELSYDYKF 5192

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5193 DFEDDQHKIPCHCGAVNCRKWMN 5215


>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
            domestica]
          Length = 4862

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4721 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4779

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4780 IDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4839

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4840 DFEDDQHKIPCHCGAVNCRKWMN 4862


>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
          Length = 4780

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4639 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4697

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4698 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4757

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4758 DFEDDQHKIPCHCGAVNCRKWMN 4780


>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
 gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
          Length = 2007

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/126 (47%), Positives = 83/126 (65%), Gaps = 1/126 (0%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            RL F +S IH WGL+A    +A   V E+ GE  R  ++D+RE  YE+ G+   YL + D
Sbjct: 1603 RLTFCKSNIHGWGLLAKTAHKAGSIVTEFKGETCRSTVADMRETAYEEEGV-DCYLLKQD 1661

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL 917
            D  VVD T +G +ARF NHSCNPN Y+K++ V+G+  I  +A+  I AGEE+TYNY+F  
Sbjct: 1662 DDTVVDCTFQGNLARFTNHSCNPNMYSKIVKVDGENHIIFFARNDIKAGEELTYNYRFES 1721

Query: 918  EEKKIP 923
            E+ K+P
Sbjct: 1722 EDGKVP 1727


>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
            griseus]
          Length = 4871

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4730 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4788

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4789 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4848

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4849 DFEDDQHKIPCHCGAVNCRKWMN 4871


>gi|326431279|gb|EGD76849.1| hypothetical protein PTSG_12693 [Salpingoeca sp. ATCC 50818]
          Length = 3557

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 68/170 (40%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 770  NRVKLRNLLAAAEGAELLKASQVKAR------KKRLRFQRSKIHDWGLVALEPIEAEDFV 823
            +R+K R     A   E L + +++AR      K +LR  RS+I   GL A +  +A + V
Sbjct: 3377 HRIKPRRRPTRAYTVERLPSERIEARALAKMRKSKLRVGRSRISGHGLYAEQDFKAGECV 3436

Query: 824  IEYVGELIRLKISDIRECRYEKMGIGSSYLFRL-DDGYVVDATKRGGIARFINHSCNPNC 882
             EY G   R  ++D+RE  Y   G+G  Y+F + ++  V+DAT +G   RFINHSC+PNC
Sbjct: 3437 AEYAGVRFRTTLADVREGEYRDRGLGC-YMFSVGNNTVVIDATLQGNWTRFINHSCDPNC 3495

Query: 883  YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL-EEKKIPCYCGSKKC 931
             ++ I +EGQ  + I+A+R I AGEE+TY+YK  + ++ K PC CG+  C
Sbjct: 3496 CSEEIDIEGQPYVVIFARRDIRAGEELTYDYKHQMDDDDKAPCLCGAIDC 3545


>gi|427785297|gb|JAA58100.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
            pulchellus]
          Length = 3936

 Score =  130 bits (327), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K   R      RS I   GL A   IE    +IEY+G+LIR +I++  E  Y
Sbjct: 3783 KSSQYRRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEIAERNEAIY 3842

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FRLD+  V+DAT  GG+AR+INHSC+PNC  +++ ++ + KI I A R I
Sbjct: 3843 EAQNRGV-YMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKILIIANRRI 3901

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
              GEE+TY+YKF  E+   KIPC C +  C   +N
Sbjct: 3902 TRGEELTYDYKFDYEDDGHKIPCLCSASNCRKWMN 3936


>gi|256084014|ref|XP_002578229.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
          Length = 399

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 64/144 (44%), Positives = 91/144 (63%), Gaps = 4/144 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   I    F+IEY+GE+IR ++++ RE  YE    G  Y+FR
Sbjct: 257 KTNVVLARSRIQGLGLYAARDISKSTFIIEYLGEVIRNEVANRRERLYESQNRGI-YMFR 315

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +DD ++VDAT  GG+AR+INHSC+PNC  +++  +    I I A ++I  G+E+TY+YKF
Sbjct: 316 VDDDWIVDATMSGGLARYINHSCDPNCTAEILHCDNSNHIVIIASKNIEKGDELTYDYKF 375

Query: 916 PLEE---KKIPCYCGSKKCHGSLN 936
            LEE    +IPC CGS  C   +N
Sbjct: 376 DLEEDRWDRIPCLCGSINCRKWMN 399


>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
            [Strongylocentrotus purpuratus]
          Length = 4860

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G LIR ++++  E  YE    G  Y+FR
Sbjct: 4718 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4776

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
            +DD  VVDAT+ G  AR+INHSCNPNC  +V++ +  QKKI I + R +  GEE+TY+YK
Sbjct: 4777 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4836

Query: 915  FPLE--EKKIPCYCGSKKCHGSLN 936
            F +E  + KIPC C +  C   +N
Sbjct: 4837 FEIENDQNKIPCLCKAPNCRKWMN 4860


>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
            [Strongylocentrotus purpuratus]
          Length = 4873

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G LIR ++++  E  YE    G  Y+FR
Sbjct: 4731 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4789

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
            +DD  VVDAT+ G  AR+INHSCNPNC  +V++ +  QKKI I + R +  GEE+TY+YK
Sbjct: 4790 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4849

Query: 915  FPLE--EKKIPCYCGSKKCHGSLN 936
            F +E  + KIPC C +  C   +N
Sbjct: 4850 FEIENDQNKIPCLCKAPNCRKWMN 4873


>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
            [Strongylocentrotus purpuratus]
          Length = 4856

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/144 (45%), Positives = 89/144 (61%), Gaps = 4/144 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G LIR ++++  E  YE    G  Y+FR
Sbjct: 4714 KTNVYLARSQIQGLGLYAAHDIEKHTMVIEYIGTLIRNEVANKWERDYEAANRGV-YMFR 4772

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
            +DD  VVDAT+ G  AR+INHSCNPNC  +V++ +  QKKI I + R +  GEE+TY+YK
Sbjct: 4773 IDDYTVVDATRSGNPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEELTYDYK 4832

Query: 915  FPLE--EKKIPCYCGSKKCHGSLN 936
            F +E  + KIPC C +  C   +N
Sbjct: 4833 FEIENDQNKIPCLCKAPNCRKWMN 4856


>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
 gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
          Length = 1018

 Score =  130 bits (327), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 92/143 (64%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY   G    YLF++ 
Sbjct: 877  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRSIADLREARYRLEG-KDCYLFKIS 935

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F 
Sbjct: 936  EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVSAGDELTYDYLFD 995

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C +  C   +N
Sbjct: 996  PDEPDEFKVPCLCKAPNCRQFMN 1018


>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1776

 Score =  130 bits (327), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 69/146 (47%), Positives = 90/146 (61%), Gaps = 17/146 (11%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K  K+ +   RS IH  GL     IEA + VIEY G +IR  ++D R+  Y+  GIG  Y
Sbjct: 1646 KISKEAVGVYRSDIHGRGLFCKRNIEAGEMVIEYAGTVIRAVLTDKRQKFYDSKGIGC-Y 1704

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            +FR+DD  VVDAT +G  ARFINHSC PNCY++VI+              I  GEE+TY+
Sbjct: 1705 MFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVIN--------------IYRGEELTYD 1750

Query: 913  YKFPLE--EKKIPCYCGSKKCHGSLN 936
            YKFP+E  E K+ C CG+++C G LN
Sbjct: 1751 YKFPIEDDESKLHCNCGTRRCRGFLN 1776


>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 93/147 (63%), Gaps = 4/147 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   +++   +S IH WGL A   +E  + V+EY GE +R  I+D+RE +Y   G    Y
Sbjct: 840 KTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRRSITDLREAQYRSEG-KDCY 898

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
            F++++  V+DAT +G IAR INHSC PNC+ +++    QK +I + AK +++AGEE+TY
Sbjct: 899 FFKINEEVVIDATDKGNIARLINHSCMPNCFARIVPSGDQKNRIVLIAKTNVSAGEELTY 958

Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
           NY F  E  E+K+PC C +  C G +N
Sbjct: 959 NYSFDDERDEEKVPCRCKAPNCSGFMN 985


>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
            mutus]
          Length = 4905

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 87/143 (60%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4764 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4822

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D  +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4823 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 4882

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4883 DFEDDQHKIPCHCGAVNCRKWMN 4905


>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
            sativus]
          Length = 1073

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE RY   G    YLF++ 
Sbjct: 932  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRTVADLREARYRLAG-KDCYLFKIS 990

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AGEE+TY+Y F 
Sbjct: 991  EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDYLFD 1050

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C +  C   +N
Sbjct: 1051 PDEPDEFKVPCLCKAPNCRKFMN 1073


>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
          Length = 4884

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 87/143 (60%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4743 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4801

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D  +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 4802 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 4861

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4862 DFEDDQHKIPCHCGAVNCRKWMN 4884


>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
          Length = 3396

 Score =  130 bits (326), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 3255 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 3313

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 3314 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 3373

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 3374 DFEDDQHKIPCHCGAVNCRKWMN 3396


>gi|226500304|ref|NP_001141695.1| uncharacterized protein LOC100273824 [Zea mays]
 gi|194705578|gb|ACF86873.1| unknown [Zea mays]
          Length = 418

 Score =  130 bits (326), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 66/158 (41%), Positives = 97/158 (61%), Gaps = 7/158 (4%)

Query: 785 ELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE 844
           E +   Q+K  K  L   +S IH  GL     I     V+EYVGE++  +++D RE  Y+
Sbjct: 262 EYIHYKQLKGWK-HLVVYKSGIHGLGLYTSVFIPRGSMVVEYVGEIVGQRVADRREIEYQ 320

Query: 845 ----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
               +    + Y F++D  +++DAT++GGIARF+NHSC PNC  K+ISV  +KK+  +A+
Sbjct: 321 SGKRQQYKSACYFFKIDREHIIDATRKGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAE 380

Query: 901 RHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           RHI  GEEITY+Y F  E+  ++I C+C S+ C   LN
Sbjct: 381 RHINPGEEITYDYHFNREDEGQRILCFCRSRYCRRYLN 418


>gi|5630077|gb|AAD45822.1|AC006017_2 similar to ALR; similar to AAC51735 (PID:g2358287) [Homo sapiens]
          Length = 1813

 Score =  130 bits (326), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 1672 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 1730

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 1731 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 1790

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 1791 DFEDDQHKIPCHCGAVNCRKWMN 1813


>gi|359489550|ref|XP_002278335.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
            vinifera]
          Length = 1073

 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 95/148 (64%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            +    R+ F +S IH WGL A   I+  + VIEY GE +R  ++D+RE +Y   G    Y
Sbjct: 927  RTENHRVCFGKSGIHGWGLFARRSIQEGEMVIEYRGEQVRRSVADLREAKYRLEG-KDCY 985

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY
Sbjct: 986  LFKISEEVVIDATNKGNIARLINHSCFPNCYARIMSVGDEESRIVLIAKINVSAGDELTY 1045

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E+   K+PC CG+  C   +N
Sbjct: 1046 DYLFDPDERDESKVPCLCGAPNCRKFMN 1073


>gi|307105682|gb|EFN53930.1| hypothetical protein CHLNCDRAFT_25365, partial [Chlorella
           variabilis]
          Length = 188

 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 64/149 (42%), Positives = 91/149 (61%), Gaps = 4/149 (2%)

Query: 789 ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMG 847
           A  V    +R+   +S IH WG  A       D VIEYVGEL+R  +SD+RE R Y+ M 
Sbjct: 10  AECVATLGQRVTIGKSGIHGWGAFAKRRHAEHDMVIEYVGELVRPSVSDLREARCYDDMV 69

Query: 848 IGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGE 907
              +Y+FRL+    VDAT+ G +A  +NHSC+PNC+++ I V     + I+AK+ I   E
Sbjct: 70  GAGTYVFRLNKALCVDATRAGNLAHMLNHSCDPNCFSRTIRV--VDHVIIFAKKDIEVAE 127

Query: 908 EITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           E+TY+Y+F  EE ++PC CG+  C G +N
Sbjct: 128 ELTYDYRFCGEE-QLPCNCGAANCRGRVN 155


>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
 gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
          Length = 1053

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 92/143 (64%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE RY   G    YLF++ 
Sbjct: 912  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRSVADLREARYRAEG-KDCYLFKIS 970

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F 
Sbjct: 971  EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVSAGDELTYDYLFD 1030

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C +  C   +N
Sbjct: 1031 PDEPDEFKVPCMCKAPNCRKFMN 1053


>gi|449528994|ref|XP_004171486.1| PREDICTED: histone-lysine N-methyltransferase ATX5-like, partial
           [Cucumis sativus]
          Length = 588

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE RY   G    YLF++ 
Sbjct: 447 RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRTVADLREARYRLAG-KDCYLFKIS 505

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
           +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AGEE+TY+Y F 
Sbjct: 506 EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDYLFD 565

Query: 917 LEEK---KIPCYCGSKKCHGSLN 936
            +E    K+PC C +  C   +N
Sbjct: 566 PDEPDEFKVPCLCKAPNCRKFMN 588


>gi|37360418|dbj|BAC98187.1| mKIAA1506 protein [Mus musculus]
          Length = 1520

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 1379 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 1437

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 1438 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 1497

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 1498 DFEDDQHKIPCHCGAVNCRKWMN 1520


>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
          Length = 1012

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 94/148 (63%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 866  KTEHTRVCFGRSGIHRWGLFARRGIQEGEMVLEYRGEQVRRSVADLREEQYRVQG-KDCY 924

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G +AR INHSC PNCY +++SV   + +I + AK++++AGEE+TY
Sbjct: 925  LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTY 984

Query: 912  NYKFPLEE---KKIPCYCGSKKCHGSLN 936
            +Y F  +E   +K+PC C +  C   +N
Sbjct: 985  DYLFDPDEADDRKVPCLCQTANCRKFMN 1012


>gi|297745446|emb|CBI40526.3| unnamed protein product [Vitis vinifera]
          Length = 1003

 Score =  129 bits (325), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 94/143 (65%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F +S IH WGL A   I+  + VIEY GE +R  ++D+RE +Y   G    YLF++ 
Sbjct: 862  RVCFGKSGIHGWGLFARRSIQEGEMVIEYRGEQVRRSVADLREAKYRLEG-KDCYLFKIS 920

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++AG+E+TY+Y F 
Sbjct: 921  EEVVIDATNKGNIARLINHSCFPNCYARIMSVGDEESRIVLIAKINVSAGDELTYDYLFD 980

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E+   K+PC CG+  C   +N
Sbjct: 981  PDERDESKVPCLCGAPNCRKFMN 1003


>gi|401396171|ref|XP_003879769.1| putative SET domain-containing protein [Neospora caninum Liverpool]
 gi|325114176|emb|CBZ49734.1| putative SET domain-containing protein [Neospora caninum Liverpool]
          Length = 5146

 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 116/230 (50%), Gaps = 18/230 (7%)

Query: 722  WEWHKWSLNASPAERARVRGAQYVHTKYLG-----PEVNASQWANGK---GLSARTNRVK 773
            W+    +L A  AER + R     H   LG     PE    Q    +    L ++    +
Sbjct: 4919 WQTADPALLAQAAERMQERNLSAFHGLLLGEDGRLPESAELQLKRNELFFALESQDRLTR 4978

Query: 774  LRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRL 833
              NLL A       +      R K L+  +S++H WG+ A EPI  ++FVIEY   ++  
Sbjct: 4979 TANLLEATAARSGARTPPFVVRTKLLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSE 5038

Query: 834  KISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK 893
             +++ RE +Y +   GS+YLF+L +  +VDAT+ G + RFINHSC PNC T+ +S     
Sbjct: 5039 AMANFREWQYMQSMGGSTYLFKLRNSTIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDD 5098

Query: 894  K-----IFIYAKRHIAAGEEITYNYKFPLEEKKI---PCYCGSKKCHGSL 935
                  + I+A R IA GEE+ YNY   L E  +    CYCG++ C G +
Sbjct: 5099 DNRHCHVGIFALRDIAIGEELFYNYS--LSEGALGHEACYCGAEGCKGVM 5146


>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
          Length = 1032

 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 94/148 (63%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 886  KTEHTRVCFGRSGIHRWGLFARRGIQEGEMVLEYRGEQVRRSVADLREEQYRVQG-KDCY 944

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G +AR INHSC PNCY +++SV   + +I + AK++++AGEE+TY
Sbjct: 945  LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTY 1004

Query: 912  NYKFPLEE---KKIPCYCGSKKCHGSLN 936
            +Y F  +E   +K+PC C +  C   +N
Sbjct: 1005 DYLFDPDEADDRKVPCLCQTANCRKFMN 1032


>gi|30696333|ref|NP_200155.2| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
 gi|75244456|sp|Q8GZ42.1|ATX5_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX5; AltName:
            Full=Protein SET DOMAIN GROUP 29; AltName:
            Full=Trithorax-homolog protein 5; Short=TRX-homolog
            protein 5
 gi|26449542|dbj|BAC41897.1| putative trithorax 5 TX5 [Arabidopsis thaliana]
 gi|29029074|gb|AAO64916.1| At5g53430 [Arabidopsis thaliana]
 gi|332008970|gb|AED96353.1| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
          Length = 1043

 Score =  129 bits (324), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 65/144 (45%), Positives = 93/144 (64%), Gaps = 5/144 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
            +R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY + G    YLF++
Sbjct: 901  ERVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKI 959

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF 915
             +  VVDAT++G IAR INHSC PNCY +++SV + + +I + AK  +A+ EE+TY+Y F
Sbjct: 960  SEEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDYLF 1019

Query: 916  PLEEK---KIPCYCGSKKCHGSLN 936
              +E    K+PC C S  C   +N
Sbjct: 1020 DPDEPDEFKVPCLCKSPNCRKFMN 1043


>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
 gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
          Length = 1070

 Score =  129 bits (324), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 91/143 (63%), Gaps = 5/143 (3%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY   G    YLF++ 
Sbjct: 929  RVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGSIADLREVRYRLEG-KDCYLFKIS 987

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFP 916
            +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK ++ AG+E+TY+Y F 
Sbjct: 988  EEVVVDATDKGNIARLINHSCMPNCYARIMSVGDNESRIVLIAKTNVPAGDELTYDYLFD 1047

Query: 917  LEEK---KIPCYCGSKKCHGSLN 936
             +E    K+PC C +  C   +N
Sbjct: 1048 PDEPDEFKVPCLCKAPNCRKFMN 1070


>gi|8843772|dbj|BAA97320.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1040

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/144 (45%), Positives = 93/144 (64%), Gaps = 5/144 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
            +R+ F RS IH WGL A   I+  + V+EY GE +R  I+D+RE RY + G    YLF++
Sbjct: 898  ERVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRREG-KDCYLFKI 956

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKF 915
             +  VVDAT++G IAR INHSC PNCY +++SV + + +I + AK  +A+ EE+TY+Y F
Sbjct: 957  SEEVVVDATEKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDYLF 1016

Query: 916  PLEEK---KIPCYCGSKKCHGSLN 936
              +E    K+PC C S  C   +N
Sbjct: 1017 DPDEPDEFKVPCLCKSPNCRKFMN 1040


>gi|358333116|dbj|GAA32467.2| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
            sinensis]
          Length = 1763

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 63/136 (46%), Positives = 84/136 (61%), Gaps = 3/136 (2%)

Query: 803  RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
            RS+I   GL A   +E +  VIEY+GELIRL++++ RE  YE    G  Y+FRL+D  V+
Sbjct: 1629 RSRIQGLGLFAARNLEPQTMVIEYIGELIRLELANKREKDYEAHNRGI-YMFRLNDDTVI 1687

Query: 863  DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
            DAT  GG+AR+INHSC PNC+ + ++      I I   R I  GEE+ Y+Y F LE+   
Sbjct: 1688 DATVCGGLARYINHSCQPNCFAEFLNFGDHSHIVIITNRLIEKGEELCYDYNFDLEDGGS 1747

Query: 921  KIPCYCGSKKCHGSLN 936
            KIPC C S  C   +N
Sbjct: 1748 KIPCLCRSTNCRKWMN 1763


>gi|326519903|dbj|BAK03876.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1025

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 879  KTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREAQYRVQG-KDCY 937

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G +AR INHSC PN Y +++SV   Q +I + AKR+++AGEE+TY
Sbjct: 938  LFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSVGHDQSRIVLIAKRNVSAGEELTY 997

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E    K+PC C +  C   +N
Sbjct: 998  DYLFDPDEADDCKVPCLCQTADCRKFMN 1025


>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3 [Ovis aries]
          Length = 4922

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 86/143 (60%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4781 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4839

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D  +V+DAT  GG AR+INHSC PNC  +V++     KI I + R I  GEE+ Y+YKF
Sbjct: 4840 MDSDHVIDATLTGGPARYINHSCAPNCVAEVVAFRAGHKIIISSSRRIQKGEELCYDYKF 4899

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4900 DFEDDQHKIPCHCGAVNCRKWMN 4922


>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
          Length = 1003

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 93/148 (62%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            +   +R+ F RS IH WGL A + I+  + V+EY GE +R  I+D+RE RY   G    Y
Sbjct: 857  RTENERVCFGRSGIHGWGLFARQNIQEGEMVLEYRGEQVRRSIADLREARYRLEG-KDCY 915

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G IAR INHSC PNCY +++SV + + +I + AK  ++ G+E+TY
Sbjct: 916  LFKISEEVVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTDVSTGDELTY 975

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E    K+PC C +  C   +N
Sbjct: 976  DYLFDPDEPDEFKVPCLCKASNCRKFMN 1003


>gi|414875701|tpg|DAA52832.1| TPA: hypothetical protein ZEAMMB73_838843 [Zea mays]
          Length = 971

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 111/193 (57%), Gaps = 11/193 (5%)

Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
           + +GP  ++  + +G  L+A  +R   ++     E    L+    K    R+   RS IH
Sbjct: 786 RIMGPRHHSQDFIDG--LNACMDRKDDQSFSTFKERLRYLQ----KTENLRVSCGRSGIH 839

Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            WGL A   I+    VIEY GE +R  ++D+RE +Y +      YLF++ +  V+DAT +
Sbjct: 840 GWGLFAARNIQEGQMVIEYRGEQVRRCVADLREAQYHREK-KDCYLFKISEDVVIDATDK 898

Query: 868 GGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIP 923
           G IAR INHSC PNCY ++++V + + +I + AKR ++AGEE+TY+Y F  +E    ++P
Sbjct: 899 GNIARLINHSCMPNCYARIMTVSDDRSEIILIAKRDVSAGEELTYDYLFDPDESEDCRVP 958

Query: 924 CYCGSKKCHGSLN 936
           C C +  C G +N
Sbjct: 959 CLCKAPNCRGYMN 971


>gi|95007060|emb|CAJ20276.1| SET-domain protein, putative [Toxoplasma gondii RH]
          Length = 4382

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/264 (32%), Positives = 130/264 (49%), Gaps = 20/264 (7%)

Query: 688  ASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKW--SLNASPAERARVRGAQYV 745
            A+  +  K+  +S    S   P   G + S +   +W     +L +  AER + R     
Sbjct: 4119 ATLNLGKKRDTSSPGENSRFAPPLVGSSSSRLLAAQWQAADPALLSLAAERMQERNLSAF 4178

Query: 746  HTKYLG-----PEVNASQWANGK---GLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
            H   LG     PE    Q    +    L +++   +  NLL A       +A     R K
Sbjct: 4179 HGLLLGEDGRLPENAELQLKRNELFFALESQSRLSRTANLLEATVLRSGARAPSFVVRTK 4238

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
             L+  +S++H WG+ A EPI  ++FVIEY   ++   +++ RE +Y +   GS+YLF+L 
Sbjct: 4239 LLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSEAMANFREWQYMQSMGGSTYLFKLK 4298

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-----IFIYAKRHIAAGEEITYN 912
            +  +VDAT+ G + RFINHSC PNC T+ +S           + I+A R IA GEE+ YN
Sbjct: 4299 NSAIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDDDSRHCHVGIFALRDIAIGEELFYN 4358

Query: 913  YKFPLEEKKI---PCYCGSKKCHG 933
            Y   L E  +    CYCG++ C G
Sbjct: 4359 YS--LSEGALGHEACYCGAEGCKG 4380


>gi|297484769|ref|XP_002694621.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Bos taurus]
 gi|296478627|tpg|DAA20742.1| TPA: SET domain containing 1B [Bos taurus]
          Length = 1980

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 118/208 (56%), Gaps = 24/208 (11%)

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
            T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 1669 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1726

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 1727 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1786

Query: 864  ATKRGGIARFINHSCN---------PNCYTKVISVEG---QKKIFIYAKRHIAAGE---- 907
            ATK G  ARFINHSCN         P    + +   G       F    R   A E    
Sbjct: 1787 ATKCGNFARFINHSCNVSVRLGATPPGACREDLEAAGTGASMASFPSMGRGRGAQEGGRS 1846

Query: 908  --EITYNYKFPLEEKKIPCYCGSKKCHG 933
                  +YKFP+E+ KIP   GS+ C G
Sbjct: 1847 XXXXXXDYKFPIEDVKIPA-LGSENCRG 1873


>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
          Length = 2347

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/146 (44%), Positives = 92/146 (63%), Gaps = 6/146 (4%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    +IEY GE+IR ++ ++RE +YE    G  Y+FR
Sbjct: 2203 KNNVYLARSRIQGLGLYAKRDIEMNAMIIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFR 2261

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISV---EGQKKIFIYAKRHIAAGEEITYN 912
            +D+  +VDAT  GG AR+INHSC+PNC T++++       KKI I A R I+AGEE+TY+
Sbjct: 2262 IDEERLVDATMAGGPARYINHSCDPNCSTRIVASGPNGDDKKIIIIANRPISAGEELTYD 2321

Query: 913  YKFPLEE--KKIPCYCGSKKCHGSLN 936
            Y+F +E+   KI C CG+  C   +N
Sbjct: 2322 YQFDIEDTADKIACLCGAPNCQKWMN 2347


>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
 gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
          Length = 1149

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F +S IH WGL A   ++  D V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 1003 KTENLRICFGKSGIHGWGLFARRDLQEGDMVVEYRGEQVRRSVADLREAKYRSEG-KDCY 1061

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITY 911
            LF++ +  V+DAT +G IAR INHSC PNCY +++S+  Q  +I + AK +++AG+E+TY
Sbjct: 1062 LFKISEEVVIDATHKGNIARLINHSCMPNCYARILSLGNQGNRIVLIAKTNVSAGQELTY 1121

Query: 912  NYKF---PLEEKKIPCYCGSKKCHGSLN 936
            +Y F     EE K+PC C +  C   +N
Sbjct: 1122 DYLFDPDEPEELKVPCLCRASNCRKFMN 1149


>gi|237844405|ref|XP_002371500.1| SET domain-containing protein [Toxoplasma gondii ME49]
 gi|211969164|gb|EEB04360.1| SET domain-containing protein [Toxoplasma gondii ME49]
 gi|221481217|gb|EEE19618.1| SET domain-containing protein, putative [Toxoplasma gondii GT1]
 gi|221501765|gb|EEE27525.1| SET domain-containing protein, putative [Toxoplasma gondii VEG]
          Length = 4827

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/264 (32%), Positives = 130/264 (49%), Gaps = 20/264 (7%)

Query: 688  ASRQVAMKKTKASKSRTSNLCPRSDGCARSSISGWEWHKW--SLNASPAERARVRGAQYV 745
            A+  +  K+  +S    S   P   G + S +   +W     +L +  AER + R     
Sbjct: 4564 ATLNLGKKRDTSSPGENSRFAPPLVGSSSSRLLAAQWQAADPALLSLAAERMQERNLSAF 4623

Query: 746  HTKYLG-----PEVNASQWANGK---GLSARTNRVKLRNLLAAAEGAELLKASQVKARKK 797
            H   LG     PE    Q    +    L +++   +  NLL A       +A     R K
Sbjct: 4624 HGLLLGEDGRLPENAELQLKRNELFFALESQSRLSRTANLLEATVLRSGARAPSFVVRTK 4683

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
             L+  +S++H WG+ A EPI  ++FVIEY   ++   +++ RE +Y +   GS+YLF+L 
Sbjct: 4684 LLKLGKSRVHGWGVFAAEPIYKDEFVIEYSAVVVSEAMANFREWQYMQSMGGSTYLFKLK 4743

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKK-----IFIYAKRHIAAGEEITYN 912
            +  +VDAT+ G + RFINHSC PNC T+ +S           + I+A R IA GEE+ YN
Sbjct: 4744 NSAIVDATQSGAVTRFINHSCRPNCQTRDLSGGSDDDSRHCHVGIFALRDIAIGEELFYN 4803

Query: 913  YKFPLEEKKI---PCYCGSKKCHG 933
            Y   L E  +    CYCG++ C G
Sbjct: 4804 YS--LSEGALGHEACYCGAEGCKG 4825


>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Brachypodium
            distachyon]
          Length = 1037

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 91/148 (61%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 891  KTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREAQYRVQG-KDCY 949

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G +AR INHSC PN Y +++SV   Q +I + AKR++ AGEE+TY
Sbjct: 950  LFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSVGHDQSRIVLIAKRNVRAGEELTY 1009

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E    K+PC C +  C   +N
Sbjct: 1010 DYLFDPDEADDCKVPCLCQTADCRKFMN 1037


>gi|21739477|emb|CAD38780.1| hypothetical protein [Homo sapiens]
          Length = 1033

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 892  KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 950

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 951  MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKF 1010

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 1011 DFEDDQHKIPCHCGAVNCRKWMN 1033


>gi|242053849|ref|XP_002456070.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
 gi|241928045|gb|EES01190.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
          Length = 1051

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/148 (41%), Positives = 93/148 (62%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 905  KTEHGRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKYRVQG-KDCY 963

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  VVDAT +G +AR INHSC PNCY +++SV   + +I + A++++ AG+E+TY
Sbjct: 964  LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIARKNVCAGDELTY 1023

Query: 912  NYKF---PLEEKKIPCYCGSKKCHGSLN 936
            +Y F     +E+K+PC C +  C   +N
Sbjct: 1024 DYLFDPDEADERKVPCLCQTANCRKFMN 1051


>gi|293333544|ref|NP_001169542.1| uncharacterized protein LOC100383419 [Zea mays]
 gi|193735372|gb|ACF20188.1| histone methylase [Zea mays]
 gi|224030013|gb|ACN34082.1| unknown [Zea mays]
          Length = 641

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K    R+ F RS IH WGL A   I+  + V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 495 KTEHGRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKYRVQG-KDCY 553

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
           LF++ +  VVDAT +G +AR INHSC PNCY +++SV   + +I + AK+ + AG+E+TY
Sbjct: 554 LFKISEEVVVDATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKKVCAGDELTY 613

Query: 912 NYKF---PLEEKKIPCYCGSKKCHGSLN 936
           +Y F     +E+K+PC C +  C   +N
Sbjct: 614 DYLFDPDEADERKVPCLCQTANCRKFMN 641


>gi|300176194|emb|CBK23505.2| unnamed protein product [Blastocystis hominis]
          Length = 400

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 10/171 (5%)

Query: 776 NLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKI 835
           + L   E     +A Q +A  ++ +  +S IH WGL A         VIE+ GE++   +
Sbjct: 230 DTLGTTELLRQYRALQCRAESEQPKVIKSPIHGWGLFAKTAFRRGMMVIEFAGEVVGQDV 289

Query: 836 SDIRECRYEK----------MGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
           ++ RE  + +            +G  Y FR+D  ++VDAT +G  +RFINH C+PNC  +
Sbjct: 290 ANAREAVWAQGAAVRRRYNRRNVGGCYFFRMDAEHIVDATVKGNESRFINHCCDPNCDAR 349

Query: 886 VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           V+++  + KI  +A R I  GEEITY+YKF  E++++ C CG+K C G +N
Sbjct: 350 VVNINNENKIIFFANRDINPGEEITYDYKFQYEDRELVCLCGAKNCVGRMN 400


>gi|74193661|dbj|BAE22782.1| unnamed protein product [Mus musculus]
          Length = 865

 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 724 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 782

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 783 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 842

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 843 DFEDDQHKIPCHCGAVNCRKWMN 865


>gi|113470963|gb|ABI34883.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 96

 Score =  127 bits (318), Expect = 4e-26,   Method: Composition-based stats.
 Identities = 59/96 (61%), Positives = 72/96 (75%), Gaps = 3/96 (3%)

Query: 843 YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRH 902
           Y+  GIG  Y+FR+DD  VVDAT  G  ARFINHSC PNCY++VI+V+GQK I I+A R 
Sbjct: 2   YDDKGIGC-YMFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVINVDGQKHIVIFATRK 60

Query: 903 IAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           I  GEE+TY+YKFP+EE   K+PC CG+KKC   LN
Sbjct: 61  IYKGEELTYDYKFPIEEPGNKLPCNCGAKKCRKFLN 96


>gi|84997431|ref|XP_953437.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304433|emb|CAI76812.1| hypothetical protein, conserved [Theileria annulata]
          Length = 3595

 Score =  127 bits (318), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 67/159 (42%), Positives = 97/159 (61%), Gaps = 3/159 (1%)

Query: 765  LSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVI 824
            L  R NR + R     +  +   +       + RL  + S IH  GL A E I A + V+
Sbjct: 3434 LETRNNRSRSRKQYDLSNTSTQYRYLLSIPPENRLDVKPSVIHGLGLFATENIPAGEPVV 3493

Query: 825  EYVGELIRLKISDIRECRYE--KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
            EYVGELIR  + D RE  Y   + G GS Y+FRLDD Y+VDAT++G ++RFINHSC+PNC
Sbjct: 3494 EYVGELIRDVVGDQREEFYSEGQGGDGSCYMFRLDDQYIVDATRKGNMSRFINHSCDPNC 3553

Query: 883  YTKVISVE-GQKKIFIYAKRHIAAGEEITYNYKFPLEEK 920
              ++I+ E G K I ++AK  ++ G+E+TY+Y+  ++ K
Sbjct: 3554 LCRIITCENGMKHIVVFAKSELSPGDEVTYDYQVTIQFK 3592


>gi|74218832|dbj|BAE37820.1| unnamed protein product [Mus musculus]
          Length = 342

 Score =  127 bits (318), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 201 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 259

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 260 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 319

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 320 DFEDDQHKIPCHCGAVNCRKWMN 342


>gi|343960312|dbj|BAK64010.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 homolog [Pan
           troglodytes]
          Length = 411

 Score =  126 bits (316), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 270 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 328

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 329 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 388

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 389 DFEDDQHKIPCHCGAVNCRKWMN 411


>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score =  125 bits (315), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 60/147 (40%), Positives = 92/147 (62%), Gaps = 4/147 (2%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   +++   +S IH WGL A   +E  + V+EY GE +R  I+D+RE +Y   G    Y
Sbjct: 840 KTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRRSITDLREAQYRSEG-KDCY 898

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
            F++ +  V+DAT +G IAR INHSC PNC+ +++ +  Q+ +I + AK +++AGEE+TY
Sbjct: 899 FFKISEEVVIDATDKGNIARLINHSCMPNCFARIVPLSDQENRIVLIAKTNVSAGEELTY 958

Query: 912 NYKFPLE--EKKIPCYCGSKKCHGSLN 936
           NY F  E  E+K+ C C +  C G +N
Sbjct: 959 NYSFDDERDEEKVVCRCKAPNCSGFMN 985


>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 989

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 93/148 (62%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           +  K R+ F +S IH WGL A   I+  + V+EY G  +R  ++D+RE +Y   G    Y
Sbjct: 843 ETEKFRVCFGKSGIHGWGLFARRDIQEGEMVVEYRGVHVRRSVADLREEKYRSEG-KDCY 901

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITY 911
           LF++ +  VVDAT RG IAR INHSC PNCY +++S+  Q  +I + AK +++AGEE+TY
Sbjct: 902 LFKISEEVVVDATNRGNIARLINHSCMPNCYARIMSLGDQGSRIVLIAKTNVSAGEELTY 961

Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
           +Y F  +E+   K+PC C +  C   +N
Sbjct: 962 DYLFDPDERDELKVPCLCKAPNCRRFMN 989


>gi|339244151|ref|XP_003378001.1| putative SET domain protein [Trichinella spiralis]
 gi|316973127|gb|EFV56754.1| putative SET domain protein [Trichinella spiralis]
          Length = 1416

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 89/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RSKI   GL A   +E    VIEYVGE+IR ++++ RE  Y+K   G  Y+FR
Sbjct: 1275 KSNVYLARSKIQGLGLFANRDVEMNAMVIEYVGEVIRNEVAERREKSYQKRNRGV-YMFR 1333

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            LD  +V+DAT  GG AR+INHSC+PNC  + I  + + +I I + R I  GEE+TY+Y+F
Sbjct: 1334 LDSDHVIDATVAGGPARYINHSCDPNCIAERIDFDRESRIVIMSCRPICKGEELTYDYQF 1393

Query: 916  PLEEK--KIPCYCGSKKCHGSLN 936
              E++  K+PC C +  C   +N
Sbjct: 1394 DFEDELNKLPCLCRAPNCRKWMN 1416


>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
 gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
          Length = 982

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/190 (37%), Positives = 110/190 (57%), Gaps = 19/190 (10%)

Query: 765 LSARTNRVK--LRNLLAAAEGAELLKASQVKARKKRLR-----------FQRSKIHDWGL 811
           L A ++R+K    + L+A E     K +   + ++RL+           F +S IH WGL
Sbjct: 794 LEAISHRLKGPSHHSLSAIENLNSFKEADFTSFRERLKHLQRTENFRVCFGKSGIHGWGL 853

Query: 812 VALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIA 871
            A + I+  + +IEY G  +R  ++D+RE  Y   G    YLF++ +  V+DAT  G IA
Sbjct: 854 FARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKISEEIVIDATDSGNIA 912

Query: 872 RFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKFPL---EEKKIPCYC 926
           R INHSC PNCY +++S+ +G+  +I + AK ++AAGEE+TY+Y F +   EE K+PC C
Sbjct: 913 RLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLFEVDESEEIKVPCLC 972

Query: 927 GSKKCHGSLN 936
            +  C   +N
Sbjct: 973 KAPNCRKFMN 982


>gi|74145466|dbj|BAE36172.1| unnamed protein product [Mus musculus]
          Length = 563

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 88/143 (61%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 422 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 480

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 481 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 540

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 541 DFEDDQHKIPCHCGAVNCRKWMN 563


>gi|350595097|ref|XP_003134587.3| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Sus
            scrofa]
          Length = 3482

 Score =  125 bits (313), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 66/155 (42%), Positives = 93/155 (60%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K   +      RS+I   GL A   IE    VIEY+G +IR ++++ +E  Y
Sbjct: 3329 KSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLY 3388

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FR+D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I
Sbjct: 3389 ESQNRGV-YMFRMDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRI 3447

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
              GEE+ Y+YKF  E  + KIPC+CG+  C   +N
Sbjct: 3448 QKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 3482


>gi|403223612|dbj|BAM41742.1| predicted protein [Theileria orientalis strain Shintoku]
          Length = 4555

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 64/130 (49%), Positives = 87/130 (66%), Gaps = 3/130 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE--KMGIGSSYL 853
            ++RL  + S IH  GL A  PI A D V+EYVGELIR  I D RE  Y   + G GS Y+
Sbjct: 4425 EQRLDVRPSLIHGLGLFASVPIRAGDPVVEYVGELIRDVIGDQREEIYSEGQGGDGSCYM 4484

Query: 854  FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYN 912
            FRLDD  +VDAT++G ++RFINHSC+PNC  ++I+ E G K I ++AK  +  G+EITY+
Sbjct: 4485 FRLDDQLIVDATRKGNMSRFINHSCDPNCLCRIITCENGLKHIVVFAKSDLGPGDEITYD 4544

Query: 913  YKFPLEEKKI 922
            Y+    + K+
Sbjct: 4545 YQVTERDDKL 4554


>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
 gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
            Full=Protein SET DOMAIN GROUP 14; AltName:
            Full=Trithorax-homolog protein 3; Short=TRX-homolog
            protein 3
 gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
          Length = 1018

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F +S IH WGL A + I+  + +IEY G  +R  ++D+RE  Y   G    YLF++ 
Sbjct: 876  RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 934

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
            +  V+DAT  G IAR INHSC PNCY +++S+ +G+  +I + AK ++AAGEE+TY+Y F
Sbjct: 935  EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLF 994

Query: 916  PL---EEKKIPCYCGSKKCHGSLN 936
             +   EE K+PC C +  C   +N
Sbjct: 995  EVDESEEIKVPCLCKAPNCRKFMN 1018


>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 4032

 Score =  125 bits (313), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K   +      RS+I   GL A   IE    VIEY+G +IR ++++ +E  Y
Sbjct: 3879 KSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLY 3938

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E    G  Y+FR+D  +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I
Sbjct: 3939 ESQNRGV-YMFRMDSDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRI 3997

Query: 904  AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
              GEE+ Y+YKF  E  + KIPC+CG+  C   +N
Sbjct: 3998 QKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 4032


>gi|302832672|ref|XP_002947900.1| histone H3 Lys 4 methyltransferase [Volvox carteri f. nagariensis]
 gi|300266702|gb|EFJ50888.1| histone H3 Lys 4 methyltransferase [Volvox carteri f. nagariensis]
          Length = 2171

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 93/152 (61%), Gaps = 5/152 (3%)

Query: 789  ASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
            A+ +++   R+ F +S IH WG+ A   I  +  V E+ GE +R  ++D+RE RY+  G 
Sbjct: 2021 AAALRSEPSRITFGKSGIHGWGIFARTDIPQDAIVTEFRGEAVRPVVADLRERRYKAQGR 2080

Query: 849  GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY-AKRHIAAGE 907
               +LF ++   V+D+T  G   RF NHSC+P+ YTKV+  EG +   ++ A+  I AG+
Sbjct: 2081 -DCFLFHMNGEVVLDSTHLGHYGRFTNHSCSPSLYTKVLEFEGGRVRLVFCARTDIRAGQ 2139

Query: 908  EITYNYKFPLE---EKKIPCYCGSKKCHGSLN 936
            E+T++Y+F  E   E K+PC CG+  C G+LN
Sbjct: 2140 ELTFDYRFKEEDHPEAKVPCRCGAPNCRGTLN 2171


>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
          Length = 1018

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 92/144 (63%), Gaps = 6/144 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F +S IH WGL A + I+  + +IEY G  +R  ++D+RE  Y   G    YLF++ 
Sbjct: 876  RVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 934

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
            +  V+DAT  G IAR INHSC PNCY +++S+ +G+  +I + AK ++AAGEE+TY+Y F
Sbjct: 935  EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDYLF 994

Query: 916  PLEEK---KIPCYCGSKKCHGSLN 936
             ++E    K+PC C +  C   +N
Sbjct: 995  EVDESGEIKVPCLCKAPNCRKFMN 1018


>gi|71029598|ref|XP_764442.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351396|gb|EAN32159.1| SET-domain protein, putative [Theileria parva]
          Length = 175

 Score =  124 bits (312), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 66/170 (38%), Positives = 102/170 (60%), Gaps = 3/170 (1%)

Query: 748 KYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIH 807
           ++L  ++   +  + + +  R NR + R     +  +   +       + RL  + S IH
Sbjct: 3   EHLSKDLREREQTDPELIDVRNNRSRTRKQYDLSNSSTQYRYLLSIPPESRLDVKPSVIH 62

Query: 808 DWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE--KMGIGSSYLFRLDDGYVVDAT 865
             GL A E I A + V+EYVGELIR  + D RE  Y   + G GS Y+FRLDD Y+VDAT
Sbjct: 63  GLGLFATESIPAGEPVVEYVGELIRDVVGDQREELYSEGQGGDGSCYMFRLDDQYIVDAT 122

Query: 866 KRGGIARFINHSCNPNCYTKVISVE-GQKKIFIYAKRHIAAGEEITYNYK 914
           ++G ++RFINHSC+PNC  ++I+ E G K I ++AK  ++ G+E+TY+Y+
Sbjct: 123 RKGNMSRFINHSCDPNCLCRIITCENGMKHIVVFAKSELSPGDEVTYDYQ 172


>gi|223974115|gb|ACN31245.1| unknown [Zea mays]
          Length = 214

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/152 (42%), Positives = 97/152 (63%), Gaps = 5/152 (3%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           GLS    +  + N+ +  E   ++KA+     +KRL F +SKIH +G+ A    +A D +
Sbjct: 64  GLSDAVQQEAIGNVSSMVEKYTIVKATF----RKRLTFGKSKIHGYGVFAKVAHKAGDMM 119

Query: 824 IEYVGELIRLKISDIRECR-YEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           +EY+GE++R  ISD RE R Y  +    +Y+FR+DD  V+DAT+ G IAR INHS  PNC
Sbjct: 120 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSREPNC 179

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
           Y++ I++ G + I I+AKR I   EE+TY+Y+
Sbjct: 180 YSRAITILGDEHIIIFAKRDIDPWEELTYDYR 211


>gi|255076429|ref|XP_002501889.1| predicted protein [Micromonas sp. RCC299]
 gi|226517153|gb|ACO63147.1| predicted protein [Micromonas sp. RCC299]
          Length = 140

 Score =  124 bits (311), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 61/140 (43%), Positives = 88/140 (62%), Gaps = 6/140 (4%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK--MGIGSSYLFRLD--D 858
           RS+IH  G+    P+ A+   +EYVGE+I    +D RE   ++  +   S+Y+F LD  +
Sbjct: 1   RSRIHGLGVYCTVPLPAKCAALEYVGEIISDANADAREAAEKRSILARASTYMFALDPVN 60

Query: 859 GYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-- 916
           G VVDA ++G   RF+NH C PNC T+V+ V G+KKI ++  R +AAGEE+TY+Y F   
Sbjct: 61  GTVVDAARKGNATRFVNHCCEPNCVTRVVVVGGEKKILLFTSRDVAAGEELTYDYMFKPD 120

Query: 917 LEEKKIPCYCGSKKCHGSLN 936
           L E + PC CG+  C G +N
Sbjct: 121 LPENEAPCDCGADTCRGIIN 140


>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
 gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
          Length = 1057

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793  KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
            K    R+ F +S IH WGL A   I+  + VIEY GE +R  I+D+RE RY   G    Y
Sbjct: 911  KTECHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGEQVRRSIADLRESRYRLEG-KDCY 969

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQKKIFIYAKRHIAAGEEITY 911
            LF++ +  V+DAT +G IAR INHSC PNCY +++SV + + +I + AK +++A +E+TY
Sbjct: 970  LFKISEEVVIDATNKGNIARLINHSCMPNCYARIMSVGDVENRIVLIAKTNVSAADELTY 1029

Query: 912  NYKFPLEEK---KIPCYCGSKKCHGSLN 936
            +Y F  +E    K+PC C +  C   +N
Sbjct: 1030 DYLFDPDEHDDLKVPCLCRAPNCRKFMN 1057


>gi|122934986|gb|ABM68250.1| MLL [Lagothrix lagotricha]
 gi|122938206|gb|ABM68961.1| MLL [Lemur catta]
 gi|124013641|gb|ABM88082.1| MLL [Macaca nemestrina]
          Length = 91

 Score =  123 bits (309), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 56/92 (60%), Positives = 70/92 (76%), Gaps = 3/92 (3%)

Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
           GIG  Y+FR+DD  VVDAT  G  ARFINHSC PNCY++VI+++GQK I I+A R I  G
Sbjct: 1   GIGC-YMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRG 59

Query: 907 EEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           EE+TY+YKFP+E+   K+PC CG+KKC   LN
Sbjct: 60  EELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 91


>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1002

 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 91/144 (63%), Gaps = 6/144 (4%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
            R+ F +S IH WGL A   I+  + +IEY G  +R  ++D+RE  Y   G    YLF++ 
Sbjct: 860  RVCFGKSGIHGWGLFARNSIQEGEMIIEYRGVKVRRSVADLREANYRSQG-KDCYLFKIS 918

Query: 858  DGYVVDATKRGGIARFINHSCNPNCYTKVISV-EGQ-KKIFIYAKRHIAAGEEITYNYKF 915
            +  V+DAT  G IAR INHSC PNCY +++S+ +G+  +I + AK +++AGEE+TY+Y F
Sbjct: 919  EEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVSAGEELTYDYLF 978

Query: 916  PL---EEKKIPCYCGSKKCHGSLN 936
             +   EE K+PC C +  C   +N
Sbjct: 979  EVDESEEIKVPCLCKAPNCRKFMN 1002


>gi|357142828|ref|XP_003572707.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR3-like [Brachypodium distachyon]
          Length = 466

 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 89/141 (63%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  +S+   WG +ALEP+E  DF+IEYVGE+I     + R    ++ G  + Y+ 
Sbjct: 228 KDKKIKIVKSEGCGWGAIALEPLEKCDFIIEYVGEVIDDATCEQRLWEMKRRGDKNFYMC 287

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   Y +DAT +G  +RF+NHSC+PNC  +   V+G+ ++ ++A R I  GE +TY+Y+
Sbjct: 288 EISKDYTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRFIEVGEPLTYDYR 347

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ C+CG+K C G L
Sbjct: 348 FVHFGEKVKCHCGAKSCQGYL 368


>gi|449275932|gb|EMC84661.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
          Length = 87

 Score =  122 bits (307), Expect = 7e-25,   Method: Composition-based stats.
 Identities = 54/87 (62%), Positives = 65/87 (74%), Gaps = 2/87 (2%)

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y+FR+DD  VVDAT  G  ARFINHSC PNCY++VI VEG K I I+A R I  GEE+TY
Sbjct: 1   YMFRIDDAEVVDATMHGSAARFINHSCEPNCYSRVIQVEGHKHIVIFALRRILRGEELTY 60

Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
           +YKFP+EE   K+PC CG+K+C   LN
Sbjct: 61  DYKFPIEEPAAKLPCNCGAKRCRRFLN 87


>gi|384252564|gb|EIE26040.1| hypothetical protein COCSUDRAFT_13072, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 859

 Score =  122 bits (307), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 88/149 (59%), Gaps = 10/149 (6%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFRL 856
           RL   +S IH WG     P    D ++EY+GEL+R  ++D RE R Y+++    +Y+F L
Sbjct: 681 RLTCGKSAIHGWGAFTKVPAAESDMLVEYMGELLRRPVADARERRTYDRLVGAGTYVFGL 740

Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG------QKKIFIYAKRHIAAGEEIT 910
            D  VVDAT++G +A  +NHSC PN Y++ +SV           + I+AKR IAAGEE+T
Sbjct: 741 SDELVVDATRKGNMAHLLNHSCEPNSYSRTVSVRCPDTGTLSDHVVIFAKRAIAAGEELT 800

Query: 911 YNYKFPLEEK---KIPCYCGSKKCHGSLN 936
           Y+Y+         K+PC CG+  C G +N
Sbjct: 801 YDYRRAPSHSTPHKLPCNCGAATCRGFVN 829


>gi|443696187|gb|ELT96958.1| hypothetical protein CAPTEDRAFT_153177 [Capitella teleta]
          Length = 844

 Score =  122 bits (307), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 67/138 (48%), Positives = 87/138 (63%), Gaps = 3/138 (2%)

Query: 801 FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGY 860
            +RS +   GL A   +E    VIEY+GELIR ++++ RE  Y+    G  Y+FR+DD  
Sbjct: 708 LRRSNVQGLGLFASRDLERHTMVIEYIGELIRSEVAEARERVYDSQNRGV-YMFRIDDNT 766

Query: 861 VVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE-- 918
           VVDAT  GG AR+INHSC PNC  +V+  E   KI I A+R IA GEE+TY+YKF  E  
Sbjct: 767 VVDATMTGGPARYINHSCAPNCVAEVVPFEKDSKIIIIARRRIARGEELTYDYKFDFEDD 826

Query: 919 EKKIPCYCGSKKCHGSLN 936
           + KIPC CG+  C   +N
Sbjct: 827 QHKIPCLCGAPNCRKWMN 844


>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 954

 Score =  122 bits (307), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 62/145 (42%), Positives = 91/145 (62%), Gaps = 5/145 (3%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K R+ F +S IH WGL A   I+  + V+EY G  +R  ++D+RE +Y   G    YLF+
Sbjct: 811 KFRVCFGKSGIHGWGLFARRDIQEGEMVVEYRGVHVRRSVTDLREEKYRSEG-KDCYLFK 869

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYK 914
           + +  VVDAT  G IAR INHSC PNCY +++S+  Q  +I + AK +++AGEE+TY+Y 
Sbjct: 870 ISEEVVVDATNSGNIARLINHSCMPNCYARIMSMGDQGSRIVLIAKTNVSAGEELTYDYL 929

Query: 915 FPLEEK---KIPCYCGSKKCHGSLN 936
           F  +E+   K+PC C +  C   +N
Sbjct: 930 FDPDERDELKVPCLCKAPNCRRFMN 954


>gi|295646353|gb|ADG23062.1| myeloid/lymphoid or mixed-lineage leukemia 3, partial [Gadus
           morhua]
          Length = 117

 Score =  122 bits (307), Expect = 7e-25,   Method: Composition-based stats.
 Identities = 57/116 (49%), Positives = 83/116 (71%), Gaps = 3/116 (2%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEY+G +IR ++++ +E  YE    G  Y+FR+D+ YV+DAT  GG AR++NHSC+PNC
Sbjct: 3   VIEYIGTIIRSEVANRKERLYEAQNRGV-YMFRIDNDYVIDATITGGPARYVNHSCSPNC 61

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
            T+V++VE + KI I + R I  GEE++Y+YKF LE  + KIPC+CG+  C   +N
Sbjct: 62  ITEVVTVERENKIIISSCRRIQRGEELSYDYKFDLEDDQHKIPCHCGTLNCRKWMN 117


>gi|291232347|ref|XP_002736118.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like, partial
            [Saccoglossus kowalevskii]
          Length = 3264

 Score =  122 bits (307), Expect = 7e-25,   Method: Composition-based stats.
 Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 7/155 (4%)

Query: 788  KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
            K+SQ +  K+  R      RS+I   GL A   IE    VIEY+G +IR ++++ RE  Y
Sbjct: 3111 KSSQYRRLKQEWRNNVYLARSRIQGLGLFATRDIEKHTMVIEYIGTIIRNEVANRREDIY 3170

Query: 844  EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
            E+   G  Y+FR++  YV+DAT  GG AR++NHSC+PNC  +V++ + ++KI I +   I
Sbjct: 3171 EEANRGV-YMFRINSDYVIDATLTGGPARYVNHSCSPNCVAEVVTFDKEQKIIIISNSRI 3229

Query: 904  AAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            A GEE+TY+YKF  E+   KI C C +  C   +N
Sbjct: 3230 AKGEELTYDYKFDFEDDKNKISCLCKAVNCKKWMN 3264


>gi|429327493|gb|AFZ79253.1| hypothetical protein BEWA_021000 [Babesia equi]
          Length = 849

 Score =  122 bits (307), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 87/146 (59%), Gaps = 5/146 (3%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K +   +SKIH +GL A++PI   + ++EY G +I   ++DI E  Y+    GS+YLF
Sbjct: 700 KAKHVTIGKSKIHGYGLFAVQPIPKGELILEYAGVVITEHLADILEREYDATIGGSTYLF 759

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTK---VISVEG-QKKIFIYAKRHIAAGEEIT 910
           RLD  Y+VD+T  G  ARFINHSCNPN  T     I  +G    + +YA R I AGEE+ 
Sbjct: 760 RLDQNYIVDSTLFGNSARFINHSCNPNTMTTNFACIEDDGFGTHVGVYASRDILAGEELY 819

Query: 911 YNYKFPL-EEKKIPCYCGSKKCHGSL 935
           YNY+      KK  CYCG + C G +
Sbjct: 820 YNYRLSEGSNKKEVCYCGDEMCQGYM 845


>gi|14626492|gb|AAK70214.1| MLL3-like protein [Mus musculus]
          Length = 677

 Score =  122 bits (307), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 61/134 (45%), Positives = 84/134 (62%), Gaps = 3/134 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 545 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 603

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE+ Y+YKF
Sbjct: 604 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKF 663

Query: 916 PLE--EKKIPCYCG 927
             E  + KIPC+CG
Sbjct: 664 DFEDDQHKIPCHCG 677


>gi|431918576|gb|ELK17794.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
          Length = 100

 Score =  122 bits (306), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 56/92 (60%), Positives = 69/92 (75%), Gaps = 3/92 (3%)

Query: 847 GIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAG 906
           GIG  Y+FR+DD  VVDAT  G  ARFINHSC PNC+++VI VEGQK I I+A R I  G
Sbjct: 10  GIGC-YMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRG 68

Query: 907 EEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           EE+TY+YKFP+E+   K+PC CG+K+C   LN
Sbjct: 69  EELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 100


>gi|326522849|dbj|BAJ88470.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 394

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 89/141 (63%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  +S+   WG VALEP+E  DF+IEYVGE+I     + R    ++ G  + Y+ 
Sbjct: 160 KDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 219

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   VEG+ ++ ++A R I  GE +TY+Y+
Sbjct: 220 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRVGVFASRIIEVGEPLTYDYR 279

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ C+CG+K C G L
Sbjct: 280 FVHFGEKVKCHCGAKSCQGYL 300


>gi|222617992|gb|EEE54124.1| hypothetical protein OsJ_00897 [Oryza sativa Japonica Group]
          Length = 1585

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/130 (46%), Positives = 86/130 (66%), Gaps = 2/130 (1%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K   KR+   RS IH WGL A + I+    VIEY G+ +R  ++D+RE RY +      Y
Sbjct: 753 KIENKRVSCGRSGIHGWGLFAAKKIQEGQMVIEYRGDQVRRSVADLREARYHREK-KDCY 811

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  VVDAT++G IAR INHSC PNCY +++SV  +K +I + AKR ++AGEE+TY
Sbjct: 812 LFKISEDVVVDATEKGNIARLINHSCMPNCYARIMSVGDEKSQIILIAKRDVSAGEELTY 871

Query: 912 NYKFPLEEKK 921
           +Y F  +E +
Sbjct: 872 DYLFDPDESE 881


>gi|222623224|gb|EEE57356.1| hypothetical protein OsJ_07493 [Oryza sativa Japonica Group]
          Length = 384

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 52/141 (36%), Positives = 89/141 (63%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  ++K   WG ++LEP+E  DF+IEYVGE+I     + R    ++ G  + Y+ 
Sbjct: 141 KDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 200

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   V+G+ ++ ++A R I  GE +TY+Y+
Sbjct: 201 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYR 260

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ CYCG++ C G L
Sbjct: 261 FVHFGEKVKCYCGAQNCQGYL 281


>gi|357512571|ref|XP_003626574.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
 gi|355501589|gb|AES82792.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
          Length = 164

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 92/148 (62%), Gaps = 5/148 (3%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K    R+   +S I  WGL A   ++  D V+EY GE +R  ++D+RE +Y   G    Y
Sbjct: 18  KTENCRVCLGKSAIQGWGLFARRDLQEGDMVVEYRGEQLRRSVADLREAKYRLEG-KDCY 76

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQK-KIFIYAKRHIAAGEEITY 911
           LF++ +  V+DAT +G IAR INHSC PNC+ +++ +  Q+ +I + AK +I+AGEE+TY
Sbjct: 77  LFKISEEVVIDATDKGNIARLINHSCMPNCFARIMCLGDQESRIVLIAKTNISAGEELTY 136

Query: 912 NYKFPLEEK---KIPCYCGSKKCHGSLN 936
           +Y F  +E+   K+PC+C +  C   +N
Sbjct: 137 DYLFDTDERDELKVPCHCKAPNCRKFMN 164


>gi|297599580|ref|NP_001047405.2| Os02g0611300 [Oryza sativa Japonica Group]
 gi|255671080|dbj|BAF09319.2| Os02g0611300 [Oryza sativa Japonica Group]
          Length = 263

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 52/141 (36%), Positives = 89/141 (63%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  ++K   WG ++LEP+E  DF+IEYVGE+I     + R    ++ G  + Y+ 
Sbjct: 20  KDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 79

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   V+G+ ++ ++A R I  GE +TY+Y+
Sbjct: 80  EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYR 139

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ CYCG++ C G L
Sbjct: 140 FVHFGEKVKCYCGAQNCQGYL 160


>gi|300176189|emb|CBK23500.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/144 (38%), Positives = 85/144 (59%), Gaps = 10/144 (6%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----------MGIGSSY 852
           +S IH WGL A         VIE+ GE++   +++ RE  + +            +G  Y
Sbjct: 177 KSPIHGWGLFAKTAFRRGMMVIEFAGEVVGQDVANAREAVWAQGAAVRRRYNRRNVGGCY 236

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
            FR+D  ++VDAT +G  +RFINH C+PNC  +V+++  + KI  +A R I  GEEITY+
Sbjct: 237 FFRMDAEHIVDATVKGNESRFINHCCDPNCDARVVNINNENKIIFFANRDINPGEEITYD 296

Query: 913 YKFPLEEKKIPCYCGSKKCHGSLN 936
           YKF  E++++ C CG+K C G +N
Sbjct: 297 YKFQYEDRELVCLCGAKNCVGRMN 320


>gi|83286297|ref|XP_730100.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23489726|gb|EAA21665.1| similar to KIAA0304 gene product-related [Plasmodium yoelii yoelii]
          Length = 1137

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 62/122 (50%), Positives = 83/122 (68%), Gaps = 4/122 (3%)

Query: 818  EAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHS 877
            E E  VIEY+GE IR  ISD RE  YEK+   S Y+FRL++  ++DATK G ++RFINHS
Sbjct: 1017 EKEMPVIEYIGEYIRNIISDKREKYYEKIE-SSCYMFRLNENIIIDATKWGNVSRFINHS 1075

Query: 878  CNPNCYTKVISVEGQ-KKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGS 934
            C PNC+ K++S +   K I I+AK+ I   EEITY+Y+F +E   +K+ C CGS  C G 
Sbjct: 1076 CEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFGVESEGEKLICLCGSSTCLGR 1135

Query: 935  LN 936
            +N
Sbjct: 1136 MN 1137


>gi|326487338|dbj|BAJ89653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 88/141 (62%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  +S+   WG VALEP+E  DF+IEYVGE+I     + R    ++ G  + Y+ 
Sbjct: 106 KDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMC 165

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   VEG+ +  ++A R I  GE +TY+Y+
Sbjct: 166 EISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRAGVFASRIIEVGEPLTYDYR 225

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ C+CG+K C G L
Sbjct: 226 FVHFGEKVKCHCGAKSCQGYL 246


>gi|255557755|ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
 gi|223540953|gb|EEF42511.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  120 bits (300), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/128 (46%), Positives = 82/128 (64%), Gaps = 4/128 (3%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE---KMGIGSS-Y 852
            K L   +S IH  GL     I   + V+EYVGE++ L+++D RE  Y+   K+   S+ Y
Sbjct: 985  KHLVVYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQSGRKLQYKSACY 1044

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
             FR+D   ++DAT +GGIARF+NHSC PNC  KVISV   KK+  +A+R I  GEEITY+
Sbjct: 1045 FFRIDKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAERDIYPGEEITYD 1104

Query: 913  YKFPLEEK 920
            Y F  E++
Sbjct: 1105 YHFNHEDE 1112


>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
            (Silurana) tropicalis]
          Length = 6019

 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5878 KTNVYLARSRIQGLGLYASKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5936

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5937 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5996

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5997 DFEDDQHKIPCHCGAWNCRKWMN 6019


>gi|297461543|ref|XP_591902.5| PREDICTED: histone-lysine N-methyltransferase SETD1B [Bos taurus]
          Length = 1936

 Score =  119 bits (299), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/136 (52%), Positives = 94/136 (69%), Gaps = 5/136 (3%)

Query: 747  TKYLGPEVNASQWANGKGLSARTNRVKLRNLLAAAEGA---ELLKASQVKARKKRLRFQR 803
            T   G  + A   A+ +  S R  R + R LL++  G+   +LLK +Q+K RKK+L+F +
Sbjct: 1726 TDTQGMSIPAQPHASTRAGSER--RSEQRRLLSSFTGSCDSDLLKFNQLKFRKKKLKFCK 1783

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S IHDWGL A+EPI A++ VIEYVG+ IR  I+D+RE RYE  GIGSSY+FR+D   ++D
Sbjct: 1784 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 1843

Query: 864  ATKRGGIARFINHSCN 879
            ATK G  ARFINHSCN
Sbjct: 1844 ATKCGNFARFINHSCN 1859


>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
          Length = 5261

 Score =  119 bits (298), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5120 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5178

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5179 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5238

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5239 DFEDDQHKIPCHCGAWNCRKWMN 5261


>gi|62088568|dbj|BAD92731.1| myeloid/lymphoid or mixed-lineage leukemia 2 variant [Homo sapiens]
          Length = 2704

 Score =  119 bits (298), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 2563 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 2621

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 2622 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 2681

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 2682 DFEDDQHKIPCHCGAWNCRKWMN 2704


>gi|196016259|ref|XP_002117983.1| hypothetical protein TRIADDRAFT_62004 [Trichoplax adhaerens]
 gi|190579456|gb|EDV19551.1| hypothetical protein TRIADDRAFT_62004 [Trichoplax adhaerens]
          Length = 589

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/136 (47%), Positives = 90/136 (66%), Gaps = 3/136 (2%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           RS I   GL A + IE    VIEY+G +IR ++++ RE  Y+K   G  Y+FR+D  +VV
Sbjct: 455 RSLIQGLGLFARKNIEKNTMVIEYIGAMIRNEVANKRERIYQKANHGI-YMFRIDPDFVV 513

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--K 920
           DAT+ GG+AR+INHSC PNC  +V++ + + +I I + R I  GEE+TY+YKF  E+   
Sbjct: 514 DATEDGGLARYINHSCQPNCVAEVVTFDSEPRIIIISNRRIDKGEELTYDYKFEYEDDLN 573

Query: 921 KIPCYCGSKKCHGSLN 936
           KIPC CG+  C G +N
Sbjct: 574 KIPCMCGAPNCRGWMN 589


>gi|170047037|ref|XP_001851046.1| set domain protein [Culex quinquefasciatus]
 gi|167869598|gb|EDS32981.1| set domain protein [Culex quinquefasciatus]
          Length = 719

 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 71/89 (79%)

Query: 827 VGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV 886
           V +++R  ++D+RE +YE +GI S+YLFR+D   ++DA+  G +ARFINHSCNPNCYTKV
Sbjct: 332 VSQVVRPSVTDLRETKYEAIGISSTYLFRIDMETIIDASTCGNLARFINHSCNPNCYTKV 391

Query: 887 ISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           I++E +KKI IY+K+ I   +EITY+YK+
Sbjct: 392 ITIESEKKIVIYSKQPIGVNKEITYDYKW 420


>gi|344254290|gb|EGW10394.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
          Length = 1475

 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 1334 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 1392

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 1393 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 1452

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 1453 DFEDDQHKIPCHCGAWNCRKWMN 1475


>gi|37590100|gb|AAH58659.1| Mll2 protein [Mus musculus]
          Length = 1250

 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 1109 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 1167

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 1168 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 1227

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 1228 DFEDDQHKIPCHCGAWNCRKWMN 1250


>gi|402577843|gb|EJW71799.1| hypothetical protein WUBG_17294 [Wuchereria bancrofti]
          Length = 122

 Score =  117 bits (293), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 57/119 (47%), Positives = 81/119 (68%), Gaps = 6/119 (5%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           +IEY GE+IR ++ ++RE +YE    G  Y+FR+DD  ++DAT  GG AR+INHSC+PNC
Sbjct: 5   IIEYKGEVIRSEVGEMREKKYEAQNRGV-YMFRIDDERLIDATMAGGPARYINHSCDPNC 63

Query: 883 YTKVIS---VEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            T+++        KKI I A R I+AGEE+TY+Y+F +E+   KIPC CG+  C   +N
Sbjct: 64  STRLVDSGPCGDDKKIIIIANRPISAGEELTYDYQFDIEDAADKIPCLCGAPNCQKWMN 122


>gi|156088645|ref|XP_001611729.1| SET domain containing protein [Babesia bovis]
 gi|154798983|gb|EDO08161.1| SET domain containing protein [Babesia bovis]
          Length = 866

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/165 (40%), Positives = 99/165 (60%), Gaps = 12/165 (7%)

Query: 781 AEGAELLKASQVK---ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
           AE  +LL  +  K   A+ KR+   +S++H +GL A++ I   D ++EY G +I   ++D
Sbjct: 697 AEQRQLLGLTNKKPFMAKCKRVIIGKSRVHGYGLFAVDTINKGDLIMEYAGVVISDYMAD 756

Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIF- 896
           +RE  Y+++  GS Y+FRLD   ++D+T  G  ARFINHSC+PN  T   S +  + +F 
Sbjct: 757 MREVMYQRLVCGSIYMFRLDLNRIIDSTFYGNCARFINHSCDPNTATSNFS-DIDEDVFG 815

Query: 897 ----IYAKRHIAAGEEITYNYKFPL--EEKKIPCYCGSKKCHGSL 935
               +YA + I AGEEI YNY+  L  E  +I C CGS +C G +
Sbjct: 816 THVGVYASKVILAGEEIYYNYRLSLGSENPQI-CRCGSYQCTGYM 859


>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
          Length = 4785

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 88/167 (52%), Gaps = 27/167 (16%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A   IE    VIEY+G +IR ++++ +E  YE    G  Y+FR
Sbjct: 4620 KSSVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFR 4678

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE------- 908
            +D+ +V+DAT  GG AR+INHSC PNC  +V++ E   KI I + R I  GEE       
Sbjct: 4679 MDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEESFTNVIT 4738

Query: 909  -----------------ITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
                             + Y+YKF  E  + KIPC+CG+  C   +N
Sbjct: 4739 HHGSSRAGKLPLSAMGPLCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 4785


>gi|302849081|ref|XP_002956071.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
            nagariensis]
 gi|300258576|gb|EFJ42811.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
            nagariensis]
          Length = 2029

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 87/147 (59%), Gaps = 13/147 (8%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECR-YEKMGIGSSYLFR 855
            +RL   +S IH WG  A  P +  D +IEY GELIR  +SD+RE R Y K+    +Y+F 
Sbjct: 1729 QRLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELIRPSVSDVREKRMYNKLVGCGTYIFT 1788

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG------QKKIFIYAK-----RHIA 904
            L+D   +DAT+ G +A  +NHSC+PNCY++ I++           + I AK     R I 
Sbjct: 1789 LNDDQHIDATRAGNMAHLLNHSCDPNCYSRTITLTDPVTRATSDHVIITAKVCVLWRDIE 1848

Query: 905  AGEEITYNYKFPLEEKKIPCYCGSKKC 931
            A EE+TY+Y+F   E ++PC CG+  C
Sbjct: 1849 AWEELTYDYRFNSSE-ELPCNCGAATC 1874


>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4301

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE    G  Y+FR
Sbjct: 4160 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 4218

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++  V+DAT  GG AR++NHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4219 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4278

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4279 DFEDDQHKIPCHCGAWNCRKWMN 4301


>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
 gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
          Length = 521

 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 84/144 (58%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K +  R R  R++   WGLVA E I A  FVIEY GE+I  K S  R   YE  G+  +Y
Sbjct: 81  KCQYARTRLVRTEGRGWGLVADENIMAGQFVIEYCGEVISWKESKRRAQAYETQGLKDAY 140

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DAT++G  ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 141 IIYLNADESIDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPFGTELSYD 200

Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 201 YNFEWYGGVMVRCLCGAASCSGFL 224


>gi|148672216|gb|EDL04163.1| mCG147092 [Mus musculus]
          Length = 900

 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 759 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 817

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 818 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 877

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 878 DFEDDQHKIPCHCGAWNCRKWMN 900


>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
          Length = 4823

 Score =  116 bits (291), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE    G  Y+FR
Sbjct: 4682 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 4740

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++  V+DAT  GG AR++NHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4741 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4800

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4801 DFEDDQHKIPCHCGAWNCRKWMN 4823


>gi|440801495|gb|ELR22513.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 981

 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 77/128 (60%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL A E I    F+IEYVGE+I   +   R   YE+MG+   Y   LD    +DA+++G
Sbjct: 420 WGLRAREKIPKGTFIIEYVGEVISTDMCQDRMKYYEEMGLEHYYFLTLDGSECIDASQKG 479

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSCNPN  T   +V+ + ++ I+A+  I  G+EIT++Y+F     KK  C+CG
Sbjct: 480 NLARFINHSCNPNAKTHKWTVDKEIRVGIFAEEDIPVGQEITFDYQFERFGGKKQKCFCG 539

Query: 928 SKKCHGSL 935
              C G L
Sbjct: 540 ETNCRGFL 547


>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
            niloticus]
          Length = 4907

 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE    G  Y+FR
Sbjct: 4766 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYEAQNRGI-YMFR 4824

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++  V+DAT  GG AR++NHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4825 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4884

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4885 DFEDDQHKIPCHCGAWNCRKWMN 4907


>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
           [Brachypodium distachyon]
          Length = 517

 Score =  116 bits (290), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 59/144 (40%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K +  R R  +++   WGL+A E I A  FVIEY GE+I  K +  R   YE  G+  +Y
Sbjct: 79  KCQYARTRLVKTEGRGWGLLAEENIMAGQFVIEYCGEVISWKEAKRRSQAYEDQGLMEAY 138

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 139 IIYLNTAESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGMELSYD 198

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 199 YNFEWFGGAIVRCLCGAASCSGFL 222


>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
            niloticus]
          Length = 4872

 Score =  116 bits (290), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 69/184 (37%), Positives = 100/184 (54%), Gaps = 18/184 (9%)

Query: 766  SARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQ-----------RSKIHDWGLVAL 814
            SAR+N+    N+     GA   K   V  R  + R+            RS+I   GL A 
Sbjct: 4694 SARSNQ----NVAQGEGGAPHNKPPVVHPRSSQYRWMKSEWRANVYLGRSRIQGLGLFAA 4749

Query: 815  EPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFI 874
              IE +  VIEY G ++R +++ IR+ +  +    + ++FR+D  +VVDAT  GGIAR+I
Sbjct: 4750 RDIEKQTMVIEYNGTILRNEVA-IRKEKIYRSKNRAVFMFRIDSEHVVDATLNGGIARYI 4808

Query: 875  NHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCH 932
            NHSC PNC  +V++ E   KI I   R I  GEE+ +NY+F   E   KI C+CG+ +C 
Sbjct: 4809 NHSCAPNCVAEVVTFERGYKIIISCIRRIEKGEELCFNYQFESVEGQHKIACHCGAPECR 4868

Query: 933  GSLN 936
              +N
Sbjct: 4869 KWIN 4872


>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
            rubripes]
          Length = 3715

 Score =  116 bits (290), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE    G  Y+FR
Sbjct: 3574 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYESQNRGI-YMFR 3632

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++  V+DAT  GG AR++NHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 3633 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 3692

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 3693 DFEDDQHKIPCHCGAWNCRKWMN 3715


>gi|238013758|gb|ACR37914.1| unknown [Zea mays]
          Length = 252

 Score =  116 bits (290), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 86/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  ++K   WG VALEP+E  DFVIEYVGE+I     + R     + G  + Y+ 
Sbjct: 20  KDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRRRGDKNFYMC 79

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G ++RF+NHSC PNC  +   V+G+ ++ ++A R I  GE +TY+Y+
Sbjct: 80  EISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIEVGEPLTYDYR 139

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ C+C +  C G L
Sbjct: 140 FVHFGEKVKCHCEAVNCQGYL 160


>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
          Length = 4897

 Score =  115 bits (289), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE    G  Y+FR
Sbjct: 4756 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYELQNRGI-YMFR 4814

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++  V+DAT  GG AR++NHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4815 INNEQVIDATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4874

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4875 DFEDDQHKIPCHCGAWNCRKWMN 4897


>gi|255575537|ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
 gi|223531892|gb|EEF33708.1| set domain protein, putative [Ricinus communis]
          Length = 495

 Score =  115 bits (289), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 86/139 (61%), Gaps = 1/139 (0%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           + R  +++   WGL+A E I+A  F+IEY GE+I  K +  R   YE+ G+  +++  L+
Sbjct: 84  KTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVISWKEAKRRSQAYERQGLKDAFIISLN 143

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
               +DAT++G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I+ G E+ Y+Y F  
Sbjct: 144 SSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKQDISIGTELAYDYNFEW 203

Query: 917 LEEKKIPCYCGSKKCHGSL 935
               K+ C CGS  C G L
Sbjct: 204 YGGAKVRCLCGSASCSGFL 222


>gi|168027908|ref|XP_001766471.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
            patens]
 gi|162682380|gb|EDQ68799.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
            patens]
          Length = 2933

 Score =  115 bits (289), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 85/157 (54%), Gaps = 31/157 (19%)

Query: 797  KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEK----MGIGSSY 852
            KRL   +S IH  GL   + I   + V+EYVGE++  +++D RE  Y         G+ Y
Sbjct: 2668 KRLAVYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACY 2727

Query: 853  LFRLDDGYVVDATKRGGIARFINHSCN---------------------------PNCYTK 885
            LFR+D   ++DAT++GGIARF+NHSC+                           PNC  K
Sbjct: 2728 LFRIDTEQIIDATRKGGIARFVNHSCSLCMCGILEIANTSGLLGYGYLIAWNSWPNCVAK 2787

Query: 886  VISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI 922
            VI VE  KK+  +AKR+I A EE+TY+YKF  +E  I
Sbjct: 2788 VICVENLKKVVFFAKRNIYAAEEVTYDYKFNCDEHII 2824


>gi|355702682|gb|AES02013.1| myeloid/lymphoid or mixed-lineage leukemia 2 [Mustela putorius furo]
          Length = 1014

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 90/138 (65%), Gaps = 3/138 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 874  KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 932

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 933  INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 992

Query: 916  PLE--EKKIPCYCGSKKC 931
              E  + KIPC+CG+  C
Sbjct: 993  DFEDDQHKIPCHCGAWNC 1010


>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
 gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
          Length = 500

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 88/141 (62%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K+++  +++   WG+  +EPI   DF+IEY+GE+I   + + R    +  G+ + Y+ 
Sbjct: 328 KEKKIKIVKTEFCGWGVETVEPINKGDFIIEYIGEVIDDAVCEQRLWDMKYKGVQNFYMC 387

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   VEG+ ++ ++A R I  GE +TY+Y+
Sbjct: 388 EIRKDFTIDATFKGNSSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKVGEPLTYDYR 447

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ C+CG+  CHG L
Sbjct: 448 FVQFGPEVRCHCGAPNCHGYL 468


>gi|224067978|ref|XP_002302628.1| SET domain protein [Populus trichocarpa]
 gi|222844354|gb|EEE81901.1| SET domain protein [Populus trichocarpa]
          Length = 667

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 93/172 (54%), Gaps = 28/172 (16%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE-------- 844
           K    R+ F +S IH WGL A   I+  + VIEY GE +R  ++D+RE RY         
Sbjct: 496 KTENHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGEKVRRSVADLREARYRLEGKDCYV 555

Query: 845 ----------------KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVIS 888
                            MG  +  LF++ +  V+DAT +G IAR INHSC PNCY +++S
Sbjct: 556 HHKGFESQDLLSGIEGAMGQLNLPLFKISEEVVIDATNKGNIARLINHSCMPNCYARIMS 615

Query: 889 V-EGQKKIFIYAKRHIAAGEEITYNYKFPLEEK---KIPCYCGSKKCHGSLN 936
           V + + +I + AK  ++AG E+TY+Y F  +E+   K+PC C +  C   +N
Sbjct: 616 VGDVENRIVLIAKTDVSAGNELTYDYLFDPDERDDLKVPCLCKAPNCRKFMN 667


>gi|297842431|ref|XP_002889097.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334938|gb|EFH65356.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 525

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 57/128 (44%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGLVALE I+A  F+IEY GE+I  K +  R   YE  G+  +Y+  L+    +DATK+G
Sbjct: 101 WGLVALEDIKAGQFIIEYCGEVISWKEAKRRAQTYETHGVKDAYIISLNASEAIDATKKG 160

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I+   E+ Y+Y F      K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220

Query: 928 SKKCHGSL 935
           +  C G L
Sbjct: 221 AVACSGFL 228


>gi|212721872|ref|NP_001132822.1| uncharacterized protein LOC100194312 [Zea mays]
 gi|194695492|gb|ACF81830.1| unknown [Zea mays]
 gi|413923000|gb|AFW62932.1| putative SET-domain containing protein family [Zea mays]
          Length = 418

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 53/135 (39%), Positives = 81/135 (60%)

Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYV 861
           Q +K   WG VALEP+E  DFVIEYVGE+I     + R     + G  + Y+  +   + 
Sbjct: 193 QHTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRRRGDKNFYMCEISKDFT 252

Query: 862 VDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 921
           +DAT +G ++RF+NHSC PNC  +   V+G+ ++ ++A R I  GE +TY+Y+F    +K
Sbjct: 253 IDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIEVGEPLTYDYRFVHFGEK 312

Query: 922 IPCYCGSKKCHGSLN 936
           + C+C +  C G L 
Sbjct: 313 VKCHCEAVNCQGYLG 327


>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
 gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
          Length = 477

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 86/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K+++  +++   WG+ A E I  EDF++EY+GE+I     + R    +  G+   Y+ 
Sbjct: 304 KEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 363

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSCNPNC  +   VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 364 EIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 423

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ C CGS+ C G L
Sbjct: 424 FVQFGPEVKCNCGSENCQGYL 444


>gi|22330671|ref|NP_177797.2| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|42572135|ref|NP_974158.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|75243465|sp|Q84WW6.1|ASHH1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH1; AltName:
           Full=ASH1 homolog 1; AltName: Full=Protein SET DOMAIN
           GROUP 26
 gi|25054844|gb|AAN71912.1| unknown protein [Arabidopsis thaliana]
 gi|225898088|dbj|BAH30376.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197758|gb|AEE35879.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|332197759|gb|AEE35880.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
          Length = 492

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGLVALE I+A  F++EY GE+I  K +  R   YE  G+  +Y+  L+    +DATK+G
Sbjct: 101 WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 160

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I+   E+ Y+Y F      K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220

Query: 928 SKKCHGSL 935
           +  C G L
Sbjct: 221 AVACSGFL 228


>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
 gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
           Full=ASH1-related protein 3; AltName: Full=Protein SET
           DOMAIN GROUP 4; AltName: Full=Protein stamen loss
 gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
 gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
 gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
 gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
          Length = 497

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 86/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K+++  +++   WG+ A E I  EDF++EY+GE+I     + R    +  G+   Y+ 
Sbjct: 324 KEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 383

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSCNPNC  +   VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 384 EIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 443

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ C CGS+ C G L
Sbjct: 444 FVQFGPEVKCNCGSENCQGYL 464


>gi|34782989|gb|AAH39197.1| MLL2 protein, partial [Homo sapiens]
          Length = 395

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 254 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 312

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 313 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 372

Query: 916 PLE--EKKIPCYCGSKKCHGSLN 936
             E  + KIPC+CG+  C   +N
Sbjct: 373 DFEDDQHKIPCHCGAWNCRKWMN 395


>gi|297602734|ref|NP_001052807.2| Os04g0429100 [Oryza sativa Japonica Group]
 gi|255675465|dbj|BAF14721.2| Os04g0429100, partial [Oryza sativa Japonica Group]
          Length = 612

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K++    R  +++   WGL+A E I A  FV+EY GE+I  K +  R   YE  G+  +Y
Sbjct: 176 KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 235

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 236 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 295

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 296 YNFEWFGGAMVRCLCGAGSCSGFL 319


>gi|6143888|gb|AAF04434.1|AC010718_3 unknown protein; 29143-26659 [Arabidopsis thaliana]
          Length = 528

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGLVALE I+A  F++EY GE+I  K +  R   YE  G+  +Y+  L+    +DATK+G
Sbjct: 101 WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 160

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I+   E+ Y+Y F      K+ C CG
Sbjct: 161 SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 220

Query: 928 SKKCHGSL 935
           +  C G L
Sbjct: 221 AVACSGFL 228


>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
            familiaris]
          Length = 5552

 Score =  114 bits (285), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5411 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5469

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5470 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5529

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5530 DFEDDQHKIPCHCGAWNCRKWMN 5552


>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
          Length = 4932

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 4791 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4849

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4850 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4909

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4910 DFEDDQHKIPCHCGAWNCRKWMN 4932


>gi|15488418|gb|AAL01110.1|AF408059_1 ASH1-like protein 1 [Arabidopsis thaliana]
          Length = 229

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGLVALE I+A  F++EY GE+I  K +  R   YE  G+  +Y+  L+    +DATK+G
Sbjct: 19  WGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKDAYIISLNASEAIDATKKG 78

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I+   E+ Y+Y F      K+ C CG
Sbjct: 79  SLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELAYDYNFEWYGGAKVRCLCG 138

Query: 928 SKKCHGSL 935
           +  C G L
Sbjct: 139 AVACSGFL 146


>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
          Length = 5559

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5418 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5476

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5477 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5536

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5537 DFEDDQHKIPCHCGAWNCRKWMN 5559


>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Nomascus leucogenys]
          Length = 5407

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5266 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5324

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5325 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5384

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5385 DFEDDQHKIPCHCGAWNCRKWMN 5407


>gi|224082490|ref|XP_002306713.1| SET domain protein [Populus trichocarpa]
 gi|222856162|gb|EEE93709.1| SET domain protein [Populus trichocarpa]
          Length = 495

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 95/169 (56%), Gaps = 9/169 (5%)

Query: 776 NLLAAAEGAELLKASQVKARKKRL-RFQRSKIH-------DWGLVALEPIEAEDFVIEYV 827
           NLL + E         V  + +R  +F+ +K          WGL+A E I+A  F+IEY 
Sbjct: 54  NLLTSTECTPGYCPCGVYCKNQRFQKFEYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYC 113

Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
           GE+I  K +  R   YE  G+  +++  L+    +DATK+G +ARFINHSC PNC T+  
Sbjct: 114 GEVISWKEAKKRSQVYENQGLKDAFIISLNSTESIDATKKGSLARFINHSCQPNCETRKW 173

Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCGSKKCHGSL 935
           +V G+ ++ I+AK++I+ G E+ Y+Y F      K+ C CG+  C G L
Sbjct: 174 TVLGEIRVGIFAKQNISIGTELAYDYNFEWYGGAKVRCLCGAVNCSGFL 222


>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
            melanoleuca]
          Length = 5483

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5342 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5400

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5401 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5460

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5461 DFEDDQHKIPCHCGAWNCRKWMN 5483


>gi|170047578|ref|XP_001851293.1| Setd1a protein [Culex quinquefasciatus]
 gi|167869969|gb|EDS33352.1| Setd1a protein [Culex quinquefasciatus]
          Length = 373

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/96 (53%), Positives = 71/96 (73%), Gaps = 6/96 (6%)

Query: 830 LIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISV 889
           ++R  ++D+RE +YE +GI S+YLFR+D   ++DA+  G +ARFINHSCNPNCYTKVI++
Sbjct: 284 VVRPSVTDLREIKYEAIGISSTYLFRIDMETIIDASNYGNLARFINHSCNPNCYTKVITI 343

Query: 890 EGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCY 925
           E +KKI       I   +EITY+YK PLE++KI  Y
Sbjct: 344 ELEKKI------PIGVNKEITYDYKVPLEDEKISFY 373


>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
            harrisii]
          Length = 5047

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 4906 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4964

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4965 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5024

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5025 DFEDDQHKIPCHCGAWNCRKWMN 5047


>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Callithrix jacchus]
          Length = 5289

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5148 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5206

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5207 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5266

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5267 DFEDDQHKIPCHCGAWNCRKWMN 5289


>gi|57997471|emb|CAE45854.2| hypothetical protein [Homo sapiens]
          Length = 116

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 54/116 (46%), Positives = 77/116 (66%), Gaps = 3/116 (2%)

Query: 823 VIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNC 882
           VIEY+G +IR ++++ +E  YE    G  Y+FR+D+ +V+DAT  GG AR+INHSC PNC
Sbjct: 2   VIEYIGTIIRNEVANRKEKLYESQNRGV-YMFRMDNDHVIDATLTGGPARYINHSCAPNC 60

Query: 883 YTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
             +V++ E   KI I + R I  GEE+ Y+YKF  E  + KIPC+CG+  C   +N
Sbjct: 61  VAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCHCGAVNCRKWMN 116


>gi|319764851|ref|YP_004128788.1| nuclear protein set [Alicycliphilus denitrificans BC]
 gi|330827056|ref|YP_004390359.1| nuclear protein SET [Alicycliphilus denitrificans K601]
 gi|317119412|gb|ADV01901.1| nuclear protein SET [Alicycliphilus denitrificans BC]
 gi|329312428|gb|AEB86843.1| nuclear protein SET [Alicycliphilus denitrificans K601]
          Length = 183

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 60/149 (40%), Positives = 89/149 (59%), Gaps = 12/149 (8%)

Query: 794 ARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYL 853
           A  +R++ +RS +H  G+ AL+PI A + VIEY GE+I  K   +R   ++      ++ 
Sbjct: 13  APGRRIQTRRSGVHGKGVFALQPIAAGEVVIEYTGEVISWK-EALRRHPHDPAQPNHTFY 71

Query: 854 FRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY 913
           F +D+  V+DA   G  AR+INHSCNPNCY      E   +IFI A R+I AGEE+ Y+Y
Sbjct: 72  FHVDEDRVIDAKFGGNSARWINHSCNPNCYAD----ERHGRIFITALRNIEAGEELNYDY 127

Query: 914 KFPLEEK-------KIPCYCGSKKCHGSL 935
              ++E+       + PC+CGS+ C G+L
Sbjct: 128 GLIIDERYTPKLKAEYPCWCGSRNCRGTL 156


>gi|225445763|ref|XP_002272781.1| PREDICTED: histone-lysine N-methyltransferase ASHR3 [Vitis
           vinifera]
 gi|297743704|emb|CBI36587.3| unnamed protein product [Vitis vinifera]
          Length = 494

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 88/141 (62%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K+++  ++++  WG+ A E I   DFVIEY+GE+I   + + R    +  G  + Y+ 
Sbjct: 326 KEKKIKIVKTELCGWGVDAAESINKGDFVIEYIGEVIDDALCERRLWDMKDRGDQNFYMC 385

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 386 EIRKDFTIDATFKGNASRFLNHSCDPNCKLEKWQVEGETRVGVFAARSIKAGEPLTYDYR 445

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ C+CG+  CHG L
Sbjct: 446 FVRFGPEVKCHCGAPSCHGYL 466


>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
            porcellus]
          Length = 5577

 Score =  114 bits (285), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5436 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5494

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5495 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5554

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5555 DFEDDQHKIPCHCGAWNCRKWMN 5577


>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Ovis aries]
          Length = 5387

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5246 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5304

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5305 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5364

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5365 DFEDDQHKIPCHCGAWNCRKWMN 5387


>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
            sapiens]
          Length = 4539

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 4398 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4456

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4457 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4516

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 4517 DFEDDQHKIPCHCGAWNCRKWMN 4539


>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
            abelii]
          Length = 5293

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5152 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5210

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5211 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5270

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5271 DFEDDQHKIPCHCGAWNCRKWMN 5293


>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 489

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K +  R R  R+    WGLVA E I A  FVIEY GE+I  K +  R   YE   +  +Y
Sbjct: 57  KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 116

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DAT++G +ARFINHSC PNC T+  +V G+ ++ I+AK++I  G E++Y+
Sbjct: 117 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 176

Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 177 YNFEWYGGVMVRCLCGAASCSGFL 200


>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Papio anubis]
          Length = 5547

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5406 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5464

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5465 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5524

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5525 DFEDDQHKIPCHCGAWNCRKWMN 5547


>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
          Length = 5538

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5397 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5455

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5456 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5515

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5516 DFEDDQHKIPCHCGAWNCRKWMN 5538


>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
 gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
 gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
          Length = 5503

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5362 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5420

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5421 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5480

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5481 DFEDDQHKIPCHCGAWNCRKWMN 5503


>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
            gorilla gorilla]
          Length = 5284

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5143 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5201

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5202 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5261

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5262 DFEDDQHKIPCHCGAWNCRKWMN 5284


>gi|242065740|ref|XP_002454159.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
 gi|241933990|gb|EES07135.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
          Length = 328

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 85/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           + K+++  ++K   WG VALEP+E  DFVIEYVGE+I     + R       G  + Y+ 
Sbjct: 96  KDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIRYRGDKNFYMC 155

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G ++RF+NHSC PNC  +   V+G+ ++ ++A R I  GE +TY+Y+
Sbjct: 156 EISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIKVGEPLTYDYR 215

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F    +K+ C+C +  C G L
Sbjct: 216 FVHFGEKVKCHCEAVNCQGYL 236


>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
            gorilla gorilla]
          Length = 5550

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5409 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5467

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5468 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5527

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5528 DFEDDQHKIPCHCGAWNCRKWMN 5550


>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
          Length = 5488

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5347 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5405

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5406 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5465

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5466 DFEDDQHKIPCHCGAWNCRKWMN 5488


>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
            abelii]
          Length = 5559

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5418 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5476

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5477 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5536

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5537 DFEDDQHKIPCHCGAWNCRKWMN 5559


>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
            sapiens]
          Length = 5265

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5124 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5182

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5183 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5242

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5243 DFEDDQHKIPCHCGAWNCRKWMN 5265


>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
            boliviensis boliviensis]
          Length = 5498

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5357 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5415

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5416 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5475

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5476 DFEDDQHKIPCHCGAWNCRKWMN 5498


>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
            Full=Lysine N-methyltransferase 2D; Short=KMT2D
          Length = 5588

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5447 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5505

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5506 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5565

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5566 DFEDDQHKIPCHCGAWNCRKWMN 5588


>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
          Length = 5588

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5447 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5505

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5506 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5565

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5566 DFEDDQHKIPCHCGAWNCRKWMN 5588


>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
 gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
            Full=ALL1-related protein; AltName: Full=Lysine
            N-methyltransferase 2D; Short=KMT2D; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
 gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
            sapiens]
          Length = 5537

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5396 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5454

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5455 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5514

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5515 DFEDDQHKIPCHCGAWNCRKWMN 5537


>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Pan paniscus]
          Length = 5373

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5232 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5290

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5291 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5350

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5351 DFEDDQHKIPCHCGAWNCRKWMN 5373


>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
            norvegicus]
          Length = 5543

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5402 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5460

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5461 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5520

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5521 DFEDDQHKIPCHCGAWNCRKWMN 5543


>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Rattus norvegicus]
          Length = 5543

 Score =  114 bits (284), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 61/143 (42%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5402 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5460

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5461 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5520

Query: 916  PLE--EKKIPCYCGSKKCHGSLN 936
              E  + KIPC+CG+  C   +N
Sbjct: 5521 DFEDDQHKIPCHCGAWNCRKWMN 5543


>gi|397580684|gb|EJK51670.1| hypothetical protein THAOC_29137 [Thalassiosira oceanica]
          Length = 1171

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 8/139 (5%)

Query: 804  SKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVD 863
            S++H  GL A   I+    V EY+GE +  + +D R+  ++  G G  Y FR+    V+D
Sbjct: 1022 SEVHGRGLYADCKIDKGQIVAEYIGEYVTQEAADRRD--HKGGGEGQDYQFRIAASLVLD 1079

Query: 864  ATKRGGIARFINHSCNPNCYTKVISVEGQ---KKIFIYAKRHIAAGEEITYNYKFPLEEK 920
            AT +GG AR+INHSC+PNC   ++  E     K++ I +KR I A EE+TY+Y+F LE  
Sbjct: 1080 ATLKGGCARYINHSCDPNCIANIVEGEQHTHLKRVLIISKRDIDANEELTYDYQFALETN 1139

Query: 921  ---KIPCYCGSKKCHGSLN 936
               ++PC CGS  C G LN
Sbjct: 1140 LGLRVPCNCGSACCRGFLN 1158


>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 497

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 86/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K++R  +++   WG+ A E I  EDF++EY+GE+I     + R    +  G+   Y+ 
Sbjct: 324 KEKKIRIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQRLWDMKHKGMKDFYMC 383

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC+PNC  +   VEG+ ++ ++A R I AGE +TY+Y+
Sbjct: 384 EIQKDFTIDATFKGNASRFLNHSCSPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYR 443

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ C CGS+ C G L
Sbjct: 444 FVQFGPEVKCNCGSESCQGYL 464


>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 503

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K +  R R  R+    WGLVA E I A  FVIEY GE+I  K +  R   YE   +  +Y
Sbjct: 71  KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 130

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DAT++G +ARFINHSC PNC T+  +V G+ ++ I+AK++I  G E++Y+
Sbjct: 131 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 190

Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 191 YNFEWYGGVMVRCLCGAASCSGFL 214


>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
 gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
 gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
          Length = 513

 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/144 (40%), Positives = 84/144 (58%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K +  R R  R+    WGLVA E I A  FVIEY GE+I  K +  R   YE   +  +Y
Sbjct: 81  KCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAY 140

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DAT++G +ARFINHSC PNC T+  +V G+ ++ I+AK++I  G E++Y+
Sbjct: 141 IIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYD 200

Query: 913 YKFPLEEK-KIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 201 YNFEWYGGVMVRCLCGAASCSGFL 224


>gi|413955941|gb|AFW88590.1| putative SET-domain containing protein family [Zea mays]
          Length = 730

 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)

Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
           RKRT DG + +   +           A +Q+  K    +   T   C +  GC++S    
Sbjct: 469 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 514

Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
               K         +++ R  Q   +  ++   P+V  + W + G G         L   
Sbjct: 515 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 562

Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
           LA  +G +      +  +++R+   RS +  WG     P+   D++ EY GELI  K +D
Sbjct: 563 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 622

Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
            R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ I
Sbjct: 623 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 680

Query: 898 YAKRHIAAGEEITYNYKF 915
           YAK HI A EE+ Y+Y++
Sbjct: 681 YAKEHIEASEELFYDYRY 698


>gi|356576073|ref|XP_003556159.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
           max]
          Length = 480

 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E I+A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 95  WGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKG 154

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I  G E+ Y+Y F      K+ C CG
Sbjct: 155 SLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIPIGNELAYDYNFEWFGGAKVRCLCG 214

Query: 928 SKKCHGSL 935
           + KC G L
Sbjct: 215 ALKCSGFL 222


>gi|222628880|gb|EEE61012.1| hypothetical protein OsJ_14832 [Oryza sativa Japonica Group]
          Length = 518

 Score =  113 bits (283), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K++    R  +++   WGL+A E I A  FV+EY GE+I  K +  R   YE  G+  +Y
Sbjct: 82  KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 141

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 142 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 201

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 202 YNFEWFGGAMVRCLCGAGSCSGFL 225


>gi|224143229|ref|XP_002324887.1| SET domain protein [Populus trichocarpa]
 gi|222866321|gb|EEF03452.1| SET domain protein [Populus trichocarpa]
          Length = 87

 Score =  113 bits (283), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 50/87 (57%), Positives = 64/87 (73%), Gaps = 2/87 (2%)

Query: 852 YLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITY 911
           Y FR+D  +++DAT++GGIARF+NHSC PNC  KVISV  +KK+  +A+R I  GEEITY
Sbjct: 1   YFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDIYPGEEITY 60

Query: 912 NYKFPLEE--KKIPCYCGSKKCHGSLN 936
           +Y F  E+  KKIPC+C SK C   LN
Sbjct: 61  DYHFNNEDEGKKIPCFCNSKNCRRYLN 87


>gi|242044114|ref|XP_002459928.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
 gi|241923305|gb|EER96449.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
          Length = 166

 Score =  113 bits (282), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 48/98 (48%), Positives = 69/98 (70%), Gaps = 1/98 (1%)

Query: 839 RECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIY 898
           R+  +  MG G+ Y+FR+DD  V+DAT+ G IA  INHSC PNCY++ I++ G + I I+
Sbjct: 47  RQLLFPLMGAGT-YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRAITILGDEHIIIF 105

Query: 899 AKRHIAAGEEITYNYKFPLEEKKIPCYCGSKKCHGSLN 936
           AKR I   EE+TY+Y+F   ++++PCYCG  KC G +N
Sbjct: 106 AKRDIDPWEELTYDYRFFSSDQRLPCYCGFPKCRGVVN 143


>gi|116309320|emb|CAH66406.1| OSIGBa0093L02.2 [Oryza sativa Indica Group]
          Length = 519

 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K++    R  +++   WGL+A E I A  FV+EY GE+I  K +  R   YE  G+  +Y
Sbjct: 83  KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 142

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 143 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 202

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 203 YNFEWFGGAMVRCLCGAGSCSGFL 226


>gi|428167957|gb|EKX36908.1| hypothetical protein GUITHDRAFT_78620, partial [Guillardia theta
           CCMP2712]
          Length = 133

 Score =  113 bits (282), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 59/130 (45%), Positives = 80/130 (61%), Gaps = 12/130 (9%)

Query: 768 RTNRVKLRN--LLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIE 825
           R NR   R    + ++  ++  +  Q+K+R KRL F++S IH WGL A E IE E+ VIE
Sbjct: 14  RANRRATRKQATVTSSISSDAFRFDQLKSRVKRLSFRKSAIHGWGLYADEVIEPEEMVIE 73

Query: 826 YVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTK 885
           Y GE          E  YE+ G+GSSY+FR+    V+DAT  G  AR+INHSC PNCYT+
Sbjct: 74  YKGE----------EKMYERSGMGSSYMFRVSKSQVIDATHCGSRARYINHSCEPNCYTR 123

Query: 886 VISVEGQKKI 895
           +I V G+ K+
Sbjct: 124 IIHVGGKPKV 133


>gi|356535766|ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
           max]
 gi|34529091|dbj|BAC85636.1| unnamed protein product [Homo sapiens]
          Length = 480

 Score =  113 bits (282), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E I+A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 95  WGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIIFLNVSESIDATRKG 154

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  I  G E+ Y+Y F      K+ C CG
Sbjct: 155 SLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIPIGTELAYDYNFEWFGGAKVRCLCG 214

Query: 928 SKKCHGSL 935
           + KC G L
Sbjct: 215 ALKCSGFL 222


>gi|413955942|gb|AFW88591.1| putative SET-domain containing protein family [Zea mays]
          Length = 555

 Score =  113 bits (282), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)

Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
           RKRT DG + +   +           A +Q+  K    +   T   C +  GC++S    
Sbjct: 294 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 339

Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
               K         +++ R  Q   +  ++   P+V  + W + G G         L   
Sbjct: 340 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 387

Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
           LA  +G +      +  +++R+   RS +  WG     P+   D++ EY GELI  K +D
Sbjct: 388 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 447

Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
            R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ I
Sbjct: 448 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 505

Query: 898 YAKRHIAAGEEITYNYKF 915
           YAK HI A EE+ Y+Y++
Sbjct: 506 YAKEHIEASEELFYDYRY 523


>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
          Length = 5154

 Score =  113 bits (282), Expect = 6e-22,   Method: Composition-based stats.
 Identities = 60/138 (43%), Positives = 90/138 (65%), Gaps = 3/138 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5015 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5073

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5074 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5133

Query: 916  PLE--EKKIPCYCGSKKC 931
              E  + KIPC+CG+  C
Sbjct: 5134 DFEDDQHKIPCHCGAWNC 5151


>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
 gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
          Length = 820

 Score =  112 bits (281), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 84/139 (60%)

Query: 797 KRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL 856
           +RLR + ++   WGL A E IE  DF+IEY+GE+I  K  + R    ++ G  + YL  +
Sbjct: 176 RRLRLKETENCGWGLFADENIERGDFLIEYIGEVIDDKTCEERLWDLKERGENNFYLCEV 235

Query: 857 DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP 916
               V+DAT +G ++RFINHSCNPN   +    +G+ +I ++A   I  G+EITY+YK+ 
Sbjct: 236 GHDKVIDATFKGNMSRFINHSCNPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYI 295

Query: 917 LEEKKIPCYCGSKKCHGSL 935
               +  C+CGSK C G L
Sbjct: 296 QFGTEQQCHCGSKNCKGIL 314


>gi|224143888|ref|XP_002325110.1| SET domain protein [Populus trichocarpa]
 gi|222866544|gb|EEF03675.1| SET domain protein [Populus trichocarpa]
          Length = 516

 Score =  112 bits (281), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 86/141 (60%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           ++K+++  +++   WG+ A EP+   DF+IEY+GE+I  K+ + R    +  G+ + Y+ 
Sbjct: 346 KEKKIKIVKTEFCGWGVEAAEPLNKGDFIIEYIGEVIDDKLCEQRLWDMKYKGVQNFYMC 405

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            +   + +DAT +G  +RF+NHSC PNC  +   VEG+ ++ ++A   I  GE +TY+Y+
Sbjct: 406 EIRKDFTIDATFKGNSSRFLNHSCKPNCILEKWDVEGETRVGVFAAGSIRVGEPLTYDYR 465

Query: 915 FPLEEKKIPCYCGSKKCHGSL 935
           F     ++ CYCG+  C G L
Sbjct: 466 FVRFGPEVKCYCGAPNCQGYL 486


>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
          Length = 4957

 Score =  112 bits (281), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 60/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 4816 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 4874

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 4875 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 4934

Query: 916  PLEEKK--IPCYCGSKKCHGSLN 936
              E+ +  IPC+CG+  C   +N
Sbjct: 4935 DFEDDQHEIPCHCGAWNCRKWMN 4957


>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
          Length = 5262

 Score =  112 bits (281), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 60/143 (41%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            K  +   RS+I   GL A + +E    VIEY+G +IR ++++ RE  YE+   G  Y+FR
Sbjct: 5121 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFR 5179

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            +++ +V+DAT  GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+F
Sbjct: 5180 INNEHVIDATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQF 5239

Query: 916  PLEEKK--IPCYCGSKKCHGSLN 936
              E+ +  IPC+CG+  C   +N
Sbjct: 5240 DFEDDQHEIPCHCGAWNCRKWMN 5262


>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
           [Amphimedon queenslandica]
          Length = 862

 Score =  112 bits (281), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 55/134 (41%), Positives = 80/134 (59%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           ++++  WGL A   I    FV+EY GE+  L+  + R   YEK      Y   L    ++
Sbjct: 129 KTEMKGWGLKATCDISRYSFVMEYCGEVCSLEEFERRRNIYEKESRRHYYFMSLKTDEIL 188

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DAT++G ++RFINHSC PNC T+  +V G+ ++  +A RHI AGEE+T++Y+F    E  
Sbjct: 189 DATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFFALRHIPAGEELTFDYQFQRFGESV 248

Query: 922 IPCYCGSKKCHGSL 935
             CYCGS+ C G L
Sbjct: 249 QKCYCGSETCRGFL 262


>gi|159470003|ref|XP_001693149.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158277407|gb|EDP03175.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 1708

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 58/146 (39%), Positives = 84/146 (57%), Gaps = 8/146 (5%)

Query: 798  RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG-SSYLFRL 856
            RL   +S IH WG  A  P +  D +IEY GELIR  +SD+RE R     +G  +Y+F L
Sbjct: 1521 RLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELIRPVVSDVREKRMYNDLVGCGTYIFSL 1580

Query: 857  DDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQ------KKIFIYAKRHIAAGEEIT 910
            +    +DATK G +A  +NHSC+PNCY++ I++           + I AKR +   EE+T
Sbjct: 1581 NGQQHIDATKAGNMAHLLNHSCDPNCYSRAITLTDPLTGATTDHVIITAKRDLQPWEELT 1640

Query: 911  YNYKFPLEEKKIPCYCGSKKCHGSLN 936
            Y+Y+F     ++PC CG+  C   +N
Sbjct: 1641 YDYRF-NSAVELPCNCGAASCRLLVN 1665


>gi|162459885|ref|NP_001105650.1| histone-lysine N-methyltransferase EZ2 [Zea mays]
 gi|33112288|sp|Q8S4P5.1|EZ2_MAIZE RecName: Full=Histone-lysine N-methyltransferase EZ2; AltName:
           Full=Enhancer of zeste protein 2
 gi|20152909|gb|AAM13421.1|AF443597_1 enhancer of zeste-like protein 2 [Zea mays]
 gi|413955940|gb|AFW88589.1| putative SET-domain containing protein family [Zea mays]
          Length = 894

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 32/258 (12%)

Query: 662 RKRTMDGLELHATKALKVAKGTAKQAASRQVAMKKTKASKSRTSNLCPRSDGCARSSISG 721
           RKRT DG + +   +           A +Q+  K    +   T   C +  GC++S    
Sbjct: 633 RKRTDDGKQCYTQYS---------PCACQQMCGKDCPCADKGT--CCEKYCGCSKSC--- 678

Query: 722 WEWHKWSLNASPAERARVRGAQ---YVHTKYLGPEVNASQWAN-GKGLSARTNRVKLRNL 777
               K         +++ R  Q   +  ++   P+V  + W + G G         L   
Sbjct: 679 ----KNKFRGCHCAKSQCRSRQCPCFAASRECDPDVCRNCWVSCGDG--------SLGEP 726

Query: 778 LAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISD 837
           LA  +G +      +  +++R+   RS +  WG     P+   D++ EY GELI  K +D
Sbjct: 727 LARGDGYQCGNMKLLLKQQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEAD 786

Query: 838 IRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFI 897
            R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ I
Sbjct: 787 KRGKIYDRAN--SSFLFDLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGI 844

Query: 898 YAKRHIAAGEEITYNYKF 915
           YAK HI A EE+ Y+Y++
Sbjct: 845 YAKEHIEASEELFYDYRY 862


>gi|218194855|gb|EEC77282.1| hypothetical protein OsI_15918 [Oryza sativa Indica Group]
          Length = 472

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K++    R  +++   WGL+A E I A  FV+EY GE+I  K +  R   YE  G+  +Y
Sbjct: 77  KSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRRSQAYENQGLTDAY 136

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G +ARFINHSC PNC T+  +V G+ ++ I+AK+ I  G E++Y+
Sbjct: 137 IIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGTELSYD 196

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F       + C CG+  C G L
Sbjct: 197 YNFEWFGGAMVRCLCGAGSCSGFL 220


>gi|359493199|ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 515

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K    + +  R++   WGL+A E I+A  FVIEY GE+I  K +  R   Y  +G+  ++
Sbjct: 93  KCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYASLGLKDAF 152

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G + RFINHSC PNC T+  +V G+ ++ I+AK+ I+ G E+ YN
Sbjct: 153 IISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDISIGTELAYN 212

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F      K+ C CG+  C G L
Sbjct: 213 YNFEWYGGAKVRCLCGAISCSGFL 236


>gi|385301625|gb|EIF45804.1| putative histone lysine methyltransferase set2p [Dekkera
           bruxellensis AWRI1499]
          Length = 695

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 15/149 (10%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR+++ D          +G+ A   I A  F+IEY GE+I  +   IR+  Y K GI 
Sbjct: 111 RFQRNQMADISIFLTEHKGYGMRANCDIPANTFIIEYKGEVIDEQAYKIRKEAYAKEGIK 170

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   + DG ++DATK+G + RF NHSC+PN Y +   V  + ++ I+AKR I  GEEI
Sbjct: 171 HFYFMMIQDGQIIDATKKGSLGRFCNHSCDPNAYXEKWVVNKRYRMGIFAKRKIPKGEEI 230

Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHGSL 935
           T++Y   ++  E +K  CYCG+K C G L
Sbjct: 231 TFDYNVDRYGAEPQK--CYCGAKNCVGYL 257


>gi|449446403|ref|XP_004140961.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Cucumis
           sativus]
          Length = 497

 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 95/169 (56%), Gaps = 9/169 (5%)

Query: 776 NLLAAAEGAELLKASQVKARKKRLR---FQRSKI-----HDWGLVALEPIEAEDFVIEYV 827
           N+L + E       S V  R +R +   + ++K+       WGL+A E I+   F+IEY 
Sbjct: 54  NVLTSTECTPGHCPSGVHCRNQRFQKCEYAKTKLFKTEGRGWGLLADENIKNGQFIIEYC 113

Query: 828 GELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVI 887
           GE+I  K +  R   YE  G+  +Y+  L+    +DAT++G +ARFINHSC PNC T+  
Sbjct: 114 GEVISWKEAKRRSHTYENQGLKDAYIISLNASESIDATRKGSLARFINHSCFPNCETRKW 173

Query: 888 SVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCGSKKCHGSL 935
           +V G+ ++ I+AK+ I+ G E+ Y+Y F      K+ C CG+  C G L
Sbjct: 174 NVLGEIRVGIFAKQDISIGTELAYDYNFEWYGGAKVRCLCGASSCSGFL 222


>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
          Length = 1179

 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 84/142 (59%), Gaps = 9/142 (6%)

Query: 799  LRFQRSKIHDWG---LVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
            L+FQ     DWG   L A + I    F+IEY+GE+I    S IR     K+G+ + Y+  
Sbjct: 994  LKFQT----DWGGNGLKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGVTNYYILE 1049

Query: 856  LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            LD+  ++DA  RG IARFINHSC+PNC      V+G  +I I++KR I  GEE+T+NY+ 
Sbjct: 1050 LDNLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQL 1109

Query: 916  PL--EEKKIPCYCGSKKCHGSL 935
                +E K  C CGSK C G +
Sbjct: 1110 QQSSDEGKTKCLCGSKNCAGFI 1131


>gi|359493195|ref|XP_003634538.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 413

 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 82/134 (61%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           R++   WGL+A E I+A +FV+EY GE+I    +  R   Y   G+   Y+  L+    +
Sbjct: 120 RAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNARECI 179

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DATK+G +ARFINHSC PNC T   SV G+ ++ I+A R+I+ G E+TY+Y F      K
Sbjct: 180 DATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEWYSGAK 239

Query: 922 IPCYCGSKKCHGSL 935
           + C CG+ +C G L
Sbjct: 240 VRCLCGATRCSGFL 253


>gi|147816102|emb|CAN75267.1| hypothetical protein VITISV_028162 [Vitis vinifera]
          Length = 460

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           R++   WGL+A E I+A  F+IEY GE+I    +  R   Y   GI  +Y+  L+    +
Sbjct: 166 RTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLNARECI 225

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DATK G  ARFINHSC PNC T+  SV G+ +I I+A R I+ G E+TY+Y F      K
Sbjct: 226 DATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQWYGGAK 285

Query: 922 IPCYCGSKKCHGSL 935
           + C CG+  C G L
Sbjct: 286 VHCLCGATSCCGFL 299


>gi|296081208|emb|CBI18234.3| unnamed protein product [Vitis vinifera]
          Length = 862

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 85/144 (59%), Gaps = 1/144 (0%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           K    + +  R++   WGL+A E I+A  FVIEY GE+I  K +  R   Y  +G+  ++
Sbjct: 494 KCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYASLGLKDAF 553

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           +  L+    +DATK+G + RFINHSC PNC T+  +V G+ ++ I+AK+ I+ G E+ YN
Sbjct: 554 IISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDISIGTELAYN 613

Query: 913 YKFP-LEEKKIPCYCGSKKCHGSL 935
           Y F      K+ C CG+  C G L
Sbjct: 614 YNFEWYGGAKVRCLCGAISCSGFL 637



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 81/139 (58%), Gaps = 1/139 (0%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           + +  R++   WGL+A E I+A  F+IEY GE+I    +  R   Y   GI  +Y+  L+
Sbjct: 76  KTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLN 135

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
               +DATK G  ARFINHSC PNC T+  SV G+ +I I+A R I+ G E+TY+Y F  
Sbjct: 136 ARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQW 195

Query: 917 LEEKKIPCYCGSKKCHGSL 935
               K+ C CG+  C G L
Sbjct: 196 YGGAKVHCLCGATSCCGFL 214


>gi|342209884|gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
          Length = 481

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 54/128 (42%), Positives = 79/128 (61%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E ++A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 95  WGLLAGEDLKAGQFVIEYCGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKG 154

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+AK  +  G E+ Y+Y F      K+ C CG
Sbjct: 155 SLARFINHSCRPNCETRKWNVLGEIRVGIFAKHDVPIGTELAYDYNFEWFGGAKVRCLCG 214

Query: 928 SKKCHGSL 935
           + KC G L
Sbjct: 215 ALKCSGFL 222


>gi|359493197|ref|XP_003634539.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 413

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           R++   WGL+A E I+A  F+IEY GE+I    +  R   Y   GI  +Y+  L+    +
Sbjct: 119 RTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGINDAYIISLNARECI 178

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DATK G  ARFINHSC PNC T+  SV G+ +I I+A R I+ G E+TY+Y F      K
Sbjct: 179 DATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTELTYDYNFQWYGGAK 238

Query: 922 IPCYCGSKKCHGSL 935
           + C CG+  C G L
Sbjct: 239 VHCLCGATSCCGFL 252


>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1363

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/148 (43%), Positives = 86/148 (58%), Gaps = 17/148 (11%)

Query: 797  KRLRFQR----------SKIHDWGLVALEPIEAEDFVIEYVGELI-RLKISD-IRECRYE 844
            K  RFQR          +K   WGL A E I A  FVIEY GE+I R +  D +RE   E
Sbjct: 973  KNQRFQRQEYALIAPFNAKKKGWGLKAKEKISAHQFVIEYCGEVITRAQSMDRMREADGE 1032

Query: 845  KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
            K      Y   LD   V+DA+++G +ARFINHSC+PNC T+  SV+G+ +I I+A + I 
Sbjct: 1033 KY----FYFLTLDSKEVLDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIE 1088

Query: 905  AGEEITYNYKFP-LEEKKIPCYCGSKKC 931
            AG E+T++Y +  +   K  CYCGS  C
Sbjct: 1089 AGTELTFDYNYERVGSSKQSCYCGSVNC 1116


>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
           NZE10]
          Length = 963

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 82/149 (55%), Gaps = 15/149 (10%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR K  D          +GL A + +   DFV EY+GE+I   +   R  +Y++ GI 
Sbjct: 202 RFQRKKYADVTVIKTEKKGYGLRADKELRPGDFVYEYIGEVIGENVFRRRMQQYDEEGIK 261

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L  G  VDATK+G + RF NHSCNPNCY     V  + ++ I+ +R+I AGEE+
Sbjct: 262 HFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVNDKLRMGIFVERNIQAGEEL 321

Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHGSL 935
            +NY   ++  + +  PCYCG   C G +
Sbjct: 322 VFNYNVDRYGADPQ--PCYCGEPNCTGYI 348


>gi|296081207|emb|CBI18233.3| unnamed protein product [Vitis vinifera]
          Length = 370

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 84/139 (60%), Gaps = 1/139 (0%)

Query: 798 RLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLD 857
           + +  R++   WGL+A E I+A +FV+EY GE+I    +  R   Y   G+   Y+  L+
Sbjct: 72  KTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLN 131

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP- 916
               +DATK+G +ARFINHSC PNC T   SV G+ ++ I+A R+I+ G E+TY+Y F  
Sbjct: 132 ARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEW 191

Query: 917 LEEKKIPCYCGSKKCHGSL 935
               K+ C CG+ +C G L
Sbjct: 192 YSGAKVRCLCGATRCSGFL 210


>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
 gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
          Length = 1004

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/128 (40%), Positives = 78/128 (60%), Gaps = 1/128 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           +GL  LE +E   FV+EY GE+I L+  + R+  Y K  I   Y   L    ++DA+++G
Sbjct: 135 FGLRTLEDLEDNQFVLEYCGEVIDLREFERRKRDYAKKKIKHYYFMTLSPNEIIDASRKG 194

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKI-PCYCG 927
             +RFINHSC+PNC T+  +V G  +I  +  R I A  E+T++Y+F    +++  CYCG
Sbjct: 195 TFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTELTFDYQFERYGREVQECYCG 254

Query: 928 SKKCHGSL 935
           S+KC G L
Sbjct: 255 SEKCRGYL 262


>gi|398407533|ref|XP_003855232.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
 gi|339475116|gb|EGP90208.1| histone methyltransferase [Zymoseptoria tritici IPO323]
          Length = 799

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/150 (38%), Positives = 81/150 (54%), Gaps = 15/150 (10%)

Query: 799 LRFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGI 848
           +RFQR K  +          +GL     +   DF+ EY+GE+I   +   R  +Y++ GI
Sbjct: 115 MRFQRKKYANVDVIKTEKKGYGLRTQTDLRPNDFIFEYIGEVIGENVFRRRMQQYDEDGI 174

Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEE 908
              Y   L  G  VDATKRG + RF NHSCNPNCY     V  + ++ I+A+R I AGEE
Sbjct: 175 KHFYFMSLTKGEFVDATKRGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERAIQAGEE 234

Query: 909 ITYNY---KFPLEEKKIPCYCGSKKCHGSL 935
           + +NY   ++  E +  PCYCG   C G +
Sbjct: 235 LVFNYNVDRYGAEPQ--PCYCGEANCTGYI 262


>gi|33305503|gb|AAQ02781.1|AF373874_1 Mll protein [Xenopus laevis]
          Length = 84

 Score =  111 bits (277), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 50/84 (59%), Positives = 62/84 (73%), Gaps = 2/84 (2%)

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
           R+DD  VVDAT  G  ARFINHSC PNCY++VI ++GQK I I+A R I  GEE+TY+YK
Sbjct: 1   RIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYDYK 60

Query: 915 FPLEE--KKIPCYCGSKKCHGSLN 936
           FP+E+   K+ C CG+KKC   LN
Sbjct: 61  FPIEDANNKLACNCGTKKCRKFLN 84


>gi|225562106|gb|EEH10386.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1000

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/145 (39%), Positives = 75/145 (51%), Gaps = 11/145 (7%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  D          +GL A   +    F+ EY+GE+I   +   R   Y++ GI 
Sbjct: 199 RFQRRQYADVSVIKTEKKGYGLRAESDLRPNQFIFEYIGEVINETLFRKRMLSYDEEGIK 258

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L  G  VDATK+G + RF NHSCNPNCY     V  + ++ I+A+RHI AGEE+
Sbjct: 259 HFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEEL 318

Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
            +NY          PCYCG   C G
Sbjct: 319 VFNYNVDRYGADPQPCYCGEPNCTG 343


>gi|407926722|gb|EKG19683.1| WW/Rsp5/WWP [Macrophomina phaseolina MS6]
          Length = 972

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 74/132 (56%), Gaps = 1/132 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           R+    +GL A   ++  DFV EY+GE+I  +    R  +Y++ GI   Y   L  G  V
Sbjct: 214 RTDKKGYGLRANTDLKPNDFVYEYIGEVINERAFRGRMVQYDEEGIKHFYFMSLSKGEFV 273

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DATK+G + RF NHSCNPNC+     V  + ++ I+A+RHI AGEE+ +NY         
Sbjct: 274 DATKKGNLGRFCNHSCNPNCFVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADP 333

Query: 922 IPCYCGSKKCHG 933
            PCYCG   C G
Sbjct: 334 QPCYCGEANCTG 345


>gi|302836397|ref|XP_002949759.1| histone H3 lys4 methyltransferase [Volvox carteri f. nagariensis]
 gi|300265118|gb|EFJ49311.1| histone H3 lys4 methyltransferase [Volvox carteri f. nagariensis]
          Length = 118

 Score =  111 bits (277), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 51/118 (43%), Positives = 76/118 (64%), Gaps = 5/118 (4%)

Query: 802 QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRL----D 857
           +RS I   G+ A E I A+  ++EY GE +R  ++D+RE RY   G+G  YLF      D
Sbjct: 2   RRSGIAQLGVFAAERIPADTLLMEYHGEAVRTSVADLREKRYSAAGLGC-YLFNPGGLPD 60

Query: 858 DGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           +  V+DAT RG I RF+NHSC PNC ++ + +EGQ++IF+++   +  G E+TY+YK 
Sbjct: 61  EAVVLDATHRGNIMRFVNHSCGPNCVSRTVVIEGQRRIFLFSLHELHPGTELTYDYKL 118


>gi|147846734|emb|CAN80636.1| hypothetical protein VITISV_017995 [Vitis vinifera]
          Length = 278

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 82/134 (61%), Gaps = 1/134 (0%)

Query: 803 RSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVV 862
           R++   WGL+A E I+A +FV+EY GE+I    +  R   Y   G+   Y+  L+    +
Sbjct: 120 RAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNARECI 179

Query: 863 DATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKK 921
           DATK+G +ARFINHSC PNC T   SV G+ ++ I+A R+I+ G E+TY+Y F      K
Sbjct: 180 DATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEWYSXAK 239

Query: 922 IPCYCGSKKCHGSL 935
           + C CG+ +C G L
Sbjct: 240 VRCLCGATRCSGFL 253


>gi|452985541|gb|EME85297.1| hypothetical protein MYCFIDRAFT_83386 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 835

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 81/147 (55%), Gaps = 15/147 (10%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  +          +GL A   + A DF+ EY+GE+I   +   +  +Y++ GI 
Sbjct: 122 RFQRKQYANVSVIQTEKKGYGLRANTDLRANDFIFEYIGEVIGENVFRRKMQQYDEQGIK 181

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L  G  VDATK+G + RF NHSCNPNCY     V  + ++ I+A+R I AGEE+
Sbjct: 182 HFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIKAGEEL 241

Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHG 933
            +NY   ++  E +  PCYCG   C G
Sbjct: 242 VFNYNVDRYGAEPQ--PCYCGEPNCTG 266


>gi|410902883|ref|XP_003964923.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Takifugu
           rubripes]
          Length = 766

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/135 (41%), Positives = 80/135 (59%), Gaps = 2/135 (1%)

Query: 781 AEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRE 840
           ++G      S  +  KK L    S +  WG    EP++  +F+ EY GELI    +D R 
Sbjct: 616 SKGVSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRG 675

Query: 841 CRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAK 900
             Y+K    SS+LF L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AK
Sbjct: 676 RIYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAK 733

Query: 901 RHIAAGEEITYNYKF 915
           R I  GEE+ ++Y++
Sbjct: 734 RAILQGEELFFDYRY 748


>gi|449300569|gb|EMC96581.1| hypothetical protein BAUCODRAFT_481348 [Baudoinia compniacensis
           UAMH 10762]
          Length = 992

 Score =  110 bits (275), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 76/130 (58%), Gaps = 5/130 (3%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           +GL     ++A DF+ EY+GE+I   +   R  +Y++ GI   Y   L  G  VDATK+G
Sbjct: 222 YGLRTNTNLKANDFIFEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKG 281

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNY---KFPLEEKKIPCY 925
            + RF NHSCNPNCY     V  + ++ I+A+R I AGEE+ +NY   ++  E +  PCY
Sbjct: 282 NLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIQAGEELVFNYNVDRYGAEPQ--PCY 339

Query: 926 CGSKKCHGSL 935
           CG   C G +
Sbjct: 340 CGEPNCTGYI 349


>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
          Length = 207

 Score =  110 bits (275), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 53/114 (46%), Positives = 72/114 (63%), Gaps = 2/114 (1%)

Query: 822 FVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPN 881
           F+IEY+GE+I    S IR     K+G+ + Y+  LD+  ++DA  RG IARFINHSC+PN
Sbjct: 44  FIIEYIGEIISHDESRIRLEESAKIGVTNYYILELDNLRMIDAGPRGNIARFINHSCDPN 103

Query: 882 CYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL--EEKKIPCYCGSKKCHG 933
           C      V+G  +I I++KR I  GEE+T+NY+     +E K  C CGSK C G
Sbjct: 104 CGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQSSDEGKTKCLCGSKNCAG 157


>gi|159490078|ref|XP_001703016.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158270923|gb|EDO96754.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 2983

 Score =  110 bits (275), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 63/176 (35%), Positives = 96/176 (54%), Gaps = 12/176 (6%)

Query: 771  RVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGEL 830
            R   R L A      L +A  +   + R+ F +S IH WG+ A   I  +  V E+ GE+
Sbjct: 2810 RASERRLKAVRWEERLQRA--LATERARITFGKSGIHGWGVFARVDIPQDAVVTEFRGEV 2867

Query: 831  IRLKISDIRECRYEKMGIG------SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYT 884
            +R  ++++RE RY   G          YLF +    V+D+T  G   RF NHSC+P+ YT
Sbjct: 2868 VRPVLAEMRERRYRAAGRAWACAGKDCYLFHVSRELVLDSTHLGHYGRFANHSCSPSLYT 2927

Query: 885  KVISVEGQKKIFIY--AKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
            KV+  EG +++ +   A+  I AG+E+T++Y+F  EE   K+ C CG+  C G+LN
Sbjct: 2928 KVLEFEGGRRVRLAFCARVDIRAGQELTFDYRFKEEEGSAKVACRCGAPNCKGTLN 2983


>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 1074

 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 56/126 (44%), Positives = 75/126 (59%), Gaps = 3/126 (2%)

Query: 810 GLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGG 869
           GL+++    A   V+EY GE+I L   + R     K    S Y   L++ YV+DA ++G 
Sbjct: 259 GLLSIRSFNAGSLVVEYTGEVIHLDEVEHRLNTIYKES-DSYYFLGLEEEYVIDAGQKGS 317

Query: 870 IARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIP--CYCG 927
           +ARF NHSC+PN   +   V G+ +I ++AKR I AGEEITY+Y F   E   P  CYCG
Sbjct: 318 VARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEEITYDYNFEWFENGEPQKCYCG 377

Query: 928 SKKCHG 933
           SK CHG
Sbjct: 378 SKNCHG 383


>gi|212535752|ref|XP_002148032.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
 gi|210070431|gb|EEA24521.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
          Length = 900

 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 78/145 (53%), Gaps = 11/145 (7%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  +          +GL A   I+   F+ EY+GE+I       R  +Y+K GI 
Sbjct: 211 RFQRKEYANVTVIKTAKKGFGLRAESDIQPHQFIYEYIGEVINEANFRRRMIQYDKEGIK 270

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L+ G  VDATK+G +ARF NHSCNPNCY     V  + ++ I+A+R+I AGEE+
Sbjct: 271 HFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCYVDKWVVGEKLRMGIFAERYIQAGEEL 330

Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
            +NY          PCYCG   C G
Sbjct: 331 VFNYNVDRYGADPQPCYCGEPNCSG 355


>gi|410927554|ref|XP_003977206.1| PREDICTED: histone-lysine N-methyltransferase EZH2-like [Takifugu
           rubripes]
          Length = 760

 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 2/123 (1%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           +  KK L    S +  WG+   EP++  DF+ EY GE+I    +D R   Y+K     S+
Sbjct: 622 RGAKKHLLLAPSDVAGWGIFIKEPVQKNDFISEYCGEIISQDEADRRGKVYDKYMC--SF 679

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           LF L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++
Sbjct: 680 LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFD 739

Query: 913 YKF 915
           Y++
Sbjct: 740 YRY 742


>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2-like, partial [Taeniopygia guttata]
          Length = 4299

 Score =  110 bits (274), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 60/144 (41%), Positives = 91/144 (63%), Gaps = 3/144 (2%)

Query: 796  KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYE-KMGIGSSYLF 854
            K  +   RS+I   GL A + IE    VIEY+G +IR ++++ RE  YE ++     Y+F
Sbjct: 4156 KNNVYLARSRIQGLGLYAAKDIEKHTMVIEYIGTIIRNEVANRREKIYEEQVRTRGIYMF 4215

Query: 855  RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            R+++ +V+DA   GG AR+INHSC PNC  +V++ + + KI I + R I  GEE+TY+Y+
Sbjct: 4216 RINNEHVIDAPLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 4275

Query: 915  FPLE--EKKIPCYCGSKKCHGSLN 936
            F  E  + KIPC+CG+  C   +N
Sbjct: 4276 FDFEDDQHKIPCHCGAWNCRKWMN 4299


>gi|16605541|emb|CAC86146.1| EZH2 homolog [Tetraodon nigroviridis]
          Length = 759

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 53/123 (43%), Positives = 76/123 (61%), Gaps = 2/123 (1%)

Query: 793 KARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSY 852
           +  KK L    S +  WG+   EP++  DF+ EY GE+I    +D R   Y+K     S+
Sbjct: 621 RGAKKHLLLAPSDVAGWGIFIKEPVQKNDFISEYCGEIISQDEADRRGKVYDKYMC--SF 678

Query: 853 LFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYN 912
           LF L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++
Sbjct: 679 LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFD 738

Query: 913 YKF 915
           Y++
Sbjct: 739 YRY 741


>gi|296808927|ref|XP_002844802.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238844285|gb|EEQ33947.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 969

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 79/147 (53%), Gaps = 11/147 (7%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQ+ +  D          +GL A   ++  DF+ EY+GE+I       R  +Y++ GI 
Sbjct: 188 RFQKRQYADVTVIKTEKKGFGLRANTDLQPNDFIFEYIGEVINEPQFRKRMIQYDEEGIK 247

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L+ G  VDAT++G + RF NHSCNPNCY     V  + ++ I+A+RH+ AGEE+
Sbjct: 248 HFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKLRMGIFAERHVKAGEEL 307

Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHGSL 935
            +NY          PCYCG   C G +
Sbjct: 308 VFNYNVDRYGADPQPCYCGEANCLGYI 334


>gi|432867980|ref|XP_004071355.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           EZH1-like [Oryzias latipes]
          Length = 780

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    EP++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 645 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 702

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++Y++
Sbjct: 703 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 762


>gi|295901334|dbj|BAJ07324.1| enhancer of zeste homolog 1 (Drosophila) [Oryzias latipes]
          Length = 766

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    EP++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 631 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 688

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++Y++
Sbjct: 689 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 748


>gi|449530380|ref|XP_004172173.1| PREDICTED: uncharacterized protein LOC101230765, partial [Cucumis
           sativus]
          Length = 627

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 64/178 (35%), Positives = 106/178 (59%), Gaps = 6/178 (3%)

Query: 89  GSIEIFWGSYNIVCKMLFDHCMQVMWNAVFGDRVAEYSSAWRKRKLWSGHPKITGPASDY 148
           GSI+ F   + ++C+MLFD+ +QV+WNAV  D VAEYSSAWR+++ WS  P  +  +S Y
Sbjct: 449 GSIDNFREVHAVICQMLFDYSLQVVWNAVSYDTVAEYSSAWRRKRFWSYRPHYSLASSGY 508

Query: 149 KDDRKRMEQAPS-----RHDSSVSDDDCPPGFGMVEIRTENDVQPYHLSLSVPVGENLSK 203
           +D  K++E+ P+     R +SS+              +TEN  +   +SLSVPVG   S+
Sbjct: 509 RDRVKKIEKTPAEASLPRKESSLHGVSSLSVSKFKGAQTENCARSAVISLSVPVGHKSSR 568

Query: 204 QKNLSCNDHLLLDDVKCILDGVENELYLSTKATYTEYVEILVEDEVRKVVSASKGINM 261
             + SC +    +D+K +++ +E EL+ S K +  EY++ ++E+EV    +AS  + +
Sbjct: 569 PTSHSCCER-PKEDLKWMVEYLEKELHSSAKVSMAEYIQDILEEEVISSCNASTDVKL 625


>gi|348509107|ref|XP_003442093.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like
           [Oreochromis niloticus]
          Length = 768

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    EP++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 633 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 690

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++Y++
Sbjct: 691 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAILQGEELFFDYRY 750


>gi|340515729|gb|EGR45981.1| predicted protein [Trichoderma reesei QM6a]
          Length = 933

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 80/147 (54%), Gaps = 15/147 (10%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  D          +GL A  P+E  DF+ EY+GE+I       R  +Y++ GI 
Sbjct: 173 RFQRKQYADVTVIKTEKKGFGLRANSPLEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIK 232

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L+    VDATK+G + RF NHSC+PNC+     V  + ++ I+A R I AGEE+
Sbjct: 233 HFYFMSLNKNEFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIQAGEEL 292

Query: 910 TYNY---KFPLEEKKIPCYCGSKKCHG 933
            +NY   ++  E +  PCYCG   C G
Sbjct: 293 VFNYNVDRYGAEPQ--PCYCGEANCVG 317


>gi|320118893|ref|NP_001035072.2| enhancer of zeste 1 [Danio rerio]
          Length = 756

 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    EP++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 621 KKHLLLAPSDVAGWGTFIKEPVQKNEFISEYCGELISQDEADRRGRIYDKYM--SSFLFN 678

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I  GEE+ ++Y++
Sbjct: 679 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRY 738


>gi|302811554|ref|XP_002987466.1| hypothetical protein SELMODRAFT_446978 [Selaginella moellendorffii]
 gi|300144872|gb|EFJ11553.1| hypothetical protein SELMODRAFT_446978 [Selaginella moellendorffii]
          Length = 835

 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           G  AR +  + RN+        LLK  Q      R+   RS +  WG     P+   D++
Sbjct: 666 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 713

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
            EY GELI  + +D R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY
Sbjct: 714 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 771

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            KVI V G  ++ I+AK  I+AGEE+ Y+Y++
Sbjct: 772 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 803


>gi|383316103|ref|YP_005376945.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
 gi|379043207|gb|AFC85263.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
          Length = 180

 Score =  109 bits (273), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 66/158 (41%), Positives = 91/158 (57%), Gaps = 15/158 (9%)

Query: 790 SQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMG-I 848
           S+     +R+R +RS IH  G+ AL  +   D V+ Y GELI    +D R   Y   G  
Sbjct: 8   SRFPPMTQRIRARRSPIHGRGVFALADLAKGDCVVRYRGELISHAQADER---YGDDGES 64

Query: 849 GSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKV-ISVEG---QKKIFIYAKRHIA 904
           G ++LF L++ YVVD  +RG +AR+INHSC PNC   + IS  G   + +I I A R IA
Sbjct: 65  GHTFLFTLNEQYVVDGNRRGNVARWINHSCMPNCQAVIEISAGGDPRRDRILIEAVRDIA 124

Query: 905 AGEEITYNYKFPLE-------EKKIPCYCGSKKCHGSL 935
           AGEEITY+Y   L        ++  PC+CG+ +C G+L
Sbjct: 125 AGEEITYDYGIVLAVPHTAAMKRLWPCHCGAARCTGTL 162


>gi|156374111|ref|XP_001629652.1| predicted protein [Nematostella vectensis]
 gi|156216657|gb|EDO37589.1| predicted protein [Nematostella vectensis]
          Length = 463

 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 7/155 (4%)

Query: 788 KASQVKARKKRLR----FQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRY 843
           KASQ +  K   R      RS I   GL A   +E    VIEY+G +IR ++++ +E  Y
Sbjct: 310 KASQYRKLKTEWRQNVFLGRSNIQGLGLFANRDMEPGCMVIEYIGSIIRNEVANKKESIY 369

Query: 844 EKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHI 903
           E    G  Y+FR+D   V+DAT  GG AR+INHSC PNC  +V++ E ++KI I + R I
Sbjct: 370 ESQNRGI-YMFRIDSDSVIDATIAGGPARYINHSCMPNCVAEVVTFEKEQKIIIISSRKI 428

Query: 904 AAGEEITYNYKFPLE--EKKIPCYCGSKKCHGSLN 936
             GEE+TY+YKF  E  E KI C CG+  C   +N
Sbjct: 429 EKGEELTYDYKFDFEDDEHKISCLCGAPNCRKWMN 463


>gi|162463310|ref|NP_001105079.1| histone-lysine N-methyltransferase EZ3 [Zea mays]
 gi|33112287|sp|Q8S4P4.1|EZ3_MAIZE RecName: Full=Histone-lysine N-methyltransferase EZ3; AltName:
           Full=Enhancer of zeste protein 3
 gi|20152911|gb|AAM13422.1|AF443598_1 enhancer of zeste-like protein 3 [Zea mays]
          Length = 895

 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           +++R+   RS +  WG     P+   D++ EY GELI  K +D R   Y++    SS+LF
Sbjct: 745 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 802

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ IYAK HI A EE+ Y+Y+
Sbjct: 803 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 862

Query: 915 F 915
           +
Sbjct: 863 Y 863


>gi|302796619|ref|XP_002980071.1| hypothetical protein SELMODRAFT_111860 [Selaginella moellendorffii]
 gi|300152298|gb|EFJ18941.1| hypothetical protein SELMODRAFT_111860 [Selaginella moellendorffii]
          Length = 820

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           G  AR +  + RN+        LLK  Q      R+   RS +  WG     P+   D++
Sbjct: 651 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 698

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
            EY GELI  + +D R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY
Sbjct: 699 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 756

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            KVI V G  ++ I+AK  I+AGEE+ Y+Y++
Sbjct: 757 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 788


>gi|414866465|tpg|DAA45022.1| TPA: putative SET-domain containing protein family [Zea mays]
          Length = 898

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           +++R+   RS +  WG     P+   D++ EY GELI  K +D R   Y++    SS+LF
Sbjct: 748 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 805

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ IYAK HI A EE+ Y+Y+
Sbjct: 806 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 865

Query: 915 F 915
           +
Sbjct: 866 Y 866


>gi|67903676|ref|XP_682094.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
 gi|74592565|sp|Q5ASA5.1|SET2_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|40740923|gb|EAA60113.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
 gi|259482964|tpe|CBF77940.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC
           2.1.1.43)(SET domain-containing protein 2)
           [Source:UniProtKB/Swiss-Prot;Acc:Q5ASA5] [Aspergillus
           nidulans FGSC A4]
          Length = 980

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 75/145 (51%), Gaps = 11/145 (7%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  +          +GL A E +    F+ EYVGE+I       R  +Y+  GI 
Sbjct: 234 RFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFIFEYVGEVINEGPFHRRMRQYDAEGIK 293

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L  G  VDATK+G + RF NHSCNPNCY     V  + ++ I+A+RHI AGEE+
Sbjct: 294 HFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIQAGEEL 353

Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
            +NY          PCYCG   C G
Sbjct: 354 VFNYNVDRYGADPQPCYCGEPNCTG 378


>gi|414866464|tpg|DAA45021.1| TPA: putative SET-domain containing protein family [Zea mays]
          Length = 895

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 76/121 (62%), Gaps = 2/121 (1%)

Query: 795 RKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLF 854
           +++R+   RS +  WG     P+   D++ EY GELI  K +D R   Y++    SS+LF
Sbjct: 745 QQQRILLGRSDVAGWGAFIKNPVNKNDYLGEYTGELISHKEADKRGKIYDRAN--SSFLF 802

Query: 855 RLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 914
            L+D YV+DA ++G   +F NHS NPNCY KV+ V G  ++ IYAK HI A EE+ Y+Y+
Sbjct: 803 DLNDQYVLDAYRKGDKLKFANHSSNPNCYAKVMLVAGDHRVGIYAKEHIEASEELFYDYR 862

Query: 915 F 915
           +
Sbjct: 863 Y 863


>gi|384253240|gb|EIE26715.1| hypothetical protein COCSUDRAFT_59232 [Coccomyxa subellipsoidea
           C-169]
          Length = 798

 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 55/126 (43%), Positives = 79/126 (62%), Gaps = 4/126 (3%)

Query: 789 ASQVKARKKRLRF--QRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKM 846
           A+ + A+KK  RF  +RS I   G+  LE  EA + ++E VGE++R  ++ +RE    + 
Sbjct: 673 AAVIAAKKKMGRFNIRRSDISGLGVFTLEFCEAGELLMECVGEVLRRPVASMREAAAIER 732

Query: 847 GIG--SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIA 904
           GI   + Y+F LD  +VVD T  G IARF+NHSC+PNC    I V+G + IF+ A R +A
Sbjct: 733 GINDVACYVFALDADHVVDTTSSGNIARFVNHSCSPNCVATAIDVKGARHIFLIAARQLA 792

Query: 905 AGEEIT 910
            GEE+T
Sbjct: 793 PGEELT 798


>gi|357443579|ref|XP_003592067.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
 gi|355481115|gb|AES62318.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
          Length = 491

 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 77/124 (62%), Gaps = 1/124 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E I+A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 108 WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 167

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+A  +I  G E+ Y+Y F      K+ C CG
Sbjct: 168 SLARFINHSCQPNCETRKWNVMGEIRVGIFALENIPIGTELAYDYNFEWFGGAKVRCLCG 227

Query: 928 SKKC 931
           + KC
Sbjct: 228 ALKC 231


>gi|336266160|ref|XP_003347849.1| hypothetical protein SMAC_06682 [Sordaria macrospora k-hell]
 gi|380091782|emb|CCC10510.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 907

 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 57/145 (39%), Positives = 74/145 (51%), Gaps = 11/145 (7%)

Query: 800 RFQRSKIHD----------WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIG 849
           RFQR +  D          +GL A   ++  DF+ EY+GE+I       R  +Y+K GI 
Sbjct: 175 RFQRKQYADVSVIKTEKKGFGLRANTDLQVNDFIFEYIGEVINEPTFRSRMIKYDKEGIK 234

Query: 850 SSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEI 909
             Y   L     VDATK+G + RF NHSCNPNCY     V  + ++ I+A R I AGEE+
Sbjct: 235 HFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAGRAIKAGEEL 294

Query: 910 TYNYKFP-LEEKKIPCYCGSKKCHG 933
            +NY          PCYCG   C G
Sbjct: 295 VFNYNVDRYGADPQPCYCGEPNCTG 319


>gi|149391157|gb|ABR25596.1| set domain containing protein [Oryza sativa Indica Group]
          Length = 107

 Score =  109 bits (272), Expect = 8e-21,   Method: Composition-based stats.
 Identities = 51/107 (47%), Positives = 72/107 (67%), Gaps = 6/107 (5%)

Query: 836 SDIRECRYE----KMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEG 891
           +D RE  Y+    +    + Y F++   +++DAT++GGIARFINHSC PNC  KVISV  
Sbjct: 1   ADKREIEYQSGKRQQYKSACYFFKIGKEHIIDATRKGGIARFINHSCQPNCVAKVISVRN 60

Query: 892 QKKIFIYAKRHIAAGEEITYNYKFPLEE--KKIPCYCGSKKCHGSLN 936
           +KK+  +A+RHI  GEEITY+Y F  E+  ++IPC+C S+ C   LN
Sbjct: 61  EKKVVFFAERHINPGEEITYDYHFNREDEGQRIPCFCRSRGCRRYLN 107


>gi|357443507|ref|XP_003592031.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
 gi|355481079|gb|AES62282.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
          Length = 479

 Score =  109 bits (272), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 1/124 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E I+A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 96  WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 155

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+A   I  G E+ Y+Y F      K+ C CG
Sbjct: 156 SLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIPIGTELAYDYNFEWFGGAKVRCLCG 215

Query: 928 SKKC 931
           + KC
Sbjct: 216 ALKC 219


>gi|242011020|ref|XP_002426255.1| protein MLP1, putative [Pediculus humanus corporis]
 gi|212510318|gb|EEB13517.1| protein MLP1, putative [Pediculus humanus corporis]
          Length = 2688

 Score =  109 bits (272), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 77/130 (59%), Gaps = 5/130 (3%)

Query: 809  WGLVALEPIEAEDFVIEYVGELIRLK-ISDIRECRYEKMGIGSSYLFRLDDGYVVDATKR 867
            WG+   +PI+A DF++EYVGE++  K   D     Y  +     Y   LD G V+D  + 
Sbjct: 1958 WGIRTKQPIKAGDFILEYVGEVVSDKEFKDRMASIY--VNDKHHYCLHLDGGLVIDGHRM 2015

Query: 868  GGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPL--EEKKIPCY 925
            GG  RF+NHSCNPNC  +  SV G  ++ ++A R+I A +E+TY+Y F L    +  PCY
Sbjct: 2016 GGDGRFVNHSCNPNCEMQKWSVNGLFRMALFALRNIPAHQELTYDYNFSLFNPAEGQPCY 2075

Query: 926  CGSKKCHGSL 935
            CG+ +C G +
Sbjct: 2076 CGTNECRGVI 2085


>gi|354485080|ref|XP_003504712.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Cricetulus
           griseus]
 gi|344251953|gb|EGW08057.1| Histone-lysine N-methyltransferase EZH1 [Cricetulus griseus]
          Length = 747

 Score =  109 bits (272), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|357443463|ref|XP_003592009.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
 gi|355481057|gb|AES62260.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
          Length = 616

 Score =  109 bits (272), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 1/124 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           WGL+A E I+A  FVIEY GE+I  K +  R   YE  G+  +++  L+    +DAT++G
Sbjct: 233 WGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISLNASESIDATRKG 292

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            +ARFINHSC PNC T+  +V G+ ++ I+A   I  G E+ Y+Y F      K+ C CG
Sbjct: 293 SLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIPIGTELAYDYNFEWFGGAKVRCLCG 352

Query: 928 SKKC 931
           + KC
Sbjct: 353 ALKC 356


>gi|302826182|ref|XP_002994616.1| hypothetical protein SELMODRAFT_138884 [Selaginella moellendorffii]
 gi|300137307|gb|EFJ04316.1| hypothetical protein SELMODRAFT_138884 [Selaginella moellendorffii]
          Length = 833

 Score =  109 bits (272), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 60/152 (39%), Positives = 86/152 (56%), Gaps = 14/152 (9%)

Query: 764 GLSARTNRVKLRNLLAAAEGAELLKASQVKARKKRLRFQRSKIHDWGLVALEPIEAEDFV 823
           G  AR +  + RN+        LLK  Q      R+   RS +  WG     P+   D++
Sbjct: 664 GPPARGDSYECRNMKL------LLKQQQ------RVLLGRSDVAGWGAFLKTPVNKHDYL 711

Query: 824 IEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCNPNCY 883
            EY GELI  + +D R   Y++    SS+LF L+D YV+DA ++G   +F NHS NPNCY
Sbjct: 712 GEYTGELISHREADKRGKIYDREN--SSFLFNLNDQYVLDACRKGDKLKFANHSPNPNCY 769

Query: 884 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
            KVI V G  ++ I+AK  I+AGEE+ Y+Y++
Sbjct: 770 AKVIMVAGDHRVGIFAKERISAGEELFYDYRY 801


>gi|291406159|ref|XP_002719453.1| PREDICTED: enhancer of zeste homolog 1-like [Oryctolagus cuniculus]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|395826329|ref|XP_003786371.1| PREDICTED: histone-lysine N-methyltransferase EZH1 [Otolemur
           garnettii]
          Length = 753

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 618 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 675

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 676 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 735


>gi|325091551|gb|EGC44861.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
          Length = 1000

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/126 (41%), Positives = 69/126 (54%), Gaps = 1/126 (0%)

Query: 809 WGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFRLDDGYVVDATKRG 868
           +GL A   +    F+ EY+GE+I   +   R   Y++ GI   Y   L  G  VDATK+G
Sbjct: 218 YGLRAESDLRPNQFIFEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKG 277

Query: 869 GIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFP-LEEKKIPCYCG 927
            + RF NHSCNPNCY     V  + ++ I+A+RHI AGEE+ +NY          PCYCG
Sbjct: 278 NLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYCG 337

Query: 928 SKKCHG 933
              C G
Sbjct: 338 EPNCTG 343


>gi|440903225|gb|ELR53912.1| Histone-lysine N-methyltransferase EZH1, partial [Bos grunniens
           mutus]
          Length = 752

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 617 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 674

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 675 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 734


>gi|348562498|ref|XP_003467047.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Cavia
           porcellus]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|426238089|ref|XP_004012990.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Ovis
           aries]
 gi|223635231|sp|A7E2Z2.2|EZH1_BOVIN RecName: Full=Histone-lysine N-methyltransferase EZH1; AltName:
           Full=Enhancer of zeste homolog 1
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|296201474|ref|XP_002748044.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1
           [Callithrix jacchus]
          Length = 753

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 618 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 675

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 676 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 735


>gi|344285102|ref|XP_003414302.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Loxodonta
           africana]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|417412559|gb|JAA52658.1| Putative transcriptional repressor ezh1, partial [Desmodus
           rotundus]
          Length = 751

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 616 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 673

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 674 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 733


>gi|410981177|ref|XP_003996949.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Felis
           catus]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|281344548|gb|EFB20132.1| hypothetical protein PANDA_011171 [Ailuropoda melanoleuca]
          Length = 749

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 614 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFN 671

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 672 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 731


>gi|426238091|ref|XP_004012991.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Ovis
           aries]
          Length = 707

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 572 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 629

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 630 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 689


>gi|403304420|ref|XP_003942795.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|301773590|ref|XP_002922190.1| PREDICTED: histone-lysine N-methyltransferase EZH1-like [Ailuropoda
           melanoleuca]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|431890589|gb|ELK01468.1| Histone-lysine N-methyltransferase EZH1 [Pteropus alecto]
          Length = 744

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 609 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 666

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 667 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 726


>gi|410981179|ref|XP_003996950.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Felis
           catus]
          Length = 677

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 542 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 599

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 600 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 659


>gi|149723756|ref|XP_001493467.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 1 [Equus
           caballus]
          Length = 747

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 669

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 670 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729


>gi|73965665|ref|XP_849127.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2 [Canis
           lupus familiaris]
          Length = 759

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 624 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 681

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 682 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 741


>gi|403304422|ref|XP_003942796.1| PREDICTED: histone-lysine N-methyltransferase EZH1 isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 677

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 75/120 (62%), Gaps = 2/120 (1%)

Query: 796 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRLKISDIRECRYEKMGIGSSYLFR 855
           KK L    S +  WG    E ++  +F+ EY GELI    +D R   Y+K    SS+LF 
Sbjct: 542 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFN 599

Query: 856 LDDGYVVDATKRGGIARFINHSCNPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKF 915
           L++ +VVDAT++G   RF NHS NPNCY KV+ V G  +I I+AKR I AGEE+ ++Y++
Sbjct: 600 LNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 659


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.377 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,921,200,602
Number of Sequences: 23463169
Number of extensions: 576743054
Number of successful extensions: 1592983
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3172
Number of HSP's successfully gapped in prelim test: 2562
Number of HSP's that attempted gapping in prelim test: 1572274
Number of HSP's gapped (non-prelim): 13301
length of query: 936
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 784
effective length of database: 8,792,793,679
effective search space: 6893550244336
effective search space used: 6893550244336
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 82 (36.2 bits)