BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004423
         (754 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449436315|ref|XP_004135938.1| PREDICTED: uncharacterized protein LOC101208296 [Cucumis sativus]
 gi|449488832|ref|XP_004158186.1| PREDICTED: uncharacterized protein LOC101230410 [Cucumis sativus]
          Length = 847

 Score =  968 bits (2502), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 503/766 (65%), Positives = 584/766 (76%), Gaps = 53/766 (6%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
           EDYARRFESG +  AS +I GEEQGQSN NVMCR+CF GENE  ERAR+MLSCK+CGKKY
Sbjct: 119 EDYARRFESGNL-DASGNIVGEEQGQSNVNVMCRICFFGENESSERARKMLSCKTCGKKY 177

Query: 65  HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
           HR+CLK+WAQ+RDLFHWSSW CPSCR CE+CRRTGDPNKFMFC+RCD AYHCYCQHPPHK
Sbjct: 178 HRSCLKSWAQHRDLFHWSSWTCPSCRACEVCRRTGDPNKFMFCKRCDGAYHCYCQHPPHK 237

Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
           NVSSGPYLCPKHT+CHSCGSNVPGNG SVRWFLGYT CDACGRLFVKGNYCPVCLKVYRD
Sbjct: 238 NVSSGPYLCPKHTRCHSCGSNVPGNGQSVRWFLGYTFCDACGRLFVKGNYCPVCLKVYRD 297

Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
           SESTPMVCCD+CQRWVHC CD ISDEKYLQFQ+DGNLQY+C  CRGECYQV++LEDAV+E
Sbjct: 298 SESTPMVCCDICQRWVHCHCDSISDEKYLQFQIDGNLQYKCTACRGECYQVKNLEDAVQE 357

Query: 245 LWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGV 304
           +WRR+D AD+DLI +LRAAAGLPT+DEIFSISPYSDDEENGP V+KNEFGRSLKLSLKG 
Sbjct: 358 IWRRRDEADRDLIVNLRAAAGLPTQDEIFSISPYSDDEENGPAVVKNEFGRSLKLSLKGF 417

Query: 305 VDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQSP 364
            DK PKK K++GKK  NKKY ++KG   PL ++ E DQ+FE  +DV   G   G++    
Sbjct: 418 ADKVPKKSKDYGKKSSNKKYAKEKG--TPLANQSELDQNFEVRNDVQQSGFGEGNEKNGG 475

Query: 365 -----KNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTS 418
                 NEGLD  S VAG +SH EG CS++QPG+LKHK+VDEVMVSD++K S+ V+ K S
Sbjct: 476 LLPQNNNEGLDT-SPVAGSLSHNEGTCSVNQPGVLKHKFVDEVMVSDEEKTSKVVQIKAS 534

Query: 419 KPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGI 478
           K   LD+GED GK+ SKSKT K KKLVINLGARKINV  SP+SDASSCQR QDL  SN  
Sbjct: 535 KAQGLDTGEDSGKYASKSKTAKGKKLVINLGARKINVATSPKSDASSCQRGQDLAVSN-- 592

Query: 479 EDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDS 538
                                    G++V++SSQS GLK      +V  FG+VR   SD+
Sbjct: 593 -------------------------GEKVNNSSQSTGLKAGETENSVPSFGKVRFGSSDT 627

Query: 539 NTKVSRGSSADEHE---PEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESR 591
           NT   RG++A   E   P+   V S KRN++ S  AV  +G V+ ++ ++    KQLES 
Sbjct: 628 NTTFGRGNTASGSEVGPPDGTRVFSRKRNMEGSTPAVGSLGGVSTVKEEKVPSGKQLESG 687

Query: 592 PNASRESNDD---TSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
            +   + +DD   T + QSLP+DSKP L+ KF+KP L+N   Q+S  EEEKSL+KGQRSK
Sbjct: 688 SHICNDGHDDNGQTPLPQSLPRDSKPLLKFKFKKPPLDN---QISCHEEEKSLVKGQRSK 744

Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
           RKRPSP  EK  FNE ED  +S+QD+L+    DANWILKKLGKDAIGKRVEV   SD SW
Sbjct: 745 RKRPSPLMEKVPFNEVEDLTRSHQDNLLD---DANWILKKLGKDAIGKRVEVQHPSDKSW 801

Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
            KGVV D ++GTSTLS+ LDD R KTLELGKQG+R VP KQKRS S
Sbjct: 802 QKGVVRDMIDGTSTLSVALDDGREKTLELGKQGIRLVPLKQKRSKS 847


>gi|224106097|ref|XP_002314042.1| predicted protein [Populus trichocarpa]
 gi|222850450|gb|EEE87997.1| predicted protein [Populus trichocarpa]
          Length = 845

 Score =  918 bits (2372), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 483/762 (63%), Positives = 566/762 (74%), Gaps = 59/762 (7%)

Query: 9   RRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC 68
           ++ ESG    AS+D  GE+ G       C++CFVG+  G ERAR+ML CKSCGKKYHR+C
Sbjct: 123 KKVESGDT-VASEDTPGEDTGP-----FCQICFVGQTGGSERARKMLPCKSCGKKYHRSC 176

Query: 69  LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS 128
           LK WA++RDLFHWSSW CPSC+ CE+CR+TGDPNKF+FC+RCD AYHCYCQHPPHKNVSS
Sbjct: 177 LKTWARHRDLFHWSSWTCPSCQTCEVCRKTGDPNKFVFCKRCDGAYHCYCQHPPHKNVSS 236

Query: 129 GPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST 188
           GPYLCPKHT+CHSCGS+VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST
Sbjct: 237 GPYLCPKHTRCHSCGSSVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST 296

Query: 189 PMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRR 248
           PMVCCD+CQRWVHC CDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DL+DA++ELWRR
Sbjct: 297 PMVCCDICQRWVHCHCDGISDEKYLQFQVDGNLQYQCATCRGECYQVKDLKDAIQELWRR 356

Query: 249 KDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKS 308
           +D AD+ LIASLRAAAGLP +++IFSISPYSD + NGP  L+N+F  S+ LSLKG+  KS
Sbjct: 357 RDKADRGLIASLRAAAGLPAQEDIFSISPYSDGDGNGPEALRNDFRHSINLSLKGIGGKS 416

Query: 309 PKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD----DTQSP 364
           PKK  +HGKK  NKK+P+KKG      SK EP Q     HD+HS  +   D    D++S 
Sbjct: 417 PKKSNDHGKKHWNKKFPKKKGCHAASISKSEPHQ-----HDIHSSVHDMDDCKIYDSESQ 471

Query: 365 KNEGLDIP-SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKTSKPHD 422
              G D   S VAGIV+HTEGVCSISQPG+LKHK+VDEVMVSD ++ S V K K++KPHD
Sbjct: 472 AKGGSDKSCSPVAGIVNHTEGVCSISQPGVLKHKFVDEVMVSDGERTSNVFKIKSNKPHD 531

Query: 423 LDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPS 482
           +DSG D  KH  KSK++KAK+LVINLGARKINV++ P+SD  SCQ E DL  SN      
Sbjct: 532 VDSGGDTEKHAGKSKSVKAKRLVINLGARKINVSSPPKSDVQSCQSELDLKASNR----- 586

Query: 483 LQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTK 541
                                 D  DHS Q+RGL K A R GN+IKFG+V+ E S+ N K
Sbjct: 587 ----------------------DTADHSGQTRGLIKFARREGNLIKFGKVKAEASNFNPK 624

Query: 542 VSRGSSADEHEP---EHMHVLSGKRNIDRSRAAVSRVG-EVAALRGDR----KQLESRPN 593
              GS +D +E    +H  V S K++++ SRA V   G EV  LR D+    KQ E RP+
Sbjct: 625 SDGGSHSDGYETVPLDHARVSSAKKSLEGSRAVVRPAGGEVPTLRSDKLSLGKQSEVRPD 684

Query: 594 ASRESNDD---TSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRK 650
              ESN D   T +  SLPK+SK  L+LK +KPNLENQ+S +   EEEKS I+GQRSKRK
Sbjct: 685 THTESNGDSGDTPIFHSLPKESKLSLKLKIKKPNLENQSSLIHLHEEEKSNIRGQRSKRK 744

Query: 651 RPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHK 710
           R S   EKT++NEDE    S+ DS M+E   AN ILKKLGKDAIGKRVEVHQ SDNSWHK
Sbjct: 745 RASSLMEKTMYNEDEGMPPSHLDSEMTE---ANRILKKLGKDAIGKRVEVHQPSDNSWHK 801

Query: 711 GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           GVV+D VEGTS LS+TLDD  VKTL+LGKQ VR V QKQKRS
Sbjct: 802 GVVSDIVEGTSKLSVTLDDGIVKTLKLGKQAVRIVSQKQKRS 843


>gi|356544287|ref|XP_003540585.1| PREDICTED: uncharacterized protein LOC100815407 [Glycine max]
          Length = 845

 Score =  891 bits (2303), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 467/764 (61%), Positives = 570/764 (74%), Gaps = 34/764 (4%)

Query: 1   MVAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
           M+AAE+YARRFESG V     ++ GEEQGQ+N +  CR+C  GENEG E+A++MLSCKSC
Sbjct: 102 MIAAEEYARRFESGDVQNTPGNLTGEEQGQANRS-YCRICKCGENEGSEKAQKMLSCKSC 160

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
           GKKYHRNCL++W +NRDLFHWSSW CP CRICE CRRTGDP+KFMFC+RCD AYHCYC  
Sbjct: 161 GKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACRRTGDPSKFMFCKRCDGAYHCYCLQ 220

Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
           PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF+ YT CDACGRLF KGNYCPVCLK
Sbjct: 221 PPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWFMAYTNCDACGRLFTKGNYCPVCLK 280

Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
           VYRDSESTPMVCCD CQ WVHCQCD IS+EKY QFQVDGNLQY+CPTCRGECYQV++ ED
Sbjct: 281 VYRDSESTPMVCCDTCQLWVHCQCDNISEEKYHQFQVDGNLQYKCPTCRGECYQVKNPED 340

Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
           A +E+WRR+++A++DLI+SLRAAAGLPT++EIFSISP+SDDE++GP+ LK+E  RS K S
Sbjct: 341 AAQEIWRRRNIAERDLISSLRAAAGLPTQEEIFSISPFSDDEDSGPLKLKSESARSFKFS 400

Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDV---HSYGNSF 357
           LK + + SPKK         +KK  +KK  Q  + SK +   S EG+ D+   HS  +  
Sbjct: 401 LKNLANDSPKKKTS------SKKTAKKKNSQSFMTSKIDTHNSCEGHSDIKSLHSLDDDK 454

Query: 358 GDDTQSPKNEGLDIPSSVA-GIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKF 415
            DD QS +NEG D+ SS A G +S TE    I+QPGILK K+VDEVMVSD+++  R V+ 
Sbjct: 455 NDDIQSQRNEGPDVYSSPATGSLSQTEASFPINQPGILKQKFVDEVMVSDEERKPRVVRI 514

Query: 416 KTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTS 475
           K++K H  DS E+ GKH  K++ +K KKLVINLGARKINV +SPRSD+SSCQ++QD  T 
Sbjct: 515 KSNKAHIPDSEEESGKHSLKTQNVKGKKLVINLGARKINVASSPRSDSSSCQKDQDPVTV 574

Query: 476 NGIEDPSLQRMNSKFVLDRHDGSSKL--GDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQ 533
           NG ED S  R   KF LDR D +++   G G +VD S QS+  +++GR GN+IK G+V+ 
Sbjct: 575 NGNEDRSQWRKGDKFALDRQDDTARHIDGKGIKVD-SGQSKFFRVSGREGNLIKLGKVKP 633

Query: 534 EVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPN 593
           ++S+ N    RG+ +D            K +ID     +++VG  A  RG+R  L  +  
Sbjct: 634 DISEFNLTSGRGNMSDGRI---------KHSID---GMINQVGIKATSRGERTYLGRQSE 681

Query: 594 ASRES-----NDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
            S ++     N++ +   SLPKDSKP LR KF+KP++E+QNS     EEEK  IKGQRSK
Sbjct: 682 GSSDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPSIESQNS--PHQEEEKMTIKGQRSK 739

Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
           RKRPSPF EK  FNE E  +QS+QDS M  IMDANWIL KLG DAIGKRVEVHQ SDNSW
Sbjct: 740 RKRPSPFKEKASFNESEGVSQSHQDSAMDGIMDANWILMKLGNDAIGKRVEVHQTSDNSW 799

Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           HKG+VTD VEGTS L + LDD +VKT+EL KQGVRFVPQKQKRS
Sbjct: 800 HKGLVTDVVEGTSKLYVALDDGKVKTVELRKQGVRFVPQKQKRS 843


>gi|356529861|ref|XP_003533505.1| PREDICTED: uncharacterized protein LOC100809429 [Glycine max]
          Length = 820

 Score =  878 bits (2268), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 458/762 (60%), Positives = 555/762 (72%), Gaps = 59/762 (7%)

Query: 1   MVAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
           M+AAE+YARRFESG V     ++ G++QGQ+N +  CR+C  GENEG E+A++MLSCKSC
Sbjct: 106 MIAAEEYARRFESGDVQNTPGNVTGDDQGQANRS-YCRICKCGENEGSEKAQKMLSCKSC 164

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
           GKKYHRNCL++W +NRDLFHWSSW CP CRICE CRRTGDP+KFMFC+RCD AYHCYC  
Sbjct: 165 GKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACRRTGDPSKFMFCKRCDGAYHCYCLQ 224

Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
           PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF+ YT CDACGRLF KGNYCPVCLK
Sbjct: 225 PPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWFMSYTNCDACGRLFTKGNYCPVCLK 284

Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
           VYRDSESTPMVCCD CQ WVHCQCD ISDEKY QFQ+DGNLQY+CPTCRGECYQV++ ED
Sbjct: 285 VYRDSESTPMVCCDSCQLWVHCQCDNISDEKYHQFQLDGNLQYKCPTCRGECYQVKNPED 344

Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
           A RE+WRR+++A++DLIASLRAAAGLPT++EIFSISP+SDDE++GP+ LK+E  RS K S
Sbjct: 345 AAREIWRRRNIAERDLIASLRAAAGLPTQEEIFSISPFSDDEDSGPLKLKSESARSFKFS 404

Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDV---HSYGNSF 357
           LK + + SPKK         +KK  +KK  Q+ + SK +   S EG+ D+   HS  +  
Sbjct: 405 LKNLANDSPKKKSS------SKKTAKKKDSQLFMTSKIDTHNSCEGHSDIKSLHSLDDDK 458

Query: 358 GDDTQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKF 415
            DD QS +NEG D+ SS  AG +S TE    I QPGILK K+VDEVMVSD+++  R V+ 
Sbjct: 459 NDDIQSQRNEGPDVYSSPAAGSLSQTEASFPIDQPGILKQKFVDEVMVSDEERKPRVVRI 518

Query: 416 KTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTS 475
           K++K    DS E+ GKH  K++ +K KKLVINLGARKINV +SPRSD SSCQ++QD  T 
Sbjct: 519 KSNKALIPDSEEESGKHSLKTQNVKGKKLVINLGARKINVASSPRSDTSSCQKDQDPVTV 578

Query: 476 NGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEV 535
           N                           G++VD S QS+  +++GR GN+IK G+V+ +V
Sbjct: 579 N---------------------------GNKVD-SGQSKIFRVSGREGNLIKLGKVKPDV 610

Query: 536 SDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNAS 595
           S+ N    RG+ +D            K +ID     +++VG  A  RG+R  L  +   S
Sbjct: 611 SEFNLTSGRGNMSDGRI---------KHSID---GMINQVGIKAPSRGERTYLGKQSEGS 658

Query: 596 RES-----NDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRK 650
            ++     N++ +   SLPKDSKP LR KF+KP++E+QNS  SQ EEEK  IKGQRSKRK
Sbjct: 659 SDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPSIESQNS--SQQEEEKMTIKGQRSKRK 716

Query: 651 RPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHK 710
           RPSPF EKT FNE E  +QS QDS M  IMDANWIL KLG DAIGKRVEVHQ SDNSWHK
Sbjct: 717 RPSPFKEKTTFNESEGVSQSRQDSAMDGIMDANWILMKLGNDAIGKRVEVHQTSDNSWHK 776

Query: 711 GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           GVVTD VEGTS L + LDD +VK +EL KQGVRFVPQKQKRS
Sbjct: 777 GVVTDVVEGTSKLYVALDDGKVKNVELRKQGVRFVPQKQKRS 818


>gi|297736278|emb|CBI24916.3| unnamed protein product [Vitis vinifera]
          Length = 679

 Score =  838 bits (2164), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/554 (74%), Positives = 462/554 (83%), Gaps = 11/554 (1%)

Query: 2   VAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
           + AEDYARRFESG +   SKDI GEEQ QSN NVMCR+CF GE EG ERAR+ML C SCG
Sbjct: 109 MVAEDYARRFESGDLVDTSKDIVGEEQSQSNVNVMCRICFFGEMEGSERARKMLPCNSCG 168

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           KKYHR CLK+W+QNRDLFHWSSW CPSCRICE+CRR+GDPNKFMFCRRCD AYHCYCQ P
Sbjct: 169 KKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVCRRSGDPNKFMFCRRCDDAYHCYCQQP 228

Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
           PHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 229 PHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 288

Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
           YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDA
Sbjct: 289 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDLEDA 348

Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSL 301
           V+ELWRR+D AD+DLIASLRA A LPT+DEIFSISPYSDDEENGPV LK+EFGRSLKLSL
Sbjct: 349 VQELWRRRDKADRDLIASLRAKARLPTQDEIFSISPYSDDEENGPVSLKSEFGRSLKLSL 408

Query: 302 KGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDD- 360
           KG VDKSPKK KE+GK+  NKK  +KKG+Q PL SK E  QSFEG+ D   +  S GDD 
Sbjct: 409 KGSVDKSPKKTKEYGKQSSNKKNVKKKGHQTPLISKKESHQSFEGHDDAQPFEYSLGDDK 468

Query: 361 -TQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKT 417
             Q  +++G  + SS VAG +SHTEG+CSI+QPG+LKHK+VDE+ V+++D+ SRV + K+
Sbjct: 469 NEQPNRSDGRGVFSSPVAGSLSHTEGICSINQPGVLKHKFVDEIAVNNEDRTSRVIQIKS 528

Query: 418 SKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNG 477
           +KPH  D GED GK  SKSKT+K  KLVI+LGAR  NVTNSPRSDASSCQREQDLTTSNG
Sbjct: 529 NKPHGSDVGEDTGKQASKSKTMKGTKLVIHLGARNRNVTNSPRSDASSCQREQDLTTSNG 588

Query: 478 IEDPSLQRMNSKFVLDRHDGSSKLGD--GDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEV 535
            ED S QRM      D+HD  +K GD  GD++D+S Q++G K  GR GN+IK G+VR E 
Sbjct: 589 SEDTSQQRMG-----DKHDRIAKFGDSKGDKIDYSGQAKGSKHGGREGNLIKLGKVRTEP 643

Query: 536 SDSNTKVSRGSSAD 549
           S+ N K  RG+  D
Sbjct: 644 SEMNPKFGRGNKDD 657


>gi|297833588|ref|XP_002884676.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330516|gb|EFH60935.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 778

 Score =  827 bits (2135), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/757 (58%), Positives = 518/757 (68%), Gaps = 97/757 (12%)

Query: 3   AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
           AAEDYARRFESG     S D AGEE   S  N+MCR+CF+GE EG ERARRMLSCK+CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEEVVHSGMNIMCRMCFLGEGEGSERARRMLSCKTCGK 173

Query: 63  KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233

Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
           HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293

Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
           RDSESTPMVCCD+CQRWVHC CDGISD+KYLQFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYLQFQVDGKLQYKCATCRGECYQVKDLQDAV 353

Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
           +ELW++KD+ DK+LIASLRAAAGLPT++EIFSI P+SDD+ENGPV      GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAAGLPTDEEIFSIFPFSDDDENGPVS-----GRSLKFSIK 408

Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
           G+V+KSPKK KE+GK  L+KK+  KKG    L    EP+   E   +    G    D+  
Sbjct: 409 GLVEKSPKKSKEYGKHSLSKKHASKKGSHTKL----EPELHQEVGSERLRLGGVRIDNVG 464

Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
              NE  D+ SSVAGI       CS  +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 465 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 517

Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
           D DS ED  ++  + K++KAKKLVINLGARKINV+ S +S+  S                
Sbjct: 518 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 560

Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTK 541
                     L R    S LG GD+VD + + R LKI+GR      FG+ + E S     
Sbjct: 561 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGS----- 599

Query: 542 VSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESNDD 601
                                                 A  G   Q  +  +     +D 
Sbjct: 600 -------------------------------------KATFGSITQFPASTSEGNHVDDK 622

Query: 602 TSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEKT 659
           TS+  +L K+++P L+ K RKPN  +Q S V +Q E+EK S  KGQRSKRKRPS   +  
Sbjct: 623 TSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDMA 682

Query: 660 LFNEDEDA-AQSNQDSLMS-EIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTV 717
              ED +A   S+QDS  + E+MDANWILKKLGKD+IGKRVEVH  S NSWHKG VTD  
Sbjct: 683 SLKEDGEATTHSHQDSSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWHKGTVTDVS 741

Query: 718 EGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
             TSTLS++LDD  +KT ELGK  VRF+PQKQKRS S
Sbjct: 742 GDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 778


>gi|145338256|ref|NP_187459.2| PHD finger-containing protein [Arabidopsis thaliana]
 gi|110739634|dbj|BAF01725.1| hypothetical protein [Arabidopsis thaliana]
 gi|110741394|dbj|BAF02246.1| hypothetical protein [Arabidopsis thaliana]
 gi|332641110|gb|AEE74631.1| PHD finger-containing protein [Arabidopsis thaliana]
          Length = 779

 Score =  809 bits (2090), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/758 (57%), Positives = 516/758 (68%), Gaps = 98/758 (12%)

Query: 3   AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
           AAEDYARRFESG     S D AGEE G S  N+MCR+CF+GE EG +RARRMLSCK CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEELGHSGMNIMCRMCFLGEGEGSDRARRMLSCKDCGK 173

Query: 63  KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233

Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
           HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293

Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
           RDSESTPMVCCD+CQRWVHC CDGISD+KY+QFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGECYQVKDLQDAV 353

Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
           +ELW++KD+ DK+LIASLRAAAGLPTE+EIFSI P+SDDEENGPV      GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAAGLPTEEEIFSIFPFSDDEENGPVS-----GRSLKFSIK 408

Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
           G+V+KSPKK KE+G         +K   +   ++K EP+   E   +    G    D+  
Sbjct: 409 GLVEKSPKKSKEYG----KHSSSKKHASKKGSHTKLEPEVHQEIGSERRRLGGVRIDNVG 464

Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
              NE  D+ SSVAGI       CS  +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 465 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 517

Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
           D DS ED  ++  + K++KAKKLVINLGARKINV+ S +S+  S                
Sbjct: 518 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 560

Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNT- 540
                     L R    S LG GD+VD + + R LKI+GR      FG+ + E S +   
Sbjct: 561 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGSKATFG 604

Query: 541 KVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESND 600
            V++  +A   E  H+                                          +D
Sbjct: 605 SVTQFPAASTSEGNHV------------------------------------------DD 622

Query: 601 DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEK 658
            TS+  +L K+++P L+ K RKPN  +Q S V +Q E+EK S  KGQRSKRKRPS   + 
Sbjct: 623 KTSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDM 682

Query: 659 TLFNEDEDA-AQSNQD-SLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
               ED +A   S+QD S   E+MDANWILKKLGKD+IGKRVEVH  S NSW KG VTD 
Sbjct: 683 ASLKEDGEATTHSHQDNSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWRKGTVTDV 741

Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
              TSTLS++LDD  +KT ELGK  VRF+PQKQKRS S
Sbjct: 742 SGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 779


>gi|6648214|gb|AAF21212.1|AC013483_36 unknown protein [Arabidopsis thaliana]
          Length = 764

 Score =  773 bits (1996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/758 (55%), Positives = 502/758 (66%), Gaps = 113/758 (14%)

Query: 3   AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
           AAEDYARRFESG     S D AGEE G S  N+MCR+CF+GE EG +RARRMLSCK CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEELGHSGMNIMCRMCFLGEGEGSDRARRMLSCKDCGK 173

Query: 63  KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233

Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
           HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293

Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
           RDSESTPMVCCD+CQRWVHC CDGISD+KY+QFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGECYQVKDLQDAV 353

Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
           +ELW++KD+ DK+LIASLRAAA               DDEENGPV      GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAA---------------DDEENGPVS-----GRSLKFSIK 393

Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
           G+V+KSPKK KE+G         +K   +   ++K EP+   E   +    G    D+  
Sbjct: 394 GLVEKSPKKSKEYG----KHSSSKKHASKKGSHTKLEPEVHQEIGSERRRLGGVRIDNVG 449

Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
              NE  D+ SSVAGI       CS  +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 450 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 502

Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
           D DS ED  ++  + K++KAKKLVINLGARKINV+ S +S+  S                
Sbjct: 503 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 545

Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSN-T 540
                     L R    S LG GD+VD + + R LKI+GR      FG+ + E S +   
Sbjct: 546 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGSKATFG 589

Query: 541 KVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESND 600
            V++  +A   E  H+                                          +D
Sbjct: 590 SVTQFPAASTSEGNHV------------------------------------------DD 607

Query: 601 DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEK 658
            TS+  +L K+++P L+ K RKPN  +Q S V +Q E+EK S  KGQRSKRKRPS   + 
Sbjct: 608 KTSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDM 667

Query: 659 TLFNEDEDA-AQSNQD-SLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
               ED +A   S+QD S   E+MDANWILKKLGKD+IGKRVEVH  S NSW KG VTD 
Sbjct: 668 ASLKEDGEATTHSHQDNSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWRKGTVTDV 726

Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
              TSTLS++LDD  +KT ELGK  VRF+PQKQKRS S
Sbjct: 727 SGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 764


>gi|413916644|gb|AFW56576.1| RING/FYVE/PHD-type zinc finger family protein [Zea mays]
          Length = 819

 Score =  620 bits (1598), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 356/773 (46%), Positives = 462/773 (59%), Gaps = 83/773 (10%)

Query: 5   EDYARRFESGYVATASKDIAGEE-------QGQSNTNVMCRLCFVGENEGCERARRMLSC 57
           ED ARRFESG  A  S +  G+E       QG +   VMCRLCF GENEG  +A +ML C
Sbjct: 99  EDLARRFESG--AYGSPEAEGDEDEWDREDQGNAAVKVMCRLCFSGENEGSTKAAKMLPC 156

Query: 58  KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCY 117
           K C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD  YHCY
Sbjct: 157 KLCSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDDPYHCY 216

Query: 118 CQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
           CQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPV
Sbjct: 217 CQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPV 276

Query: 178 CLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRD 237
           CLKVYRDSE  PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C  CRGEC Q+RD
Sbjct: 277 CLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCAACRGECSQIRD 336

Query: 238 LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSL 297
            EDA+RELW+R+D+AD +L+ +LRAAA LP+ +++  + P SDDE+ G  VLK+E   +L
Sbjct: 337 TEDAIRELWKRRDVADHELMITLRAAAKLPSLEDVSPLYPNSDDEKLGAYVLKSESRNTL 396

Query: 298 KLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHS 352
           K SLK    K P    E  K        NKK  +KKG Q    +    +   E  HDV S
Sbjct: 397 KFSLKSNSSKPPPDTPEQEKVVFKSSGSNKKPSKKKGGQGNKTNDGHDEIFLERRHDVKS 456

Query: 353 YGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR 412
             +  GD +    ++     +     +S +    + S    LK   +  V  ++ D I +
Sbjct: 457 SNSRLGDQSIDGNHDMSPFKNDDNAYISSS----TRSSEKNLKSPSMKAV-TNNADMIPK 511

Query: 413 VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDL 472
           VK K SK   L   +D  ++ SK+ T KA KLVI+LG+R    + SP+S+ S+ QREQDL
Sbjct: 512 VKIKGSKVSSLHY-KDGEENTSKADTGKATKLVIHLGSRHKTRSGSPKSELSNYQREQDL 570

Query: 473 TTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVR 532
            + +                           G ++D +SQ +G +   +  +V+K  R  
Sbjct: 571 GSIH---------------------------GRKLDVTSQLKGSRSEVKERSVMKLVR-E 602

Query: 533 QEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRP 592
             V   N+ +    ++ +H        +GK    RS A +S         G+     +RP
Sbjct: 603 TGVQQRNSLLGDLGTSKKH-------ATGK----RSNALIS-----GMENGNETGTRNRP 646

Query: 593 NASRESN----DDTSVLQSLPKDSKPP-LRLKFRKPNLENQNSQVSQPEEEKSLI----- 642
            A ++S+    D+       P + KP  L+LKF++P+ E  N+Q SQPEE  S +     
Sbjct: 647 FAQKQSHSSQVDENQGTADSPDNLKPSLLKLKFKRPHYEQLNTQASQPEEPTSWVSQQED 706

Query: 643 -----KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKR 697
                KGQRSKRKRPS   EK    +    A+ +  S   E+MDANWIL+KLGKDAIGKR
Sbjct: 707 QFNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHHQSTDDEVMDANWILRKLGKDAIGKR 764

Query: 698 VEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
           +EVH  SD  WH+G+V++ + G  TL I LD+ R + +ELGKQ +R +  + K
Sbjct: 765 IEVHLTSDGKWHQGMVSNVMGG--TLCIQLDNGRSENVELGKQAIRLIASRSK 815


>gi|357151790|ref|XP_003575905.1| PREDICTED: uncharacterized protein LOC100821635 [Brachypodium
           distachyon]
          Length = 809

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 341/772 (44%), Positives = 455/772 (58%), Gaps = 87/772 (11%)

Query: 5   EDYARRFESGY---VATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
           ED+ARR E  Y    A    D + +E+G +   VMCRLC  GENEG  +A +ML CK C 
Sbjct: 95  EDFARRLEGSYGFPEAEGDDDDSDQEEGNAAIKVMCRLCISGENEGSSKAAKMLPCKLCN 154

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           KKYH+ C+K W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD AYHCYCQ P
Sbjct: 155 KKYHKKCVKYWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQQP 214

Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
            HKNVS GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 215 SHKNVSHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLKV 274

Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
           YRDSE  PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C +CRGEC Q+RD EDA
Sbjct: 275 YRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADENLQYTCASCRGECSQIRDAEDA 334

Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSL 301
           VRELW+R+++ D DL+ SLRAAA LP+ +++   +P SDDE  G  V KN+   +LK S 
Sbjct: 335 VRELWKRRNIVDHDLMVSLRAAAALPSLEDVSPSNPNSDDERLGAFVPKNDGRNTLKFSF 394

Query: 302 KGVVDKSPKKVKEHGKK-------WLNKKYPRKKGYQMPLNSKPEPDQSF-EGYHDVHSY 353
           K    K P  + + G++         NKK+ +KKG Q  + S  +PD+ F E  H+  SY
Sbjct: 395 KSNSSKPP--LDQSGQEKNVPKTSGSNKKHSKKKGNQGNI-SVGDPDEIFLEKRHEAKSY 451

Query: 354 GNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV 413
            N  G   +   N G          + + + V ++S       +  ++   ++ D I +V
Sbjct: 452 SNLGGHTIEG--NHG-------QSTIKNDDSVFTLSAT-----RSSEKGAANNADMIPKV 497

Query: 414 KFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLT 473
           K + SK   L   +    + +KS   K  KLV + G R  + + SP+S+ ++  +EQ+L 
Sbjct: 498 KIRGSKAPSLHFKDVGEVNTAKSDAGKGTKLVFHFGTRHKSGSGSPKSEMTNSHKEQELG 557

Query: 474 TSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQ 533
           + +G                            ++D +SQ +  K   +  +V+K  R   
Sbjct: 558 SLHG---------------------------GKIDVTSQFKSSKSEKKEKSVMKLVR-ET 589

Query: 534 EVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPN 593
            V   N+ +    ++ +H       ++GKR    S A +S + E A   G R +      
Sbjct: 590 GVQQRNSLLGDLGTSKKH-------VTGKR----SNAIISGM-ENAGESGTRSRSFGHKQ 637

Query: 594 ASRESNDDTSVLQSLPKDSKPP------LRLKFRKPNLENQNSQVSQPEE---------E 638
           +      D     SLP ++ P       L+LKF++P+ E  ++QV+QPEE         E
Sbjct: 638 SIPNQLTDNQATASLPVNNSPDSLKPSLLKLKFKRPHFEQPSAQVAQPEETATWASQQEE 697

Query: 639 KSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRV 698
            ++ KGQRSKRKRPS   +K   +E +   + +Q S   E MDA WIL+KLGKDAIGKR+
Sbjct: 698 LNVAKGQRSKRKRPS--MDKMDGSEGKTPGKRHQQSTGDEAMDATWILRKLGKDAIGKRI 755

Query: 699 EVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
           E+   SD  WH+GVV++ + G  TL + LDD   + LELGKQ VR V Q+ K
Sbjct: 756 EIQLPSDGKWHQGVVSNVLSG--TLCVQLDDGSSENLELGKQAVRLVAQRSK 805


>gi|115488844|ref|NP_001066909.1| Os12g0527800 [Oryza sativa Japonica Group]
 gi|77556508|gb|ABA99304.1| PHD-finger family protein, expressed [Oryza sativa Japonica Group]
 gi|113649416|dbj|BAF29928.1| Os12g0527800 [Oryza sativa Japonica Group]
 gi|215717023|dbj|BAG95386.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 688

 Score =  593 bits (1530), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 340/745 (45%), Positives = 445/745 (59%), Gaps = 91/745 (12%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
           MCR+CF GENEG  +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1   MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
           RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61  RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120

Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
           FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE  PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180

Query: 216 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 275
           Q D NLQY C  CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++   
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240

Query: 276 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 330
            P SDDE+ G  V+KN+   +LK S K    K      E  K  +     NKK+ +KKG 
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300

Query: 331 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 383
           Q       + +   E  ++  S G S GD       D  S KN   D  + V       E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356

Query: 384 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 440
               +       H         + D I +VK K +K    H  D GE++    +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404

Query: 441 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 500
             KLVI++G+R  + + SP+S+ S+ Q+EQ+L + +G                       
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441

Query: 501 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 560
                +VD +SQ +  +   +  +V+K   VR+     N+ +    ++ +H        +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKH-------AT 488

Query: 561 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 616
           GKR    S A VS + E A+  G R     Q +S  + +    + +  + + P   KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543

Query: 617 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
            +LKF++P  E  ++Q SQPEE         E ++ KGQRSKRKRPS   +K   +E + 
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601

Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
            AA+ ++ S   E MDANWIL+KLGKDAIGKR+EV   SD  WH+GVV++ + G  TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659

Query: 726 TLDDSRVKTLELGKQGVRFVPQKQK 750
            LD+ R + +ELGK+ +R + Q+ K
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQRSK 684


>gi|414878222|tpg|DAA55353.1| TPA: RING/FYVE/PHD-type zinc finger family protein [Zea mays]
          Length = 818

 Score =  593 bits (1528), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 350/777 (45%), Positives = 460/777 (59%), Gaps = 93/777 (11%)

Query: 5   EDYARRFESGYVAT-----ASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS 59
           ED ARRFESG   +        +   E+QG +   VMCRLCF GENEG  +A +ML CK 
Sbjct: 100 EDLARRFESGAYGSPEAGEDEDEWDREDQGNAAVKVMCRLCFSGENEGSTKAAKMLPCKL 159

Query: 60  CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
           C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD AYHCYCQ
Sbjct: 160 CSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQ 219

Query: 120 HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCL 179
            P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCP+CL
Sbjct: 220 QPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPICL 279

Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLE 239
           KVYRDSE  PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C  CRGEC Q+RD E
Sbjct: 280 KVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCAACRGECSQIRDTE 339

Query: 240 DAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKL 299
           DA+RELW+R+D+AD +L+A+LRAAA LP+ +++      SDDE+ G   LKNE   +LK 
Sbjct: 340 DAIRELWKRRDVADHELMATLRAAAALPSLEDVSPPYQNSDDEKLGAYALKNESRNTLKF 399

Query: 300 SLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEP----DQSF-EGYHDVHSYG 354
           SLK    K P    E  K         KK  +       +     D+ F E  H V S  
Sbjct: 400 SLKSNSSKPPPDTPEQEKIVFKSSGSNKKPSKKKSGQANKTVDGHDEIFLERRHAVKSSN 459

Query: 355 NSFGDDT------QSP--KNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSD 406
           +  GD T      +SP   ++ + + SS   +  +      +  P +       + + ++
Sbjct: 460 SCLGDQTINENHDRSPFKNDDNVYVSSSTRSLEKN------LKSPSM-------KAVANN 506

Query: 407 DDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSC 466
            D I +VK K SK   L   +D  ++  K+ T KA KLVI+LG+R    + SP+S+ S+ 
Sbjct: 507 ADMIPKVKIKGSKVSSLHY-KDGEENTPKNDTGKATKLVIHLGSRHKTRSGSPKSELSNS 565

Query: 467 QREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVI 526
           QREQDL + +G                            ++D +SQ +  +   +  +V+
Sbjct: 566 QREQDLGSIHG---------------------------GKIDVTSQLKSSRNEVKERSVM 598

Query: 527 KFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRK 586
           K  R    V   N+ +    ++ +H        +GKR    S A +S + E A   G R 
Sbjct: 599 KLVR-DTGVQQRNSLLGDLGTSKKHA-------TGKR----SNALISGM-ENANETGTRN 645

Query: 587 Q--LESRPNASRESNDDTSVLQSLPKDSKPPL-RLKFRKPNLENQNSQVSQPEEEKSLI- 642
           +   + + ++S+  N  T+     P   KP L +LKF++P+ E  N+Q SQPEE  S + 
Sbjct: 646 RSFAQKQSHSSQVENHGTA---DSPDSLKPSLLKLKFKRPHFEQLNTQASQPEEPTSWVS 702

Query: 643 ---------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDA 693
                    KGQRSKRKRPS   EK    +    A+ +Q S   E+MDANWIL+KLGKDA
Sbjct: 703 QQEEQLNVAKGQRSKRKRPS--MEKADGLDGITPAKRHQQS-TDEVMDANWILRKLGKDA 759

Query: 694 IGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
           IGKR+EVH  SD  WH+G+V++ + G  TL I LD+ R + +ELGKQ +R +  + K
Sbjct: 760 IGKRIEVHLTSDGKWHQGMVSNVMGG--TLCIRLDNGRSENVELGKQAIRLIASRSK 814


>gi|222617191|gb|EEE53323.1| hypothetical protein OsJ_36320 [Oryza sativa Japonica Group]
          Length = 756

 Score =  593 bits (1528), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 340/744 (45%), Positives = 444/744 (59%), Gaps = 91/744 (12%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
           MCR+CF GENEG  +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1   MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
           RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61  RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120

Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
           FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE  PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180

Query: 216 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 275
           Q D NLQY C  CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++   
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240

Query: 276 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 330
            P SDDE+ G  V+KN+   +LK S K    K      E  K  +     NKK+ +KKG 
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300

Query: 331 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 383
           Q       + +   E  ++  S G S GD       D  S KN   D  + V       E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356

Query: 384 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 440
               +       H         + D I +VK K +K    H  D GE++    +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404

Query: 441 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 500
             KLVI++G+R  + + SP+S+ S+ Q+EQ+L + +G                       
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441

Query: 501 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 560
                +VD +SQ +  +   +  +V+K   VR+     N+ +    ++ +H        +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKHA-------T 488

Query: 561 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 616
           GKR    S A VS + E A+  G R     Q +S  + +    + +  + + P   KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543

Query: 617 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
            +LKF++P  E  ++Q SQPEE         E ++ KGQRSKRKRPS   +K   +E + 
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601

Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
            AA+ ++ S   E MDANWIL+KLGKDAIGKR+EV   SD  WH+GVV++ + G  TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659

Query: 726 TLDDSRVKTLELGKQGVRFVPQKQ 749
            LD+ R + +ELGK+ +R + Q Q
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQSQ 683


>gi|326519042|dbj|BAJ92681.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 816

 Score =  575 bits (1482), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 340/774 (43%), Positives = 440/774 (56%), Gaps = 83/774 (10%)

Query: 5   EDYARRFESGYVATASK----DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
           ED+ARR E  Y +  ++    D   E+QG +   VMCRLCF GENEG  +A +ML CK C
Sbjct: 94  EDFARRLEGAYGSPEAEGDEDDSDREDQGNAAVKVMCRLCFSGENEGSSKAAKMLPCKLC 153

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
            KKYH+ C+KNW ++RDLFHWSSW C SCR CE+CRR GDPNK MFC+RCD AYHCYCQ 
Sbjct: 154 NKKYHKKCVKNWGEHRDLFHWSSWICSSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQQ 213

Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
           P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLK
Sbjct: 214 PSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLK 273

Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
           VYRDSE  PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C +CRGEC Q+RD ED
Sbjct: 274 VYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCASCRGECSQIRDAED 333

Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
           AVRELW+R+++ D DL+ SLRAAAGLP+ +++ S  P SDDE  G +VLKN+   +LK S
Sbjct: 334 AVRELWKRRNVVDHDLMISLRAAAGLPSLEDV-SPCPNSDDERLGALVLKNDGRNTLKFS 392

Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPL---------NSKPEPDQSF-EGYHDV 350
           LK    K P    E       K  P+  G               S  +P++ F E  H+ 
Sbjct: 393 LKSNSSKPPLDQCEQ-----EKNVPKNSGTNKKHSKKKSSQGNKSVADPNEIFLERRHEA 447

Query: 351 HSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKI 410
            S  +  GD T    ++     ++    V  +    + S    LK   V +   ++ + I
Sbjct: 448 KSMSSHLGDHTVDVNHDRNSFKNNENVFVLPS----TRSSEKDLKSTSV-KATTNNANTI 502

Query: 411 SRVKFKTSKP---HDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQ 467
            +VK K SK    H  D GE++    +   T K  KLVI+LG R  + + SP+S+ S+  
Sbjct: 503 PKVKIKGSKVPSLHFKDIGEENN---ANGDTGKGTKLVIHLGTRHKSKSGSPKSEMSNSH 559

Query: 468 REQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIK 527
           +EQ+L +++G +        S     +     KL     V  SS    L  + R      
Sbjct: 560 KEQELGSTHGGKTDVTSLFKSSKSSKKEKSVMKLVGETGVQQSSLLGDLGTSKRHA---- 615

Query: 528 FGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQ 587
                           + SSA         ++SG  N + S       G           
Sbjct: 616 --------------TGKRSSA---------LISGMENANESGTRSRSFG----------H 642

Query: 588 LESRPNASRESNDDTSVLQSLPKDSKPP--LRLKFRKPNLENQNSQVSQPEEEKS----- 640
            +S P+   ES    S   +   DS  P  L+LKF++P+LE  + QVSQ EE  +     
Sbjct: 643 KQSIPSQLTESQGTASFAVNNSPDSLKPSLLKLKFKRPHLEQPSLQVSQTEEPATWASQQ 702

Query: 641 ----LIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGK 696
               + KGQRSKRKRPS  T+K   +E    ++ +  S   E MDA WIL+KLG DAIGK
Sbjct: 703 EDLNVAKGQRSKRKRPS--TDKMDGSEGSTPSKRHGQSTGDEAMDATWILRKLGNDAIGK 760

Query: 697 RVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
           R+E+   SD  WH+GVV++ + G   L + LD+   + LELG Q VR + Q+ K
Sbjct: 761 RIEIQLASDGKWHQGVVSNVISG--MLCVQLDNGSSENLELGNQAVRLIAQRLK 812


>gi|224055146|ref|XP_002298424.1| predicted protein [Populus trichocarpa]
 gi|222845682|gb|EEE83229.1| predicted protein [Populus trichocarpa]
          Length = 797

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 315/535 (58%), Positives = 377/535 (70%), Gaps = 53/535 (9%)

Query: 190 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 249
           +   D C   V       SDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDAV+ELWRR+
Sbjct: 236 IFASDGCTVIVMASEIFCSDEKYLQFQVDGNLQYQCSTCRGECYQVKDLEDAVQELWRRR 295

Query: 250 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 309
           D AD+ LIASLRAAAGLP +++IFSI+PYSDD+ENGP   +N+FGRS+KLSLKG+V+KSP
Sbjct: 296 DKADRGLIASLRAAAGLPAQEDIFSITPYSDDDENGPAAPRNDFGRSIKLSLKGLVEKSP 355

Query: 310 KKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD----DTQSPK 365
           KK K+HGKK LNKKYP++KG      SK E  Q     H+ HSY +  GD    DT+S  
Sbjct: 356 KKSKDHGKKHLNKKYPKRKGPHAASFSKTESYQ-----HESHSYEHDSGDEKNNDTESQA 410

Query: 366 NEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDL 423
             GL   SS VAGIV+HTEG+CSI+QPG LKHK+V+EVMVSD ++ S+ VK K++KP DL
Sbjct: 411 KGGLGRCSSPVAGIVNHTEGICSINQPGALKHKFVEEVMVSDGERTSKIVKIKSNKPRDL 470

Query: 424 DSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSL 483
           DSG DD +  SKSK++KAKKLVINLGARKINV++SP+SDA SCQREQDL  SN       
Sbjct: 471 DSG-DDAEKPSKSKSVKAKKLVINLGARKINVSSSPKSDAQSCQREQDLKASN------- 522

Query: 484 QRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTKV 542
                               GD VDHS Q RGL K A R GN IKFG+V+ E S  N K 
Sbjct: 523 --------------------GDGVDHSEQKRGLIKFARREGNFIKFGKVKAEASSLNLKS 562

Query: 543 SRGSSADEHEP---EHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNAS 595
             G+  D +E    +H  V S KR+++ SRAAV   GEV  LR DR    KQ E+R +  
Sbjct: 563 DGGNHFDAYETTPLDHARVTSSKRSLEGSRAAVGPAGEVPMLRNDRVSLGKQSEARLDTH 622

Query: 596 RESND---DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRP 652
            ESND   DT +L SLPKDSK  L+LK +KPNLENQ+SQ+   EEEKS  +GQRSKRKR 
Sbjct: 623 TESNDDSGDTPILHSLPKDSKLSLKLKIKKPNLENQSSQILLHEEEKSNTRGQRSKRKRA 682

Query: 653 SPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNS 707
           S F +KT++NEDED ++S+ D   SE+M+ANWILKKLGKDAIGKRVEVHQ SDNS
Sbjct: 683 STFMDKTMYNEDEDMSESHLD---SEMMEANWILKKLGKDAIGKRVEVHQPSDNS 734


>gi|359487302|ref|XP_002274438.2| PREDICTED: uncharacterized protein LOC100249974 [Vitis vinifera]
          Length = 730

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 232/292 (79%), Positives = 251/292 (85%), Gaps = 9/292 (3%)

Query: 2   VAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
           + AEDYARRFESG +   SKDI GEEQ QSN NVMCR+CF GE EG ERAR+ML C SCG
Sbjct: 109 MVAEDYARRFESGDLVDTSKDIVGEEQSQSNVNVMCRICFFGEMEGSERARKMLPCNSCG 168

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           KKYHR CLK+W+QNRDLFHWSSW CPSCRICE+CRR+GDPNKFMFCRRCD AYHCYCQ P
Sbjct: 169 KKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVCRRSGDPNKFMFCRRCDDAYHCYCQQP 228

Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
           PHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 229 PHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 288

Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
           YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDA
Sbjct: 289 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDLEDA 348

Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEF 293
           V+ELWRR+D AD+ + +S   A  L   + I SI        N P VLK++F
Sbjct: 349 VQELWRRRDKADRGVFSS-PVAGSLSHTEGICSI--------NQPGVLKHKF 391



 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/392 (58%), Positives = 271/392 (69%), Gaps = 41/392 (10%)

Query: 373 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKTSKPHDLDSGEDDGK 431
           S VAG +SHTEG+CSI+QPG+LKHK+VDE+ V+++D+ SRV + K++KPH  D GED GK
Sbjct: 366 SPVAGSLSHTEGICSINQPGVLKHKFVDEIAVNNEDRTSRVIQIKSNKPHGSDVGEDTGK 425

Query: 432 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 491
             SKSKT+K  KLVI+LGAR  NVTNSPRSDASSCQREQDLTTSNG              
Sbjct: 426 QASKSKTMKGTKLVIHLGARNRNVTNSPRSDASSCQREQDLTTSNG-------------- 471

Query: 492 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 551
                        D++D+S Q++G K  GR GN+IK G+VR E S+ N K  RG+  D  
Sbjct: 472 -------------DKIDYSGQAKGSKHGGREGNLIKLGKVRTEPSEMNPKFGRGNKDDGV 518

Query: 552 E---PEHMHVLSGKRNIDRSRAAVSRVGEVAALRGD----RKQLESRPNASRESNDDTS- 603
           E   PE+  VL GKR+I+ S      V EV+  RG+    RK  ESR N   E NDD S 
Sbjct: 519 EAIPPENTRVLLGKRSIEGSTNVAGAVTEVS--RGEKVFSRKHPESRLNMYGEGNDDNSS 576

Query: 604 ---VLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTL 660
              V  SLPKDSKP L+LKF+ P+ ENQ+S     E+EKS +KGQRSKRKRPSPF EKT 
Sbjct: 577 TPSVSHSLPKDSKPLLKLKFKNPSFENQSSWGLPGEDEKSAVKGQRSKRKRPSPFMEKTS 636

Query: 661 FNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGT 720
           F EDED +Q +QD  M +IMDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGT
Sbjct: 637 FKEDEDGSQFHQDDSMDQIMDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGT 696

Query: 721 STLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           STL +  DD R KTLELGKQ +R + QKQKRS
Sbjct: 697 STLIVKFDDGRAKTLELGKQAIRLISQKQKRS 728


>gi|255553540|ref|XP_002517811.1| protein binding protein, putative [Ricinus communis]
 gi|223543083|gb|EEF44618.1| protein binding protein, putative [Ricinus communis]
          Length = 734

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 224/267 (83%), Positives = 245/267 (91%), Gaps = 2/267 (0%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
           EDYARRFESG +A A+KD+AG EQG SN NVMCR+CF+GE EG ERARRMLSCKSCGKKY
Sbjct: 126 EDYARRFESGDMAVATKDVAGHEQGLSNANVMCRMCFLGEAEGSERARRMLSCKSCGKKY 185

Query: 65  HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
           HR+CLK+WAQ+RDLFHWSSW CPSCRICEICRRTGDPNKFMFC+RCD AYHCYCQHPPHK
Sbjct: 186 HRSCLKSWAQHRDLFHWSSWTCPSCRICEICRRTGDPNKFMFCKRCDGAYHCYCQHPPHK 245

Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
           NVSSGPYLCPKHT+CHSCGS+VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD
Sbjct: 246 NVSSGPYLCPKHTRCHSCGSSVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 305

Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
           SESTPMVCCD+CQRWVHC CDGISDEKYLQFQVDGNLQY+C TCRGECYQV+D EDAV+E
Sbjct: 306 SESTPMVCCDICQRWVHCSCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDHEDAVQE 365

Query: 245 LWRRKDMADKDLIASLRAAAGLPTEDE 271
           LWRR+D AD+ + +S  + AG+    E
Sbjct: 366 LWRRRDEADRGVYSS--SIAGVVNHAE 390



 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 234/385 (60%), Positives = 273/385 (70%), Gaps = 38/385 (9%)

Query: 373 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGK 431
           SS+AG+V+H EG CS++Q G+LKHKYVDEVMVSD ++ SR V+ K  KPHDLDSG+D  K
Sbjct: 380 SSIAGVVNHAEGNCSVNQTGVLKHKYVDEVMVSDGERTSRIVRLKNKKPHDLDSGDDAEK 439

Query: 432 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 491
           H  K K++KAKKLVINLGARKINVTNS RSDASSCQR+QD+TT NG              
Sbjct: 440 HAIKFKSVKAKKLVINLGARKINVTNSHRSDASSCQRDQDMTTPNG-------------- 485

Query: 492 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 551
                        D VDHS Q R LK   R GN IKFG+V+ E S+ N K   GS AD  
Sbjct: 486 -------------DTVDHSVQIRSLKFPRREGNFIKFGKVKNETSNLNPKFQTGSDADGE 532

Query: 552 EPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNASRESNDDTSVLQS 607
           +   + V S KR+ID    AV  V EV  LR D+    KQLE R     ESNDD+    S
Sbjct: 533 K--MVSVSSSKRSIDGCGTAVGPVDEVPTLRSDKVSIGKQLEVRSETHAESNDDSGD-AS 589

Query: 608 LPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDA 667
           LPKDSK  L+LK + PNL NQ S+   PEEEKS I+GQRSKRKRPS F +K+LFNE+ED 
Sbjct: 590 LPKDSKISLKLKIKNPNLLNQYSRKPPPEEEKSSIRGQRSKRKRPSSFMDKSLFNENEDI 649

Query: 668 AQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITL 727
            Q++QDS   E+++A+WILKKLGKDAIGKRVEVHQ SDNSWHKGVV+DTVEGTS +S+TL
Sbjct: 650 TQAHQDS---EMLEASWILKKLGKDAIGKRVEVHQPSDNSWHKGVVSDTVEGTSMISVTL 706

Query: 728 DDSRVKTLELGKQGVRFVPQKQKRS 752
           DDSRVKTL+LGKQ VRFVPQKQKRS
Sbjct: 707 DDSRVKTLQLGKQAVRFVPQKQKRS 731


>gi|297816482|ref|XP_002876124.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321962|gb|EFH52383.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 678

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 193/260 (74%), Positives = 223/260 (85%), Gaps = 1/260 (0%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
           EDYA    SG +  + K++ GE+    ++++ C +C++ E    ERA+ MLSCK CGKKY
Sbjct: 112 EDYAVNDGSGDLEVSVKELNGEDHDHHSSSITCHMCYLVEVGKSERAK-MLSCKCCGKKY 170

Query: 65  HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
           HRNCLK+WAQ+RDLF+WSSW CPSCRICE C   GDP KFMFC+RCD AYHC CQ P HK
Sbjct: 171 HRNCLKSWAQHRDLFNWSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQQPRHK 230

Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
           NVSSGPYLCPKHTKC+SCGS VPGNG S+RWFLG+TCCDACGRLFVKGNYCPVCLKVYRD
Sbjct: 231 NVSSGPYLCPKHTKCYSCGSTVPGNGQSLRWFLGHTCCDACGRLFVKGNYCPVCLKVYRD 290

Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
           SE+TPMVCCD CQRWVHC CDGISDEKY+QFQVDGNLQY+C TCRGECYQV+DLEDAV+E
Sbjct: 291 SEATPMVCCDFCQRWVHCHCDGISDEKYMQFQVDGNLQYKCSTCRGECYQVKDLEDAVQE 350

Query: 245 LWRRKDMADKDLIASLRAAA 264
           +W+RKD+ADKDLIASL+A+A
Sbjct: 351 IWKRKDIADKDLIASLKASA 370



 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 98/225 (43%), Positives = 133/225 (59%), Gaps = 9/225 (4%)

Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
           V +  S SN K  +  S +    E    L GK N D  R +    GEV   + +   +  
Sbjct: 458 VSKLTSKSNGKQEKLQSEETFSREQHRSLLGKNN-DEKRGSR---GEVTTSKAEGGFIGR 513

Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEKSLIKGQRSKR 649
             +   + N  +    S+ KDS+  L+L+ +K N E+Q  +  S   E     KG RSKR
Sbjct: 514 HSDGKGDLNSGSH--DSMQKDSRRLLKLRIKKHNPESQEGETPSIVYERGKSGKGHRSKR 571

Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
           KR SP  EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW 
Sbjct: 572 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 631

Query: 710 K--GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           K          GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 632 KGVVSEVGGGGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 676


>gi|218186977|gb|EEC69404.1| hypothetical protein OsI_38556 [Oryza sativa Indica Group]
          Length = 625

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 208/358 (58%), Positives = 243/358 (67%), Gaps = 35/358 (9%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI- 94
           MCR+CF GENEG  +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+ 
Sbjct: 1   MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVL 60

Query: 95  ----------------------------CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                                       CRR GDPNK MFC+RCD AYHCYCQ P HKNV
Sbjct: 61  LDWSLGFDVNLAKTLVCGVTGPTSGSSVCRRPGDPNKLMFCKRCDGAYHCYCQQPSHKNV 120

Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
           + GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE
Sbjct: 121 THGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 180

Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 246
             PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C  CRGEC Q+RD EDAVRELW
Sbjct: 181 VIPMVCCDVCEKWVHIECDGISEEKYQQFQSDQNLQYTCGACRGECSQIRDTEDAVRELW 240

Query: 247 RRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVD 306
           +R+D+ D DL+ASLRAAA LP+ +++    P SDDE+ G  V+KN+   +LK S K    
Sbjct: 241 KRRDVVDHDLMASLRAAAALPSLEDVSPSHPNSDDEKLGAYVMKNDGRNTLKFSFKSNST 300

Query: 307 KSPKKVKEHGKKWL-----NKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD 359
           K      E  K  +     NKK+ +KKG Q       + +   E  ++  S G S GD
Sbjct: 301 KPALDSSEQEKNAIKSSGSNKKHSKKKGNQNNKTVSEQDEIFLEKRNETKSLG-SLGD 357



 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 69/144 (47%), Positives = 95/144 (65%), Gaps = 14/144 (9%)

Query: 616 LRLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
           L+LKF++P  E  ++Q SQPEE         E ++ KGQRSKRKRPS   +K   +E + 
Sbjct: 449 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 506

Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
            AA+ ++ S   E MDANWIL+KLGKDAIGKR+EV   SD  WH+GVV++ + G  TL +
Sbjct: 507 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 564

Query: 726 TLDDSRVKTLELGKQGVRFVPQKQ 749
            LD+ R + +ELGK+ +R + Q Q
Sbjct: 565 QLDNGRSENIELGKRAIRLIAQSQ 588


>gi|42565848|ref|NP_190778.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
 gi|332645370|gb|AEE78891.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
          Length = 696

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 187/244 (76%), Positives = 213/244 (87%), Gaps = 1/244 (0%)

Query: 21  KDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
           KD+  E+    + ++ C +C++ E    ERA+ MLSCK CGKKYHRNC+K+WAQ+RDLF+
Sbjct: 146 KDLNEEDHDHHSASITCHMCYLVEVGKSERAK-MLSCKCCGKKYHRNCVKSWAQHRDLFN 204

Query: 81  WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCH 140
           WSSW CPSCRICE C   GDP KFMFC+RCD AYHC CQHP HKNVSSGPYLCPKHTKC+
Sbjct: 205 WSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKCY 264

Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWV 200
           SC S VPGNG S+RWFLG+TCCDACGRLFVKGNYCPVCLKVYRDSE+TPMVCCD CQRWV
Sbjct: 265 SCESTVPGNGQSLRWFLGHTCCDACGRLFVKGNYCPVCLKVYRDSEATPMVCCDFCQRWV 324

Query: 201 HCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASL 260
           HCQCDGISDEKY+QFQVDGNLQY+C TCRGE YQV+DLEDAV+E+W+RKDMADKDLIASL
Sbjct: 325 HCQCDGISDEKYMQFQVDGNLQYKCSTCRGESYQVKDLEDAVQEIWKRKDMADKDLIASL 384

Query: 261 RAAA 264
           +A+A
Sbjct: 385 KASA 388



 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)

Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
           V +  S SN K  +  + +    E    L GK N D  R +    GEV  L+ +   +  
Sbjct: 476 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 531

Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 649
             +   + N  +    S  KDS+  L+LK +K N E Q S+      E+S   KG RSKR
Sbjct: 532 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 589

Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
           KR SP  EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW 
Sbjct: 590 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 649

Query: 710 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           KGVV++     GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 650 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 694


>gi|168045006|ref|XP_001774970.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673717|gb|EDQ60236.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 686

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 162/285 (56%), Positives = 207/285 (72%), Gaps = 3/285 (1%)

Query: 2   VAAEDYARRFESGYVATASKDIAGEEQGQSNT---NVMCRLCFVGENEGCERARRMLSCK 58
            +A  +A R  +G       D    E+G   +    V+C LC  GE  G ++A RMLSC+
Sbjct: 135 ASANGWADRERAGMPVRGGSDEEFNEEGHFESLFPQVVCGLCGCGEAIGSDKAGRMLSCQ 194

Query: 59  SCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
           +C K+YHR C K WA++RDLF+W+SW C SCR+CE+C R+GD NK MFC+RCD AYH  C
Sbjct: 195 ACRKQYHRKCTKYWAEHRDLFNWASWMCGSCRVCEVCLRSGDSNKLMFCKRCDHAYHSSC 254

Query: 119 QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVC 178
            HPP K+V  GP++CPKH +C SC + VPG G+S +WFL Y+ CDACGRLF +G YCP+C
Sbjct: 255 LHPPLKHVPKGPFVCPKHVRCTSCNTTVPGGGVSSKWFLSYSLCDACGRLFTRGKYCPIC 314

Query: 179 LKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDL 238
           LKVYRDSE  PMVCCDVC+ WVHC+CDGISDEKY +FQV+  L+Y+C +CRGECY+V DL
Sbjct: 315 LKVYRDSEPAPMVCCDVCEHWVHCECDGISDEKYQEFQVNSQLRYKCASCRGECYKVADL 374

Query: 239 EDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
           +DA  E+WRRKD+ D   IA +RAAAGLP+ +EI    P SD+E+
Sbjct: 375 DDAAVEIWRRKDIRDATQIAEIRAAAGLPSPEEILKAYPSSDEED 419


>gi|4678939|emb|CAB41330.1| putative protein [Arabidopsis thaliana]
          Length = 763

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 175/273 (64%), Positives = 205/273 (75%), Gaps = 12/273 (4%)

Query: 3   AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
           A E YA    SG +  + KD+  E+    + ++ C +C++ E    ERA+ MLSCK CGK
Sbjct: 110 AVEVYAANDCSGDLEVSVKDLNEEDHDHHSASITCHMCYLVEVGKSERAK-MLSCKCCGK 168

Query: 63  KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           KYHRNC+K+WAQ+RDLF+WSSW CPSCRICE C   GDP KFMFC+RCD AYHC CQHP 
Sbjct: 169 KYHRNCVKSWAQHRDLFNWSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPR 228

Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF------LGYTCCDACGRL---FVKGN 173
           HKNVSSGPYLCPKHTKC+SC S VPGNG S+R+       L    C   G L    V+G 
Sbjct: 229 HKNVSSGPYLCPKHTKCYSCESTVPGNGQSLRYLTFCLVILEIYSCGFWGILVVMLVEGC 288

Query: 174 YC--PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
                + L VYRDSE+TPMVCCD CQRWVHCQCDGISDEKY+QFQVDGNLQY+C TCRGE
Sbjct: 289 LLRGIIVLYVYRDSEATPMVCCDFCQRWVHCQCDGISDEKYMQFQVDGNLQYKCSTCRGE 348

Query: 232 CYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 264
            YQV+DLEDAV+E+W+RKDMADKDLIASL+A+A
Sbjct: 349 SYQVKDLEDAVQEIWKRKDMADKDLIASLKASA 381



 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)

Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
           V +  S SN K  +  + +    E    L GK N D  R +    GEV  L+ +   +  
Sbjct: 469 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 524

Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 649
             +   + N  +    S  KDS+  L+LK +K N E Q S+      E+S   KG RSKR
Sbjct: 525 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 582

Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
           KR SP  EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW 
Sbjct: 583 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 642

Query: 710 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           KGVV++     GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 643 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 687


>gi|302813786|ref|XP_002988578.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
 gi|300143685|gb|EFJ10374.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
          Length = 774

 Score =  338 bits (868), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 154/287 (53%), Positives = 201/287 (70%), Gaps = 9/287 (3%)

Query: 2   VAAEDYARRFES-----GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLS 56
           V AED    +++     G V+T S+    EE       + C LC + E E  ++  RML+
Sbjct: 144 VPAEDNQPDYQTPDDGGGEVSTPSQP---EESVAPPEEIFCGLCQLAEAES-KKQERMLT 199

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C+ C +++HR CLK+WA NRDLF+W+SW+C  CR CE C+ TGDPN+ +FC+RCD A+H 
Sbjct: 200 CQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDCKVTGDPNRLLFCKRCDEAHHN 259

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
            C+    K  + GP+LCPKH++CHSCG+ VPG G S RWF  Y  CDACGRLFVK  YCP
Sbjct: 260 NCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRWFHSYLFCDACGRLFVKDKYCP 319

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
           +C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +FQ   NL++ C  CRGEC+Q  
Sbjct: 320 ICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEFQTIQNLRFTCAACRGECFQAT 379

Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
            +E+AV ELW+RKD AD+D I SLRA+AGLP+E E+  + P SDDE+
Sbjct: 380 SVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARLCPSSDDEQ 426



 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 94/182 (51%), Gaps = 23/182 (12%)

Query: 572 VSRVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQ 631
           V +V   A  RG   +  +  +  R  +D+       P+  +  L+LK +KP+     ++
Sbjct: 440 VFKVNSSAKARGKSSEEAADSSKKRSRSDNVEP----PETERKTLKLKIKKPS----GTE 491

Query: 632 VSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGK 691
           V    E  +  +GQRSKRKRP+   E+    E  DA +S++D           IL +LG 
Sbjct: 492 VV---EASNTARGQRSKRKRPASSQEE----EVADAVESDEDDTS--------ILHRLGS 536

Query: 692 DAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKR 751
           DA+ KRVEV + SD +W KG +T   +  S  ++  D+   KTL+ GK+ VR + ++++ 
Sbjct: 537 DAVTKRVEVCRSSDKTWLKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRERY 596

Query: 752 SM 753
           ++
Sbjct: 597 AI 598


>gi|302795017|ref|XP_002979272.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
 gi|300153040|gb|EFJ19680.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
          Length = 764

 Score =  337 bits (865), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 154/287 (53%), Positives = 200/287 (69%), Gaps = 9/287 (3%)

Query: 2   VAAEDYARRFES-----GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLS 56
           V AED    +++     G V+T S+    EE       + C LC   E E  ++  RML+
Sbjct: 144 VPAEDNQPDYQTPDDGGGEVSTPSQP---EESVAPPEEIFCGLCQQAEAES-KKQERMLT 199

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C+ C +++HR CLK+WA NRDLF+W+SW+C  CR CE C+ TGDPN+ +FC+RCD A+H 
Sbjct: 200 CQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDCKVTGDPNRLLFCKRCDEAHHN 259

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
            C+    K  + GP+LCPKH++CHSCG+ VPG G S RWF  Y  CDACGRLFVK  YCP
Sbjct: 260 NCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRWFHSYLFCDACGRLFVKDKYCP 319

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
           +C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +FQ   NL++ C  CRGEC+Q  
Sbjct: 320 ICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEFQTIQNLRFTCAACRGECFQAT 379

Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
            +E+AV ELW+RKD AD+D I SLRA+AGLP+E E+  + P SDDE+
Sbjct: 380 SVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARLCPSSDDEQ 426



 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 87/162 (53%), Gaps = 25/162 (15%)

Query: 595 SRESNDDTSVLQSLPKDSKPP------LRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
           S E   D+S  +S   + +PP      L+LK +KP+     ++V    E  +  +GQRSK
Sbjct: 453 SSEEAADSSKKRSRSDNVEPPETERKTLKLKIKKPS----GTEVV---EASNTARGQRSK 505

Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
           RKRP+   E+    E  DA +S++D           IL +LG DA+ KRVEV + SD +W
Sbjct: 506 RKRPASSQEE----EVADAVESDEDDTS--------ILHRLGSDAVTKRVEVCRSSDKTW 553

Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
            KG +T   +  S  ++  D+   KTL+ GK+ VR + ++++
Sbjct: 554 LKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRER 595


>gi|242085692|ref|XP_002443271.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
 gi|241943964|gb|EES17109.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
          Length = 531

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 215/600 (35%), Positives = 302/600 (50%), Gaps = 112/600 (18%)

Query: 190 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 249
           MVCCDVC++WVH +CDGISDEKY +FQ D NLQY C  CRGEC Q+RD EDA+RELW+R+
Sbjct: 1   MVCCDVCEKWVHIECDGISDEKYQEFQADQNLQYTCAACRGECSQIRDTEDAIRELWKRR 60

Query: 250 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 309
           D+ D +L+ +LRAAA LP+ +++    P SDDE+ G  VLKNE   +LK SLK    K P
Sbjct: 61  DVVDHELMVTLRAAAALPSPEDVSPPYPNSDDEKLGAYVLKNESRNTLKFSLKSNSSKPP 120

Query: 310 KKVKEHGKKWL-----------------NKK--------YPRKKGYQMPLNSKPEPDQSF 344
               E  K                    NK           R+   ++P NS+   DQS 
Sbjct: 121 SDTPEQEKIVFKSPGSNKKSSKKKGGQGNKTDDGHDEIFLERRHDVKLP-NSRL-GDQSI 178

Query: 345 EGYHDVHSY---GNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDE 401
           +G HD   +    N++   +     + L  PS  A                         
Sbjct: 179 DGNHDRSPFKNDDNAYISSSTRSSEKSLKSPSKKA------------------------- 213

Query: 402 VMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRS 461
            + ++ D I +VK K SK   L   +D  ++  K+ T KA KLVI+LG+R    + SP+S
Sbjct: 214 -VPNNADMIPKVKIKGSKVSTLHY-KDGEENTPKNDTGKATKLVIHLGSRHKTRSGSPKS 271

Query: 462 DASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGR 521
           + S+ QREQDL + +G                            +VD +SQ +  +   +
Sbjct: 272 ELSNSQREQDLGSIHG---------------------------GKVDVTSQLKSSRSEVK 304

Query: 522 GGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAAL 581
             +V+K  R        N+ +    ++ +H        +GKR    S A +S + E A  
Sbjct: 305 ERSVMKLVR-ETGAPQRNSLLGDLGTSKKH-------ATGKR----SNALISGM-ENANE 351

Query: 582 RGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPL-RLKFRKPNLENQNSQVSQPEEEKS 640
            G R +  ++        D+       P + KP L +LKF++P+ E  N+Q SQPEE  S
Sbjct: 352 TGSRNRSFAQKQYHSSQVDENQGTADSPDNLKPSLLKLKFKRPHFEQLNTQASQPEEPTS 411

Query: 641 LI----------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLG 690
            +          KGQRSKRKRPS   EK    +    A+ +Q S   E+MDANWIL+KLG
Sbjct: 412 WVSQQEEQLNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHQQSTDDEVMDANWILRKLG 469

Query: 691 KDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
           KDAIGKR+EVH  SD  WH+G+V++ + G  TL I LD+ R + +ELGKQ +R +  + K
Sbjct: 470 KDAIGKRIEVHLTSDGKWHQGMVSNVIGG--TLCIQLDNGRSENVELGKQAIRLIASRSK 527


>gi|224106095|ref|XP_002314041.1| predicted protein [Populus trichocarpa]
 gi|222850449|gb|EEE87996.1| predicted protein [Populus trichocarpa]
          Length = 96

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 75/96 (78%), Positives = 82/96 (85%), Gaps = 3/96 (3%)

Query: 657 EKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
           EKT++NEDE  +QS+ DS   E+M+ANWILKKLG DAIGKRVEVHQ SDNSWHKGVV+D 
Sbjct: 2   EKTMYNEDEGMSQSHLDS---EMMEANWILKKLGYDAIGKRVEVHQPSDNSWHKGVVSDI 58

Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
           VE TS LSITLDD RVKTLELGKQ VRFV QKQKRS
Sbjct: 59  VEDTSMLSITLDDDRVKTLELGKQAVRFVSQKQKRS 94


>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
           [Strongylocentrotus purpuratus]
          Length = 4860

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 108/220 (49%), Gaps = 5/220 (2%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG+ YH +CL +   + D    + W+CP+C+IC+ CR+ GD NK + C  CD  Y
Sbjct: 393 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 451

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P    +    + C     C  CG+  PGNG S RW   YT CD+C +   KG  
Sbjct: 452 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 511

Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
           CP+C K YR  +    MV C +C R+VH  CD      KY Q +  G    Y+CP CR  
Sbjct: 512 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 571

Query: 232 CYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDE 271
             +  +LE   R     +D       +S R    LP  D+
Sbjct: 572 PNRGLELERR-RSASPFEDGRRSPFASSSRTPPHLPIPDD 610



 Score =  108 bits (271), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C    + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C ++
Sbjct: 787 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 844

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC  PP + V  G + C     C  CGS  PG   +  W   
Sbjct: 845 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 902

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
           YT C  C  +     +C  C + YRD+E   ++C C  CQRW H  C+ +  E   +  +
Sbjct: 903 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 955

Query: 218 DGNLQYRCPTCR 229
           D    + C  CR
Sbjct: 956 DKG--FICTLCR 965


>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
           [Strongylocentrotus purpuratus]
          Length = 4856

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG+ YH +CL +   + D    + W+CP+C+IC+ CR+ GD NK + C  CD  Y
Sbjct: 392 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 450

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P    +    + C     C  CG+  PGNG S RW   YT CD+C +   KG  
Sbjct: 451 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 510

Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
           CP+C K YR  +    MV C +C R+VH  CD      KY Q +  G    Y+CP CR  
Sbjct: 511 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 570

Query: 232 CYQVRDLE 239
             +  +LE
Sbjct: 571 PNRGLELE 578



 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C    + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C ++
Sbjct: 767 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 824

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC  PP + V  G + C     C  CGS  PG   +  W   
Sbjct: 825 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 882

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
           YT C  C  +     +C  C + YRD+E   ++C C  CQRW H  C+ +  E   +  +
Sbjct: 883 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 935

Query: 218 DGNLQYRCPTCR 229
           D    + C  CR
Sbjct: 936 DKG--FICTLCR 945


>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
           [Strongylocentrotus purpuratus]
          Length = 4873

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG+ YH +CL +   + D    + W+CP+C+IC+ CR+ GD NK + C  CD  Y
Sbjct: 409 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 467

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P    +    + C     C  CG+  PGNG S RW   YT CD+C +   KG  
Sbjct: 468 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 527

Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
           CP+C K YR  +    MV C +C R+VH  CD      KY Q +  G    Y+CP CR  
Sbjct: 528 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 587

Query: 232 CYQVRDLE 239
             +  +LE
Sbjct: 588 PNRGLELE 595



 Score =  108 bits (271), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C    + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C ++
Sbjct: 784 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 841

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC  PP + V  G + C     C  CGS  PG   +  W   
Sbjct: 842 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 899

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
           YT C  C  +     +C  C + YRD+E   ++C C  CQRW H  C+ +  E   +  +
Sbjct: 900 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 952

Query: 218 DGNLQYRCPTCR 229
           D    + C  CR
Sbjct: 953 DKG--FICTLCR 962


>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
          Length = 1566

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 68/178 (38%), Positives = 91/178 (51%), Gaps = 4/178 (2%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG+ YH +CL        +   + W+CP C+IC+ CR+ GD NK + C  CD  Y
Sbjct: 388 LFCTSCGQHYHGSCLDPPVDVNPVVR-AGWQCPECKICQTCRQPGDDNKMLVCDTCDKGY 446

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P  + +    + C     C  CGS  PG+G S RW L Y+ CD+C +   KG  
Sbjct: 447 HTFCLRPVMQTIPKNGWKCKNCRICTDCGSRTPGSGPSSRWHLNYSVCDSCYQQRNKGLC 506

Query: 175 CPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
           CP+C K YR  +    M+ C+ C++WVH  CD   D    Q   D  L   Y C  CR
Sbjct: 507 CPICGKAYRQHTAHNAMIQCESCKKWVHVDCDESIDISVYQQLKDDKLTTIYNCVDCR 564



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 93/199 (46%), Gaps = 19/199 (9%)

Query: 33  TNVMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  MC  C  F  + EG     R+L+C  CG+ YH  C+    +   +     W+C  C 
Sbjct: 797 TQDMCVSCGSFGRDAEG-----RLLTCSQCGQCYHPYCVN--IKITKVVLSKGWRCLDCT 849

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C +  D  + + C  CD +YH YC  PP +NV  G + C     C  CG+  P  G
Sbjct: 850 VCEGCGKASDEGRLLLCDDCDISYHTYCLEPPLQNVPKGGWKCKWCVCCTKCGATSP--G 907

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W   YT C  C  L      CPVC K Y++ E   ++ C  C RW+H +CDG  +E
Sbjct: 908 FNSEWQNNYTQCGPCSSLLT----CPVCFKEYKEDEL--IIQCVQCYRWLHAECDGFHNE 961

Query: 211 KYLQFQVDGNLQYRCPTCR 229
             ++   D    Y C  CR
Sbjct: 962 DDIERAADQG--YHCLLCR 978


>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
          Length = 4990

 Score =  135 bits (341), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 72/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)

Query: 20   SKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCL-KNWAQNRDL 78
            ++ IAGEE         C LC   +  G     + L C SCG  YH  CL  + A + ++
Sbjct: 1820 AETIAGEE-------AFCVLCCQADKIG-----KQLFCTSCGHHYHGGCLHPSVALSPEV 1867

Query: 79   FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTK 138
               + W+CP C++C++CR+ G+ +K + C  CD  YH +C  P    +    + C     
Sbjct: 1868 --RAGWQCPDCKVCQMCRQPGEDSKMLVCDTCDKGYHTFCLKPVMTAIPKNGWKCKNCRV 1925

Query: 139  CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQR 198
            C  CGS  PG+G S RW L Y+ CD+C +   KG  CP+C K YR      M+ C  C++
Sbjct: 1926 CGDCGSRTPGSGPSSRWHLNYSVCDSCYQQRNKGLSCPLCGKAYRQFTQKAMIQCGTCKK 1985

Query: 199  WVHCQCDGISDEKYL-QFQVDGNLQYRCPTCR 229
             VH +CD   D   L + + +  + Y C  CR
Sbjct: 1986 HVHAECDDAIDNLMLDRVRNEEQVDYMCSVCR 2017



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 58/177 (32%), Positives = 83/177 (46%), Gaps = 12/177 (6%)

Query: 53   RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
            +++ C  CG+ YH  C     +   +     W+C  C +CE C +  D  + + C  CD 
Sbjct: 2160 KLIVCTQCGQCYHPYCAS--VKVTKVILSKGWRCLDCTVCEGCGKPHDEGRLLLCDECDI 2217

Query: 113  AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
            +YH YC  PP   V  G + C     C +CG+  PG G +  W   YT C  C R  +  
Sbjct: 2218 SYHIYCLDPPLDQVPKGTWKCKWCVMCINCGTTTPGFGCN--WQNNYTQCGPC-RSKID- 2273

Query: 173  NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
              CPVC   Y+D E   ++ C  C RW+H  CDG+  E  ++   D    Y+C  CR
Sbjct: 2274 --CPVCRHKYQDDEM--IIQCLQCNRWLHALCDGLRSEDDMERAAD--YDYQCLFCR 2324


>gi|281202543|gb|EFA76745.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
          Length = 604

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/209 (32%), Positives = 102/209 (48%), Gaps = 6/209 (2%)

Query: 23  IAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHW 81
           +   E  +S T   C +C   +N       ++L C  C + +H  C+         +   
Sbjct: 333 VVPAELKKSTTISTCEVCSHTDNRD-----QLLQCVGCLRSFHGKCINLQTLAIETIKKL 387

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHS 141
           ++WKC  C++CE C+ T + +K +FC  CD  YH +C +PP +   +G + C     C  
Sbjct: 388 NTWKCTDCKVCEACKDTTNEDKMLFCDVCDRGYHTFCLNPPLERPPTGGWRCSTCVFCIH 447

Query: 142 CGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVH 201
           CG+  PG   +  W   YT C+ C  L  +  YC VC KV +  E +P + C  C RW H
Sbjct: 448 CGTRTPGPQANSAWRGHYTECEQCNVLVAERKYCSVCRKVIKPHEKSPTIQCGYCDRWTH 507

Query: 202 CQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
            QCDG+S     +F+ + N QY+C  CR 
Sbjct: 508 SQCDGMSVSNLEKFKDNPNHQYKCQACRN 536


>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
          Length = 5215

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 108/225 (48%), Gaps = 10/225 (4%)

Query: 29  GQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           GQ++      +C+  +  G      ++ C SCG+ YH  C+   AQ   +   + W+C  
Sbjct: 358 GQASLECENSVCYTCKTLG--DIANLMFCSSCGEHYHGICV-GLAQLPGV--RAGWQCRK 412

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
           CRIC++CR TGD  K M C +CD  YH  CQ P   ++    + C     C  CGS  PG
Sbjct: 413 CRICQVCRMTGDETKLMTCEQCDKIYHSTCQRPIVTSIPKYGWKCRCCRVCGDCGSRTPG 472

Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
            GLS RW   YT CD+C +   KG  CP+C + YR      MV C +C+++VH  CD  +
Sbjct: 473 AGLSSRWHAHYTVCDSCYQQRNKGFSCPLCHRAYRAHAHREMVQCTLCRKFVHGTCDPEA 532

Query: 209 DEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMAD 253
           D      + + + +Y        C   ++L      L +R  + D
Sbjct: 533 DLVTYHQRKEAHPEYEY-----VCLMCKNLTQPATLLAKRNSIDD 572



 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 83/181 (45%), Gaps = 17/181 (9%)

Query: 33  TNVMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C     ++EGC     ++SC  CG+ YH  C+    +   +     W+C  C 
Sbjct: 702 TQDICVMCGALGTDHEGC-----LISCVQCGQCYHPYCVN--VKITKVVLQKGWRCLDCT 754

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C +  D  + + C  CD +YH YC  PP   V  G + C     C +CG+  P  G
Sbjct: 755 VCEGCGQRNDEARLILCDDCDISYHIYCMDPPLDYVPHGNWKCKWCAICQTCGATDP--G 812

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W      C  CG      N CP C + Y  SE   ++ C  C+RW+H  CD I  E
Sbjct: 813 FNCSWM---NSCSECGPCASHVN-CPSCSEPY--SEGDLIIQCVQCERWLHGTCDSIKTE 866

Query: 211 K 211
           +
Sbjct: 867 E 867


>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
          Length = 5713

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 99/202 (49%), Gaps = 14/202 (6%)

Query: 32  NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSC 89
           N +V C  C      G      ++ C  CG+ YH +C  L      R     + W+C SC
Sbjct: 375 NGDVTCMQCC-----GMGDVSNLVMCSICGQHYHGSCVGLALLPGVR-----AGWQCASC 424

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
           R+C++CR+  D +K M C RC+ AYH  C  P   ++    + C     C  CGS  PG 
Sbjct: 425 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 484

Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
           GLS RW   YT CD+C +   KG  CP+C K YR +    MV C  C+++VH  CD  +D
Sbjct: 485 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 544

Query: 210 EKYLQF--QVDGNLQYRCPTCR 229
               Q   +V  + +Y C  C+
Sbjct: 545 PLTYQHRKEVKPDYEYVCLHCK 566



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 17/177 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 713 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 765

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C     C +CGSN P  G
Sbjct: 766 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 823

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
            +  W   YT C  C         C  C + Y + +   ++ C  C+RW+HC CD I
Sbjct: 824 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 874


>gi|328717947|ref|XP_001943997.2| PREDICTED: hypothetical protein LOC100159693, partial
           [Acyrthosiphon pisum]
          Length = 2904

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 101/208 (48%), Gaps = 17/208 (8%)

Query: 34  NVMCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCR 90
           NV C +C  +G+         ++ C +CG  YH  CL        L     + W+C +CR
Sbjct: 383 NVACGICSTLGD------VSNLMMCTACGSHYHGVCL-----GLALLPGVRAGWQCGNCR 431

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           IC++CR+  +  K M C  CD AYH  C  P    +    + C     C  CGS  PG G
Sbjct: 432 ICQVCRQPAEQTKVMLCEGCDKAYHPGCLRPQVTTIPKIGWKCKCCRVCTDCGSRTPGAG 491

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
           LS RW   YT CD+C +   KG+ CP+C + YR +    MV C  C++++H  CD   + 
Sbjct: 492 LSSRWHAHYTVCDSCYQQRNKGSSCPLCHRAYRAAAHREMVQCISCRKYIHGACD--PEA 549

Query: 211 KYLQFQVDGNL-QYRCPTCRGECYQVRD 237
           +Y+      +  +Y CP C+    Q RD
Sbjct: 550 EYITSHSKSSASEYMCPLCKNAVQQRRD 577



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 84/172 (48%), Gaps = 10/172 (5%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G ++   ++SC  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 723 ICVMCGSLGTDQEACLISCSQCGQCYHPFCVN--VKVTKVILQKGWRCLDCTVCEGCGQR 780

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D ++ + C  CD +YH YC  P    V  G + C    +C +CGSN P  G +  W   
Sbjct: 781 NDESRLILCDECDISYHIYCTDPKLDYVPRGTWKCKWCAQCLTCGSNDP--GFNCSWLNN 838

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
           YT C  C    +    CP C + Y D++   ++ C  C RW+H +CD I +E
Sbjct: 839 YTECGPCASRSI----CPSCQESYTDNQL--IIKCSQCDRWLHGKCDKIENE 884


>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
          Length = 5622

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 14/202 (6%)

Query: 32  NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSC 89
           N ++ C  C      G      ++ C  CG+ YH +C+        L     + W+C SC
Sbjct: 366 NGDITCMHCC-----GMGDVSNLVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSC 415

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
           R+C++CR+  D +K M C RC+ AYH  C  P   ++    + C     C  CGS  PG 
Sbjct: 416 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 475

Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
           GLS RW   YT CD+C +   KG  CP+C K YR +    MV C  C+++VH  CD  +D
Sbjct: 476 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 535

Query: 210 EKYLQFQ--VDGNLQYRCPTCR 229
               Q +  V  + +Y C  C+
Sbjct: 536 PLTYQHRKDVKPDYEYVCLHCK 557



 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 17/177 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 704 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 756

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C     C +CGSN P  G
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 814

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
            +  W   YT C  C         C  C + Y  +E   ++ C  C+RW+HC CD I
Sbjct: 815 FNSSWQKNYTQCGPCA----SHAACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865


>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
          Length = 5619

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 14/202 (6%)

Query: 32  NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSC 89
           N ++ C  C      G      ++ C  CG+ YH +C+        L     + W+C SC
Sbjct: 366 NGDITCMHCC-----GMGDVSNLVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSC 415

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
           R+C++CR+  D +K M C RC+ AYH  C  P   ++    + C     C  CGS  PG 
Sbjct: 416 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 475

Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
           GLS RW   YT CD+C +   KG  CP+C K YR +    MV C  C+++VH  CD  +D
Sbjct: 476 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 535

Query: 210 EKYLQFQ--VDGNLQYRCPTCR 229
               Q +  V  + +Y C  C+
Sbjct: 536 PLTYQHRKDVKPDYEYVCLHCK 557



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 17/177 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 704 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 756

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C     C +CGSN P  G
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 814

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
            +  W   YT C  C         C  C + Y  +E   ++ C  C+RW+HC CD I
Sbjct: 815 FNSSWQKNYTQCGPCA----SHTACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865


>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1858

 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 93/211 (44%), Gaps = 20/211 (9%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKN-----WAQNRDLFHWSSWKCPSCR 90
           +CR C  G     E    M  C  C + YH  C+K+     +          SWKC  C 
Sbjct: 578 LCRGC--GTRGTDEETSGMHWCNQCCQPYHDFCVKSSFGDAYESTLKEIAQGSWKCWDCI 635

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GN 149
           +C  C  +      + C  C    H  C   P   V SG +LC +  KC SCG+  P G 
Sbjct: 636 VCTTCNSSFPEETLVVCDNCAVGRHLGCMDIPLAEVPSGRWLCSQCVKCDSCGAQTPRGM 695

Query: 150 GLS-----------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQ 197
           G +             W   Y+ C  CG L  +GNYC VC KVY D +  TPM+ C+ C 
Sbjct: 696 GKTRLPSSFPSSQPCEWMFDYSLCQPCGLLKARGNYCRVCEKVYEDDDYDTPMISCEQCS 755

Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
            W+H  C G+ +E Y  +  D NL + CP+C
Sbjct: 756 MWLHTHCVGMDEETYEMYSNDENLAFTCPSC 786


>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
          Length = 3474

 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 75/224 (33%), Positives = 107/224 (47%), Gaps = 25/224 (11%)

Query: 35  VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRIC 92
           V C  C+     G      ++ C  CG+ YH +C  L      R     + W+C SCR+C
Sbjct: 365 VTCVQCY-----GMGDVSNLVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVC 414

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
           ++CR+  D +K M C RCD AYH  C  P   ++    + C     C  CGS  PG GLS
Sbjct: 415 QVCRQPEDVSKVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLS 474

Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            RW   YT CD+C +   KG  CP+C K YR +    MV C  C+++VH  CD  +D   
Sbjct: 475 SRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLT 534

Query: 213 LQFQVDG--NLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADK 254
            Q + +   + +Y C  C+           ++  + RRKD  D+
Sbjct: 535 YQQRKEAKPDYEYVCLHCK-----------SIAMVARRKDSIDE 567



 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 85/180 (47%), Gaps = 17/180 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 700 TQDICVMCGSIGMDQEGC-----LIACVQCGQCYHPYCAG--VKITKVILQKGWRCLDCT 752

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C    +C +CGSN P  G
Sbjct: 753 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAQCQTCGSNDP--G 810

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W   YT C  C         C VC + Y++ +   ++ C  C+RW+HC CD I  E
Sbjct: 811 FNSSWQKSYTQCGPCA----SHTACVVCQEAYQEGDL--IIQCVQCERWLHCGCDSIKSE 864


>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
 gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
          Length = 6073

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/236 (31%), Positives = 111/236 (47%), Gaps = 22/236 (9%)

Query: 39  LCFVGENEGCE------RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCR 90
           L  +GE   C           +L C SCG  +H +C+        L     + W+C  CR
Sbjct: 370 LLPIGETSQCSTCLSLGNVSNILMCTSCGAHHHGSCV-----GLALLPGVRAGWQCFECR 424

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +C++CR+  +  K M C  CD AYH  C  P   ++    + C     C  CGS  PG+G
Sbjct: 425 VCQVCRQPSEIGKIMLCESCDKAYHPSCLRPIVTSIPKYGWKCKCCRVCSDCGSRTPGSG 484

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
           LS RW   ++ CD+C +   KG  CPVC + YR +    MV C  C+++VH  CD  +D 
Sbjct: 485 LSSRWHNHFSVCDSCYQQRNKGFCCPVCGRAYRAAAHREMVQCIKCRKYVHGSCDNEADI 544

Query: 211 KYLQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 264
                + + N   +Y C  C       ++L    R+  +RKD  D+ L+ S  +A+
Sbjct: 545 SVYAARKETNPDYEYICCIC-------KNLNSMGRQGIKRKDSFDEALLESSLSAS 593



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C     + EGC     ++SC  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 723 ICVMCGALGTDQEGC-----LISCAQCGQCYHPYCVN--VKVTKVILQKGWRCLDCTVCE 775

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  D ++   C  CD +YH YC  PP   V  G + C     C  CGSN P  G + 
Sbjct: 776 GCGQRNDDSRLTLCDDCDISYHIYCMDPPLDYVPRGVWKCKWCVVCIRCGSNDP--GFNC 833

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W  GYT C  C        +CP CL+ Y + E   ++ C+ C+RW+H  CDGI ++  L
Sbjct: 834 NWMNGYTECGPCA----SHTFCPSCLEPYVEGEL--VIQCEQCERWLHGSCDGIRND--L 885

Query: 214 QFQVDGNLQYRCPTCR 229
             +   + +Y C  CR
Sbjct: 886 DAEKCADEKYTCVLCR 901


>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
          Length = 3915

 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 65/191 (34%), Positives = 92/191 (48%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C V  + G     R+LSC  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 20  MCVVCGSFGRGAEGRLLSCSQCGQCYHPFCVN--IKITKVVLSKGWRCLECTVCEACGQA 77

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            DP + + C  CD +YH YC  PP +NV  G + C     C  CG+  P  GL   W   
Sbjct: 78  SDPGRLLLCDDCDISYHTYCLDPPLQNVPKGSWKCKWCVLCTHCGATSP--GLRCEWQNN 135

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
           YT C  C  L V    CPVC + YR+ E   ++ C  C RWVH  C G++ ++ ++   D
Sbjct: 136 YTQCGPCASLTV----CPVCTRSYREEEL--ILQCRQCDRWVHGSCQGLNSDEDVENAAD 189

Query: 219 GNLQYRCPTCR 229
               + C  CR
Sbjct: 190 EG--FDCTLCR 198


>gi|291240901|ref|XP_002740354.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like
            [Saccoglossus kowalevskii]
          Length = 4402

 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 100/202 (49%), Gaps = 20/202 (9%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEI 94
            +CF+  + G    + ++ C  C + +H  CL    K    N D+     W C  C+ C +
Sbjct: 1052 VCFLCASTG---QQELVYCNVCCEPFHEFCLEEDEKPLDDNTDI-----WCCKRCKFCHV 1103

Query: 95   CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGL 151
            C R  +    + C +C   YH  C  P +    +     ++C K  +C SCG+  PG   
Sbjct: 1104 CGRQQN---LLQCDKCHNTYHAECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGQSS 1160

Query: 152  SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 210
            S +W   ++ C  CG+LF  GNYCP+C + Y D +  + M+ C  C+ WVH +C+G++DE
Sbjct: 1161 SAQWSHDFSLCQDCGKLFDIGNYCPLCQQCYTDDDYDSKMMQCPCCESWVHAKCEGLTDE 1220

Query: 211  KY-LQFQVDGNLQYRCPTCRGE 231
             Y +  +   ++ Y C  C+ E
Sbjct: 1221 MYQIMCEFPEDIHYTCSKCQPE 1242


>gi|449684588|ref|XP_002166105.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
           magnipapillata]
          Length = 229

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 99/195 (50%), Gaps = 18/195 (9%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           ++++C  C    H +CL        +     W+C  C++C  C    D ++ MFC  CD 
Sbjct: 14  KLINCSQCSNGGHPSCLDMNKSLLKVIKGYPWQCMECKVCTECLAPHDEHEMMFCDNCDR 73

Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYT----------- 160
            YH YC     K +  G ++C +  KC SC S  P  +G S RW + +T           
Sbjct: 74  GYHSYCVGV--KEIPKGRWVCNRCGKCCSCLSRQPVSDGGSGRWKMEFTKPTDGSEPEFL 131

Query: 161 --CCDACGRLFVKGNYCPVCLKVYRDSEST--PMVCCDVCQRWVHCQCDGISDEKYLQFQ 216
              C  C  LF KG++CPVCLKVY D +    PMVCCD C RW+H  CDGI +++Y++  
Sbjct: 132 QNHCRKCSILFRKGSFCPVCLKVYCDDDGVVNPMVCCDNCDRWIHTDCDGIDEQRYIELS 191

Query: 217 VDGNLQYRCPTCRGE 231
            D +  Y C  CRGE
Sbjct: 192 KDHHSAYTCVLCRGE 206


>gi|297736277|emb|CBI24915.3| unnamed protein product [Vitis vinifera]
          Length = 75

 Score =  120 bits (302), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 57/73 (78%), Positives = 62/73 (84%)

Query: 680 MDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGK 739
           MDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGTSTL +  DD R KTLELGK
Sbjct: 1   MDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGTSTLIVKFDDGRAKTLELGK 60

Query: 740 QGVRFVPQKQKRS 752
           Q +R + QKQKRS
Sbjct: 61  QAIRLISQKQKRS 73


>gi|427798455|gb|JAA64679.1| Putative phagocytosis engulfment, partial [Rhipicephalus
           pulchellus]
          Length = 951

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)

Query: 41  FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +VG    C+    M+S      C  CG  YH  CL              W+CP C+ C+ 
Sbjct: 18  YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 76

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           C R GD  + + C  CD A+H YC  P   NV    + C     C  CGS  PG+G S R
Sbjct: 77  CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 136

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           W + Y+ CD+C +   KG  CP+C K YR  S    M  C VC++++H +CDG    +  
Sbjct: 137 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 192

Query: 214 QFQVDGNLQYRCPTC 228
               DG+  Y CP C
Sbjct: 193 SSPKDGD--YVCPVC 205



 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 96/196 (48%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C  F    EG     R+++C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 380 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 432

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  D ++ + C  CD +YH YC  PP + V  G + C     C  CG+  PGNG   
Sbjct: 433 GCGQPHDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--S 490

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           +W   YT C  C  +      CP+CL  Y+DSE   ++ C  C+RW+H  CD IS E+  
Sbjct: 491 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEEDA 544

Query: 214 QFQVDGNLQYRCPTCR 229
           +   +    Y CP CR
Sbjct: 545 ERCAE--YGYNCPYCR 558


>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
          Length = 4906

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 942  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 994

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+  P  GL  
Sbjct: 995  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1052

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CP+C + YRD E   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1053 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1106

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1107 ENIAD--MGFDCTICR 1120



 Score = 97.1 bits (240), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/179 (30%), Positives = 82/179 (45%), Gaps = 21/179 (11%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C +CG+ YH  CL            + W+CP C++C+ C+ +G+ NK + C  CD  Y
Sbjct: 359 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 415

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P   +V +  + C     C  CG+       S +W      CD+C +   + N 
Sbjct: 416 HTFCLQPVMDSVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 468

Query: 175 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCD---GISDEKYLQFQVDGNLQYRCPTCR 229
            CP C K+        M+ C +C+RW+H +CD   GI  E  L+        Y C  C+
Sbjct: 469 SCPFCDKLCLQDFQKDMLHCHMCKRWIHMECDRSPGIELESQLK-------DYICTLCK 520


>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
            guttata]
          Length = 4871

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 912  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 964

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+  P  GL  
Sbjct: 965  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1022

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CP+C + YRD E   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1023 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1076

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1077 ENIAD--MGFDCTICR 1090



 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 80/176 (45%), Gaps = 15/176 (8%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C +CG+ YH  CL            + W+CP C++C+ C+ +G+ NK + C  CD  Y
Sbjct: 327 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 383

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P    V +  + C     C  CG+       S +W      CD+C +   + N 
Sbjct: 384 HTFCLQPVMDAVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 436

Query: 175 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            CP C K+        M+ C +C+RW+H  CD  S    L+ Q+     Y C  CR
Sbjct: 437 SCPFCEKLCLQDFQKDMLHCHMCKRWIHIDCDR-SPGSELESQLK---DYICTLCR 488


>gi|241859648|ref|XP_002416243.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
 gi|215510457|gb|EEC19910.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
          Length = 1179

 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 15/195 (7%)

Query: 41  FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +VG    C+    M+S      C  CG  YH  CL + A          W+CP C+ C+ 
Sbjct: 18  YVGSQANCQSCEEMVSPSELLFCTLCGAHYHGFCL-DPAVRVTTSTRVGWQCPDCKACQA 76

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           CRR GD  + + C  CD  +H YC  P   NV    + C     C  CGS  PG+G S R
Sbjct: 77  CRRPGDEARLLTCDICDKGFHVYCVKPVVANVPKHGWKCQNCRVCGDCGSRTPGSGPSSR 136

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           W + ++ CD+C +   KG  CP+C K YR  S    M  C +C++++H +CD        
Sbjct: 137 WHMNFSVCDSCYQQRNKGVACPLCGKAYRQFSHREDMAQCTMCRKYIHMECDS------- 189

Query: 214 QFQVDGNLQYRCPTC 228
           Q     +  Y CP C
Sbjct: 190 QLANHQDADYVCPVC 204



 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C  F    EG     R+++C  CG+ YH  C+ N    R +     W+C  C +CE
Sbjct: 287 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCV-NVKVTRMILK-KGWRCLDCTVCE 339

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  D ++ + C  CD +YH YC  PP +NV  G + C     C  CG+  PG G   
Sbjct: 340 GCGQPHDESRLLLCDECDISYHTYCLSPPLENVPQGNWKCRWCVVCLQCGATDPGFG--S 397

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C         CP+CL  Y++++   ++ C  C+RW+H  CD I+ E+  
Sbjct: 398 HWQNNYTQCGPC----ASKTSCPLCLLKYQENDL--VIQCVQCERWMHGFCDQIACEE-- 449

Query: 214 QFQVDGNLQYRCPTCR 229
             +      Y CP CR
Sbjct: 450 DAEKCAEYGYNCPYCR 465


>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
            niloticus]
          Length = 4872

 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 991  MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKITKVVLSKGWRCLECTVCEACGQA 1048

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP +NV    + C     C  CG+  P  GL   W   
Sbjct: 1049 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQNN 1106

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CP+CL  Y  SE T +V C  C RW H  C  +  E+ ++   D
Sbjct: 1107 YTLCAPCASL----STCPICLVDY--SEGTIIVQCRQCDRWFHASCQSLHSEEDIEKAAD 1160

Query: 219  GNLQYRCPTCR 229
             +  + C  CR
Sbjct: 1161 SS--FDCTMCR 1169



 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 73/175 (41%), Gaps = 11/175 (6%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG  YH  CL            + W+CP C++C+ C+  G+  K + C  CD  Y
Sbjct: 357 LFCTSCGLHYHGICLDMAVTP---LRRAGWQCPECKVCQTCKNPGEDTKMLVCDMCDKGY 413

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P    + +  + C     C  CG+   G     +W      C+ C +       
Sbjct: 414 HTFCLQPVIDTLPTNGWRCQNCRVCLQCGTRTGG-----QWHHTSLLCENCVQNQDPALC 468

Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           CP+C  +        +V C  C+RW+H +C+    +   Q ++     Y C  CR
Sbjct: 469 CPMCSCILDPEHHKDLVFCHTCKRWLHLECE---RQNSGQAEIHPREDYVCSNCR 520


>gi|358334996|dbj|GAA53428.1| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
           sinensis]
          Length = 3518

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 70/230 (30%), Positives = 103/230 (44%), Gaps = 19/230 (8%)

Query: 8   ARRFESGYVATASKDIAGEEQGQSNTNVMCRL--CFVGENEGCERA------RRMLSCKS 59
           +RR ++   +     +  E+    +T V+CR    FV E + C           +L+C  
Sbjct: 469 SRRTKTRKTSMYESKMPDEKDDHPSTVVLCRADDTFVLEQDMCVACGSFGLDTVLLACAQ 528

Query: 60  CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
           CG+ YH  C       R +     W+C  C +CE C  T + +  + C  CD +YH YC 
Sbjct: 529 CGQCYHPFCADVPKITRTMLE-KGWRCLDCTVCEGCGGTTNESLLLLCDDCDISYHTYCL 587

Query: 120 HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCL 179
            PP + V  G + C +   C +CG   P  GL+ +W   Y+ C  C  L      CPVC 
Sbjct: 588 DPPLQEVPKGGWKCSECVVCTNCGQRDP--GLNGKWHANYSMCAPCASLAT----CPVCT 641

Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
             YR+ E   ++ C +C RW H  CD +  E  L+   D  + Y C  CR
Sbjct: 642 LAYREGEL--LIRCALCSRWSHAGCDQLRTEDELELATD--MGYNCLLCR 687



 Score = 96.3 bits (238), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 82/217 (37%), Gaps = 42/217 (19%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG  YH +CL+   Q         W+C  C+ C IC  + D NK + C  CD  
Sbjct: 53  LLFCTGCGSHYHGSCLEPSLQPNPTIRIG-WQCAECKACLICNESKDENKMLVCDVCDKG 111

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG----SNVPGNGLSV----------RWFLGY 159
           +H YC  PP   +    + C +   C  CG    S V G G+ V          +W   Y
Sbjct: 112 FHTYCLRPPVSCIPRNGFKCERCRVCSDCGAGRASTVSGLGVMVEFNNPQLPVIKWHSNY 171

Query: 160 TCCDACGRLFVKGNY-CPVCLKVYRDSESTPMVC----------------CDVCQRWVHC 202
           T CD C     +    CPVC + +R S   P                   C  C+R VH 
Sbjct: 172 TLCDRCFHSRKRPTASCPVCERAWRCSLQVPSYISTQPQTSTHVTWPGRRCTKCRRMVHA 231

Query: 203 QCDGI----------SDEKYLQFQVDGNLQYRCPTCR 229
            CD +          S        + G + Y CP CR
Sbjct: 232 DCDPLQSVATGTASPSSAMSEDTNIAGGIAYSCPVCR 268


>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
            latipes]
          Length = 4802

 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 965  MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKINKVVLSKGWRCLECTVCEACGQA 1022

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP +NV    + C     C  CG+  P  GL   W   
Sbjct: 1023 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQSN 1080

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CP+CL  Y  SE T +V C  C RW H  C G+  E  L+   +
Sbjct: 1081 YTQCAPCASL----STCPICLVNY--SEGTVIVQCRQCDRWFHASCQGLHSEDDLEKAAE 1134

Query: 219  GNLQYRCPTCR 229
             +  + C  C+
Sbjct: 1135 NS--FDCTICQ 1143



 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 75/175 (42%), Gaps = 11/175 (6%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C SCG  YH  CL            + W+CP C++C+ C+  GD  K + C  CD  Y
Sbjct: 357 LFCTSCGLHYHGMCLDMAVTP---LRRAGWQCPECKVCQTCKNHGDDTKMLVCDMCDKGY 413

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P  +++ +  + C     C  CG+   G+     W      C+ C +       
Sbjct: 414 HTFCLQPAMESLPTNGWRCKNCRVCIQCGTRTSGH-----WHHNSLLCENCFQNQDPALC 468

Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           C +C  +        ++ C  C+RW+H +C+    +   Q +++    Y C  CR
Sbjct: 469 CSMCSCILDPEHHKDLLFCQTCKRWLHLECE---RQNSGQTEINPREDYVCFNCR 520


>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
            (Silurana) tropicalis]
          Length = 3840

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 99/199 (49%), Gaps = 16/199 (8%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  + G       + C+ C + +HR CL+   +  +     +W C  C+ C +C R 
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433

Query: 99   GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNVPGNGL 151
                K +  C +C  +YH  C  P   N  + P      ++C K  +C SCGS  PG G 
Sbjct: 1434 QQATKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKCVRCKSCGSTTPGKGW 1490

Query: 152  SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 210
              +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ ++DE
Sbjct: 1491 DAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGKCDRWVHSKCENLTDE 1550

Query: 211  KY-LQFQVDGNLQYRCPTC 228
             Y +   +  ++ Y C  C
Sbjct: 1551 MYEILSNLPESVAYTCINC 1569


>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4527

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 91/201 (45%), Gaps = 14/201 (6%)

Query: 31  SNTNVMCR--LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           SN N   +  +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  
Sbjct: 661 SNDNFTLKQDMCVVCGSFGLGAEGRLLACAQCGQCYHPYCVG--IKINKVVLSKGWRCLE 718

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
           C +CE C +  DP + + C  CD +YH YC  PP +NV    + C     C  CG+  P 
Sbjct: 719 CTVCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVTCTQCGATTP- 777

Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
            GL   W   YT C  C  L      CP+CL  Y  SE T ++ C  C RW H  C  + 
Sbjct: 778 -GLRCEWQKNYTQCAPCASLMT----CPICLVDY--SEGTTILQCRQCDRWFHASCQSLH 830

Query: 209 DEKYLQFQVDGNLQYRCPTCR 229
            E+ ++   +    + C  CR
Sbjct: 831 SEEDVEKAAENG--FNCTMCR 849



 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           L C SCG+ YH  CL            + W+CP C+IC+ C+
Sbjct: 138 LFCTSCGQHYHGICLDMAVTP---LRRAGWQCPECKICQTCK 176


>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
          Length = 2688

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1187 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1245

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +LF
Sbjct: 1246 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1304

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1305 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1364

Query: 228  CRGECY 233
            C G  +
Sbjct: 1365 CAGATH 1370


>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
          Length = 2711

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +LF
Sbjct: 1269 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1327

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387

Query: 228  CRGECY 233
            C G  +
Sbjct: 1388 CAGATH 1393


>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
            mutus]
          Length = 2524

 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1066 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1124

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +LF
Sbjct: 1125 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1183

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1184 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1243

Query: 228  CRGECY 233
            C G  +
Sbjct: 1244 CAGATH 1249


>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 3691

 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 69/214 (32%), Positives = 95/214 (44%), Gaps = 22/214 (10%)

Query: 26  EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
           EE    NT VM            +C V  + G     R+L+C  CG+ YH  C+ N    
Sbjct: 128 EENSMHNTVVMFSTTDHFTLKQDMCVVCGSFGQGAEGRLLACSQCGQCYHPFCV-NVKMT 186

Query: 76  RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           R +     W+C  C +CE C    DP + + C  CD +YH YC  PP   V  G + C  
Sbjct: 187 RVVL-TKGWRCLECTVCEACGEASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKW 245

Query: 136 HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
             +C  CGS+ P  G+   W   Y+CC  CG L      CP+C + Y   E   ++ C  
Sbjct: 246 CVRCVQCGSSSP--GVRCDWQDNYSCCGPCGSL----RRCPLCQRPYAHDEL--IMQCQQ 297

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           C RWVH  C  +  E+ ++   D    + C  CR
Sbjct: 298 CDRWVHATCQNLMCEEDVEAAADEG--FDCSLCR 329


>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
          Length = 3017

 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1517 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1575

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1576 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1634

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1635 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1694

Query: 228  CRGECY 233
            C G  +
Sbjct: 1695 CAGATH 1700


>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
            boliviensis boliviensis]
          Length = 2665

 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1165 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1223

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1224 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1282

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1283 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1342

Query: 228  CRGECYQVRDLEDAVRELWR 247
            C G          AV+  WR
Sbjct: 1343 CAG----------AVQPRWR 1352


>gi|328874899|gb|EGG23264.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
          Length = 758

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 89/184 (48%), Gaps = 3/184 (1%)

Query: 53  RMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 111
           +  SC  C + YH  CL+ N      +    +WKC  C++CE+C      +K MFC  CD
Sbjct: 494 QTFSCIGCHRVYHGKCLQLNQLAIDTIKRNGNWKCIDCKLCEVCNEGVHEDKMMFCDVCD 553

Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 171
             YH +C  P      +G + C +   C  CGS   G   S +W   YT C+ C      
Sbjct: 554 KGYHTFCCSPKLDAPPTGGWKCSQCVHCIHCGSRSAGPSSSSKWNANYTVCEVCTPKVQD 613

Query: 172 GNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCR 229
             YC VC K+ + S E  P+V C  C RW H  CD I++E   + + + N  QY+CPTC+
Sbjct: 614 KKYCTVCRKIIKSSGEKKPIVQCVYCDRWTHAGCDSITEEFLEKMKENPNYHQYKCPTCK 673

Query: 230 GECY 233
              Y
Sbjct: 674 TGNY 677


>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Ornithorhynchus anatinus]
          Length = 4910

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 962  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1014

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+  P  GL  
Sbjct: 1015 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1072

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CPVC + YR+ E   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1073 EWQNNYTQCAPCASL----STCPVCYRNYREEEL--ILQCRQCDRWMHAICQNLNTEEEV 1126

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1127 ENIAD--IGFDCTMCR 1140



 Score = 99.8 bits (247), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 83/173 (47%), Gaps = 12/173 (6%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+ +G+ NK + C  CD  YH 
Sbjct: 354 CTTCGQHYHGMCLDIAITP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHT 410

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P   +V +  + C     C  CG+       S +W      CD+C +   +   CP
Sbjct: 411 FCLQPVIDSVPTNGWKCKNCRVCAECGTRT-----SAQWHHNCLVCDSCYQQ-QESLSCP 464

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            C K Y       M+ C +C+RW+H +CD  +D + L+ Q+    +Y C  C+
Sbjct: 465 FCGKYYHPDFQKDMLHCHMCKRWIHIECDKPTDTE-LESQL--REEYICMFCK 514


>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4 [Callithrix jacchus]
          Length = 2660

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1160 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1218

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1219 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1277

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1278 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1337

Query: 228  CRGECYQVRDLEDAVRELWR 247
            C G          AV+  WR
Sbjct: 1338 CAG----------AVQPRWR 1347


>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
 gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
            Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
            homolog; AltName: Full=Trithorax homolog 2; AltName:
            Full=WW domain-binding protein 7; Short=WBP-7
          Length = 2713

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279

Query: 113  AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398

Query: 228  CRG 230
            C G
Sbjct: 1399 CAG 1401


>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
          Length = 2713

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279

Query: 113  AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398

Query: 228  CRG 230
            C G
Sbjct: 1399 CAG 1401


>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
          Length = 3463

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 10/195 (5%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 97
            +CF+  ++G      ML C+ C + +HR CL+  A+     +  +W C  CR C +C R+
Sbjct: 1732 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCRFCHVCGRK 1787

Query: 98   TGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
              +    + C RC   YH  C  P  P +N     ++C    +C SCG   PG    + W
Sbjct: 1788 NKNSKPLLECERCQNCYHASCLGPSYPKQNKKRKTWVCVTCIRCKSCGV-TPGKSWDIDW 1846

Query: 156  FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 213
                  C  C +LF  GNYCP+C K Y D++  + M+ C  C  WVH +C+ ++DE Y +
Sbjct: 1847 NHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1906

Query: 214  QFQVDGNLQYRCPTC 228
               +  ++ Y C  C
Sbjct: 1907 LSSLPESVVYSCRPC 1921


>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
            anubis]
          Length = 2716

 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1216 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1274

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1275 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1333

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1334 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1393

Query: 228  CRG 230
            C G
Sbjct: 1394 CAG 1396


>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
          Length = 2721

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1280 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1338

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398

Query: 228  CRGECY 233
            C G  +
Sbjct: 1399 CAGATH 1404


>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
          Length = 2605

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1105 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1163

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1164 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1222

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1223 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1282

Query: 228  CRG 230
            C G
Sbjct: 1283 CAG 1285


>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
 gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
            Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein
            4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
            domain-binding protein 7; Short=WBP-7
 gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
          Length = 2715

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1273

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1274 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1332

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1333 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1392

Query: 228  CRG 230
            C G
Sbjct: 1393 CAG 1395


>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
          Length = 2415

 Score =  115 bits (289), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 915  LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 973

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 974  AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1032

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1033 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1092

Query: 228  CRG 230
            C G
Sbjct: 1093 CAG 1095


>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
          Length = 2566

 Score =  115 bits (289), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 950  LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1008

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1009 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1067

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1068 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1127

Query: 228  CRGECY 233
            C G  +
Sbjct: 1128 CAGATH 1133


>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
            garnettii]
          Length = 2714

 Score =  115 bits (289), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268

Query: 113  AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1269 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1327

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387

Query: 228  CRG 230
            C G
Sbjct: 1388 CAG 1390


>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
            gorilla gorilla]
          Length = 2536

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1120 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1178

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1179 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1237

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1238 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1297

Query: 228  CRG 230
            C G
Sbjct: 1298 CAG 1300


>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
            troglodytes]
          Length = 2526

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1026 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1084

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1085 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1143

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1144 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1203

Query: 228  CRG 230
            C G
Sbjct: 1204 CAG 1206


>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
            glaber]
          Length = 2592

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1072 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRASKHLLECERCCH 1130

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1131 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1189

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1190 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1249

Query: 228  CRG 230
            C G
Sbjct: 1250 CAG 1252


>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Pan paniscus]
          Length = 2776

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1276 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1334

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1335 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1393

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1394 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCRP 1453

Query: 228  CRG 230
            C G
Sbjct: 1454 CAG 1456


>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Nomascus leucogenys]
          Length = 2433

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1192 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1250

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1251 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1309

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1310 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1369

Query: 228  CRG 230
            C G
Sbjct: 1370 CAG 1372


>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
          Length = 2209

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1067 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1125

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1126 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1184

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1185 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1244

Query: 228  CRG 230
            C G
Sbjct: 1245 CAG 1247


>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
            mulatta]
          Length = 2789

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1337 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1395

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1396 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCIQLY 1454

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1455 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1514

Query: 228  CRG 230
            C G
Sbjct: 1515 CAG 1517


>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
          Length = 2013

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 512 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 570

Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 571 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 629

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 630 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 689

Query: 228 CRG 230
           C G
Sbjct: 690 CAG 692


>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4301

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F   +EG     ++L+C  C + YH  C+ +      L     W+C  C +CE
Sbjct: 188 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 240

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
           +C +  DP++ + C  CD +YH YC  PP  NV  G + C     C  CGSN P  G   
Sbjct: 241 MCGKASDPSRLLLCDDCDVSYHTYCLEPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 298

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            W   YT C  C  L      CPVC + + + E   ++ C  C RWVH  C+ +  E
Sbjct: 299 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 349


>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Loxodonta africana]
          Length = 4785

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 905  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 962

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+  P  GL   W   
Sbjct: 963  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRCEWQNN 1020

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1021 YTQCAPCASL----STCPVCYRHYREEDL--ILQCRQCDRWMHAICQNLNTEEEVENVAD 1074

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1075 --IGFDCSMCR 1083



 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 307 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 363

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 364 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 416

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C VC+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 417 FCGKCYHPEFQEDMLHCSVCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 466


>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Ailuropoda melanoleuca]
          Length = 2611

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1111 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1169

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1170 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1228

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGN+CP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1229 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1288

Query: 228  CRGECY 233
            C G  +
Sbjct: 1289 CAGATH 1294


>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
            tropicalis]
          Length = 2771

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 18/207 (8%)

Query: 32   NTNVMCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSC 89
            N  +MC LC         R R ++L C+ C + +HR CL+     R L +   +W C  C
Sbjct: 1229 NPTLMCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRC 1279

Query: 90   RICEICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSN 145
            + C +C + G   K +  C  C   YH  C  P  P K   SG  + C    +C SCG  
Sbjct: 1280 KFCNVCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI- 1338

Query: 146  VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC 204
             PG    +        C  C  L+ KGN+CP+C++ Y +SE  + M+ C  C +W+H +C
Sbjct: 1339 APGKDGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKC 1398

Query: 205  DGISDEKY-LQFQVDGNLQYRCPTCRG 230
            +G+SDE Y L   +  ++ Y CP C G
Sbjct: 1399 EGLSDEGYELLSNLPDSVVYTCPPCLG 1425


>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
            lupus familiaris]
          Length = 2713

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1212 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1270

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 1271 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1329

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGN+CP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 1330 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1389

Query: 228  CRGECY 233
            C G  +
Sbjct: 1390 CAGATH 1395


>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
          Length = 2270

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 771 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 829

Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 830 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 888

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 889 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 948

Query: 228 CRG 230
           C G
Sbjct: 949 CAG 951


>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
 gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
          Length = 1503

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/283 (29%), Positives = 122/283 (43%), Gaps = 28/283 (9%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
           +++ C SCG  +H  C  L N    R     S W C  C  C+ICR+  D N  KF+ C 
Sbjct: 216 KLIMCSSCGDHFHSTCIGLANLPDTR-----SGWCCARCTKCQICRQQ-DSNDIKFVKCE 269

Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
           +C   YH  C  P   ++    + C +   C  CGS  PG G S RW   YT CD+C + 
Sbjct: 270 QCQKIYHASCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329

Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
             KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLMAYHKKKEQNPDYDYVCP 389

Query: 227 TCRGECYQV--RDLEDAVRELWRRKDMADKDLIA---SLRAAAGLPTEDEIFSISPYSDD 281
            C+    +     L + +         +D  L+     +    G PT +++ S     ++
Sbjct: 390 NCKTNSTRPPQSQLTETIDMALSESQTSDTQLMVKDIEMDPLEGRPTTNDVIS----EEN 445

Query: 282 EENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 317
            +  P     V   N  G+  K  L+  GV+ +  KK    GK
Sbjct: 446 HKQPPGAKKKVCFSNLRGKGTKFMLQRMGVMSQISKKRSTRGK 488



 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 88/191 (46%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C  +   ++ +     W+C  C +CE C + 
Sbjct: 532 ICVMCGSLGIESDAVMITCAQCGQCYHPYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 589

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC +PP + V  G + C   T C  CG N      +V   + 
Sbjct: 590 NDEARLLLCDECDISYHIYCVNPPLETVPQGTWKCSFCTMCQKCGRNPTEKSDNVDSNMS 649

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C  C         C VC   Y + E   ++ C+ C++W H  CD ++ +  + +  D
Sbjct: 650 E--CPPCA----SQTACSVCTNPYANGEM--IIQCEKCEQWSHFLCDSVNAQLTIDY-YD 700

Query: 219 GNLQYRCPTCR 229
            N+ Y+C  CR
Sbjct: 701 KNI-YKCLKCR 710


>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
           rubripes]
          Length = 3715

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F   +EG     ++L+C  C + YH  C+ +      L     W+C  C +CE
Sbjct: 418 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 470

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
           +C +  DP++ + C  CD +YH YC  PP  NV  G + C     C  CGSN P  G   
Sbjct: 471 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 528

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            W   YT C  C  L      CPVC + + + E   ++ C  C RWVH  C+ +  E
Sbjct: 529 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 579



 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 9/115 (7%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG+ YH  CL+  A        + W+CP C++C+ CR+ G+ +K + C  CD  
Sbjct: 98  LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 154

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDAC 165
           YH +C  P   ++ + P+ C +   C  CG+    +PG   S +WF  Y  C+AC
Sbjct: 155 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEAC 206



 Score = 40.4 bits (93), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 43/184 (23%), Positives = 63/184 (34%), Gaps = 35/184 (19%)

Query: 38  RLCFVGENEGCERARRMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCP-------- 87
           RLC     E C+R    + C +  C + YH  C       + +       CP        
Sbjct: 26  RLC-----EHCKRLGATIRCHAEGCSRFYHFPCSAASGSFQSMKQLL-LLCPEHIDKAKE 79

Query: 88  ----SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
               +C  C +C   G+ +  +FC  C   YH  C       +    + CP+   C +C 
Sbjct: 80  LGEEAC--CAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKVCQTC- 136

Query: 144 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
              PG    +        CDAC +      Y   CL+   DS  T    C  C+    C 
Sbjct: 137 -RKPGEDSKM------LVCDACDK-----GYHTFCLQPAMDSLPTDPWKCKRCRVCTDCG 184

Query: 204 CDGI 207
             G+
Sbjct: 185 ARGL 188


>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL4-like [Loxodonta africana]
          Length = 2200

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 94/186 (50%), Gaps = 8/186 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL   A+     H  +W C  C+ C +C R G   K +  C RC  
Sbjct: 827  LVFCQVCCDPFHPFCLDE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGTKHLLECERCRH 885

Query: 113  AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 886  AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 944

Query: 170  VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
             KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 945  EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1004

Query: 228  CRGECY 233
            C G  +
Sbjct: 1005 CAGATH 1010


>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
          Length = 3310

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 569 ENVAD--IGFDCSMCR 582


>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
          Length = 3314

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 569 ENVAD--IGFDCSMCR 582


>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
            latipes]
          Length = 4536

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 101/210 (48%), Gaps = 23/210 (10%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC    N         + C+ C + +H  CL      R L   + +W C  CR C+
Sbjct: 1675 VLCFLCASSGNV------EFVYCRVCCEPFHLFCLGE--SERPLQEQFENWCCRLCRFCQ 1726

Query: 94   ICRRTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSN 145
             C R     K   + C +C  +YH  C  P   N  + P      ++C K  +C SCG+ 
Sbjct: 1727 ACGRQHQKAKQQLVECDKCRNSYHPECLGP---NYPTKPTKKKRIWICTKCVRCKSCGAT 1783

Query: 146  VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQC 204
             PG     +W   ++ C  C +LF KGN+CP+C K Y D    + M+ C  C++WVH +C
Sbjct: 1784 KPGKSWDAQWSHDFSMCHDCAKLFAKGNFCPLCDKSYSDDYYDSKMMECARCKQWVHAKC 1843

Query: 205  DGISDEKY-LQFQVDGNLQYRCPTCRGECY 233
            + I+DE + L  ++  N+ Y C  C  EC+
Sbjct: 1844 ENITDEMFELLSKLPENIAYTCMKC-AECH 1872


>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
          Length = 1744

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 270 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 328

Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 329 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 387

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 388 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 447

Query: 228 CRG 230
           C G
Sbjct: 448 CAG 450


>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Anolis carolinensis]
          Length = 4817

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+LSC  CG+ YH  C+ +    + + H   W+C  C +CE
Sbjct: 911  MCVVCGSFGKGAEG-----RLLSCSQCGQCYHPYCV-SIKITKVVLH-KGWRCLECTVCE 963

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+  P  GL  
Sbjct: 964  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1021

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CP+C   YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1022 EWQNNYTQCAPCASL----STCPICCCNYREEDL--ILQCRQCDRWMHTVCQNLNTEEEV 1075

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D    + C  CR
Sbjct: 1076 ESTADNG--FDCTMCR 1089



 Score = 95.5 bits (236), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 99/226 (43%), Gaps = 21/226 (9%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C +CG+ YH  CL            + W+CP C++C+ C+ +G+ NK + C  CD  Y
Sbjct: 332 LFCTTCGQHYHGMCLDIQV---TALKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 388

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P   +V +  + C     C  CG+       S +W      CD+C     K   
Sbjct: 389 HTFCLQPVMDSVPTNGWKCKYCRVCAECGTRT-----SSQWHHNCLMCDSCYNQQEKLP- 442

Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCRG--- 230
           CP+C K          + C +C+RW+H +CD   +      ++D +L +Y C  C+    
Sbjct: 443 CPLCEKTSSPDGQKDRLYCHLCRRWIHIECDRSPNN-----ELDSHLKEYVCSLCKHSIV 497

Query: 231 --ECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFS 274
             EC    D  +   +L    D    D +    +A G+  ED++ +
Sbjct: 498 EEECALSCDAMETA-QLLPEPDTGCADEMEIEDSAEGVTNEDQVVT 542


>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL2-like, partial [Taeniopygia guttata]
          Length = 4299

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 102/223 (45%), Gaps = 17/223 (7%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 730 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 787

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP + V  G + C     C  CG+  P
Sbjct: 788 ECIVCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCVQCGAASP 847

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             G    W   YT C  C  L V    CP C + Y + +   ++ C  C RW+H  CD +
Sbjct: 848 --GFHCEWQNNYTHCAPCASLVV----CPFCREKYVEDDL--LIQCRHCDRWLHAACDSL 899

Query: 208 SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKD 250
             E+ ++   D    + C  C  + Y V+ +     E+ + KD
Sbjct: 900 FTEEEVEQAADEG--FDCSAC--QPYVVKPVPAPSAEMIKAKD 938



 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 9/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C     +G    R +  C SCG+ +H  CL      R     S W+CP C++C+  +
Sbjct: 126 CSVC-----DGPGELRDLAFCTSCGQHFHGACLDISLTPR---KRSGWQCPQCKVCQNLQ 177

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
              D +  + C  CD  YH  C  P  + + +  + C     C  CG    G   S +W 
Sbjct: 178 PGQD-SAMLVCETCDKGYHTSCTEPAAQGLPTTSWKCKNCWVCSDCGQRPAGPVSSCQWS 236

Query: 157 LGYTCCDAC 165
            G   C  C
Sbjct: 237 PGSEVCGDC 245


>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
          Length = 4823

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F   +EG     ++L+C  C + YH  C+ +      L     W+C  C +CE
Sbjct: 688 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 740

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
           +C +  DP++ + C  CD +YH YC  PP  NV  G + C     C  CGSN P  G   
Sbjct: 741 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 798

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            W   YT C  C  L      CPVC + + + E   ++ C  C RWVH  C+ +  E
Sbjct: 799 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 849



 Score =  102 bits (254), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 75/154 (48%), Gaps = 13/154 (8%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG+ YH  CL+  A        + W+CP C++C+ CR+ G+ +K + C  CD  
Sbjct: 227 LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 283

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDACGRLFV 170
           YH +C  P   ++ + P+ C +   C  CG+    +PG   S +WF  Y  C+AC     
Sbjct: 284 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEACQHH-- 338

Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
           +   C VC K   D     +  C VC R VH  C
Sbjct: 339 RNCTCSVCNK--PDGSVATLQSCSVCHRLVHSGC 370


>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Equus caballus]
          Length = 4910

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 917  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 974

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 975  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1032

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  +S E+ ++   D
Sbjct: 1033 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLSTEEEVENVAD 1086

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1087 --IGFDCSMCR 1095



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 10/153 (6%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 319 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 375

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 376 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 428

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
            C K Y       M+ C++C+RWVH +CD  +D
Sbjct: 429 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD 461


>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Ailuropoda melanoleuca]
          Length = 4927

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/223 (30%), Positives = 102/223 (45%), Gaps = 24/223 (10%)

Query: 17   ATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHR 66
            A++SKD   EE    NT V+            +C V  + G     R+L+C  CG+ YH 
Sbjct: 941  ASSSKD--EEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYHP 998

Query: 67   NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
             C+    +   +     W+C  C +CE C +  DP + + C  CD +YH YC  PP + V
Sbjct: 999  YCVS--IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTV 1056

Query: 127  SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
              G + C     C  CG+     GL   W   YT C  C  L    + CPVC + YR+ +
Sbjct: 1057 PKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREED 1110

Query: 187  STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
               ++ C  C RW+H  C  ++ E+ ++   D  + + C  CR
Sbjct: 1111 L--ILQCRQCDRWMHAVCQNLNTEEEVENIAD--IGFDCSMCR 1149



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 374 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 430

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 431 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 483

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 235
            C K Y       M+ C +C+RWVH +CD  +D +   QF+ +    Y C  C+    ++
Sbjct: 484 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 539

Query: 236 RDLE 239
             L+
Sbjct: 540 DPLQ 543


>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4137

 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 93/202 (46%), Gaps = 13/202 (6%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
            V+C LC    N         + C+ C + +H  CL    +  D   W +W C  CR C +
Sbjct: 1559 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHV 1611

Query: 95   C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
            C R+     + + C +C  +YH  C    HP         ++C K  +C SCG+  PG  
Sbjct: 1612 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1671

Query: 151  LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
               +W   ++ C  C +   KGN CP+C K Y D +  + M+ C  C RWVH +C+ ++D
Sbjct: 1672 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1731

Query: 210  EK-YLQFQVDGNLQYRCPTCRG 230
            +   L   +  N+ Y C  C G
Sbjct: 1732 DMCELMSSLPENVVYTCTNCTG 1753


>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
          Length = 4780

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 68/223 (30%), Positives = 102/223 (45%), Gaps = 24/223 (10%)

Query: 17   ATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHR 66
            A++SKD   EE    NT V+            +C V  + G     R+L+C  CG+ YH 
Sbjct: 797  ASSSKD--EEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYHP 854

Query: 67   NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
             C+    +   +     W+C  C +CE C +  DP + + C  CD +YH YC  PP + V
Sbjct: 855  YCVS--IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTV 912

Query: 127  SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
              G + C     C  CG+     GL   W   YT C  C  L    + CPVC + YR+ +
Sbjct: 913  PKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREED 966

Query: 187  STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
               ++ C  C RW+H  C  ++ E+ ++   D  + + C  CR
Sbjct: 967  L--ILQCRQCDRWMHAVCQNLNTEEEVENIAD--IGFDCSMCR 1005



 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 230 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 286

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 287 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 339

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 235
            C K Y       M+ C +C+RWVH +CD  +D +   QF+ +    Y C  C+    ++
Sbjct: 340 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 395

Query: 236 RDLE 239
             L+
Sbjct: 396 DPLQ 399


>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
 gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
          Length = 1461

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 110/233 (47%), Gaps = 18/233 (7%)

Query: 31  SNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
           S+ NV C  C  +G+        +++ C SCG  +H  C  L N    R     S W C 
Sbjct: 210 SDNNVECLTCSSLGD------LSKLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCA 258

Query: 88  SCRICEICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 146
            C  C+ICR+   +  KF+ C +C   YH  C  P   ++    + C +   C  CGS  
Sbjct: 259 RCTKCQICRQQEANDIKFVKCEQCQKIYHANCLRPVISSIPKYGWKCNRCRVCTDCGSRT 318

Query: 147 PGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG 206
           PG G S RW   YT CD+C +   KG  CP+C K YR +    MV C  C ++VH  CD 
Sbjct: 319 PGGGSSSRWHSHYTICDSCYQQRNKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDE 378

Query: 207 ISD--EKYLQFQVDGNLQYRCPTCR-GECYQVRDLEDAVRELWRRKDMADKDL 256
            +D    + + + + +  Y CP C+ G   ++  LE ++ +    +  + +D+
Sbjct: 379 EADLTAYHKRKEYNPDYDYVCPVCKVGSNIKIEPLERSIADPQASEHFSIRDV 431



 Score = 92.4 bits (228), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    +++C  C + YH  C  +   +R +     W+C  C +CE C + 
Sbjct: 535 ICVMCGSVGVEGDAELITCAQCAQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 592

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC +PP + V  G + C   T C  CG N P   ++      
Sbjct: 593 NDEARLLLCDECDISYHIYCVNPPLEQVPRGNWKCSFCTICQKCGRN-PTEKIN-HSDSN 650

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C  C       N C +C K Y D E   ++ C+ C++W+H  CD I+ +  + +  D
Sbjct: 651 SPECPPCA----SQNSCSICSKGYSDGEM--IIQCEQCEQWLHFLCDSINSQHTMDY-YD 703

Query: 219 GNLQYRCPTCR 229
            N+ Y+C  CR
Sbjct: 704 HNM-YKCIKCR 713


>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
          Length = 2185

 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 20  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 73  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 185 ENVAD--IGFDCSMCR 198


>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
          Length = 3750

 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 23/206 (11%)

Query: 36   MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 90
            +C LC   G++E       ML C+ C + +HR CL    +   +N++     +W C  C+
Sbjct: 1633 VCLLCASKGQHE-------MLFCQVCCEPFHRFCLDPSERPLEENKE-----NWCCRRCK 1680

Query: 91   ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +C R+  +    + C RC   YH  C  P  P  N    P++C    +C SCG   P
Sbjct: 1681 FCRVCGRKNKESKPLLECERCQNCYHPACLGPNYPKPNKRKKPWVCMTCIRCRSCGV-TP 1739

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
            G      W      C  C RLF +GNYC +C K Y D++  + M+ C  C  WVH +C+G
Sbjct: 1740 GKSWDTEWNHDKGLCPDCTRLFDQGNYCTMCFKCYEDNDYDSQMMQCSTCNHWVHAKCEG 1799

Query: 207  ISDEKY-LQFQVDGNLQYRCPTCRGE 231
            ++D+ Y +   +  ++ Y C  C  E
Sbjct: 1800 LTDDLYEILSSLPESVVYSCQPCLKE 1825


>gi|443723098|gb|ELU11679.1| hypothetical protein CAPTEDRAFT_130729, partial [Capitella teleta]
          Length = 625

 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/208 (31%), Positives = 104/208 (50%), Gaps = 27/208 (12%)

Query: 41  FVGENEGCERA----------RRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKC 86
            V  N  C RA           +++ C  C + +H  CL    +  A N +     +W C
Sbjct: 4   LVSSNSLCVRAVCYLCGSGGHNQLIYCSVCCEPFHSFCLDKGERPLADNLE-----NWCC 58

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYH----CYCQHPPHKNVSSGPYLCPKHTKCHSC 142
             C+ C +C +    N  + C +C  AYH    C  ++P   + +   ++CPK  KC SC
Sbjct: 59  RKCQFCRVCGKGS--NNLLQCIQCQDAYHPSQSCLQKYPNKPSKNRRIWVCPKCVKCKSC 116

Query: 143 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVH 201
           G+  PG+     W   ++ C++CG L  KGN+CPVC K Y D +  + M+ C  C+ WVH
Sbjct: 117 GATSPGDSSDATWMYDFSLCNSCGLLMSKGNFCPVCHKCYADDDWDSKMMQCSTCESWVH 176

Query: 202 CQCDGISDEKY-LQFQVDGNLQYRCPTC 228
            +C+G++DE Y +   +  ++QY CP C
Sbjct: 177 AKCEGLTDEMYSIMSYLPEDVQYHCPRC 204


>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
          Length = 4499

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 471 MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 523

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 524 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 581

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 582 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 635

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 636 ENVAD--IGFDCSMCR 649


>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
           boliviensis boliviensis]
          Length = 4029

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 20  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 73  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 185 ENVAD--IGFDCSMCR 198


>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
 gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
          Length = 1458

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 97/204 (47%), Gaps = 17/204 (8%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRR--TGDPNKFMFCR 108
           +++ C SCG  +H  C  L N    R     S W C  C  C+ICR+  T D  KF+ C 
Sbjct: 220 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQHETNDI-KFIKCE 273

Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
           +C   YH  C  P   ++    + C +   C  CGS  PG G S RW   YT CD+C + 
Sbjct: 274 QCQKMYHAMCLRPTISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 333

Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCP 226
             KG  CP+C K YR +    MV C  C ++VH  CD  +D    + + + + +  Y CP
Sbjct: 334 RNKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEYNPDYDYVCP 393

Query: 227 TCR-----GECYQVRDLEDAVREL 245
            C+      +   V  LE A+ ++
Sbjct: 394 ICKTSSTAAQLKAVDPLERAITDM 417



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 91/195 (46%), Gaps = 20/195 (10%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +    G E    +++C  CG+ YH  C  +   +R +     W+C  C +CE C + 
Sbjct: 532 ICVMCGTLGIESDSVLITCAQCGQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 589

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN----VPGNGLSVR 154
            D  + + C  CD +YH YC  PP + V  G + C   T C  CG N    V  +  S+ 
Sbjct: 590 NDEARLLLCDECDISYHIYCVKPPLETVPHGNWKCSFCTICQKCGRNPTEKVKNSDASLS 649

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 214
                  C  C       N CP+C K Y D E   ++ C+ C++W H  CD I+ +  ++
Sbjct: 650 E------CLPCA----SQNSCPLCRKAYSDGEM--IIQCEQCEQWSHFLCDSINAQYTME 697

Query: 215 FQVDGNLQYRCPTCR 229
           +  D N+ Y+C  CR
Sbjct: 698 Y-YDNNV-YKCMKCR 710


>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
          Length = 4884

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 959  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1129 --IGFDCSMCR 1137



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518


>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
          Length = 4575

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 565 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 622

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 623 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 680

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
           YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 681 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 734

Query: 219 GNLQYRCPTCR 229
             + + C  CR
Sbjct: 735 --IGFDCSMCR 743



 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 65/136 (47%), Gaps = 14/136 (10%)

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
           R++G+ +K + C  CD  YH +C  P  K+V +  + C     C  CG+       S +W
Sbjct: 1   RQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQW 55

Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
                 CD C +   + N CP C K Y       M+ C++C+RWVH +CD  +D      
Sbjct: 56  HHNCLICDNCYQQ--QDNLCPFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH----- 108

Query: 216 QVDGNL--QYRCPTCR 229
           ++D  L  +Y C  C+
Sbjct: 109 ELDPQLKEEYICMYCK 124


>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
          Length = 4025

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 20  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 73  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 185 ENVAD--IGFDCSMCR 198


>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
          Length = 4911

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 959  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1129 --IGFDCSMCR 1137



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518


>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
          Length = 4911

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 959  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1129 --IGFDCSMCR 1137



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518


>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
            garnettii]
          Length = 4945

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 996  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1053

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1054 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1111

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1112 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1165

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1166 --IGFDCSMCR 1174



 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 397 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 453

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       SV+W      CD+C +   + N CP
Sbjct: 454 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SVQWHHNCLICDSCYQQ--EDNLCP 506

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C+VC+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 507 FCGKCYHPELQKDMLHCNVCKRWVHLECDKPTDH-----ELDSQLKEEYICMFCK 556


>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
 gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Homologous to ALR protein; AltName: Full=Lysine
            N-methyltransferase 2C; Short=KMT2C; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
          Length = 4911

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 959  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1129 --IGFDCSMCR 1137



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518


>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
            jacchus]
          Length = 4909

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 953  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1069 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1123 --IGFDCSMCR 1131



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  CR
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCR 518


>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
           troglodytes]
          Length = 4026

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 20  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 73  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 185 ENVAD--IGFDCSMCR 198


>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
            (Silurana) tropicalis]
          Length = 3855

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  + G       + C+ C + +HR CL+   +  +     +W C  C+ C +C R 
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433

Query: 99   GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 136
                K                 + C +C  +YH  C  P   N  + P      ++C K 
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490

Query: 137  TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 195
             +C SCGS  PG G   +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550

Query: 196  CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
            C RWVH +C+ ++DE Y +   +  ++ Y C  C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584


>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
          Length = 4884

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 957  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1009

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 1010 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1067

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1068 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1121

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1122 ENVAD--IGFDCSMCR 1135



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 235
            C K Y       M+ C++C+RWVH +CD  +D E   QF+ +    Y C  C+    ++
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELESQFKEE----YICMYCKHLAAEM 524

Query: 236 RDLE 239
             L+
Sbjct: 525 DPLQ 528


>gi|328773887|gb|EGF83924.1| hypothetical protein BATDEDRAFT_84646 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 828

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 96/207 (46%), Gaps = 34/207 (16%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLF---HWSSWKCPSCRICEICRRTGDPNKFMFCRRC 110
           +L+C  CG K+H  C++   +++ L        W+C +C++C +C   GD +K +FC  C
Sbjct: 585 LLNCTQCGTKHHPRCIE--FEDKVLITKVMTFDWRCSNCKLCTVCNNAGDDDKLLFCDTC 642

Query: 111 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN--------------VPGNGLSVRWF 156
           D  YH YC +PP + +  G +LC +   C SC                 V    LS++  
Sbjct: 643 DRGYHMYCLNPPLEVLPEGSWLCSECAVCKSCKKRPEKQEGTEDMWRHVVIPPSLSLQEI 702

Query: 157 ----------LG-YTC--CDACGRLFVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHC 202
                     LG Y C  C  C   F    +CP+C+ VY  DS+   MVCCD C RWVH 
Sbjct: 703 QIKPPPATSALGTYLCTYCTDCYDHFEADRFCPLCIHVYSEDSDDLAMVCCDECDRWVHV 762

Query: 203 QCDG-ISDEKYLQFQVDGNLQYRCPTC 228
            CD  ++D+ Y +        + C  C
Sbjct: 763 GCDPELTDDVYQKLVEQEEPAFTCALC 789


>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
            mulatta]
          Length = 4785

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 875  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 932

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 933  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 990

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 991  YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1044

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1045 --IGFDCSMCR 1053



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 385 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 434


>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 4032

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 10  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 62

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 63  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATC--AGLRC 120

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 121 EWQNNYTQCAPCASL----SACPVCFRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEV 174

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 175 ESVAD--IGFDCSMCR 188


>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
            leucogenys]
          Length = 4856

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 891  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 948

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 949  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1006

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1007 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1060

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1061 --IGFDCSMCR 1069



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 400

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 401 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 450


>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
          Length = 4912

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 959  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1129 --IGFDCSMCR 1137



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 469 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518


>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
          Length = 4916

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 906  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 963

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 964  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1021

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1022 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1075

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1076 --IGFDCSMCR 1084



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 306 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 362

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 363 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 415

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 416 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 465


>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4930

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 953  MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1005

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 1006 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1063

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1064 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1117

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1118 ENVAD--IGFDCSMCR 1131



 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ CR++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L  +Y C  C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517


>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4931

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 954  MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1006

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 1007 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1064

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 1065 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1118

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1119 ENVAD--IGFDCSMCR 1132



 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ CR++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L  +Y C  C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517


>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
 gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
          Length = 1465

 Score =  112 bits (280), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 87/293 (29%), Positives = 123/293 (41%), Gaps = 27/293 (9%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
           +++ C SCG  +H  C  L N    R     S W C  C  C+ICR+   +  KF+ C +
Sbjct: 223 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQQEANDIKFIKCEQ 277

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           C   YH  C  P   ++    + C +   C  CGS  PG G S RW   YT CD+C +  
Sbjct: 278 CQKIYHATCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 337

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCPT 227
            KG  CP+C K YR +    MV C  C ++VH  CD  +D    +   + + +  Y CP 
Sbjct: 338 NKGFSCPICQKAYRAAAYKEMVKCSCCHKFVHSTCDEEADLTAYHKTKEYNPDYDYVCPI 397

Query: 228 CRGECYQVRDLEDAVRELWRRKDMADKDLI----ASLRAAAGLPTEDEIFSISPYSDDEE 283
           C+      + ++   R L    DM   +        +    G P  D + S  P      
Sbjct: 398 CKSNSAGAKIVDPLERSL---ADMQSSEQFNIKHVEIEPLNGKPCMD-VNSEDPLKLPNT 453

Query: 284 NGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNS 336
              V L N  G+  K  L  +       + + GKK  N    R KG Q+ L S
Sbjct: 454 KKKVCLTNIRGKGGKFVLHRM-----GAISQLGKKRSN----RGKGRQLVLQS 497



 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 90/192 (46%), Gaps = 12/192 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +    G E    +++C  CG+ YH  C  +   +R +     W+C  C +CE C + 
Sbjct: 533 ICVMCGTLGIESDAVLITCAQCGQCYHPYC-ASVKHSRGMLQ-KGWRCLDCTVCEGCGKK 590

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D  + + C  CD +YH YC +PP + V  G + C   T C  CG N P   ++      
Sbjct: 591 NDEARLLLCDECDISYHIYCVNPPLETVPHGNWKCSFCTICQKCGRN-PTEKVNFNEPSA 649

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
             C     +     N C VC K Y  +E   ++ C+ C++W H  CD ++ +  +++  D
Sbjct: 650 PECLPCASQ-----NNCFVCKKSY--TEGDMIIQCEQCEQWSHFLCDSVNTQLTMEY-YD 701

Query: 219 GNLQYRCPTCRG 230
            N+ Y+C  CR 
Sbjct: 702 NNV-YKCMKCRA 712


>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
            domestica]
          Length = 4862

 Score =  112 bits (279), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 978  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1035

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+  PG      W   
Sbjct: 1036 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1093

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L +    CPVC + YR+ +   ++ C  C RW+H  C  ++ E  ++   D
Sbjct: 1094 YTQCAPCASLSI----CPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1147

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1148 --IGFDCTMCR 1156



 Score = 99.0 bits (245), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 18/176 (10%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+ +G+ +K + C  CD  YH 
Sbjct: 368 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 424

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY-C 175
           +C  P   +V +  + C     C  CG+       S +W      CD+C +   + N  C
Sbjct: 425 FCLQPIMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDSCYQ--PQDNLSC 477

Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
           P C K Y+      M+ C +C+RW+H +CD  +D      ++D  L  +Y C  C+
Sbjct: 478 PFCGKCYQPDLQKDMLHCHMCKRWIHIECDKPADT-----ELDSQLKEEYVCMYCK 528


>gi|427798099|gb|JAA64501.1| Putative phagocytosis engulfment, partial [Rhipicephalus
           pulchellus]
          Length = 926

 Score =  112 bits (279), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)

Query: 41  FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +VG    C+    M+S      C  CG  YH  CL              W+CP C+ C+ 
Sbjct: 298 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 356

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           C R GD  + + C  CD A+H YC  P   NV    + C     C  CGS  PG+G S R
Sbjct: 357 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 416

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           W + Y+ CD+C +   KG  CP+C K YR  S    M  C VC++++H +CDG    +  
Sbjct: 417 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 472

Query: 214 QFQVDGNLQYRCPTC 228
               DG+  Y CP C
Sbjct: 473 SSPKDGD--YVCPVC 485



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 95/196 (48%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C  F    EG     R+++C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 660 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 712

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  D ++ + C  CD +YH YC  PP + V  G + C     C  CG+  PGNG   
Sbjct: 713 GCGQPHDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--S 770

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           +W   YT C  C  +      CP+CL  Y+DSE   ++ C  C+RW+H  CD IS E+  
Sbjct: 771 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE-- 822

Query: 214 QFQVDGNLQYRCPTCR 229
             +      Y CP CR
Sbjct: 823 DAERCAEYGYNCPYCR 838


>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
            porcellus]
          Length = 4878

 Score =  112 bits (279), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 910  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 967

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 968  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1026 YTQCAPCASL----SSCPVCYRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1080 --IGFDCSMCR 1088



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 315 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 371

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CDAC +   + N CP
Sbjct: 372 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHSCLVCDACYQQ--QDNLCP 424

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 425 FCGKCYHPELQKDMLHCNICKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 474


>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
 gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
          Length = 1481

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 122/278 (43%), Gaps = 21/278 (7%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+   +  K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C +  
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
            KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP 
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391

Query: 228 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 286
           C+         + A+  +     D + + L  SL+     P E +  ++ P SD+    P
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-TMDPSSDELHKLP 448

Query: 287 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
                V L +  GRS K  L   GV+ +  KK    GK
Sbjct: 449 TGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486



 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 91/204 (44%), Gaps = 14/204 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C      +R +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN-VPGNGLSVRWFL 157
            D  + + C  CD +YH YC +PP + V +G + C   T C  CG N    N       L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKNEFGESNML 647

Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
               C +C       + CPVC   Y + E   ++ C+ C+ W H  CD ++ +  +    
Sbjct: 648 E---CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQLTID-HY 697

Query: 218 DGNLQYRCPTCRGECYQVRDLEDA 241
           D N+ Y+C  CR        L D+
Sbjct: 698 DNNV-YKCFKCRCSTRSTNSLTDS 720


>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
 gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
          Length = 1481

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 89/288 (30%), Positives = 121/288 (42%), Gaps = 41/288 (14%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+   +  K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C +  
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
            KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP 
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391

Query: 228 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 276
           C+      G   Q  D       D++ E    K++        L    G PT D      
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSLSEQLSLKEI-------ELDPLEGKPTMD------ 438

Query: 277 PYSDDEENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 317
           P SD+    P     V L +  GRS K  L   GV+ +  KK    GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRIGVMSQINKKRSTRGK 486



 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C      +R +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
            D  + + C  CD +YH YC +PP + V +G + C   T C  CG N          N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647

Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
                     C +C       + CPVC   Y + E   ++ C+ C+ W H  CD ++ + 
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQL 692

Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
            +    D N+ Y+C  CR        L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLTDA 720


>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3 [Canis lupus familiaris]
          Length = 4874

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 924  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 981

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 982  SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1039

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1040 YTQCAPCASL----SSCPVCCRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1093

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1094 --IGFDCSMCR 1102



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 87/185 (47%), Gaps = 16/185 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+ TG     +FC  C   Y+ 
Sbjct: 327 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKETGKNTFVLFCFTCSLNYNP 383

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYTCCDACGRLFVKGNYC 175
           +C  P  + V +  +     T+C +C   V  G   S +W      CD+C +   + N C
Sbjct: 384 FCVSPLVRIVPTNLF-----TQCRNCRICVECGTRSSSQWHHNCLVCDSCYQQ--QDNLC 436

Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQ 234
           P C K Y       M+ C++C+RWVH +CD  +D +   QF+ +    Y C  C+    +
Sbjct: 437 PFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELDSQFKEE----YICMYCKHIAAE 492

Query: 235 VRDLE 239
           +  L+
Sbjct: 493 MDPLQ 497


>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
          Length = 3692

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 96/195 (49%), Gaps = 10/195 (5%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  ++G      ML C+ C + +HR CL+  A+     +  +W C  C+ C +C + 
Sbjct: 1730 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCKFCHVCGKK 1785

Query: 99   GDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
                K +  C RC   YH  C  P  P  N     ++C +  +C SCG   PG    + W
Sbjct: 1786 NQLTKPLLECERCQNCYHASCLGPNYPKLNKKRKAWVCMRCIRCKSCGV-TPGKSWEIDW 1844

Query: 156  FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 213
                  C  C +L+ +GNYCP+C K Y D++  + M+ C  C  WVH +C+ ++DE Y +
Sbjct: 1845 NHDKGLCPDCSKLYEQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1904

Query: 214  QFQVDGNLQYRCPTC 228
               +  ++ Y C  C
Sbjct: 1905 LSSLPESVVYSCRPC 1919


>gi|427791139|gb|JAA61021.1| Putative phagocytosis engulfment, partial [Rhipicephalus
           pulchellus]
          Length = 741

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)

Query: 41  FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +VG    C+    M+S      C  CG  YH  CL              W+CP C+ C+ 
Sbjct: 178 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 236

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           C R GD  + + C  CD A+H YC  P   NV    + C     C  CGS  PG+G S R
Sbjct: 237 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 296

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           W + Y+ CD+C +   KG  CP+C K YR  S    M  C VC++++H +CDG    +  
Sbjct: 297 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 352

Query: 214 QFQVDGNLQYRCPTC 228
               DG+  Y CP C
Sbjct: 353 SSPKDGD--YVCPVC 365



 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/196 (30%), Positives = 81/196 (41%), Gaps = 50/196 (25%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C  F    EG     R+++C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 539 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 591

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C   G PN    CR C                            C  CG+  PGNG   
Sbjct: 592 GC---GQPN--WKCRWC--------------------------VICVKCGATEPGNG--S 618

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
           +W   YT C  C  +      CP+CL  Y+DSE   ++ C  C+RW+H  CD IS E+  
Sbjct: 619 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE-- 670

Query: 214 QFQVDGNLQYRCPTCR 229
             +      Y CP CR
Sbjct: 671 DAERCAEYGYNCPYCR 686


>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
          Length = 4532

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 530 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 582

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL  
Sbjct: 583 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 640

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
            W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ +
Sbjct: 641 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 694

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 695 ENVAD--IGFDCSMCR 708



 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 37/60 (61%), Gaps = 2/60 (3%)

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK--HTKC 139
           + W+CP C++C+ C+++G+ +K + C  CD  YH +C  P  K+V +  + C +  H +C
Sbjct: 13  AGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKRWVHLEC 72


>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
            [Oryctolagus cuniculus]
          Length = 4865

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 910  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 967

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 968  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1026 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1080 --IGFDCSMCR 1088



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 313 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 369

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 370 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 422

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 423 FCGKWYHPELQKDMLHCNMCKRWVHLECDKPTDN-----ELDSQLKEEYICMYCK 472


>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
            harrisii]
          Length = 4951

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 1010 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1067

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+  PG      W   
Sbjct: 1068 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1125

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E  ++   D
Sbjct: 1126 YTQCAPCASL----STCPVCSRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1179

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1180 --IGFDCTMCR 1188



 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 86/185 (46%), Gaps = 20/185 (10%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+ +G+ +K + C  CD  YH 
Sbjct: 403 CTTCGQHYHGMCLDIAVT---ALKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 459

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P   +V +  + C     C  CG+       S +W      CD C +       CP
Sbjct: 460 FCLQPVMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDNCYQP-QDNTACP 513

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCRGECYQ 234
            C K Y+      M+ C +C+RW+H +CD  +D      ++D  L+  Y C  C+    Q
Sbjct: 514 FCGKCYQPDFQKDMLHCQMCKRWIHIECDKPADT-----ELDSQLKEDYVCMCCK----Q 564

Query: 235 VRDLE 239
           + DLE
Sbjct: 565 LGDLE 569


>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4219

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 92/200 (46%), Gaps = 13/200 (6%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
            V+C LC    N         + C+ C + +H  CL    +  D   W +W C  CR C +
Sbjct: 1626 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHD-EQWENWCCRRCRFCHV 1678

Query: 95   C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
            C R+     + + C +C  +YH  C    HP         ++C K  +C SCG+  PG  
Sbjct: 1679 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1738

Query: 151  LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
               +W   ++ C  C +   KGN CP+C K Y D +  + M+ C  C RWVH +C+ ++D
Sbjct: 1739 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1798

Query: 210  EK-YLQFQVDGNLQYRCPTC 228
            +   L   +  N+ Y C  C
Sbjct: 1799 DMCELMSSLPENVVYTCTNC 1818


>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
          Length = 4904

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 953  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1069 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1123 --IGFDCSMCR 1131



 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L+  Y C  C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517


>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
            homolog
          Length = 4903

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 952  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1009

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 1010 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1067

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1068 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1121

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1122 --IGFDCSMCR 1130



 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L+  Y C  C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517


>gi|195347259|ref|XP_002040171.1| GM16061 [Drosophila sechellia]
 gi|194135520|gb|EDW57036.1| GM16061 [Drosophila sechellia]
          Length = 1476

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/278 (30%), Positives = 121/278 (43%), Gaps = 21/278 (7%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+   +  K++ C +
Sbjct: 211 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 265

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C +  
Sbjct: 266 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 325

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
            KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP 
Sbjct: 326 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 385

Query: 228 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 286
           C+         +  +  +     D + + L  SL+     P E +  S+ P SD+    P
Sbjct: 386 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-SMDPSSDELHKLP 442

Query: 287 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
                V L +  GRS K  L   GV+ +  KK    GK
Sbjct: 443 AGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 480



 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 26/210 (12%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C      +R +     W+C  C +CE C + 
Sbjct: 524 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 581

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
            D  + + C  CD +YH YC +PP + V +G + C   T C  CG N          N L
Sbjct: 582 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 641

Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
                     C +C       + CPVC   Y + E+  ++ C+ C+ W H  CD ++ + 
Sbjct: 642 E---------CPSC----TSQSSCPVCKVSYSNGET--IIQCEHCELWAHFHCDTVNAQL 686

Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
            +    D N+ Y+C  CR        L DA
Sbjct: 687 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 714


>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
          Length = 1622

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 18/207 (8%)

Query: 32  NTNVMCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSC 89
           N  +MC LC         R R ++L C+ C + +HR CL+     R L +   +W C  C
Sbjct: 32  NPTLMCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRC 82

Query: 90  RICEICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSN 145
           + C +C + G   K +  C  C   YH  C  P  P K   SG  + C    +C SCG  
Sbjct: 83  KFCNVCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI- 141

Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC 204
            PG    +        C  C  L+ KGN+CP+C++ Y +SE  + M+ C  C +W+H +C
Sbjct: 142 APGKDGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKC 201

Query: 205 DGISDEKY-LQFQVDGNLQYRCPTCRG 230
           +G+SDE Y L   +  ++ Y CP C G
Sbjct: 202 EGLSDEGYELLSNLPDSVVYTCPPCLG 228


>gi|4336749|gb|AAD17932.1| myeloid/lymphoid leukemia 2 [Homo sapiens]
          Length = 1010

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 96/183 (52%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 553 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 611

Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V+W   Y+ C  C +L+
Sbjct: 612 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVQWSGDYSLCPRCTQLY 670

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 671 EKGNYCPICTRCYEDNDYESKMMQCAQCDYWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 730

Query: 228 CRG 230
           C G
Sbjct: 731 CAG 733


>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
            griseus]
          Length = 4871

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 935  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 992

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 993  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1050

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CP+C + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 1051 YTQCAPCASL----SSCPICCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1104

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1105 --IGFDCSMCR 1113



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 104/224 (46%), Gaps = 22/224 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 342 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 398

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 399 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCYQQ--QDNLCP 451

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
            C K Y       M+ C++C+RWVH +CD  +D + L  Q+  +  Y C  C+    ++ 
Sbjct: 452 FCGKCYNPEFQKDMLYCNMCKRWVHLECDKPTDHE-LDSQIKED--YICMYCKHLGAEID 508

Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSD 280
            L     EL   +   D         ++G+  EDE+  + P ++
Sbjct: 509 SLHPG-NELEMPELPTD--------YSSGMEIEDEVLFLDPTAN 543


>gi|161611540|gb|AAI55711.1| mll protein [Xenopus (Silurana) tropicalis]
          Length = 2316

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  + G       + C+ C + +HR CL+   +  +     +W C  C+ C +C R 
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433

Query: 99   GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 136
                K                 + C +C  +YH  C  P   N  + P      ++C K 
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490

Query: 137  TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 195
             +C SCGS  PG G   +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550

Query: 196  CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
            C RWVH +C+ ++DE Y +   +  ++ Y C  C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584


>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
 gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
 gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
          Length = 1482

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/288 (30%), Positives = 120/288 (41%), Gaps = 41/288 (14%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+   +  K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C +  
Sbjct: 272 CQKTYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
            KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP 
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391

Query: 228 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 276
           C+      G   Q  D       D+  E    K++        L    G PT D      
Sbjct: 392 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQLSLKEI-------ELDPLEGKPTMD------ 438

Query: 277 PYSDDEENGP-----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
           P SD+    P     V L +  GRS K  L   GV+ +  KK    GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486



 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C      +R +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
            D  + + C  CD +YH YC +PP + V +G + C   T C  CG N          N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647

Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
                     C +C       + CPVC   Y + E   ++ C+ C+ W H  CD ++ + 
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDSVNAQL 692

Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
            +    D N+ Y+C  CR        L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 720


>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
            niloticus]
          Length = 4907

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 93/201 (46%), Gaps = 15/201 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   ++  +C V  + G     ++L+C  C + YH  C+ +      L     W+C 
Sbjct: 836  SNTDKFVLLQDMCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCL 893

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CGSN P
Sbjct: 894  ECIVCEVCGKASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP 953

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CPVC + + + E   ++ C  C RWVH  C+ +
Sbjct: 954  --GFHCEWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESL 1005

Query: 208  SDEKYLQFQVDGNLQYRCPTC 228
              E  ++   D    + C +C
Sbjct: 1006 YTEDEVEQASDEG--FACTSC 1024



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 13/154 (8%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG  YH  CL+  A        + W+CP C++C+ CR+ G+ +K + C  CD  
Sbjct: 230 LLFCTGCGLHYHAACLEIGATP---IQRAGWQCPECKVCQTCRQPGEDSKMLVCDACDKG 286

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 170
           YH +C  P   ++ S P+ C +   C  CG     +PG   S +WF  Y  C+ C     
Sbjct: 287 YHTFCLQPAMDSLPSDPWKCRRCRVCMVCGVRGLVLPG---SAQWFDNYAVCEGCQHH-- 341

Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
           + + C VC K    S +  + CC +C RWVH +C
Sbjct: 342 RSSICCVCSKAANPSVA--LQCCSMCHRWVHSEC 373


>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
          Length = 3396

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 845  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 902

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
             DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 903  TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 960

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
            YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 961  YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1014

Query: 219  GNLQYRCPTCR 229
              + + C  CR
Sbjct: 1015 --IGFDCSMCR 1023



 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 290 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 346

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 347 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 399

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L+  Y C  C+
Sbjct: 400 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 449


>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
            mutus]
          Length = 4905

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/224 (29%), Positives = 99/224 (44%), Gaps = 22/224 (9%)

Query: 16   VATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYH 65
             A AS +   EE    NT V+            +C V  + G     R+L+C  CG+ YH
Sbjct: 865  AADASSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYH 924

Query: 66   RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
              C+    +   +     W+C  C +CE C +  DP + + C  CD +YH YC  PP + 
Sbjct: 925  PYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQT 982

Query: 126  VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS 185
            V  G + C     C  CG+    +G    W   YT C  C  L    + CPVC + YR+ 
Sbjct: 983  VPKGGWKCKWCVWCRHCGAT--SSGPRCEWQNNYTQCAPCASL----SSCPVCCRNYREE 1036

Query: 186  ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            +   ++ C  C RW+H  C   S E+ ++   D  + + C  CR
Sbjct: 1037 DL--ILQCRQCDRWMHAVCQNFSTEEEVENVAD--IGFDCSLCR 1076



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 305 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 361

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 362 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 414

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 235
            C K Y       M+ C++C+RWVH +CD  +D E  LQ + +    Y C  C+    ++
Sbjct: 415 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHEPDLQLREE----YICTYCKHLAAEM 470

Query: 236 RDLE 239
             L+
Sbjct: 471 GPLQ 474


>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
          Length = 2812

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 66/208 (31%), Positives = 98/208 (47%), Gaps = 20/208 (9%)

Query: 32   NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
             T ++C LC        +   +++ C+ C   +H  CL++  Q        SW C  C+ 
Sbjct: 1231 TTQLVCLLC------ASKGFHQLVFCQVCCDPFHVFCLEDDEQPLP-EQEESWCCRRCKF 1283

Query: 92   CEICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGPY------LCPKHTKCHSCGS 144
            C +C R    +K +  C RC   YH  C  P   N  + P+      +C    +C SCG+
Sbjct: 1284 CHVCGRKNKASKQLLECERCRNCYHLACLGP---NYPTKPFRKRKNWVCSACIRCKSCGT 1340

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQ 203
              PG      W   Y+ C AC  L  KGNYCP+CL  Y D++  + M+ C  C  WVH +
Sbjct: 1341 -APGKNWDTEWSNDYSLCSACSVLHDKGNYCPICLHCYEDNDYESKMMQCAKCDHWVHAK 1399

Query: 204  CDGISDEKY-LQFQVDGNLQYRCPTCRG 230
            C+G+SDE Y +   +  ++ Y C  C G
Sbjct: 1400 CEGLSDEGYEILSNLPESVVYACRPCCG 1427


>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
          Length = 4897

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 92/201 (45%), Gaps = 15/201 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   ++  +C V  + G     ++L+C  C + YH  C+ +      L     W+C 
Sbjct: 817  SNTDKFVLLQDMCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCL 874

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C    DP++ + C  CD +YH YC  PP   V  G + C     C  CGSN P
Sbjct: 875  ECIVCEVCGEASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP 934

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CPVC + + + E   ++ C  C RWVH  C+ +
Sbjct: 935  --GFHCEWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESL 986

Query: 208  SDEKYLQFQVDGNLQYRCPTC 228
              E  ++   D    + C +C
Sbjct: 987  YTEDEVEQASDEG--FACTSC 1005



 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 79/159 (49%), Gaps = 13/159 (8%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG  YH  CL   A        + W+CP C++C+ CR+ G+ +K + C  C+  
Sbjct: 229 LLFCTGCGLHYHATCLDTGATP---ILRAGWQCPECKVCQTCRQPGEDSKMLVCDSCEKG 285

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 170
            H +C  P   +V S  + C     C  CG +   +PG   + +WF  YT C+ C     
Sbjct: 286 CHTFCLQPAMDSVPSDRWKCRSCRVCMECGVHGLVLPG---TAQWFESYTLCEGCQHH-- 340

Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
           + + C VC K   D+ S  + CC +C RW+H +C  +++
Sbjct: 341 RSSICCVCSKP--DNPSVSLQCCSLCHRWMHSECSSLTE 377


>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
            latipes]
          Length = 4455

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 92/215 (42%), Gaps = 22/215 (10%)

Query: 26   EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
            EE    NT VM            +C V  + G     R+L+C  CG+ YH  C+ N    
Sbjct: 998  EENSMHNTVVMFSTSDHFTLKQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCV-NIKIT 1056

Query: 76   RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            R +     W+C  C +CE C    DP + + C  CD +YH YC  PP   V  G + C  
Sbjct: 1057 RVIL-TKGWRCLECTVCEACGDASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKW 1115

Query: 136  HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
               C  CGS  P  G+   W   Y+ C  C  L    + CP C + Y  +E   ++ C  
Sbjct: 1116 CVWCVQCGSTSP--GVHSDWQRNYSLCGPCCSL----SRCPACQQAY--AEDDLILQCQQ 1167

Query: 196  CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
            C RWVH  C G+  E  ++   D    + C  C+ 
Sbjct: 1168 CDRWVHATCQGLCTEDEVEVAADEG--FDCSLCKA 1200



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 7/101 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           CRLC    + G      +L C  CG  YH +CL        L     W+CP CR+C  C 
Sbjct: 464 CRLCAGSGDSGG-----LLMCSCCGSCYHGSCLDPPVTPSPLSRVG-WQCPQCRVCRSCS 517

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHT 137
             GD +  + C RCD AYH +C  PP  +     + C   T
Sbjct: 518 LQGD-SGVLLCARCDKAYHAHCLTPPLDDAPHAAWTCKAET 557


>gi|312384476|gb|EFR29199.1| hypothetical protein AND_02074 [Anopheles darlingi]
          Length = 2401

 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 68/213 (31%), Positives = 98/213 (46%), Gaps = 12/213 (5%)

Query: 34  NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           ++ CR C    + G      ++ C  CG  YH  C+   AQ   +   + W+C SC+ C+
Sbjct: 625 DINCRQCSALGDVG-----NLIICSLCGDHYHGTCV-GLAQLPGV--RTGWQCNSCKKCQ 676

Query: 94  ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
           ICR       + + C  CD  YH  C  P   ++    + C     C  CG+  PG G S
Sbjct: 677 ICRVPDSSEGRSVACELCDKIYHASCLRPIMTSIPKFGWKCRCCRVCSDCGARTPGAGAS 736

Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            RW   YT CD+C +   KG  CP+C + YR +    MV C VC ++VH  CD  +D   
Sbjct: 737 SRWHSHYTVCDSCYQQRNKGFSCPICHRAYRAAAYREMVKCSVCSKFVHSTCDPDADLTV 796

Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVR 243
              + + N   +Y C  C+   +  R L  AVR
Sbjct: 797 YNGRKEANPDYEYLCTPCKAAIHSGR-LVAAVR 828



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 96/224 (42%), Gaps = 22/224 (9%)

Query: 16   VATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYH 65
            +A++S     +E G  N  V+C           +C +    G ++   +++C  CG+ YH
Sbjct: 985  LASSSIKTTEDEPGGENRLVLCSAKDKFVLTQDICVMCGAIGTDQEGCLIACTQCGQCYH 1044

Query: 66   RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
              C     +   +     W+C  C ICE C +  D  + + C  CD +YH YC  PP + 
Sbjct: 1045 PYCTN--VKVTKVILQKGWRCLDCTICEGCGQRNDEARLILCDDCDISYHIYCMDPPLEQ 1102

Query: 126  VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS 185
            V  G + C     C  CG+N P  G +  W   YT C  C         CP C   Y D 
Sbjct: 1103 VPQGTWKCKWCAICQKCGTNSP--GFNSGWMNSYTECGPC----ASQTNCPSCNDGYADG 1156

Query: 186  ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            E   ++ C  C+RW+HC CD I +E   +        Y C  CR
Sbjct: 1157 EL--IIQCHQCERWLHCACDQIKNEAEAERCA--EEAYNCLICR 1196


>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
 gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
          Length = 1486

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 68/206 (33%), Positives = 97/206 (47%), Gaps = 19/206 (9%)

Query: 31  SNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
           S+ NV C  C  +G+        +++ C +CG  +H  C  L N    R     S W C 
Sbjct: 200 SDNNVECLTCSSLGD------LSKLIMCSTCGDHFHSTCVGLANLPDTR-----SGWNCA 248

Query: 88  SCRICEICRRTGDPN--KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
            C  C+ICR   D N  K++ C +C   YH  C  P    +    + C +   C  CGS 
Sbjct: 249 RCTKCQICR-VQDSNDLKYVKCEQCQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSR 307

Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCD 205
            PG G S RW   YT CD+C +   KG  CP+C K YR +    MV C  C ++VH  CD
Sbjct: 308 TPGGGSSSRWHSHYTICDSCYQQRNKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCD 367

Query: 206 GISD--EKYLQFQVDGNLQYRCPTCR 229
             +D    + + +++ +  Y CP C+
Sbjct: 368 EEADLTAYHKKKELNPDYDYVCPNCK 393



 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 87/202 (43%), Gaps = 34/202 (16%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C      +R +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
            D  + + C  CD +YH YC +PP + V +G + C   T C  CG           SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNMP 647

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
                         C  C       + C VC   Y + E   ++ C+ C+ W H  CD +
Sbjct: 648 E-------------CPPCA----SQSACNVCKSAYANGEM--IIQCEHCELWSHFLCDTV 688

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
           + +  +    D N+ Y+C  CR
Sbjct: 689 NAQLTID-HYDSNI-YKCLKCR 708


>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
          Length = 5261

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 93/202 (46%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 961  SNTDKFVLMQDMCVVCGSFGRGSEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1018

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1019 ECIVCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLNTVPKGGWKCKWCVCCVQCGAVSP 1078

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  CD +
Sbjct: 1079 --GFHCEWQNNYTHCAPCASLVT----CPICQVKYVEEDL--LIQCQHCERWMHAVCDNL 1130

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E+ ++   D    + C +C+
Sbjct: 1131 FTEEEVEQAADEG--FDCTSCQ 1150



 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 5/94 (5%)

Query: 40  CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
           C V +  G    R +L C SCG  YH  CL+     R     S W+C  C++C+ CR +G
Sbjct: 234 CMVCDAPG--ELRDLLFCTSCGLHYHGTCLEITVTPRK---RSGWQCHECKVCQTCRLSG 288

Query: 100 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           + ++ + C  C+  YH YC  P  ++V +  + C
Sbjct: 289 EDSRMLVCEACEKCYHTYCLKPAIESVPADSWKC 322


>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
           anubis]
          Length = 1431

 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 695 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 752

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 753 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 810

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
           YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C  ++ E+ ++   D
Sbjct: 811 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 864

Query: 219 GNLQYRCPTCR 229
             + + C  CR
Sbjct: 865 --IGFDCSMCR 873



 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 95  CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 151

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 152 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 204

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 205 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 254


>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
 gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
          Length = 1486

 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+  D N  K++ C 
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269

Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
           +C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C + 
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329

Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
             KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389

Query: 227 TCR 229
            C+
Sbjct: 390 NCK 392



 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C  +   ++ +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
            D  + + C  CD +YH YC +PP + V SG + C   T C  CG           SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
                         C +C       + C VC   Y   E   ++ C+ C+ W H  CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
           + +  +++  D N+ Y+C  CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708


>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
 gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
          Length = 1486

 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)

Query: 53  RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
           +++ C +CG  +H  C  L N    R     S W C  C  C+ICR+  D N  K++ C 
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269

Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
           +C   YH  C  P    +    + C +   C  CGS  PG G S RW   YT CD+C + 
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329

Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
             KG  CP+C K YR +    MV C  C ++VH  CD  +D      + + N    Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389

Query: 227 TCR 229
            C+
Sbjct: 390 NCK 392



 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G E    M++C  CG+ YH  C  +   ++ +     W+C  C +CE C + 
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
            D  + + C  CD +YH YC +PP + V SG + C   T C  CG           SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
                         C +C       + C VC   Y   E   ++ C+ C+ W H  CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
           + +  +++  D N+ Y+C  CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708


>gi|348530100|ref|XP_003452549.1| PREDICTED: hypothetical protein LOC100689867 [Oreochromis niloticus]
          Length = 2557

 Score =  109 bits (272), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 102/216 (47%), Gaps = 16/216 (7%)

Query: 54   MLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCR 108
            M+ C+ C + +H  CL    +   +N++     +W C  C+ C +C RR+      + CR
Sbjct: 1183 MIFCQICCEPFHSFCLSPEERPLEENKE-----NWFCRRCKFCHVCGRRSKSTKPVLQCR 1237

Query: 109  RCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACG 166
            RC  +YH  C  P  P    SS P++C    +C SCG   PG    + W      C  C 
Sbjct: 1238 RCQTSYHPSCLGPTYPKPMNSSVPWVCMTCIRCKSCGVT-PGKTWDLTWNHEQDLCPDCT 1296

Query: 167  RLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYR 224
             L  KGN+C +C K Y DS + + M+ C  C  W+H +C+GIS+E + L     G + + 
Sbjct: 1297 SLHKKGNFCTICHKCYEDSIQPSQMLQCSQCSHWIHYRCEGISEELFGLLTSQPGRVDFT 1356

Query: 225  CPTCRGECYQVRDL-EDAVRELWRRKDMADKDLIAS 259
            C  C         L E+  R L  R +    DL++S
Sbjct: 1357 CSPCSQHQTSHSILKEELQRRLTARVEEVLTDLLSS 1392


>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
            norvegicus]
          Length = 3894

 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1362 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1416

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1417 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1476

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+G+SDE Y
Sbjct: 1477 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1536

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1537 EILSNLPESVAYTCVNC 1553


>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
          Length = 3987

 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1455 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1509

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1510 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1569

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+G+SDE Y
Sbjct: 1570 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1629

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1630 EILSNLPESVAYTCVNC 1646


>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
           (Silurana) tropicalis]
          Length = 6019

 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 93/202 (46%), Gaps = 15/202 (7%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 798 SNTDKFVLMQDMCVVCGSFGRGSEGHLLACSQCSQCYHPYCVNSRITKVMLL--KGWRCV 855

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 856 ECIVCEVCGKATDPSRLLLCDDCDISYHTYCLDPPLHTVPKGGWKCRWCVSCMQCGAVTP 915

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             G    W   YT C  C  L      CPVC   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 916 --GFRSEWQNNYTHCAPCASLV----SCPVCHLKYLEGDL--LIQCRHCERWLHAVCENL 967

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
             E+ ++   D    + C +C+
Sbjct: 968 FTEEEVEQAADEG--FDCSSCQ 987



 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 3/125 (2%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C SCG  YH  CL+           S W+CP C++C+ CR+ G+    + C  CD  
Sbjct: 233 LLFCTSCGLHYHGTCLEITVSP---LKRSGWQCPECKVCQTCRQPGEDTMMLVCDACDKG 289

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
           YH +C  P  + + +  + C     C  CGS         +W+  Y+ C  C     +  
Sbjct: 290 YHTFCLKPAIECLPTDSWKCKTCRVCRICGSRTAHMEPGSQWYDNYSVCSKCQEKRNRAE 349

Query: 174 YCPVC 178
            C +C
Sbjct: 350 TCVLC 354


>gi|321469512|gb|EFX80492.1| hypothetical protein DAPPUDRAFT_318677 [Daphnia pulex]
          Length = 1953

 Score =  108 bits (271), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/214 (33%), Positives = 98/214 (45%), Gaps = 24/214 (11%)

Query: 29  GQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           G S+ N  C +C    N         L C +CGK YH +C+            ++W+C  
Sbjct: 449 GPSDAN--CAVCHSPSN-----VANQLFCVTCGKHYHGSCV---GLGSSPGVRTAWQCNE 498

Query: 89  CRICEICRR------TG-----DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHT 137
           C++C  CR       TG     D  K + C  CD  YH  C  P   N+    + C    
Sbjct: 499 CKVCITCRTPVAQQGTGAEAVTDRTKMLVCDTCDKNYHPSCVRPLISNIPKLGWKCKNCR 558

Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
            C  CGS  PG+G S RW   +T CD+C +   KG  CP+C K YR S+   M  C  C+
Sbjct: 559 VCGDCGSRTPGSGPSSRWHACFTVCDSCYQQRNKGVSCPMCGKAYRHSQRE-MSQCTRCR 617

Query: 198 RWVHCQCDGISDEKYLQFQVDGN--LQYRCPTCR 229
           ++VH  CD  +D   +Q + D N   +Y CP C+
Sbjct: 618 KYVHSGCDPEADRTLVQRKKDMNSDYEYLCPPCK 651



 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 40   CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
            C +  + G ++  R++SC  CG+ YH  C     +   +     W+C  C +CE C    
Sbjct: 856  CAMCGSFGLDQEGRLISCAQCGQCYHPFCAN--VKVTKVILQKGWRCLDCTVCEGCGERH 913

Query: 100  DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF--L 157
            D  + + C  CD +YH YC  PP   V  G + C     C  CGSN P  GL+  W    
Sbjct: 914  DEARLLLCDECDISYHIYCMEPPLDYVPQGNWKCKWCAVCQVCGSNEP--GLNANWTHQA 971

Query: 158  GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
              + C  C  L      CP C   Y + E   ++ C  C +W+H  CD I +E+  +F  
Sbjct: 972  NGSLCGPCASL----RQCPSCSSSYNEGEL--IIQCQQCAQWLHAACDLIRNEREAEFCA 1025

Query: 218  DGNLQYRCPTCR 229
            +    Y C  CR
Sbjct: 1026 EDG--YTCVLCR 1035


>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
 gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
          Length = 2934

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 92/200 (46%), Gaps = 21/200 (10%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 627 TQDICVMCGAIGTDQEGC-----LIACTQCGQCYHPYCTN--VKVTKVILQKGWRCLDCT 679

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           ICE C +  D  + + C  CD +YH YC  PP + V  G + C     C  CG+N P  G
Sbjct: 680 ICEGCGQRNDEGRLILCDDCDISYHTYCMDPPLEQVPQGNWKCKWCAICLKCGTNDP--G 737

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W   Y+ C  C         CP C + Y D E   ++ C+ C+RW+HC CD I  E
Sbjct: 738 YNCAWLNNYSECGPCASQV----SCPCCGEGYADGEL--IIQCNQCERWLHCGCDQIKSE 791

Query: 211 KYLQ-FQVDGNLQYRCPTCR 229
              +    DG   Y C  CR
Sbjct: 792 NEAERCAEDG---YNCLLCR 808



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 101/229 (44%), Gaps = 17/229 (7%)

Query: 34  NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           ++ CR C    + G      ++ C  CG  YH  C+   AQ   +   S W+C SC+ C+
Sbjct: 288 DINCRQCSALGDVG-----NLMMCSICGDHYHGTCV-GLAQLPGV--RSGWQCGSCKKCQ 339

Query: 94  ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
           ICR       + + C +CD  YH  C  P   ++    + C     C  CGS  PG G S
Sbjct: 340 ICRVPDSSEGRTVGCEQCDKIYHASCLRPIMTSIPKYGWKCRCCRICSDCGSRTPGAGAS 399

Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            RW   ++ CD+C +   KG  CP+C + YR +    MV C  C ++VH  CD  +D   
Sbjct: 400 SRWHAHFSVCDSCYQQRNKGFSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDAEADLSV 459

Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIAS 259
              + + N   +Y C  C+   +  R          RR    D D +++
Sbjct: 460 YHAKKETNPDYEYLCSPCKTAIHSGR------MAAMRRNSSVDDDSMSA 502


>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3 [Ovis aries]
          Length = 4922

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 90/196 (45%), Gaps = 19/196 (9%)

Query: 36   MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
            MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 923  MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 975

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
             C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     G   
Sbjct: 976  ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGPRG 1033

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
             W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  C   S E+ +
Sbjct: 1034 EWQNNYTQCAPCASL----SACPVCHRNYREEDL--ILQCRQCDRWMHAVCQNFSTEEEV 1087

Query: 214  QFQVDGNLQYRCPTCR 229
            +   D  + + C  CR
Sbjct: 1088 ENVAD--IGFDCSLCR 1101



 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 78/155 (50%), Gaps = 10/155 (6%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 330 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 386

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 387 FCLQPVMKSVPTNGWRCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 439

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
            C K Y       M+ C++C+RWVH +CD  +D +
Sbjct: 440 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHE 474


>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
            guttata]
          Length = 3849

 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  + G       + C+ C + +H+ CL+   + ++     +W C  C+ C +C R 
Sbjct: 1297 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEESERPQE-DQLENWCCRRCKFCHVCGRQ 1352

Query: 99   GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 154
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +
Sbjct: 1353 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQ 1412

Query: 155  WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 212
            W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y 
Sbjct: 1413 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1472

Query: 213  LQFQVDGNLQYRCPTC 228
            +   +  ++ Y C  C
Sbjct: 1473 ILSNLPESVAYTCINC 1488


>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
           sapiens]
          Length = 4539

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 370 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 427

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 428 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 487

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 488 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 539

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
             E  ++   D    + C +C+
Sbjct: 540 FTEDDVEQAADEG--FDCVSCQ 559


>gi|66811728|ref|XP_640043.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
 gi|60468063|gb|EAL66073.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
          Length = 795

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/190 (33%), Positives = 88/190 (46%), Gaps = 27/190 (14%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 111
           +++C SC KKYH  CL    +  D +    + WKC  C+ CE+C  +G   K +FC  CD
Sbjct: 578 LITCSSCSKKYHAKCLNLHQKCIDKYREDPTQWKCTDCKSCELCDDSGHDEKMLFCDVCD 637

Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFV 170
             YH +C  PP      G + C     C  C S V  N L+ ++W   YTCCD+C   F 
Sbjct: 638 KGYHTFCLTPPLSQTPEGGWRCNDCAFCIHCYSRVDKNSLNKIKWKENYTCCDSC---FS 694

Query: 171 KG-----NYCPVCLKVYRD--SESTPMVCCDVCQRWVHCQC-----DGISDEKYLQFQVD 218
           KG      YCP+C    +D   E   +  C  C + VH  C     D + +E ++     
Sbjct: 695 KGFSEKSKYCPICSHSIKDEGEEEDSITTCQYCHKSVHDHCDQNIIDNLENEHFI----- 749

Query: 219 GNLQYRCPTC 228
               Y+CP C
Sbjct: 750 ----YKCPNC 755


>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Meleagris gallopavo]
          Length = 3851

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1310 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1364

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1365 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1424

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1425 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1484

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1485 EILSNLPESVAYTCINC 1501


>gi|431901376|gb|ELK08402.1| Histone-lysine N-methyltransferase MLL2 [Pteropus alecto]
          Length = 5640

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1369 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1426

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1427 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1486

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1487 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHASCESL 1538

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1539 FTEDDVEQAADEG--FDCVSCQ 1558



 Score = 87.0 bits (214), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 61/131 (46%), Gaps = 8/131 (6%)

Query: 35  VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           V C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ 
Sbjct: 246 VRCAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQA 297

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           CR+ G+ +K + C  CD  YH +C  PP + +    + C     C +CG+       +  
Sbjct: 298 CRKPGNDSKMLVCETCDKGYHTFCLKPPMEELPPHSWKCKACRVCRACGAGSAELNPNSE 357

Query: 155 WFLGYTCCDAC 165
           WF  Y+ C  C
Sbjct: 358 WFENYSLCHRC 368


>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
            harrisii]
          Length = 5047

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 908  SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 965

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 966  ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAVSP 1025

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1026 --GFHCEWQNSYTHCGPCASLVT----CPACRAPYVEEDL--LIQCRHCERWMHAGCESL 1077

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E+ ++   D    + C +C+
Sbjct: 1078 FTEEEVEQAADEG--FDCASCQ 1097



 Score = 83.2 bits (204), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 67/141 (47%), Gaps = 8/141 (5%)

Query: 27  EQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           E+ +   +  C +C     +G    R +L C SCG+ YH  CL      R     + W+C
Sbjct: 218 EEAEHLEDAHCVVC-----DGLGELRDLLFCTSCGQHYHGACLDTALTARK---RAGWQC 269

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 146
           P C++C+ CR+ G+ +  + C  CD  YH +C  P  +++    + C     C +CG+  
Sbjct: 270 PDCKVCQTCRQPGEDSMMLVCEACDKGYHTFCLKPAIQSLPPDSWKCKTCRVCRACGACP 329

Query: 147 PGNGLSVRWFLGYTCCDACGR 167
                + +W+  Y+ C+ C R
Sbjct: 330 AELDPNCQWYENYSLCERCQR 350


>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
            Full=Lysine N-methyltransferase 2D; Short=KMT2D
          Length = 5588

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1381

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CPVC   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1493

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C     EG  +   +L C SCG  YH  CL      R     +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
          Length = 5588

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1381

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CPVC   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1493

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C     EG  +   +L C SCG  YH  CL      R     +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
          Length = 3463

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +HR CL+     R L     +W C  C+ C +C R
Sbjct: 914  VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 968

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 969  QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1028

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1029 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1088

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1089 EILSSLPESVAYTCVNC 1105


>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
            rotundus]
          Length = 3966

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1403 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1457

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1458 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1517

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1518 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1577

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1578 EILSNLPESVAYTCVNC 1594


>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
          Length = 3871

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1368

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1369 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1489 EILSNLPESVAYTCINC 1505


>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Rattus norvegicus]
          Length = 5543

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1381

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1493

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513



 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340

Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
             Y+ C  C +  V+G+       V   +E  P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369


>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
          Length = 4932

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 806 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 863

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 864 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 923

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 924 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESL 975

Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
             E  ++   D    + C +C+
Sbjct: 976 FTEDDVEQAADEG--FDCVSCQ 995


>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
            norvegicus]
          Length = 5543

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1381

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1493

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513



 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340

Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
             Y+ C  C +  V+G+       V   +E  P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369


>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
            domestica]
          Length = 3960

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1421 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1475

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1476 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1535

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1536 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1595

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1596 EILSNLPESVAYTCVNC 1612


>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
          Length = 3844

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1309 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1363

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1364 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1423

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1424 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1483

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1484 EILSNLPESVAYTCVNC 1500


>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
           [Cavia porcellus]
          Length = 2799

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +HR CL+     R L     +W C  C+ C +C R
Sbjct: 269 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 323

Query: 98  TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 324 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 383

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 384 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 443

Query: 213 -LQFQVDGNLQYRCPTC 228
            +   +  ++ Y C  C
Sbjct: 444 EILSNLPESVAYTCINC 460


>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
          Length = 3786

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1280 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1334

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1335 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1394

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1395 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1454

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1455 EILSNLPESVAYTCINC 1471


>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
          Length = 3554

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1023 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1077

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1078 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1137

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1138 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1197

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1198 EILSNLPESVAYTCVNC 1214


>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
          Length = 3969

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624


>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
          Length = 3824

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1288 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1342

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1343 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1402

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1403 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1462

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1463 EILSNLPESVAYTCVNC 1479


>gi|344254289|gb|EGW10393.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
          Length = 4002

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1738 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1795

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1796 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1855

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CPVC   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1856 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1907

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1908 FTEDEVEQAADEG--FDCVSCQ 1927



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C+ C+ CR
Sbjct: 642 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKECQACR 693

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 694 KPGNDSKMLVCETCDKGYHTFCLKPPIEELPAHSWKCMTCRVCRACGVGSAELNPNSEWF 753

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+ F  V G   PVC
Sbjct: 754 ENYSLCHRCHKAQGGQPFISVAGQRLPVC 782


>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
          Length = 3459

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 957  VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1011

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1012 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1071

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1072 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1131

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1132 EILSNLPESVAYTCVNC 1148


>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Loxodonta africana]
          Length = 3962

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1429 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1483

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1484 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1543

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1544 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1603

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1604 EILSNLPESVAYTCVNC 1620


>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
          Length = 3710

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1178 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1232

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1233 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1292

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1293 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1352

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1353 EILSNLPESVAYTCVNC 1369


>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
          Length = 3969

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624


>gi|157109809|ref|XP_001650834.1| set domain protein [Aedes aegypti]
 gi|108878936|gb|EAT43161.1| AAEL005378-PA [Aedes aegypti]
          Length = 1458

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 96/214 (44%), Gaps = 22/214 (10%)

Query: 26  EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
           EE G  N  V+C           +C +    G ++   +++C  CG+ YH  C     + 
Sbjct: 733 EEPGGDNRLVLCSAKDKFVLTQDICVMCGAIGTDQEGCLIACTQCGQCYHPYCTN--VKV 790

Query: 76  RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
             +     W+C  C ICE C +  D  + + C  CD +YH YC  PP ++V  G + C  
Sbjct: 791 TKVILQKGWRCLDCTICEGCGQRNDEGRLILCDDCDISYHIYCMDPPLEHVPQGNWKCKW 850

Query: 136 HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
              C  CGS+ PG+  +  W   Y+ C  C         CPVC + Y + E   ++ C+ 
Sbjct: 851 CAICLKCGSSNPGH--NSNWLNNYSECGPCASQV----NCPVCAEGYVEGEL--IIQCNT 902

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           C+RW+HC CD I  E   +   +    Y C  CR
Sbjct: 903 CERWLHCGCDQIKTENDAERCAEEG--YNCTLCR 934



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/232 (28%), Positives = 102/232 (43%), Gaps = 16/232 (6%)

Query: 34  NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           ++ CR C      G      ++ C  CG  YH  C+   AQ   +   + W+C SC+ C+
Sbjct: 414 DINCRQC-----SGLGDVGNLMMCSICGDHYHGKCV-GLAQLPGV--RAGWQCSSCKKCQ 465

Query: 94  ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
           ICR       + + C +CD  YH  C  P   ++    + C     C  CGS  PG G S
Sbjct: 466 ICRVPDSSEGRTVGCEQCDKIYHASCLRPVMTSIPKYGWKCKCCRVCSDCGSRTPGAGAS 525

Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            RW   YT CD+C +   KG  CP+C + YR +    MV C  C ++VH  CD  ++   
Sbjct: 526 SRWHAHYTVCDSCYQQRNKGFSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDPEAELTV 585

Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRA 262
              + + N   +Y C  C+   +  R        + R   + D  + AS+ +
Sbjct: 586 YHAKKENNPDYEYLCNPCKASLHTGR-----FSAMRRTSSIDDDSMSASMES 632


>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
            boliviensis boliviensis]
          Length = 3985

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1451 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1505

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1506 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1565

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1566 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1625

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1626 EILSNLPESVAYTCVNC 1642


>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
            abelii]
          Length = 5293

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1263 FTEDDVEQAADEG--FDCVSCQ 1282



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369


>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
            sapiens]
 gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
            7; AltName: Full=Lysine N-methyltransferase 2A;
            Short=KMT2A; AltName: Full=Trithorax-like protein;
            AltName: Full=Zinc finger protein HRX; Contains: RecName:
            Full=MLL cleavage product N320; AltName: Full=N-terminal
            cleavage product of 320 kDa; Short=p320; Contains:
            RecName: Full=MLL cleavage product C180; AltName:
            Full=C-terminal cleavage product of 180 kDa; Short=p180
 gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila) [Homo sapiens]
          Length = 3969

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624


>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_a [Homo sapiens]
          Length = 3969

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624


>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
          Length = 3968

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623


>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
          Length = 4957

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 13/183 (7%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 788 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 845

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 846 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 905

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 906 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 957

Query: 208 SDE 210
             E
Sbjct: 958 FTE 960


>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
          Length = 2526

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 14/192 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC V  + G      M+SC +C + YH  C+    +         W+C  C ICE C   
Sbjct: 442 LCLVCGSIGKGPEASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNG 501

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFL 157
           GD  K + C  CD +YH YC  PP ++V SGP+ C   ++C  C      GN L+ +   
Sbjct: 502 GDEEKLLLCDECDVSYHVYCMKPPLESVPSGPWRCHWCSRCRRCNHKATSGNDLTPKGL- 560

Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
               C +C  L V    CP C + Y+ ++   ++ C +C++W H  C+ +  E+ L+   
Sbjct: 561 ----CHSCASLQV----CPCCNRGYQINDK--IIRCSLCKKWQHGACENLHTEEQLEQAA 610

Query: 218 DGNLQYRCPTCR 229
              +  RC +CR
Sbjct: 611 QNRM--RCASCR 620


>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
          Length = 5488

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1329 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1386

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1387 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1446

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1447 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1498

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1499 FTEDDVEQAADEG--FDCVSCQ 1518



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CTVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG++      +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRICRTCGASSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             ++ C  C
Sbjct: 341 ENFSLCHRC 349


>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
          Length = 5154

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 947  SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1004

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1005 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1064

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1065 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1116

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1117 FTEDDVEQAADEG--FDCVSCQ 1136


>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
            familiaris]
          Length = 3829

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1300 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1354

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1355 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1414

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1415 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1474

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1475 EILSNLPESVAYTCVNC 1491


>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
            melanoleuca]
          Length = 3981

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1450 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1504

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1505 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1564

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1565 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1624

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1625 EILSNLPESVAYTCVNC 1641


>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
            gorilla gorilla]
          Length = 5284

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1263 FTEDDVEQAADEG--FDCVSCQ 1282



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
            sapiens]
          Length = 5265

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1096 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1153

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1154 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1213

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1214 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1265

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1266 FTEDDVEQAADEG--FDCVSCQ 1285



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+    V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369


>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
            garnettii]
          Length = 4062

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1528 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1582

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1583 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1642

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1643 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1702

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1703 EILSNLPESVAYTCVNC 1719


>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_d [Homo sapiens]
          Length = 4002

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1640

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1641 EILSNLPESVAYTCVNC 1657


>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Pan paniscus]
          Length = 5373

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1200 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1257

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1258 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1317

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1318 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1369

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1370 FTEDDVEQAADEG--FDCVSCQ 1389



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369


>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
            boliviensis boliviensis]
          Length = 5498

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1386 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1443

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1444 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1503

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1504 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1555

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1556 FTEDDVEQAADEG--FDCVSCQ 1575



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSLCHHCHKAQGGQPLSSVAEQHTPVC 369


>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
 gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
          Length = 3311

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/230 (30%), Positives = 105/230 (45%), Gaps = 41/230 (17%)

Query: 12   ESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKN 71
            E+G+    S+D +           +C LC      G     +++ C SC + YH  C+  
Sbjct: 906  ETGFCLIGSEDFS--------VRALCFLC------GSSGQEKLIHCASCCEPYHEFCIDE 951

Query: 72   WAQNRDLFHWS-SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC--------QHPP 122
                     W   W CP C +C  C +T      + C +CD +YH  C         H P
Sbjct: 952  AQLKLQNNTWKFDWVCPRCTVCFTCGKTSGQQ--LKCVKCDNSYHIECVDRVGGRLLHSP 1009

Query: 123  HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKV 181
             +     P++C    +C SC      NG+ V  F+G    C AC  L  KGN+CP+C + 
Sbjct: 1010 DR-----PWVCSICLRCKSC------NGVDVSVFVGNLPLCRACFVLRQKGNFCPLCQRC 1058

Query: 182  YRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
            Y D +  + M+ C  C+ WVH +C+G+SDEKY  L F +  +++Y C  C
Sbjct: 1059 YNDDDYDSKMMECGQCKCWVHAKCEGLSDEKYQVLSF-LPESVEYVCRMC 1107


>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
            gorilla gorilla]
          Length = 5550

 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1359 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1416

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1417 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1476

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1477 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1528

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1529 FTEDDVEQAADEG--FDCVSCQ 1548



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369


>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
 gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4218

 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 91/200 (45%), Gaps = 13/200 (6%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
            V+C LC    N         + C+   + +H  CL    +  D   W +W C  CR C +
Sbjct: 1626 VVCFLCASSGNV------EFVFCQVRCEPFHLFCLGEAERPHD-EQWENWCCRRCRFCHV 1678

Query: 95   C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
            C R+     + + C +C  +YH  C    HP         ++C K  +C SCG+  PG  
Sbjct: 1679 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1738

Query: 151  LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
               +W   ++ C  C +   KGN CP+C K Y D +  + M+ C  C RWVH +C+ ++D
Sbjct: 1739 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1798

Query: 210  EK-YLQFQVDGNLQYRCPTC 228
            +   L   +  N+ Y C  C
Sbjct: 1799 DMCELMSSLPENVVYTCTNC 1818


>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
            leucogenys]
          Length = 3968

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623


>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
            abelii]
          Length = 5559

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1359 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1416

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1417 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1476

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1477 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1528

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1529 FTEDDVEQAADEG--FDCVSCQ 1548



 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369


>gi|349603659|gb|AEP99439.1| Histone-lysine N-methyltransferase MLL3-like protein, partial
           [Equus caballus]
          Length = 452

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 89/191 (46%), Gaps = 12/191 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 62  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 119

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL   W   
Sbjct: 120 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 177

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
           YT C  C  L    + CPVC + YR  E   ++ C  C RW+H  C  +S E+ ++   D
Sbjct: 178 YTQCAPCASL----SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLSTEEEVENVAD 231

Query: 219 GNLQYRCPTCR 229
             + + C  CR
Sbjct: 232 --IGFDCSMCR 240


>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
          Length = 3963

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSDEMY 1606

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623


>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
          Length = 4202

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1666 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1720

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1721 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1780

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1781 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1840

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1841 EILSNLPESVAYTCVNC 1857


>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
            melanoleuca]
          Length = 5483

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1425

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESL 1537

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557



 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+L   V     PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLVSSVAEQQPPVC 369


>gi|241687917|ref|XP_002401627.1| mll protein, putative [Ixodes scapularis]
 gi|215504524|gb|EEC14018.1| mll protein, putative [Ixodes scapularis]
          Length = 1259

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 106/237 (44%), Gaps = 30/237 (12%)

Query: 1   MVAAEDY--ARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSC 57
           +   E+Y  A+   +G+   +S+  A           +C LC   GE E       +L C
Sbjct: 777 LFGEEEYRRAQIIATGFALVSSRRFA--------VQAVCFLCASAGEEE-------LLFC 821

Query: 58  KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCY 117
             C + YH  CL      + L    SW CP C+ C  C      ++ + C +C   YH  
Sbjct: 822 TVCCEPYHWFCLDPEEAPQGLDK-ESWCCPRCQTCIACGHRSSVSQLLRCSKCQQTYHTD 880

Query: 118 CQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           C  P + +  S     +LC K  +C SCG++   +     W    + C  C  L  KGNY
Sbjct: 881 CLGPGYPSKPSRKKKIWLCVKCIRCKSCGTSSKQSA----WNFDLSLCQDCMLLREKGNY 936

Query: 175 CPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
           CP+C K Y D +  + MV C  CQ+W+H +CDGIS+E Y  L    +  L Y C  C
Sbjct: 937 CPLCEKCYEDDDYESMMVQCSQCQKWIHARCDGISEELYQVLSLLPETEL-YLCRIC 992


>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
 gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
            Full=ALL1-related protein; AltName: Full=Lysine
            N-methyltransferase 2D; Short=KMT2D; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
 gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
            sapiens]
          Length = 5537

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1425

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1537

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+    V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369


>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
            porcellus]
          Length = 5577

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1425

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1537

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557



 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C     M  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELC----NMFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C  CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRVCGAGSSELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|297262270|ref|XP_001099471.2| PREDICTED: histone-lysine N-methyltransferase MLL2 [Macaca mulatta]
          Length = 5505

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1315 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1372

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1373 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1432

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1433 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1484

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1485 FTEDDVEQAADEG--FDCISCQ 1504



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 203
             Y+ C  C R         V   + PVC +    +S  TP    D    +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393


>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
          Length = 5262

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 13/183 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262

Query: 208  SDE 210
              E
Sbjct: 1263 FTE 1265



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+    V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369


>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
          Length = 5559

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1353 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1410

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1411 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1470

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1471 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1522

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1523 FTEDDVEQAADEG--FDCVSCQ 1542



 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Papio anubis]
          Length = 5547

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1341 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1398

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1399 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1458

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1459 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1510

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1511 FTEDDVEQXADEG--FDCISCQ 1530



 Score = 86.7 bits (213), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 8/131 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGR 167
             Y+ C  C R
Sbjct: 341 ENYSLCHRCHR 351


>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Nomascus leucogenys]
          Length = 5407

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1380 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1437

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1438 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1497

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1498 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1549

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1550 FTEDDVEQAADEG--FDCVSCQ 1569



 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
             Y+ C  C +         V   + PVC
Sbjct: 341 ENYSVCHRCHKAQGGQPVSSVAEQHTPVC 369


>gi|2160396|dbj|BAA03407.1| MLL [Homo sapiens]
          Length = 1909

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624


>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
          Length = 5538

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1367 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1424

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1425 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1484

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1485 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1536

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1537 FTEDDVEQAADEG--FDCISCQ 1556



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 203
             Y+ C  C R         V   + PVC +    +S  TP    D    +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393


>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
          Length = 3846

 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1368

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1369 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1489 EILSNLPESVAYTCVNC 1505


>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
            familiaris]
          Length = 5552

 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1350 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1407

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1408 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1467

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1468 --GFHCEWQNSYTHCGPCASLVT----CPICHTPYVEEDL--LIQCRHCERWMHAGCESL 1519

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1520 FTEDDVEQAADEG--FDCVSCQ 1539



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 69/149 (46%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+L   V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLIGSVAEQHPPVC 369


>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Callithrix jacchus]
          Length = 5289

 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1353 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1410

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1411 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1470

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1471 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1522

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1523 FTEDDVEQAADEG--FDCVSCQ 1542



 Score = 85.5 bits (210), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNSNSEWF 340

Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
             Y+ C  C     G+    V   + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPLSSVAEQHTPVC 369


>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
            taurus]
 gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
          Length = 3821

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1289 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1343

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1344 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1403

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1404 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1463

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1464 EILSNLPESVAYTCVNC 1480


>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
          Length = 4678

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 57/180 (31%), Positives = 82/180 (45%), Gaps = 17/180 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 359 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 411

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C     C +CGSN P  G
Sbjct: 412 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 469

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W   YT C  C         C  C + Y + +   ++ C  C+RW+HC CD I  E
Sbjct: 470 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSIKSE 523


>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL2 [Ovis aries]
          Length = 5387

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1425 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1482

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1483 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1542

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1543 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1594

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1595 FTEDDVEQAADEG--FDCVSCQ 1614



 Score = 85.9 bits (211), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
 gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
 gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
          Length = 5503

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1331 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1388

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1389 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1448

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1449 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1500

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1501 FTEDDVEQAADEG--FDCVSCQ 1520



 Score = 85.9 bits (211), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 341 ENYSLCHRC 349


>gi|86129850|gb|ABC86577.1| myeloid/lymphoid or mixed-lineage leukemia protein [Danio rerio]
          Length = 1154

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 92/200 (46%), Gaps = 13/200 (6%)

Query: 35  VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           V+C LC    N         + C+ C + +H  CL    +  D   W +W C  CR C +
Sbjct: 142 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHV 194

Query: 95  C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           C R+     + + C +C  +YH  C    HP         ++C K  +C SCG+  PG  
Sbjct: 195 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 254

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
              +W   ++ C  C +   KGN CP+C K Y D +  + M+ C  C RWVH +C+ ++D
Sbjct: 255 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 314

Query: 210 EK-YLQFQVDGNLQYRCPTC 228
           +   L   +  N+ Y C  C
Sbjct: 315 DMCELMSSLPENVVYTCTNC 334


>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
           Drosophila), isoform CRA_c [Homo sapiens]
          Length = 3130

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 594 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 648

Query: 98  TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 649 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 708

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 709 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 768

Query: 213 -LQFQVDGNLQYRCPTC 228
            +   +  ++ Y C  C
Sbjct: 769 EILSNLPESVAYTCVNC 785


>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
          Length = 3826

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1294 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1348

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1349 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1408

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 1409 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1468

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1469 EILSNLPESVAYTCVNC 1485


>gi|384494147|gb|EIE84638.1| hypothetical protein RO3G_09348 [Rhizopus delemar RA 99-880]
          Length = 690

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 88/207 (42%), Gaps = 20/207 (9%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSS--WKCPSCRICEICRRTGDPNKFMFCRRCD 111
           ++ C  C +KYH  C  N    + +    S  W CP C++C +CR  GD +  M C  CD
Sbjct: 435 LVKCSRCTRKYHPVC-ANLTTPKQVVGAESYPWLCPECKVCFVCRTAGDESTLMICDGCD 493

Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN------------VPGNGLSVRWFLGY 159
             +H  C  P   ++  G +LC    KCH C                P +          
Sbjct: 494 RGWHTGCCTPKVDHIPEGEWLCQLCAKCHGCNERGMKDESQYTHVAAPKSDKCKYPVYLA 553

Query: 160 TCCDACGRLFVKGNYCPVCLKVY----RDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQ 214
           T CD C   F +  +CPVCLK Y     D E   MV CD C  WVH +CD  ++ E+Y  
Sbjct: 554 TYCDKCVIDFKEDRFCPVCLKTYSDEENDEEDNEMVACDTCDHWVHTRCDESLTPERYQM 613

Query: 215 FQVDGNLQYRCPTCRGECYQVRDLEDA 241
              D + +Y CP C        D E A
Sbjct: 614 LCDDESAKYSCPMCEDRIKSTVDTEAA 640


>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
           Drosophila) variant [Homo sapiens]
          Length = 2880

 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 344 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 398

Query: 98  TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 399 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 458

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 459 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 518

Query: 213 -LQFQVDGNLQYRCPTC 228
            +   +  ++ Y C  C
Sbjct: 519 EILSNLPESVAYTCVNC 535


>gi|256081465|ref|XP_002576990.1| myst-related protein [Schistosoma mansoni]
 gi|353229452|emb|CCD75623.1| myst-related protein [Schistosoma mansoni]
          Length = 1074

 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 83/176 (47%), Gaps = 11/176 (6%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L+C  CG+ YH  C +     R +     W+C  C +CE C  T + +  + C  C+ +
Sbjct: 784 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 842

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
           +H YC  PP K V  G + C     C +CG   P  GL+ +W   Y+ C  C  L     
Sbjct: 843 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASL----T 896

Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            CP+C   YR+ E   +V C +C RW H  CD +  E  L+   D  L Y C  CR
Sbjct: 897 TCPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR 948



 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 44/243 (18%)

Query: 26  EEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           E    + T++ C LC   E+ G      +L C  CG  YH +CL+   Q         W+
Sbjct: 302 EALASNCTSIQCGLC---ESPG--DLTELLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQ 355

Query: 86  CPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-- 143
           C  C+ C IC  + D NK + C  CD  YH YC  PP  ++    + C +   C  CG  
Sbjct: 356 CAECKTCLICNESKDENKMLVCDVCDKGYHTYCLKPPVSSIPKNGFRCERCRVCSDCGGG 415

Query: 144 --SNVPG--------NGL--SVRWFLGYTCCDACGRLFVKGN-YCPVCLKV--------- 181
             S + G        N L  +VRW   YT CD C     + N  CPVC +          
Sbjct: 416 RSSTLSGLEGPVAFNNQLNPNVRWHSNYTLCDRCFHAHKRPNSCCPVCERAWRCSLPVPE 475

Query: 182 --YRDSESTPMVC----CDVCQRWVHCQCDGISDEKYL------QFQVDG--NLQYRCPT 227
             YR+  ST +V     C  C+R VH +CD  S++  +        +  G     Y CP 
Sbjct: 476 NFYRNPNSTFLVWPGSRCSQCRRMVHAECDPSSNQATVSPLSAASEETSGICGTNYVCPV 535

Query: 228 CRG 230
           CR 
Sbjct: 536 CRA 538


>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
          Length = 2525

 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 9/183 (4%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 110
           + + C+ C + +H+ CL+     R L     +W C  C+ C +C R     K +  C +C
Sbjct: 2   KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 59

Query: 111 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
             +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   ++ C  C +
Sbjct: 60  RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 119

Query: 168 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 225
           LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +  ++ Y C
Sbjct: 120 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYTC 179

Query: 226 PTC 228
             C
Sbjct: 180 VNC 182


>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
          Length = 4724

 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 337 CTTCGQHYHGMCLDIVVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 393

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 394 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDTCYQQ--QDNLCP 446

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 447 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 496


>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
           [Nasonia vitripennis]
          Length = 5138

 Score =  105 bits (263), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 81/174 (46%), Gaps = 17/174 (9%)

Query: 36  MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C +CE
Sbjct: 475 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 527

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C    D  + + C  CD +YH YC  PP + V  G + C    +C +CG+N P  G + 
Sbjct: 528 GCGERNDEGRLILCDDCDISYHIYCTDPPLECVPQGTWKCKWCAQCQTCGANDP--GFNS 585

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
            W   YT C  C         C  C + Y   E   ++ C  C+RW+HC CD I
Sbjct: 586 NWQKNYTQCGPCS----SHTACAACNESY--GEGDLIIQCVQCERWLHCMCDAI 633


>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Anolis carolinensis]
          Length = 3817

 Score =  105 bits (263), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            +CF+  + G       + C+ C + +H+ CL++  +  +     +W C  C+ C +C R 
Sbjct: 1336 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEDGERPLE-DQLENWCCRRCKFCHVCGRQ 1391

Query: 99   GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 154
                K +  C +C  +YH  C  P +    +     ++C K  +C SCG+  PG G   +
Sbjct: 1392 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGKGWDAQ 1451

Query: 155  WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 212
            W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y 
Sbjct: 1452 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1511

Query: 213  LQFQVDGNLQYRCPTC 228
            L   +  ++ Y C  C
Sbjct: 1512 LLSNLPESVAYTCINC 1527


>gi|432114496|gb|ELK36344.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
          Length = 3462

 Score =  105 bits (263), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 1355 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1412

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1413 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1472

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1473 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1524

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1525 FTEDDVEQAADEG--FDCVSCQ 1544



 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 261 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKVCQACR 312

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C SCG+       +  WF
Sbjct: 313 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRSCGAGSAELNPNSEWF 372

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 373 ENYSLCYRC 381


>gi|410046801|ref|XP_003313790.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
            troglodytes]
          Length = 2476

 Score =  105 bits (263), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)

Query: 31   SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
            SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 965  SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1022

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 1023 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1082

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
              G    W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ +
Sbjct: 1083 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1134

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              E  ++   D    + C +C+
Sbjct: 1135 FTEDDVEQAADEG--FDCVSCQ 1154


>gi|440792783|gb|ELR13991.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
           Neff]
          Length = 506

 Score =  105 bits (263), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 92/197 (46%), Gaps = 16/197 (8%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           CR+C   + E  E A  ++ C  C  +YH +CL+   +N        W+C  C+ CE C+
Sbjct: 199 CRMCL--KEESAEGA--LIRCTECKDQYHPDCLELKKENIPKMMSFGWRCMHCKKCETCK 254

Query: 97  RTGDPNKFMFCRRC-DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
            TGD  K    R   D  +H +C  PP K    G + C +  +C SCG    G   S RW
Sbjct: 255 DTGDEEKARAARAFHDMGFHTFCLSPPLKRPPIGGWFCRECVECKSCGGKTAGKAKSCRW 314

Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVC--CDVCQRWVHCQCDGISDEKY 212
             GYT C+ C + +    YCPVC  VY+D ++  P +   C  C+  VH  CDG      
Sbjct: 315 HRGYTMCEMCYKRYKHNKYCPVCTLVYQDRDARNPALLRSCVSCRHCVHAGCDG------ 368

Query: 213 LQFQVDGNLQYRCPTCR 229
             F       Y+CP CR
Sbjct: 369 -NF-AGVTSPYQCPPCR 383


>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
          Length = 2720

 Score =  105 bits (262), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 85/181 (46%), Gaps = 8/181 (4%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 111
            M+ C+ C + +H  CL    + R L  +  +W C  C+ C +C RR+ +    + CRRC 
Sbjct: 996  MIFCQICCEPFHSFCLS--PEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 1053

Query: 112  AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
             +YH  C  P  P       P++C    +C SCG   PG    + W      C  C  L 
Sbjct: 1054 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKSWDLAWNHDEDLCPDCTLLH 1112

Query: 170  VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
             KGN+C +C K Y D+ + T M+ C  C  W+H  C+GISDE Y          + C  C
Sbjct: 1113 NKGNFCTICHKCYDDNMQHTEMIQCSACNHWIHYSCEGISDELYGLVSNQREDSFTCQPC 1172

Query: 229  R 229
            R
Sbjct: 1173 R 1173


>gi|428181743|gb|EKX50606.1| hypothetical protein GUITHDRAFT_60438, partial [Guillardia theta
           CCMP2712]
          Length = 149

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 85/155 (54%), Gaps = 10/155 (6%)

Query: 54  MLSCKSCGKKYHRNC----LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
            L C+ CG  +H+ C    LK   + R++     W+CP+CRICE+C+   + ++ + C  
Sbjct: 1   FLFCRDCGDSFHKYCFDLTLKIPPEKRNM-----WRCPACRICEVCKGEENWDEMLCCDE 55

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           CD  +H YC  PP K + +  + C +  +C SCGS  PG   S RW   YT C +C   +
Sbjct: 56  CDRGFHIYCLRPPLKQIPAEGWRCSECVRCLSCGSKTPGPKGSDRWRKDYTLCSSCWVEY 115

Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
            K NYCP+C KV   S+   MV CD CQ WVH  C
Sbjct: 116 EKKNYCPIC-KVVTSSKDIKMVNCDSCQMWVHVTC 149


>gi|256081467|ref|XP_002576991.1| myst-related protein [Schistosoma mansoni]
 gi|353229451|emb|CCD75622.1| myst-related protein [Schistosoma mansoni]
          Length = 914

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 83/176 (47%), Gaps = 11/176 (6%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L+C  CG+ YH  C +     R +     W+C  C +CE C  T + +  + C  C+ +
Sbjct: 624 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 682

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
           +H YC  PP K V  G + C     C +CG   P  GL+ +W   Y+ C  C  L     
Sbjct: 683 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASLTT--- 737

Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            CP+C   YR+ E   +V C +C RW H  CD +  E  L+   D  L Y C  CR
Sbjct: 738 -CPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR 788



 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 44/243 (18%)

Query: 26  EEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           E    + T++ C LC   E+ G      +L C  CG  YH +CL+   Q         W+
Sbjct: 142 EALASNCTSIQCGLC---ESPG--DLTELLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQ 195

Query: 86  CPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-- 143
           C  C+ C IC  + D NK + C  CD  YH YC  PP  ++    + C +   C  CG  
Sbjct: 196 CAECKTCLICNESKDENKMLVCDVCDKGYHTYCLKPPVSSIPKNGFRCERCRVCSDCGGG 255

Query: 144 --SNVPG--------NGL--SVRWFLGYTCCDACGRLFVKGN-YCPVCLKV--------- 181
             S + G        N L  +VRW   YT CD C     + N  CPVC +          
Sbjct: 256 RSSTLSGLEGPVAFNNQLNPNVRWHSNYTLCDRCFHAHKRPNSCCPVCERAWRCSLPVPE 315

Query: 182 --YRDSESTPMVC----CDVCQRWVHCQCDGISDEKYL------QFQVDG--NLQYRCPT 227
             YR+  ST +V     C  C+R VH +CD  S++  +        +  G     Y CP 
Sbjct: 316 NFYRNPNSTFLVWPGSRCSQCRRMVHAECDPSSNQATVSPLSAASEETSGICGTNYVCPV 375

Query: 228 CRG 230
           CR 
Sbjct: 376 CRA 378


>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
            norvegicus]
          Length = 3725

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1190 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1244

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1245 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1304

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+G+S  + 
Sbjct: 1305 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSGTED 1364

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1365 EMYEILSNLPESVAYTCVNC 1384


>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
          Length = 4785

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 87/174 (50%), Gaps = 15/174 (8%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 400

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCR 229
            C K Y       M+ C++C+RW+H +CD  +D++   QF+     +Y C  C+
Sbjct: 401 FCGKCYNPELQKDMLHCNMCKRWIHLECDKPTDQELDSQFRE----EYICTYCK 450



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 28/50 (56%)

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           W+C  C +CE C +  DP + + C  CD +YH YC  PP + V  G + C
Sbjct: 822 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKC 871


>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
           abelii]
          Length = 1215

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 12/177 (6%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           R+L+C  C + YH  C+    +   +     W+C  C +CE C +  DP + + C  CD 
Sbjct: 635 RLLACSQCCQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 692

Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
           +YH YC  PP + V  G + C     C  CG+     GL   W   YT C  C  L    
Sbjct: 693 SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 746

Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           + CPVC + YR  E   ++ C  C RW+H  C  ++ E+ ++   D  + + C  CR
Sbjct: 747 SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 799



 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 21  CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 77

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD+C +   + N CP
Sbjct: 78  FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 130

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 131 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD-----HELDTQLKEEYICMYCK 180


>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile
           rotundata]
          Length = 4805

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 17/177 (9%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  +G + EGC     +++C  CG+ YH  C     +   +     W+C  C 
Sbjct: 472 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 524

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D  + + C  CD +YH YC  PP   V  G + C     C +CGSN P  G
Sbjct: 525 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 582

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
            +  W   YT C  C         C  C + Y + +   ++ C  C+RW+HC CD I
Sbjct: 583 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 633


>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
 gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
          Length = 4925

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 106/244 (43%), Gaps = 53/244 (21%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKN--------------------------- 71
            LCF+  + G E    ML C  C + YH+ C+K+                           
Sbjct: 1713 LCFLCGSAGLES---MLFCVCCCEPYHQYCVKDEYNLRTGTGTGLDDTGNMSLLDVTLGA 1769

Query: 72   ---WAQNRDLFHWSSWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 126
                 Q + L    +W CP C +C  C   TG   K   C++C   YH  C     + + 
Sbjct: 1770 SPQQQQEQLLIARYNWMCPRCTVCFSCNMATGAKVK---CQKCAKHYHTTCLGTSKRLHG 1826

Query: 127  SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
            +  P +C    +C SCG+       +V  F+G    C  C RL  KGNYCP+C K Y D+
Sbjct: 1827 ADRPLICAACLRCKSCGTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1880

Query: 186  E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 243
            +    M+ C  C+RWVH +C+G++DE+Y    V   N+++ C  C       +   D+  
Sbjct: 1881 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFVCKKC------AKHSSDSTA 1934

Query: 244  ELWR 247
             LWR
Sbjct: 1935 HLWR 1938


>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
          Length = 3866

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 18/204 (8%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC   E+         + C+ C + +H+ CL+     R L     +W C  C+ C 
Sbjct: 1331 VVCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCH 1382

Query: 94   ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGN 149
            +C R     K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG 
Sbjct: 1383 VCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGK 1442

Query: 150  GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
            G   +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S
Sbjct: 1443 GWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLS 1502

Query: 209  DEKYLQFQVDGNL----QYRCPTC 228
              +   +++  NL     Y C  C
Sbjct: 1503 GTEDEMYEILSNLPESVAYTCVNC 1526


>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3, partial [Pan paniscus]
          Length = 4810

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 385 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 434



 Score = 99.0 bits (245), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 71/146 (48%), Gaps = 10/146 (6%)

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
           W+C  C +CE C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG
Sbjct: 849 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLXPPLQTVPKGGWKCKWCVWCRHCG 908

Query: 144 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
           +     GL   W   YT C  C  L    + CPVC + YR+ +   ++ C  C RW+H  
Sbjct: 909 AT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAV 960

Query: 204 CDGISDEKYLQFQVDGNLQYRCPTCR 229
           C  ++ E+ ++   D  + + C  CR
Sbjct: 961 CQNLNTEEEVENVAD--IGFDCSMCR 984


>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
          Length = 3869

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 18/204 (8%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC   E+         + C+ C + +H+ CL+     R L     +W C  C+ C 
Sbjct: 1334 VVCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCH 1385

Query: 94   ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGN 149
            +C R     K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG 
Sbjct: 1386 VCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGK 1445

Query: 150  GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
            G   +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S
Sbjct: 1446 GWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLS 1505

Query: 209  DEKYLQFQVDGNL----QYRCPTC 228
              +   +++  NL     Y C  C
Sbjct: 1506 GTEDEMYEILSNLPESVAYTCVNC 1529


>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
            griseus]
          Length = 3907

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1370 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1424

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1425 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1484

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ + DE Y
Sbjct: 1485 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1544

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1545 EILSNLPESVAYTCVNC 1561


>gi|449672214|ref|XP_002156610.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
           magnipapillata]
          Length = 686

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/187 (31%), Positives = 89/187 (47%), Gaps = 13/187 (6%)

Query: 48  CERARRM---LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
           C+++  M   L C SCG+ +H  C+        +     W+C  C++C+ C++ GD  K 
Sbjct: 246 CQKSDNMQSQLFCTSCGRHFHSYCVDMNIPITPVVRMG-WQCSFCKVCQGCKQPGDEEKM 304

Query: 105 MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDA 164
           + C +CD  YH YC +PP   V    + C    KC  CGS+ PG+G S RW   ++ CD 
Sbjct: 305 LCCDQCDKGYHIYCLNPPISVVPKSVWKCVSCRKCSDCGSSKPGSGPSCRWHNNFSLCDR 364

Query: 165 CGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYR 224
           C +   KG  CP+C +  R   +   + C  C + VH +C            +D   +Y 
Sbjct: 365 CYQQRKKGQSCPICKRAVRLFNNGDAIQCKKCFKCVHGECHS---------PLDDGAEYI 415

Query: 225 CPTCRGE 231
           CP C  E
Sbjct: 416 CPDCIEE 422


>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
           gorilla]
          Length = 4782

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 264 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 320

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 321 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 373

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D      ++D  L  +Y C  C+
Sbjct: 374 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 423



 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 72/154 (46%), Gaps = 12/154 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLK-----NWAQNRDLFH-WSSWKCPSCRIC 92
            +C V  + G     R+L+C  CG+ YH  C+       + + R+    +  W+C  C +C
Sbjct: 864  MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKGNTGFCEFRNQNERFKGWRCLECTVC 923

Query: 93   EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
            E C +  DP + + C  CD +YH YC  PP + V  G + C     C  CG+     GL 
Sbjct: 924  EACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLR 981

Query: 153  VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
              W   YT C  C  L    + CPVC + YR+ +
Sbjct: 982  CEWQNNYTQCAPCASL----SSCPVCYRNYREED 1011


>gi|344249614|gb|EGW05718.1| Histone-lysine N-methyltransferase HRX [Cricetulus griseus]
          Length = 3512

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1164 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1218

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1219 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1278

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ + DE Y
Sbjct: 1279 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1338

Query: 213  -LQFQVDGNLQYRCPTC 228
             +   +  ++ Y C  C
Sbjct: 1339 EILSNLPESVAYTCVNC 1355


>gi|322792929|gb|EFZ16759.1| hypothetical protein SINV_09310 [Solenopsis invicta]
          Length = 549

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 72/222 (32%), Positives = 102/222 (45%), Gaps = 37/222 (16%)

Query: 54  MLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF------- 104
           ++ C  CG+ YH +C  L      R     + W+C SCR+C++CR+  D +K        
Sbjct: 223 LVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVCQVCRQPEDVSKINVLQNIK 277

Query: 105 ----------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
                     M C RCD AYH  C  P   ++    + C     C  CGS  PG GLS R
Sbjct: 278 NIHSKSYTHVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSR 337

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 214
           W   YT CD+C +   KG  CP+C K YR +    MV C  C+++VH  CD  +D    Q
Sbjct: 338 WHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLTYQ 397

Query: 215 FQVDG--NLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADK 254
            + D   + +Y C  C+           ++  + RRKD  D+
Sbjct: 398 QRKDAKPDYEYVCLHCK-----------SIAMVARRKDSIDE 428


>gi|357617693|gb|EHJ70933.1| hypothetical protein KGM_14791 [Danaus plexippus]
          Length = 4460

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 17/206 (8%)

Query: 32  NTNVMCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
             ++ CR C  +G+         +++C +CG  YH  C+   AQ   +   + W C SCR
Sbjct: 371 TADIDCRTCRTIGD------MANLMTCVTCGGHYHGTCV-GLAQLPGV--RAGWSCRSCR 421

Query: 91  ICEICRRTGDPNKF-----MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
           +C++CR             + C  CD  YH  C  P    V    + C     C  CG+ 
Sbjct: 422 VCQVCRGEAGGGAGGEARAVACEHCDKLYHAACLRPVMATVPKYGWKCKCCRVCSDCGAR 481

Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCD 205
            PG G S RW   YT CD+C +   KG+ CP+C + YR +    M+ C  C+R+VH  CD
Sbjct: 482 SPGAGPSSRWHAHYTVCDSCYQQRNKGSCCPLCRRAYRAAAYRDMIRCSACRRYVHGMCD 541

Query: 206 GISD-EKYLQFQ-VDGNLQYRCPTCR 229
             ++ + Y Q +  + + +Y CP C+
Sbjct: 542 PEAEPQNYKQKKGENSSYEYTCPICK 567



 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 89/200 (44%), Gaps = 22/200 (11%)

Query: 33  TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           T  +C +C  VG ++EGC     +++C  CG+ YH  C+    +   +     W+C  C 
Sbjct: 713 TQDLCVMCGAVGTDSEGC-----LIACSQCGQTYHPYCVN--IKVSQVIVSLGWRCLDCT 765

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C   GD    + C  CD A+H YC  P    V  G + C +  +C  CG+      
Sbjct: 766 VCEGCGSRGDEPLLVLCDDCDTAWHTYCARPALAEVPRGAWRCGRCRRCLVCGTRD---- 821

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
            +  W   YT C  C  L +    C VC + Y D E   ++ C  C RW+H  CD I  E
Sbjct: 822 -TALWCDNYTECAPCASLVM----CCVCSEPYSDGEL--IIQCTACSRWLHAACDSIRSE 874

Query: 211 KYLQFQVDGNLQYRCPTCRG 230
              +        Y+C  CRG
Sbjct: 875 ADAETCCRAG--YKCTWCRG 892


>gi|328716144|ref|XP_001947369.2| PREDICTED: hypothetical protein LOC100162709 isoform 1 [Acyrthosiphon
            pisum]
          Length = 1495

 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/182 (29%), Positives = 79/182 (43%), Gaps = 17/182 (9%)

Query: 37   CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C+LC    +         ++ C  C   YH  CL    +        +W+C  C+ C  C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
            +   D +K +FC  CD  YH YC     + V  G + C +   C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398

Query: 156  FLGY-------------TCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 202
            F  +             T C  C R + KG++CP C++ Y       M  C+ C R++H 
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRSWKKGHFCPNCMRCYPIKNVERMTQCNSCDRYLHS 1458

Query: 203  QC 204
            +C
Sbjct: 1459 EC 1460


>gi|14626491|gb|AAK70213.1| MLL3-like protein [Mus musculus]
          Length = 420

 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 12/177 (6%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           R+L+C  CG+ YH  C+    +   +     W+C  C +CE C +  DP + + C  CD 
Sbjct: 10  RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 67

Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
           +YH YC  PP + V  G + C     C  CG+     GL   W   YT C  C  L    
Sbjct: 68  SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 121

Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           + CPVC + YR  E   ++ C  C RW+H  C  ++ E+ ++   D  + + C  CR
Sbjct: 122 SSCPVCCRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 174


>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
            niloticus]
          Length = 4517

 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 90/203 (44%), Gaps = 20/203 (9%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC    N         + C+ C + +H  CL      R L   + +W C  CR C+
Sbjct: 1645 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRYCQ 1696

Query: 94   ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNV 146
             C R     K +  C +C  +YH  C  P   N  + P      ++C K  +C SCG+  
Sbjct: 1697 ACGRQHQKTKQLLECDKCHNSYHPECLGP---NYPTRPTKKKRIWVCTKCVRCKSCGTTK 1753

Query: 147  PGNGLSVRWFLGYTCCDACGRLFVKGNYCPV-CLKVYRDSESTPMVCCDVCQRWVHCQCD 205
            PG     +W   ++ C  C +LF KGN+CP+       D   + M+ C  C  WVH +C+
Sbjct: 1754 PGKSWDAQWSHDFSMCHDCAKLFAKGNFCPLCDKCYDDDDYDSKMMLCGRCNHWVHAKCE 1813

Query: 206  GISDEKYLQFQVDGNLQYRCPTC 228
             ++DE Y       ++ Y C  C
Sbjct: 1814 NLTDEMYELLSKPESVAYTCTKC 1836


>gi|339244153|ref|XP_003378002.1| putative PHD finger protein [Trichinella spiralis]
 gi|316973126|gb|EFV56753.1| putative PHD finger protein [Trichinella spiralis]
          Length = 864

 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/201 (30%), Positives = 94/201 (46%), Gaps = 17/201 (8%)

Query: 32  NTNVMCR-LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           ++ V CR +C V  + G  +   M++C  CG+ YH  C  N   N  + H   W+C  C 
Sbjct: 208 DSYVFCRDMCVVCGSFGRGQEGHMVACTQCGQCYHTYC-ANVTLNSVIVH-RGWRCLDCT 265

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE C    D    + C  CD +YH YC  PP  ++  G + C   + C  CG+  P NG
Sbjct: 266 VCEGCGTGDDEQHLLLCDECDVSYHMYCLDPPLDSIPQGAWRCKWCSTCQFCGAT-PPNG 324

Query: 151 L--SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
           +  S++       C  C  L+     C  C   Y++ +   ++ CD+C RW H  C+G+ 
Sbjct: 325 MLDSIK---NLRACFKCASLYS----CCFCHLQYKEEDM--IILCDICHRWSHANCNGLC 375

Query: 209 DEKYLQFQVDGNLQYRCPTCR 229
            E  L+  +D    + C  CR
Sbjct: 376 AEDILKKGLDAG--FICVYCR 394


>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
            harrisii]
          Length = 3995

 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1446 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1500

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1501 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1560

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1561 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1620

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1621 EMYEILSNLPESVAYTCVNC 1640


>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
            sapiens]
          Length = 3972

 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627


>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_e [Homo sapiens]
          Length = 3972

 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627


>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
          Length = 3986

 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1447 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1501

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1502 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1561

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1562 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1621

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1622 EMYEILSNLPESVAYTCVNC 1641


>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
            jacchus]
          Length = 3994

 Score =  102 bits (255), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1467 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1521

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1522 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1581

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1582 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1641

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1642 EMYEILSNLPESVAYTCVNC 1661


>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
          Length = 4005

 Score =  102 bits (255), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1640

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1641 EMYEILSNLPESVAYTCVNC 1660


>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL [Pongo abelii]
          Length = 4012

 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1473 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1527

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1528 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1587

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1588 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1647

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1648 EMYEILSNLPESVAYTCVNC 1667


>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=Zinc finger protein HRX;
            Contains: RecName: Full=MLL cleavage product N320;
            AltName: Full=N-terminal cleavage product of 320 kDa;
            Short=p320; Contains: RecName: Full=MLL cleavage product
            C180; AltName: Full=C-terminal cleavage product of 180
            kDa; Short=p180
          Length = 3966

 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1606

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1607 EMYEILSNLPESVAYTCVNC 1626


>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
          Length = 3706

 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
            +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 1172 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1226

Query: 98   TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                 K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 1227 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1286

Query: 154  RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
            +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 1287 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1346

Query: 213  LQFQVDGNL----QYRCPTC 228
              +++  NL     Y C  C
Sbjct: 1347 EMYEILSNLPESVAYTCVNC 1366


>gi|395526071|ref|XP_003765195.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
           [Sarcophilus harrisii]
          Length = 1005

 Score =  102 bits (253), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 85/164 (51%), Gaps = 7/164 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G   K +  C RC  
Sbjct: 584 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRSTKHLLECERCRH 642

Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG      W    + C +C +L+
Sbjct: 643 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGA-APGKNWDSEWSGDCSLCPSCTQLY 701

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
            KGN+CP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y
Sbjct: 702 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEGY 745


>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
           (Silurana) tropicalis]
          Length = 5215

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 81/177 (45%), Gaps = 16/177 (9%)

Query: 55  LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
           L C +CG+ YH  CL            + W+CP C++C+ C+ +GD N+ + C  CD  Y
Sbjct: 682 LFCTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGDDNQMLVCDTCDKGY 738

Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
           H +C  P   +V +  + C     C  CG+       S  W L    CD C +  V    
Sbjct: 739 HTFCLQPVMDSVPTNGWKCKNCRICTECGTRT-----SSLWHLNCLLCDPCFQQQVSLP- 792

Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
           CP+C K  +      M+ C VC+RW+H  C     EK  +  +D  L+  Y C  C+
Sbjct: 793 CPICDKPLQPELQKDMLHCHVCKRWIHLDC-----EKCTENDIDDQLKEDYACTLCK 844



 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 68/146 (46%), Gaps = 10/146 (6%)

Query: 84   WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
            W+C  C +CE C +  DP + + C  CD +YH +C  PP + V  G + C     C +C 
Sbjct: 1113 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTFCLDPPLQTVPKGGWKCKWCVSCTNCK 1172

Query: 144  SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
            +  P  GL   W   YT C  C  L    + CPVC + Y + E   ++ C  C RW H  
Sbjct: 1173 AITP--GLRCEWQNNYTQCAPCASL----SACPVCCQNYIEEEL--ILQCRQCIRWSHAS 1224

Query: 204  CDGISDEKYLQFQVDGNLQYRCPTCR 229
            C  ++ E  ++   D    + C  C+
Sbjct: 1225 CQNLNTEAEVELAADSG--FDCAACK 1248


>gi|553800|gb|AAA92511.1| trithorax, partial [Homo sapiens]
          Length = 1012

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 117 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 171

Query: 98  TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 172 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 231

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 232 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 291

Query: 213 -LQFQVDGNLQYRCPTC 228
            +   +  ++ Y C  C
Sbjct: 292 EILSNLPESVAYTCVNC 308


>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
          Length = 3489

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 83
            LCF+  + G +    +L C  C + YH+ C+K   N  Q      N  L   +S      
Sbjct: 857  LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 913

Query: 84   ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
                        W CP C +C  C   TG   K   C++C   YH  C     + + +  
Sbjct: 914  SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 970

Query: 130  PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
            P +C    KC SC +       +V  F+G    C  C RL  KGN+CP+C + Y D++  
Sbjct: 971  PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 1024

Query: 188  TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 244
              M+ C  C+RWVH +C+G++DE+Y +   +  N+++ C  C    EC  V    DAV  
Sbjct: 1025 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 1082

Query: 245  LWRRKDMADKDLIASLRAAAGL 266
             ++   ++   L++  R A  L
Sbjct: 1083 EFKAGLLSVVKLLSKSRQACAL 1104


>gi|403362853|gb|EJY81162.1| PHD zinc finger-containing protein [Oxytricha trifallax]
          Length = 473

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 82/183 (44%), Gaps = 12/183 (6%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           + L C  C K YH  C +    N +L     W+C  C+ C+ C +  D +K + C  CD 
Sbjct: 273 KSLKCFRCLKMYHSTCHQP-PLNTELVKRFQWECSDCKTCKNCNQNNDEDKIIICDMCDK 331

Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-----PGNGLSVRWFLG-YTCCDACG 166
           A H +C +PP   + S  + C     C SC   +        GL   W  G Y  C  C 
Sbjct: 332 AVHIHCLNPPLFQIPSHNWFCKDCVNCLSCDKELGPISQKSQGL---WCEGIYRMCKDCN 388

Query: 167 RLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCP 226
               +GN+C VC K Y    +   + CD CQ W+H  CDG   EK  + + D   +Y CP
Sbjct: 389 YQLQQGNFCKVCRKSYSQDSNEDFIQCDECQDWIHAACDGFDSEKLAKMKDDE--KYSCP 446

Query: 227 TCR 229
            C+
Sbjct: 447 ICK 449


>gi|380807935|gb|AFE75843.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
          Length = 314

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 104 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 162

Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +L+
Sbjct: 163 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 221

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGNYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 222 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 281

Query: 228 CRG 230
           C G
Sbjct: 282 CAG 284


>gi|74189196|dbj|BAC35712.2| unnamed protein product [Mus musculus]
          Length = 814

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C  CG+       S +W      CD C +   + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
            C K Y       M+ C++C+RWVH +CD  +D+     ++D  L+  Y C  C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517


>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
            porcellus]
          Length = 2692

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/180 (31%), Positives = 86/180 (47%), Gaps = 24/180 (13%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
            ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCLK 1273

Query: 113  AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
                               +C    +C SCG+  PG    V W   Y+ C  C +L+ KG
Sbjct: 1274 -------------------ICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLYEKG 1313

Query: 173  NYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 230
            NYCP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  C G
Sbjct: 1314 NYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAG 1373


>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
          Length = 3069

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 83
           LCF+  + G +    +L C  C + YH+ C+K   N  Q      N  L   +S      
Sbjct: 656 LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 712

Query: 84  ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
                       W CP C +C  C   TG   K   C++C   YH  C     + + +  
Sbjct: 713 SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 769

Query: 130 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
           P +C    KC SC +       +V  F+G    C  C RL  KGN+CP+C + Y D++  
Sbjct: 770 PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 823

Query: 188 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 244
             M+ C  C+RWVH +C+G++DE+Y +   +  N+++ C  C    EC  V    DAV  
Sbjct: 824 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 881

Query: 245 LWRRKDMADKDLIASLRAAAGL 266
            ++   ++   L++  R A  L
Sbjct: 882 EFKAGLLSVVKLLSKSRQACAL 903


>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
          Length = 2347

 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/191 (28%), Positives = 87/191 (45%), Gaps = 11/191 (5%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C +  + G +    M+SC +C + YH  C+    +         W+C  C +CE C   
Sbjct: 376 VCLICGSIGNDIEGTMVSCATCAQSYHTFCVGLHDKLNSTVVKRGWRCLDCTVCEGCGDG 435

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D +  + C  CD +YH YC  PP + +  G + C     C  C + +P NG   +   G
Sbjct: 436 RDESNLLLCDECDISYHIYCLDPPLECIPHGSWRCKWCATCRRCSAQIP-NGTDTQRMEG 494

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C+ C  L      CP CL++Y   E   ++ C  C RW+H +C+ I  E+ L+   +
Sbjct: 495 L--CETCYSL----RKCPKCLRLYEIGEH--IIKCQHCSRWLHGKCEEICGEEMLEAAAE 546

Query: 219 GNLQYRCPTCR 229
               +RC  CR
Sbjct: 547 NG--FRCSLCR 555


>gi|156353194|ref|XP_001622959.1| predicted protein [Nematostella vectensis]
 gi|156209597|gb|EDO30859.1| predicted protein [Nematostella vectensis]
          Length = 634

 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 78/163 (47%), Gaps = 12/163 (7%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M  CK C + +H  CL     + D     SW C SC  C +C   G  +K + C +C   
Sbjct: 1   MFFCKVCSEPFHGFCLDEEPIDED-----SWCCDSCSTCVVC---GQQDKLLMCDKCQRG 52

Query: 114 YHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
           YH  C  P +  V  G    ++C +  +C  CGS   G      W   +T C  CG  + 
Sbjct: 53  YHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFTHCYDCGTAWD 112

Query: 171 KGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
            GNYCP+C K Y D++  + M+ C+ CQ WVH  C  I+ ++Y
Sbjct: 113 NGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 155


>gi|170058059|ref|XP_001864757.1| trithorax [Culex quinquefasciatus]
 gi|167877298|gb|EDS40681.1| trithorax [Culex quinquefasciatus]
          Length = 3165

 Score =  100 bits (248), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 72/259 (27%), Positives = 113/259 (43%), Gaps = 43/259 (16%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNW---------AQNRDLFHWSS------ 83
           LCF+  + G +    ML C  C + YH+ C+K+            N  L   +S      
Sbjct: 721 LCFLCGSSGLDE---MLFCVCCCEPYHQYCVKDEYNIRHASLDETNISLLELTSTTIVNS 777

Query: 84  -----------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG-P 130
                      W CP C +C  C   TG   K   C++C   YH  C     + + +  P
Sbjct: 778 SPAQQALNRFNWMCPRCTVCYTCNMATGTKVK---CQKCCKNYHTTCLGTSKRLLGADRP 834

Query: 131 YLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-ST 188
            +C    KC SC +       +V  F+G    C  C RL  KGN+CP+C + Y +++   
Sbjct: 835 MICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEENDFDL 888

Query: 189 PMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 247
            M+ C  CQRWVH +C+G++DE+Y +   +  N+++ C  C           DAV   ++
Sbjct: 889 KMMECGDCQRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNESANVWRDAVAAEFK 948

Query: 248 RKDMADKDLIASLRAAAGL 266
              ++   L++  R A  L
Sbjct: 949 AGLLSVVKLLSKSRQACAL 967


>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
          Length = 3530

 Score = 99.8 bits (247), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 105/234 (44%), Gaps = 33/234 (14%)

Query: 6    DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
            D+  +++   V      + G E  Q     +C LC      G      ++ C+ C + YH
Sbjct: 880  DFWEQYDPTEVGAKGFALIGSELFQ--IPAICYLC------GSAGKEPLIHCQCCCEPYH 931

Query: 66   RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
              CL+   W    +     +W CP C IC+ C     P   + C RC  ++H  C     
Sbjct: 932  AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 983

Query: 124  KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
              VS+       PY+C    KC SCGS     G++V        C  C +L  +GNYCP+
Sbjct: 984  SGVSARLYSPERPYVCQNCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1038

Query: 178  CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
            C + Y +++  T M+ C  C  WVH  C+GISDE+Y  L +  D  +++ C  C
Sbjct: 1039 CQRCYNENDFDTKMMECSECSYWVHAYCEGISDERYQILSYLPD-TIEFTCSQC 1091


>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
          Length = 4717

 Score = 99.4 bits (246), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 75/263 (28%), Positives = 112/263 (42%), Gaps = 47/263 (17%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLK--------NWAQNRDLFHWS-------- 82
            LCF+  + G E    ML C  C + YH+ C+K        N     D  + S        
Sbjct: 1583 LCFLCGSAGLED---MLFCVCCCEPYHQYCVKDEYNLRAGNGGALDDTLNVSLLDVTLGA 1639

Query: 83   --------------SWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 126
                          +W CP C +C  C   TG   K   C++C   YH  C     + + 
Sbjct: 1640 SPQEQQQQLLLGRFNWMCPRCTVCFSCNMATGSKVK---CQKCAKYYHTTCLGTSKRLHG 1696

Query: 127  SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
            +  P +C    +C SC +       +V  F+G    C  C RL  KGNYCP+C K Y D+
Sbjct: 1697 ADRPLICADCLRCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1750

Query: 186  E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 243
            +    M+ C  C+RWVH +C+G++DE+Y    V   N+++ C  C           DAV 
Sbjct: 1751 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFICKKCGKHNETANMWRDAVA 1810

Query: 244  ELWRRKDMADKDLIASLRAAAGL 266
              ++   ++   L++  R A  L
Sbjct: 1811 AEFKAGLLSVVKLLSKSRQACAL 1833


>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
          Length = 3522

 Score = 99.4 bits (246), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 105/237 (44%), Gaps = 39/237 (16%)

Query: 6    DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
            D+  +++   V      + G E  Q     +C LC      G      ++ C+ C + YH
Sbjct: 880  DFWEQYDPTEVGAKGFALIGSELFQ--IPAICYLC------GSAGREPLIHCQCCCEPYH 931

Query: 66   RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ---- 119
              CL+   W    +     +W CP C IC+ C     P   + C RC  ++H  C     
Sbjct: 932  AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLSKSG 985

Query: 120  -----HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
                 + P +     PY+C    KC SCGS     G++V        C  C +L  +GNY
Sbjct: 986  VSARLYSPER-----PYVCQNCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNY 1035

Query: 175  CPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
            CP+C + Y +++  T M+ C  C  WVH  C+GISDE+Y  L +  D  +++ C  C
Sbjct: 1036 CPLCQRCYNENDFDTKMMECSECSYWVHAYCEGISDERYQILSYLPD-TIEFTCSQC 1091


>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
          Length = 3434

 Score = 99.4 bits (246), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 68/234 (29%), Positives = 106/234 (45%), Gaps = 33/234 (14%)

Query: 6    DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
            D+  +++   V      + G E      + +C LC      G      ++ C+ C + YH
Sbjct: 877  DFWEQYDPAEVGAKGFALIGSELFH--ISAICYLC------GSAGKEPLIHCQCCCEPYH 928

Query: 66   RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
              CL+   W    +     +W CP C IC+ C     P   + C RC  ++H  C     
Sbjct: 929  AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 980

Query: 124  KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
              VS+       PY+C    KC SCGS     G++V        C  C +L  +GNYCP+
Sbjct: 981  SGVSARLYSPERPYVCQSCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1035

Query: 178  CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
            C + Y +++  T M+ C  C  WVH +C+G+SDE+Y  L +  D  +++ C  C
Sbjct: 1036 CQRCYNENDFDTKMMECSECSYWVHARCEGLSDERYQILSYLPDS-IEFTCSQC 1088


>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
           mellifera]
          Length = 3195

 Score = 99.0 bits (245), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 31/204 (15%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 93
           +C LC      G      ++ C+ C + YH  CL+   W    +     +W CP C IC+
Sbjct: 780 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 829

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 147
            C     P   + C RC  ++H  C       VS+       PY+C    KC SCGS   
Sbjct: 830 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSARLYSPERPYVCQSCVKCKSCGSE-- 883

Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
             G++V        C  C +L  +GNYCP+C + Y +++  T M+ C  C  WVH QC+G
Sbjct: 884 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHAQCEG 940

Query: 207 ISDEKY--LQFQVDGNLQYRCPTC 228
           +SDE+Y  L +  D  +++ C  C
Sbjct: 941 LSDERYQILSYLPD-TIEFTCSQC 963


>gi|47221226|emb|CAG13162.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 3783

 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/202 (28%), Positives = 91/202 (45%), Gaps = 16/202 (7%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC    N         + C+ C + +H  CL      R L   + +W C  CR C+
Sbjct: 1448 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1499

Query: 94   ICRRTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPG 148
             C R     K   + C +C  +YH  C  P H    +     ++C    +C  CG+  PG
Sbjct: 1500 ACGRQHQKTKQQLLECDKCRNSYHPECLGPSHPTRPTKKKRVWVCNNCVRCKCCGATKPG 1559

Query: 149  NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGI 207
                 +W   ++ C  C +LF K N+C +C K Y D E+   M+ C  C   VH +C+ +
Sbjct: 1560 KSWDAQWSHDFSMCHDCAKLFAKRNFCHICTKCYEDDEADAKMIECGRCHHRVHAKCEKL 1619

Query: 208  SDEKY-LQFQVDGNLQYRCPTC 228
            +D+ Y L  ++  ++ Y C  C
Sbjct: 1620 TDDMYELLSKLPESVAYTCTKC 1641


>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 921

 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 79/156 (50%), Gaps = 10/156 (6%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 294 CTTCGQHYHGMCLDVAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 350

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
           +C  P  K+V +  + C     C +CG+       S +W      CD+C +   + N CP
Sbjct: 351 FCLQPVMKSVPTNGWKCKNCRICVACGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 403

Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            C K Y       M+ C++C+RWVH +CD  +D + 
Sbjct: 404 FCGKSYHPELQKDMLHCNMCKRWVHLECDKPADHEL 439


>gi|121483956|gb|ABM54289.1| MLL [Pan paniscus]
          Length = 523

 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 11/180 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 71  VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 125

Query: 98  TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K  + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 126 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 185

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y
Sbjct: 186 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 245


>gi|402592532|gb|EJW86460.1| hypothetical protein WUBG_02629, partial [Wuchereria bancrofti]
          Length = 2207

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 88/191 (46%), Gaps = 14/191 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC +  + G +    M++C +C + YH  C+    +         W+C  C +CE C   
Sbjct: 268 LCLICGSIGKDAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 327

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D +  + C  CD +YH YC  PP + +  GP+ C   + C  CG+ +  N    + F  
Sbjct: 328 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQI-FNVTDNQNF-- 384

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C+ C  L      CP CL++Y   ++  ++ C  C RW+H +C+ +  E+   F+  
Sbjct: 385 ---CETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MFETA 433

Query: 219 GNLQYRCPTCR 229
               +RC  CR
Sbjct: 434 SENGFRCSLCR 444


>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Takifugu rubripes]
          Length = 4478

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 90/201 (44%), Gaps = 15/201 (7%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC    N         + C+ C + +H  CL      R L   + +W C  CR C+
Sbjct: 1668 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1719

Query: 94   ICRRTGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
             C R     K +  C +C  +YH  C    HP         ++C K  +C  CG+  PG 
Sbjct: 1720 ACGRQHQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPGK 1779

Query: 150  GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
                +W   ++ C  C +LF K N C +C K Y D E+   MV C  C   VH +C+ ++
Sbjct: 1780 SWDAQWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKLT 1839

Query: 209  DEKY-LQFQVDGNLQYRCPTC 228
            D+ Y L  ++  ++ Y C  C
Sbjct: 1840 DDMYELLSKLPESVAYTCTKC 1860


>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
 gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
          Length = 2144

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 20/194 (10%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC +  + G      M++C +C + YH  C+    +         W+C  C +CE C   
Sbjct: 166 LCLICGSIGKGAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 225

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG---SNVPGNGLSVRW 155
            D +  + C  CD +YH YC  PP + +  GP+ C   + C  CG   SN+  N      
Sbjct: 226 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQISNITDN------ 279

Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
                 C+ C  L      CP CL++Y   ++  ++ C  C RW+H +C+ +  E+   F
Sbjct: 280 ---QNFCETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MF 328

Query: 216 QVDGNLQYRCPTCR 229
           +      +RC  CR
Sbjct: 329 ETASENGFRCSLCR 342


>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
          Length = 4498

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 90/202 (44%), Gaps = 16/202 (7%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
            V+C LC    N         + C+ C + +H  CL      R L   + +W C  CR C+
Sbjct: 1668 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1719

Query: 94   ICRRTGDPNK--FMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
             C R     K   + C +C  +YH  C    HP         ++C K  +C  CG+  PG
Sbjct: 1720 ACGRQHQKTKQQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPG 1779

Query: 149  NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGI 207
                 +W   ++ C  C +LF K N C +C K Y D E+   MV C  C   VH +C+ +
Sbjct: 1780 KSWDAQWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKL 1839

Query: 208  SDEKY-LQFQVDGNLQYRCPTC 228
            +D+ Y L  ++  ++ Y C  C
Sbjct: 1840 TDDMYELLSKLPESVAYTCTKC 1861


>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
          Length = 2498

 Score = 96.7 bits (239), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 93/204 (45%), Gaps = 17/204 (8%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M++C +C + YH  C+    +         W+C  C ICE C + GD    M C  CD  
Sbjct: 447 MIACLNCAQTYHTYCVLLHEKINSAIMTHGWRCLDCTICEGCGKGGDDKNLMLCDECDVP 506

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-PGNGLSVRWFLGYTCCDACGRLFVKG 172
           YH YC  PP + V +G + C   ++C  C   V  G+ L+ R       C  C  L    
Sbjct: 507 YHTYCLKPPIEKVPTGSWRCQWCSRCRRCNHKVSSGSELTARGL-----CHPCDSL---- 557

Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGE 231
            +C  C + Y+ ++   ++ C  C +W H +C+G+ +DE+  Q  ++   + RC  CR  
Sbjct: 558 QHCARCRQGYQLNDK--LIRCSQCNKWEHGRCEGLYTDEQLEQAALN---RMRCAACRPN 612

Query: 232 CYQVRDLEDAVRELWRRKDMADKD 255
             +   L D V  +W      DKD
Sbjct: 613 RIRNNGLSD-VDTVWCDSIALDKD 635


>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus
           floridanus]
          Length = 3218

 Score = 96.7 bits (239), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 106/234 (45%), Gaps = 33/234 (14%)

Query: 6   DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
           D+  +++   V      + G E    +   +C LC      G      ++ C+ C + YH
Sbjct: 751 DFWEQYDPSEVGAKGFALIGSELF--HIPAICYLC------GSAGKEPLIHCQCCCEPYH 802

Query: 66  RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
             CL+   W    +     +W CP C IC+ C     P   + C RC  ++H  C     
Sbjct: 803 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 854

Query: 124 KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
             VSS       PY+C    KC SCGS     G++V        C  C +L  +GNYCP+
Sbjct: 855 SGVSSRLYNPDRPYVCQSCIKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 909

Query: 178 CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
           C + Y +++  T M+ C  C  WVH +C+G+SDE+Y  L +  D  +++ C  C
Sbjct: 910 CQRCYNENDFDTKMMECSECSCWVHARCEGLSDERYQILSYLPDS-IEFTCSQC 962


>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
 gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
          Length = 2475

 Score = 96.3 bits (238), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 99/235 (42%), Gaps = 27/235 (11%)

Query: 33  TNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS 82
           T ++CR          +C V  + G      M++C +C + YH  C+    +        
Sbjct: 411 TAIVCRVNDEFLQKACMCLVCGSIGKGPEGSMVACSNCAQTYHTYCVTLHDKLNSAVVGR 470

Query: 83  SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
            W+C  C +CE C   GD    + C  CD +YH YC  P    +  GP+ C   ++C  C
Sbjct: 471 GWRCLDCTVCEGCGTGGDEANLLLCDECDVSYHIYCMKPLLDKIPQGPWRCQWCSRCRRC 530

Query: 143 GSNVP-GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVH 201
                 GN L+ +       C  C  L      CP C + Y+ +E   ++ C  C +W H
Sbjct: 531 NHKAASGNDLTSQGL-----CFPCASL----RKCPRCERNYQLNEK--LIRCSQCSKWQH 579

Query: 202 CQCDGI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
             C+G+ +DE+  Q  +D   + RC  CR +  Q     D V  +W      DKD
Sbjct: 580 GACEGLYTDEQLEQAAID---RMRCSACRPKRVQPSGFSD-VDTVWCDYVALDKD 630


>gi|312071355|ref|XP_003138570.1| F/Y-rich family protein [Loa loa]
          Length = 1597

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 14/199 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC +  + G +    M++C +C + YH  C+    +         W+C  C +CE C   
Sbjct: 273 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 332

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D +  + C  CD +YH YC  PP + +  GP+ C   + C  C SN   N    + F  
Sbjct: 333 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 389

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C+ C  L      CP CL+ Y   +S  ++ C  C RW+H +C+ +  ++   F+  
Sbjct: 390 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 438

Query: 219 GNLQYRCPTCRGECYQVRD 237
               +RC  CR +   V D
Sbjct: 439 SENGFRCSLCRPQGNVVGD 457


>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
          Length = 2288

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 86/191 (45%), Gaps = 14/191 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC +  + G +    M++C +C + YH  C+    +         W+C  C +CE C   
Sbjct: 296 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 355

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D +  + C  CD +YH YC  PP + +  GP+ C   + C  C SN   N    + F  
Sbjct: 356 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 412

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
              C+ C  L      CP CL+ Y   +S  ++ C  C RW+H +C+ +  ++   F+  
Sbjct: 413 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 461

Query: 219 GNLQYRCPTCR 229
               +RC  CR
Sbjct: 462 SENGFRCSLCR 472


>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
          Length = 2812

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/182 (31%), Positives = 85/182 (46%), Gaps = 7/182 (3%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCDA 112
            M+ C+ C + +H  CL      ++     +W C  C+ C +C RR+      + CRRC  
Sbjct: 1050 MIFCQICCEPFHSFCLTPEECPQE-DSKENWCCRRCKFCHVCGRRSKSAKPVLQCRRCQT 1108

Query: 113  AYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
             YH  C  P  P     S P++C    +C SCG   PG    + W      C  C  L  
Sbjct: 1109 CYHPSCLGPTYPKPVNCSLPWVCMTCIRCKSCGVT-PGKTWDLAWNHEQDLCPECTILNK 1167

Query: 171  KGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEK-YLQFQVDGNLQYRCPTC 228
            KG++C VC K Y D S    M+ C  C+ W+H +C+G+S+E   L   +  +  + C  C
Sbjct: 1168 KGHFCTVCQKCYEDGSRPLQMIQCSECRHWIHPKCEGLSEELCGLMSTLPDSGGFTCTPC 1227

Query: 229  RG 230
            RG
Sbjct: 1228 RG 1229


>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
          Length = 3452

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 30/216 (13%)

Query: 6    DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
            D+  +++   V      + G E        +C LC      G      ++ C+ C + YH
Sbjct: 875  DFWEQYDPSEVGAKGFALIGSELFH--IPAICYLC------GSAGKEPLIHCQCCCEPYH 926

Query: 66   RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
              CL+   W    +     +W CP C IC+ C     P   + C RC  ++H  C     
Sbjct: 927  AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 978

Query: 124  KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
              VSS       PY+C    KC SCGS     G++V        C  C +L  +GNYCP+
Sbjct: 979  SGVSSRLYSPDRPYVCQSCIKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1033

Query: 178  CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
            C + Y +++  T M+ C  C  WVH +C+G+SDE+Y
Sbjct: 1034 CQRCYNENDFDTKMMECSECSCWVHARCEGLSDERY 1069


>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
          Length = 2924

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 69/142 (48%), Gaps = 5/142 (3%)

Query: 91   ICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPG 148
            +C +C   G     + C RC   YH  C  P  P +N     ++C   T+C SCG   PG
Sbjct: 1411 VCFLCASKGQHEPLLECERCQNCYHASCLGPNYPKQNKKRKAWVCMTCTRCKSCGV-TPG 1469

Query: 149  NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGI 207
                  W      C  C +L+ +GNYCP+C K Y D++  + M+ C  C  WVH +C+ +
Sbjct: 1470 KSWDTEWNHDKGLCPDCSKLYDQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDL 1529

Query: 208  SDEKY-LQFQVDGNLQYRCPTC 228
            +DE Y +   +  ++ Y C  C
Sbjct: 1530 TDELYEILSSLPESVVYSCRPC 1551


>gi|355732606|gb|AES10757.1| histone-lysine N-methyltransferase MLL4-like protein [Mustela
           putorius furo]
          Length = 212

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 11  LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 69

Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C +LF
Sbjct: 70  AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 128

Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
            KGN+CP+C + Y D++  + M+ C  C  WVH +C+G+SDE Y +   +  ++ Y C  
Sbjct: 129 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 188

Query: 228 CRG 230
           C G
Sbjct: 189 CAG 191


>gi|60360484|dbj|BAD90486.1| mKIAA4050 protein [Mus musculus]
          Length = 762

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 26  VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 80

Query: 98  TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K  + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 81  QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 140

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S  + 
Sbjct: 141 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 200

Query: 213 LQFQVDGNL----QYRCPTC 228
             +++  NL     Y C  C
Sbjct: 201 EMYEILSNLPESVAYTCVNC 220


>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
          Length = 2027

 Score = 93.6 bits (231), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/264 (25%), Positives = 103/264 (39%), Gaps = 24/264 (9%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHW----SSWKCPSCRICEICRRTGDPNKFMFCRR 109
            + C+ C + YHR C  +     D   W      W C  C+ C  C +    +  + C +
Sbjct: 641 FVHCRVCCQPYHRFCTGDSRLQSDE-RWRDAAEQWMCIDCQTCCACGQAEPRSTLLTCAK 699

Query: 110 CDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
           C    H +C     P        Y+C     C  CGS  PG      W   +  C+ C  
Sbjct: 700 CTRHIHSHCAGDGVPQGLKRDDVYMCSACVVCTKCGSTSPGEFKGSTWHCQFELCEECHG 759

Query: 168 LFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC--DGISDEKYLQFQVDG--NLQ 222
             +KGN CP+C K+Y D +  TPM CC+ C+RW+H  C          L + + G   + 
Sbjct: 760 QTLKGNICPMCDKLYSDDDFDTPMFCCEKCERWLHATCVDPAFKTNMELYYLISGVQTID 819

Query: 223 YRCPTC----RGECYQVRDLEDAVRELWRRK------DMADKDLIASLRAAAGLPTEDEI 272
           Y C  C    +G       L  A +  ++ +      DM  ++   +L A +  P  D  
Sbjct: 820 YHCQDCVRLSKGVLPDFAALWKATQAAFQERLVGLVEDMKQQEGYTALAAVSPAPVRDHT 879

Query: 273 F--SISPYSDDEENGPVVLKNEFG 294
              + +  SDD    P    +E G
Sbjct: 880 SDDTRTRASDDTRTRPSTGTDEDG 903


>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
 gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
          Length = 3837

 Score = 93.6 bits (231), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 105/245 (42%), Gaps = 48/245 (19%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNC----------------------------LK 70
            LCF+  + G +    ++ C  C + YH+ C                            + 
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFDDTLMNSLLETSSISAIT 1309

Query: 71   NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
            N A  + L    +W CP C +C  C  +      + C++C   YH  C     + + +  
Sbjct: 1310 NTALTQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGADR 1366

Query: 130  PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
            P +C    KC SC +        V  F+G    C AC +L  KGN+CP+C K Y D++  
Sbjct: 1367 PLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDDNDFD 1420

Query: 188  TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 246
              M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C   C    D+     E W
Sbjct: 1421 LKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRC----DVSRNKAEEW 1476

Query: 247  RRKDM 251
            R+  M
Sbjct: 1477 RQAVM 1481


>gi|1042097|gb|AAB34770.1| trx Zinc-finger region homolog [Homo sapiens]
          Length = 366

 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 92/183 (50%), Gaps = 9/183 (4%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 110
           + + C+ C + +H+ CL+     R L     +W C  C+ C +C R     K +  C +C
Sbjct: 156 KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 213

Query: 111 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
             +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   ++ C  C +
Sbjct: 214 RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 273

Query: 168 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 225
           LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +   + Y C
Sbjct: 274 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPECVAYTC 333

Query: 226 PTC 228
             C
Sbjct: 334 VNC 336


>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
          Length = 3272

 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/186 (31%), Positives = 88/186 (47%), Gaps = 28/186 (15%)

Query: 36   MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 93
            +C LC      G      ++ C+ C + YH  CL+   W    +     +W CP C IC+
Sbjct: 856  ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 905

Query: 94   ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 147
             C     P   + C RC  ++H  C       +SS       PY+C    +C SCGS   
Sbjct: 906  SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGISSRLYSPDRPYVCQSCIRCKSCGSE-- 959

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
              G++V        C  C +L  +GNYCP+C + Y +++  T M+ C  C  WVH +C+G
Sbjct: 960  --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSCWVHARCEG 1016

Query: 207  ISDEKY 212
            +SDE+Y
Sbjct: 1017 LSDERY 1022


>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
 gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
          Length = 3822

 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 52/268 (19%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
            LCF+  + G +    ++ C  C + YH+ C+                             
Sbjct: 1273 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLETSNNACAI 1329

Query: 70   ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                N A N+ L    +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1330 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1386

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C AC +L  KGN+CP+C K Y D
Sbjct: 1387 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1440

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLED-- 240
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C   C   R+  D  
Sbjct: 1441 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADEW 1500

Query: 241  --AVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1501 RQAVMEEFKSSLYSVLKLLSKSRQACAL 1528


>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
 gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
          Length = 3828

 Score = 92.8 bits (229), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 52/268 (19%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
            LCF+  + G +    ++ C  C + YH+ C+                             
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMTSLLETSNNACAI 1309

Query: 70   ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                N A N+ L    +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1310 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1366

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C AC +L  KGN+CP+C K Y D
Sbjct: 1367 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1420

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLED-- 240
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C   C   R+  D  
Sbjct: 1421 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADEW 1480

Query: 241  --AVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1481 RQAVMEEFKSSLYSVLKLLSKSRQACAL 1508


>gi|195446233|ref|XP_002070689.1| GK10889 [Drosophila willistoni]
 gi|194166774|gb|EDW81675.1| GK10889 [Drosophila willistoni]
          Length = 3189

 Score = 92.8 bits (229), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 116/267 (43%), Gaps = 53/267 (19%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1155 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLDTSVN 1208

Query: 83   ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
                           +W CP C +C  C  +      + C++C   YH  C     + + 
Sbjct: 1209 PSTCAPSMNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHATCLGTSKRLLG 1266

Query: 128  SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
            +  P +C    KC SC +        V  F+G    C AC +L  KGNYCP+C K Y D+
Sbjct: 1267 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTACFKLRKKGNYCPICQKCYDDN 1320

Query: 186  E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 239
            +    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E   ++  E  
Sbjct: 1321 DFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSHIKADEWR 1380

Query: 240  DAVRELWRRKDMADKDLIASLRAAAGL 266
             AV E ++    +   L++  R A  L
Sbjct: 1381 QAVMEEFKSSLYSVLKLLSKSRQACAL 1407


>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
 gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
          Length = 3756

 Score = 92.4 bits (228), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 52/249 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
            LCF+  + G +    ++ C  C + YH+ C+                             
Sbjct: 1234 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLDTSSNACAI 1290

Query: 70   ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                N A N+ L    +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1291 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1347

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C AC +L  KGN+CP+C K Y D
Sbjct: 1348 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1401

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C   C   R   D  
Sbjct: 1402 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRQKADE- 1460

Query: 243  RELWRRKDM 251
               WR+  M
Sbjct: 1461 ---WRQAVM 1466


>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
          Length = 4387

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 93/200 (46%), Gaps = 26/200 (13%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
             + +LCF+  + G E+   ML C SC + YH  C +            SW C  C  C  
Sbjct: 855  TLAKLCFLCGSAGREK---MLVCSSCCEWYHVWCAEEAG------GGGSWTCARCVWCAA 905

Query: 95   CRRTGDPNKFMFCRRCDAAYHCYC--QHPP-HKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C R       + CR C   YH  C    PP H+  S  P +C    KC SC SN      
Sbjct: 906  CARPA---ARLRCRSCARPYHAACLPSAPPDHR--SDWPQICSSCLKCKSCDSN------ 954

Query: 152  SVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
             V  F+G    C  C +L  KGNYCP+C   YRD++  + M+ C  C RWVH  C+G+S 
Sbjct: 955  RVNKFVGSLPFCRPCFKLRQKGNYCPLCQACYRDNDFDSKMMECGWCARWVHASCEGLSG 1014

Query: 210  EKY-LQFQVDGNLQYRCPTC 228
            E Y L   +  +++Y C  C
Sbjct: 1015 EGYQLLSALPPSIEYICCKC 1034


>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1776

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/160 (33%), Positives = 76/160 (47%), Gaps = 8/160 (5%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 111
           M+ C+ C + +H  CL    + R L  +  +W C  C+ C +C RR+ +    + CRRC 
Sbjct: 184 MIFCQICCEPFHSFCL--LPEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 241

Query: 112 AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
            +YH  C  P  P       P++C    +C SCG   PG    + W      C  C  L 
Sbjct: 242 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKTWDLAWNHDEDLCPDCTLLH 300

Query: 170 VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGIS 208
            KGN+C +C K Y D+     M+ C  C  W+H  C+GIS
Sbjct: 301 KKGNFCTICHKCYDDNMRHAEMIQCSACNHWIHYSCEGIS 340


>gi|167526642|ref|XP_001747654.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773758|gb|EDQ87394.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1547

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 77/158 (48%), Gaps = 13/158 (8%)

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHS 141
           W+C +C+ C++C R       + C  C    H  C     P   +++  ++C    +C  
Sbjct: 633 WQCKNCQTCDVCTRIEPTQHLLSCDVCGVHRHAACASAATPSYMLATQRWVCTDCVQCEH 692

Query: 142 CG-SNVPGNGLSVR------WFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCC 193
           CG ++V G+    +      W   +  C  CG   ++GN+CPVC K YR D     MV C
Sbjct: 693 CGATDVRGHRPDPKLREEPTWQCDFRLCFDCGLNKLRGNFCPVCGKTYRGDDYDVKMVGC 752

Query: 194 DVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTCR 229
           D C RW+H +CD I + +Y  L F V  ++ Y CP CR
Sbjct: 753 DRCDRWLHAECDDIDEARYHLLTF-VPSSMSYFCPDCR 789


>gi|349803513|gb|AEQ17229.1| putative trx zinc-finger region [Pipa carvalhoi]
          Length = 506

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 76/160 (47%), Gaps = 29/160 (18%)

Query: 85  KCPSCRI---CEICRRTGDPNKF-----------MFCRRCDAAYHCYCQHPPHKNVSSGP 130
           +CP C++   C +C    D  KF           M C +C             +N     
Sbjct: 312 QCPGCQVPDDCGVCTNCLDKPKFGGRNIKKQCCKMECNKC-------------RNKKKRV 358

Query: 131 YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STP 189
           ++C K  +C SCGS  PG G   +W   ++ C  C +LF KGN+CP+C K Y D +  + 
Sbjct: 359 WICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESK 418

Query: 190 MVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
           M+ C  C RWVH +C+ ++DE Y +   +  ++ Y C  C
Sbjct: 419 MMQCGKCDRWVHSKCESLTDEMYEILSNLPESVAYTCINC 458


>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
          Length = 3806

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)

Query: 103  KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
            + + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   +
Sbjct: 1399 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1458

Query: 160  TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
            + C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +
Sbjct: 1459 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1518

Query: 218  DGNLQYRCPTC 228
              ++ Y C  C
Sbjct: 1519 PESVAYTCVNC 1529


>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
            gorilla]
          Length = 3837

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)

Query: 103  KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
            + + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   +
Sbjct: 1363 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1422

Query: 160  TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
            + C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +
Sbjct: 1423 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1482

Query: 218  DGNLQYRCPTC 228
              ++ Y C  C
Sbjct: 1483 PESVAYTCVNC 1493


>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
          Length = 3899

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)

Query: 103  KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
            + + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   +
Sbjct: 1437 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1496

Query: 160  TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
            + C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +
Sbjct: 1497 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1556

Query: 218  DGNLQYRCPTC 228
              ++ Y C  C
Sbjct: 1557 PESVAYTCINC 1567


>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
 gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
          Length = 2509

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 96/225 (42%), Gaps = 37/225 (16%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCL--------------------KNWAQNRDL 78
           LC V  + G      M++C +C + YH  C+                    + +  N  +
Sbjct: 437 LCLVCGSIGKGVEASMVACSNCAQTYHTYCISLHDKVGSQMIIKMLSTIIFQKFQLNSAV 496

Query: 79  FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTK 138
                W+C  C  CE C   GD  K + C  CD +YH YC  PP + +  GP+ C   ++
Sbjct: 497 IT-RGWRCLDCTFCEGCGAGGDEEKLLLCEECDVSYHMYCIKPPLEAIPKGPWRCQWCSR 555

Query: 139 CHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           C  C   +  GN L+ +       C +C  L      C  C + Y+ +E   ++ C  C 
Sbjct: 556 CRRCNHKSTSGNDLTSKGL-----CHSCQSL----QSCSRCNRGYQLNEK--IIKCSQCS 604

Query: 198 RWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
           +W H  C+G+ +DE+  Q  ++   + RC +CR    Q+    +A
Sbjct: 605 KWHHGFCEGLHTDEQLEQAALN---RMRCTSCRPNRAQINGFSEA 646


>gi|195329040|ref|XP_002031219.1| GM25861 [Drosophila sechellia]
 gi|194120162|gb|EDW42205.1| GM25861 [Drosophila sechellia]
          Length = 3603

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521


>gi|899268|emb|CAA58584.1| ALL-1 protein [Homo sapiens]
          Length = 395

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 88/176 (50%), Gaps = 11/176 (6%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 222 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 276

Query: 98  TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   
Sbjct: 277 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 336

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S
Sbjct: 337 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 392


>gi|195501651|ref|XP_002097884.1| GE26459 [Drosophila yakuba]
 gi|194183985|gb|EDW97596.1| GE26459 [Drosophila yakuba]
          Length = 2853

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1283 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1336

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1337 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1394

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1395 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1448

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1449 NDFDLKMMECGDCAQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1508

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1509 RQAVMEEFKASLYSVLKLLSKSRQACAL 1536


>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
 gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
          Length = 3741

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 116/269 (43%), Gaps = 55/269 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1281 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1334

Query: 83   -----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
                             +W CP C +C  C  +      + C++C   YH  C     + 
Sbjct: 1335 ASTGPSASSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRL 1392

Query: 126  VSSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYR 183
            + +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y 
Sbjct: 1393 LGADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYD 1446

Query: 184  DSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE 239
            D++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E
Sbjct: 1447 DNDFDLKMMECGDCAQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSRIKAEE 1506

Query: 240  --DAVRELWRRKDMADKDLIASLRAAAGL 266
               AV E ++    +   L++  R A  L
Sbjct: 1507 WRQAVMEEFKASLYSVLKLLSKSRQACAL 1535


>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
 gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
            spliced, exon II-containing isoform} [Drosophila,
            embryos, Peptide, 3726 aa]
          Length = 3726

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521


>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
 gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
 gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
            Full=Lysine N-methyltransferase 2A
 gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
 gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
          Length = 3726

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521


>gi|326676507|ref|XP_689347.4| PREDICTED: hypothetical protein LOC560856 [Danio rerio]
          Length = 1761

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 98/204 (48%), Gaps = 24/204 (11%)

Query: 36   MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 90
            +C LC   G++E       M+ C+ C + +H  CL    +   +N++     +W C  C+
Sbjct: 1098 VCLLCASKGQHE-------MVYCQMCCEPFHHFCLPPDDRPKEENKE-----NWCCRRCK 1145

Query: 91   ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 147
             C +C R++      + C+RC   YH  C  P  P     +  ++C    +C SCG   P
Sbjct: 1146 FCHVCGRKSKQTKPVLQCKRCMYCYHPSCLGPTYPKPVKMNTSWVCMLCIRCKSCGVT-P 1204

Query: 148  GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
            G      W      C  C +L  KG++C VCLK Y + E  + M+ C  C  WVH +C+G
Sbjct: 1205 GKSSDTSWNHELELCPDCNKLHSKGDFCTVCLKCYEEHEFDSSMMQCARCAHWVHPKCEG 1264

Query: 207  ISDEKY-LQFQVDG-NLQYRCPTC 228
            ++D+ + +  ++ G +L + C  C
Sbjct: 1265 LTDDLHEILCRLRGKSLVFSCAPC 1288


>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
          Length = 3759

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521


>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 3779

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1191 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 1244

Query: 83   -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
                         +W CP C +C  C  +      + C++C   YH  C     + + + 
Sbjct: 1245 GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 1302

Query: 130  -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 186
             P +C    KC SC +        V  F+G    C  C +L  KGNYCP+C K Y D++ 
Sbjct: 1303 RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1356

Query: 187  STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
               M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C       RD      + 
Sbjct: 1357 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1411

Query: 246  WRRKDMADKDLIASLRAAAGL 266
            WR+  M  ++  ASL +   L
Sbjct: 1412 WRQAVM--EEFKASLYSVLKL 1430


>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
 gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
          Length = 3708

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/267 (25%), Positives = 117/267 (43%), Gaps = 53/267 (19%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 1266 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEETTLMGSLLETSVT 1319

Query: 83   ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
                           +W CP C +C  C  +      + C++C   YH  C     + + 
Sbjct: 1320 AVGSSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLG 1377

Query: 128  SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
            +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C K Y D+
Sbjct: 1378 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTNCFKLRKKGNFCPICQKCYDDN 1431

Query: 186  E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 239
            +    M+ C  C++WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E  
Sbjct: 1432 DFDLKMMECGDCRQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKMKAEEWR 1491

Query: 240  DAVRELWRRKDMADKDLIASLRAAAGL 266
             AV E ++    +   L++  R A  L
Sbjct: 1492 QAVMEEFKVSLYSVLKLLSKSRQACAL 1518


>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
          Length = 2805

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 77/188 (40%), Gaps = 23/188 (12%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 2585 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2643

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +CGS  PG   S R  
Sbjct: 2644 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2701

Query: 155  ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMVCCDVC 196
               W   Y            T C  C +L+ KG YCP C + +   R      +V C  C
Sbjct: 2702 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEVNLVHCSAC 2761

Query: 197  QRWVHCQC 204
             +++H  C
Sbjct: 2762 DKYLHLGC 2769


>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
 gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
 gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
 gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
 gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
 gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
          Length = 3358

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 900  LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 954  ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153


>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
          Length = 3358

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 900  LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953

Query: 83   ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
                            +W CP C +C  C  +      + C++C   YH  C     + +
Sbjct: 954  ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011

Query: 127  SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
             +  P +C    KC SC +        V  F+G    C  C +L  KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065

Query: 185  SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
            ++    M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C  R E  +++  E 
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125

Query: 240  -DAVRELWRRKDMADKDLIASLRAAAGL 266
              AV E ++    +   L++  R A  L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153


>gi|307199377|gb|EFN80002.1| Supporter of activation of yellow protein [Harpegnathos saltator]
          Length = 1532

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/193 (26%), Positives = 79/193 (40%), Gaps = 23/193 (11%)

Query: 32   NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
            +  + C++C    N+   +   ++ C +C    H +C+             +W+C  C+ 
Sbjct: 1306 DVKLKCKMCLKVLNKH-NKTEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1364

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C  C    D +K +FC  CD  YH YC     + V  G + C +   C +CGS  PG   
Sbjct: 1365 CAQCHDPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREPGGAN 1422

Query: 152  SVR-----WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMV 191
            S R     W   Y            T C  C +L+ KG YCP C + +   R      +V
Sbjct: 1423 SDRNSVAQWQHEYKKGEKNTRVYVSTLCIPCSKLWRKGRYCPHCSRCHTAQRLDLEPNLV 1482

Query: 192  CCDVCQRWVHCQC 204
             C  C +++H  C
Sbjct: 1483 HCSACDKYLHLDC 1495


>gi|383863051|ref|XP_003706996.1| PREDICTED: uncharacterized protein LOC100875915 [Megachile rotundata]
          Length = 3343

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 3123 CKMCLKTLNKHG-KVEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3181

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +C S  PG   S R  
Sbjct: 3182 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCASKEPGGINSDRNS 3239

Query: 155  ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
               W   Y            T C  C +L+ KG YCP C + +     D E+  +V C  
Sbjct: 3240 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3298

Query: 196  CQRWVHCQC 204
            C +++H  C
Sbjct: 3299 CDKYLHLDC 3307


>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 3474

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)

Query: 39   LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
            LCF+  + G +    ++ C  C + YH+ C+++     +L H S                
Sbjct: 886  LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 939

Query: 83   -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
                         +W CP C +C  C  +      + C++C   YH  C     + + + 
Sbjct: 940  GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 997

Query: 130  -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 186
             P +C    KC SC +        V  F+G    C  C +L  KGNYCP+C K Y D++ 
Sbjct: 998  RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1051

Query: 187  STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
               M+ C  C +WVH +C+G+SDE+Y L   +  ++++ C  C       RD      + 
Sbjct: 1052 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1106

Query: 246  WRRKDMADKDLIASLRAAAGL 266
            WR+  M  ++  ASL +   L
Sbjct: 1107 WRQAVM--EEFKASLYSVLKL 1125


>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
          Length = 3066

 Score = 86.7 bits (213), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 79/189 (41%), Gaps = 25/189 (13%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 2845 CKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2903

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +CGS   G   S R  
Sbjct: 2904 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGANSDRNS 2961

Query: 155  ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
               W   Y            T C  C +L+ KG YCP C + +     D E+  +V C  
Sbjct: 2962 VAQWQHEYKKGEKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-LVHCSA 3020

Query: 196  CQRWVHCQC 204
            C +++H +C
Sbjct: 3021 CDKYLHLEC 3029


>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
          Length = 1472

 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 25/194 (12%)

Query: 32   NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
            +T + C++C    N+   +   ++ C +C    H +C+             +W+C  C+ 
Sbjct: 1247 DTKLKCKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1305

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C  C    D +K +FC  CD  YH YC     + V  G + C +   C +CGS   G   
Sbjct: 1306 CVQCHDPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGAN 1363

Query: 152  SVR-----WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPM 190
            S R     W   Y            T C  C +L+ KG YCP C + +     D E+  +
Sbjct: 1364 SDRNSVAQWQHEYKKGEKNTRIYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-L 1422

Query: 191  VCCDVCQRWVHCQC 204
            V C  C +++H +C
Sbjct: 1423 VHCSACDKYLHLEC 1436


>gi|11072099|gb|AAG26335.2| MLL protein [Homo sapiens]
          Length = 380

 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 95/200 (47%), Gaps = 15/200 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
           +CF+  + G       + C+ C + +H+ CL+     R L     +W C  C+ C +C R
Sbjct: 123 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 177

Query: 98  TGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
                K +  C +C  +YH  C    +P         ++C K  +C SCGS  PG G   
Sbjct: 178 QHRATKQLLECNKCRNSYHLECLGPTYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 237

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
           +W   ++ C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C  +S  + 
Sbjct: 238 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCGNLSGTED 297

Query: 213 LQFQVDGNL----QYRCPTC 228
             +++  NL     Y C  C
Sbjct: 298 EMYEILSNLPESVAYTCVNC 317


>gi|357614029|gb|EHJ68865.1| putative PHD finger protein 10 [Danaus plexippus]
          Length = 2413

 Score = 85.5 bits (210), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 81/192 (42%), Gaps = 23/192 (11%)

Query: 53   RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
            R L C +C  K H +C++            SW+C  C+ C  C R  D +K +FC  CD 
Sbjct: 2163 RFLVCSNCNAKLHPSCVELGPDTIRKCREYSWQCAECKTCCACSRPADDDKMLFCDLCDR 2222

Query: 113  AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR--W------------FLG 158
             +H YC       V +G + C + + C SCGS   G        W               
Sbjct: 2223 GFHIYCVG--LHTVPNGRWHCVECSVCKSCGSRSAGGPGGGPGDWHHQTRRGPGGHKLYS 2280

Query: 159  YTCCDACGRLFVKGNYCPVCLKVYRDSEST-PMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
            ++ C  C R +  G YCP+C + +   + T  +V C +C R +H +C     +   + +V
Sbjct: 2281 HSLCTPCARAYRIGRYCPLCDRSFIGPKGTMQLVICKLCDRQLHQEC---VKQTTSRLKV 2337

Query: 218  DGNLQYRCPTCR 229
               L Y C  CR
Sbjct: 2338 ---LDYTCGECR 2346


>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
          Length = 4493

 Score = 85.5 bits (210), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 65/140 (46%), Gaps = 13/140 (9%)

Query: 35   VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
            V+C LC      G      ML C  C + +H  CL+   +  ++ H  +W C  C+ C++
Sbjct: 944  VICFLC------GSAGKHEMLYCNVCCEPFHEFCLEEEERPHEI-HSDNWCCKKCQSCQV 996

Query: 95   CRRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C   G  N  + C +C   YH  C    +P   +     ++C K  KC SCG+  PG+GL
Sbjct: 997  C---GRQNNLLQCDKCQNTYHPECLGPNYPTKPSKKKNIWICTKCVKCKSCGATTPGSGL 1053

Query: 152  SVRWFLGYTCCDACGRLFVK 171
            +  W   +  C  CG+L  K
Sbjct: 1054 AATWMYDFQLCYECGQLMDK 1073


>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
            occidentalis]
          Length = 2686

 Score = 85.5 bits (210), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 62/216 (28%), Positives = 90/216 (41%), Gaps = 28/216 (12%)

Query: 5    EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGK 62
            ED  R    G     S       Q Q +  V+C +C   V  N         + C  C +
Sbjct: 844  EDITRILNIGLPVIVSN------QLQLSNVVLCFVCASSVKHNPA-------VYCAMCCE 890

Query: 63   KYHRNCLKNWAQNRDL--FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
             YH  C+    Q RD+   +  +W CP C+ C  C +    N+ + C  C   YH  C  
Sbjct: 891  PYHTFCV----QIRDVENINTMNWLCPKCQACFECGKRDKRNQMLKCNSCREVYHTECIP 946

Query: 121  PPHKNVSS---GPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
            P + +  S     + C K  +C SC +N     +   W      C  C  L  +GNYCP+
Sbjct: 947  PVYPSKPSKRKNIWTCIKCIRCKSCETNA---RVMSGWNFDLQLCTECVSLRQRGNYCPL 1003

Query: 178  CLKVY-RDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            C K Y  D     M+ C  C++WVH  C+ ++ E+Y
Sbjct: 1004 CEKCYDADDYDINMIECARCRKWVHASCEELTAEEY 1039


>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
 gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
          Length = 3002

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 10/162 (6%)

Query: 53   RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
            ++L C SC + +H  C++   + R   H+    C +C  C++CRR   P   + C RC  
Sbjct: 965  QLLFCVSCAEPFHFYCVERQFRPRRKEHFV---CRNCTTCKVCRR---PASDLRCIRCST 1018

Query: 113  AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
             YH  C    PP K    G + CP+ + C  CG           W      C AC +   
Sbjct: 1019 GYHPECLADFPPAKTSQRGCWTCPECSVCLHCGVRA-CKPAETTWSYETNKCAACSQAES 1077

Query: 171  KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
            KG+ CP C + Y  + +  M+ CD CQ W+H  C  ++ ++Y
Sbjct: 1078 KGDVCPECDRAYLPT-TKQMIQCDSCQLWMHRTCTKLTVDEY 1118


>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
          Length = 2891

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 2671 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2729

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +C S  PG   S R  
Sbjct: 2730 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 2787

Query: 155  ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
               W   Y            T C  C +L+ KG YCP C + +     D E+  +V C  
Sbjct: 2788 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 2846

Query: 196  CQRWVHCQC 204
            C +++H  C
Sbjct: 2847 CDKYLHLGC 2855


>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
          Length = 3312

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 3092 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3150

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +C S  PG   S R  
Sbjct: 3151 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 3208

Query: 155  ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
               W   Y            T C  C +L+ KG YCP C + +     D E+  +V C  
Sbjct: 3209 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3267

Query: 196  CQRWVHCQC 204
            C +++H  C
Sbjct: 3268 CDKYLHLGC 3276


>gi|444515379|gb|ELV10878.1| Histone-lysine N-methyltransferase MLL2 [Tupaia chinensis]
          Length = 3975

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 314 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 365

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP + + +  + C     C +CG        +  WF
Sbjct: 366 KPGNDSKMLVCEMCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELDPNSEWF 425

Query: 157 LGYTCCDAC 165
             Y+ C  C
Sbjct: 426 ENYSLCHRC 434



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 17/141 (12%)

Query: 89   CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
            C +C    R  +    + C +C   YH YC +     VSSG   C     C  CG+  PG
Sbjct: 1038 CVVCGSFGRGAE-GHLLACSQCSQCYHPYCVN---SKVSSGLKRC---VSCMQCGAASPG 1090

Query: 149  NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
                  W   YT C  C  L      CP+C   Y + +   ++ C  C+RW+H  C+ + 
Sbjct: 1091 --FHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLF 1142

Query: 209  DEKYLQFQVDGNLQYRCPTCR 229
             E  ++   D    + C +C+
Sbjct: 1143 TEDDVEQAADEG--FDCVSCQ 1161


>gi|124111214|gb|ABM91995.1| MLL [Pan troglodytes]
          Length = 390

 Score = 83.2 bits (204), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)

Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
           + + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   +
Sbjct: 1   QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 60

Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
           + C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +
Sbjct: 61  SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 120

Query: 218 DGNLQYRCPTC 228
             ++ Y C  C
Sbjct: 121 PESVAYTCVNC 131


>gi|326431279|gb|EGD76849.1| hypothetical protein PTSG_12693 [Salpingoeca sp. ATCC 50818]
          Length = 3557

 Score = 82.8 bits (203), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 77/183 (42%), Gaps = 17/183 (9%)

Query: 54  MLSCKSCGKKYHRNCLKN------WAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFM 105
           ML C +C +++H  C         +A N +    + W C  C  C  C  T        M
Sbjct: 783 MLLCMACARRFHPRCAGMQPSSPLFAANEESVR-TCWLCADCTTCSRCDSTASARSRARM 841

Query: 106 FCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 165
            CR C   +H  C   P +  S   Y C    +C  C      N +S  W   +  C  C
Sbjct: 842 ACRVCGKEFHPGCAGLPKRPPS---YCCEDCRRCLDCHRT--ANEVS-SWSATFDYCTPC 895

Query: 166 GRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRC 225
            +L  +GN CPVC   YRD ++  MV C+ C  WVH +C+G+ D    +     +  Y C
Sbjct: 896 SQLRKRGNVCPVCKVSYRD-DTPDMVLCETCDTWVHAECEGLDDAGLRRLGATDD-PYHC 953

Query: 226 PTC 228
            TC
Sbjct: 954 STC 956



 Score = 66.2 bits (160), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 86/213 (40%), Gaps = 33/213 (15%)

Query: 36   MCRLCF---VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN--RDLFHW--SSWKCPS 88
            MC LC    VG ++       M+ C +C + +H  CL +      R  +H+  S ++CP 
Sbjct: 1038 MCTLCHSAGVGGDDDL-----MVFCDTCCEAFHLGCLLDTVPTTYRHAWHYDFSKFRCPR 1092

Query: 89   CRICEICRRTGDPN------------KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKH 136
            C  C+ C R   P+            + + C RC    H  C  P H ++ +  ++CP+ 
Sbjct: 1093 CLQCQCCARALAPDLVIGRIINMDTAEHVVCSRCQTICHTECIPPSHPSMDTKHWVCPEC 1152

Query: 137  TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 195
             KC  C           R+ +  T       L     +CPVC   +    S   +  C+ 
Sbjct: 1153 IKCSMCRRWQAARSSDWRYCMNCT-------LAQDTQHCPVCKDEFEIKLSLQYILQCNA 1205

Query: 196  CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
            C+  VH  CD +   +  + +   N  Y CPTC
Sbjct: 1206 CKGLVHDHCDPLIKSRRTRKKYIEN-GYECPTC 1237


>gi|395755470|ref|XP_003779950.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like, partial
           [Pongo abelii]
          Length = 225

 Score = 82.4 bits (202), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 71/147 (48%), Gaps = 10/147 (6%)

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC- 142
           W+C  C +CE C +  DP + + C  CD +YH YC  PP + V  G + C  +++C  C 
Sbjct: 10  WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWYSRCVWCR 69

Query: 143 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 202
                  GL   W   YT C  C  L    + CPVC + YR+     ++ C  C RW+H 
Sbjct: 70  HCGATSAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREDL---ILQCRQCDRWMHA 122

Query: 203 QCDGISDEKYLQFQVDGNLQYRCPTCR 229
            C  ++ E+ ++   D  + + C  CR
Sbjct: 123 VCQNLNTEEEVENVAD--IGFDCSMCR 147


>gi|403360488|gb|EJY79922.1| Histone-lysine N-methyltransferase [Oxytricha trifallax]
          Length = 2438

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 17/163 (10%)

Query: 36   MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            MC LC    N         LSC  CG+ +H  CL+   +++   +   WKC +C+ CEIC
Sbjct: 1208 MCYLCGSFGN-----GEDFLSCTLCGESFHTYCLQ-LPEDQVSKYQQYWKCLNCKFCEIC 1261

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------NVP 147
                     ++C  CD A+H +C  P  K++ +  + C +  KC  CG+        N  
Sbjct: 1262 ASATQEAFLLYCDVCDKAFHSFCLKPQLKSIPNCQWKCQECFKCQQCGTKEFFSAKDNEE 1321

Query: 148  GNGLSVRWF---LGYTCCDACGRLFVKGNYCPVCLKVYRDSES 187
               L V  F     ++ C  CG+   K ++C +C K   DS+S
Sbjct: 1322 KRNLEVTDFEFSQNFSFCYQCGKNEYKKSFCKICQKKTEDSDS 1364



 Score = 79.0 bits (193), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 60/221 (27%), Positives = 92/221 (41%), Gaps = 32/221 (14%)

Query: 55  LSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           L C +C K YH  CLK  N   N  L   + W+C  C  C  C    + +  + C++C+A
Sbjct: 579 LKCSNCSKNYHLQCLKLPNIHSNLGL-RETDWRCQDCIRCTNCLSLRNRDSMLICQKCNA 637

Query: 113 AYHCYC------QHPP----HKNVSSGP-------------YLCPKHTKCHSCGSNVPGN 149
            +H  C      Q  P     K++  G              Y C +  +C +CGS   G 
Sbjct: 638 GFHYDCLDQSVKQGVPSISNEKDLKLGIETRKTTYSQLNQFYKCEQCVECENCGSKEAGE 697

Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS----ESTPMVCCDVCQRWVHCQCD 205
             + +W   Y  C  C +      +C VC K + D+    +    + C  C   VH  CD
Sbjct: 698 KRNNKWSKDYKLCTTCNKKRSNKEFCLVCEKFWPDTKEKQDQLQTIQCVQCLMCVHQDCD 757

Query: 206 GI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
            I  +   LQ   DG+L+Y C  CR    ++R + + + +L
Sbjct: 758 RIFKNPNVLQQFSDGSLKYNCQKCRQNV-RLRFISEIIEKL 797


>gi|326436266|gb|EGD81836.1| hypothetical protein PTSG_02551 [Salpingoeca sp. ATCC 50818]
          Length = 830

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/167 (27%), Positives = 79/167 (47%), Gaps = 21/167 (12%)

Query: 76  RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           +D+   ++W   +CR+ E C  T D    +FC  CD  +H YC      ++  G ++C  
Sbjct: 658 KDIMTGTAW---NCRVFEKCGSTKDEKDILFCDECDRGFHTYCTG--LTSLPRGRWICSH 712

Query: 136 HTKCHSCGSNVPGNGLSVRW------------FLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + C  C    P +  + +W            FL  T CDAC + +  G++CP+CL+++ 
Sbjct: 713 CSVCDGCNFK-PDDPSTYKWSHFTDRRDNQRRFL-KTFCDACFKKWNTGDFCPICLELF- 769

Query: 184 DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
              S  ++ C +C R VH  C+ ++ E    F+  G  ++ C  C G
Sbjct: 770 -EPSADLLECSLCSRLVHKDCEDLTPEDEETFKAQGWGRFACSICSG 815


>gi|449679774|ref|XP_002159618.2| PREDICTED: uncharacterized protein LOC100210554 [Hydra
           magnipapillata]
          Length = 1172

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 89/211 (42%), Gaps = 43/211 (20%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+ C  C + +H  CL     + +     +W C  C+ C +C   G     + C  C   
Sbjct: 1   MIFCSLCCEPFHTFCL-----DVEPVFKKNWYCDRCKYCTVC---GMKESLLMCDICHDC 52

Query: 114 YHCYCQHPPHKNVSSG----------------------PYLCPKH----TKCHSC----- 142
           YH  C  P ++  + G                       Y+   +     +C  C     
Sbjct: 53  YHAECLGPFYQCETEGEDEIWFRWNWLQEKDYNDEHYSAYITKINKAGAVQCTICNKDFY 112

Query: 143 -GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCCDVCQRWV 200
            GS V  N  S RW   +T C+ C   + KGNYCPVCL +Y  D ++  M+ C  C+ WV
Sbjct: 113 YGSEVTMNK-SSRWMQNFTMCEKCDFNWKKGNYCPVCLVLYNDDDDALKMMFCTKCEAWV 171

Query: 201 HCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 230
           H +C+GI+++ Y +   +  ++ Y C  C G
Sbjct: 172 HMECEGITEDDYEILADLPDDVPYLCKLCTG 202


>gi|68070837|ref|XP_677332.1| SET-domain protein [Plasmodium berghei strain ANKA]
 gi|56497407|emb|CAH97187.1| SET-domain protein, putative [Plasmodium berghei]
          Length = 1439

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 10/159 (6%)

Query: 80   HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
            H+  + C  C  C  C       ++T +   ++ C+ C+   H  C +P   ++    + 
Sbjct: 969  HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCYPNVPDIYLFNWK 1028

Query: 133  CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
            C    KC  C  SN+     +  W L   CC  C + + K N+C +C + Y+  +S   V
Sbjct: 1029 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1087

Query: 192  CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
             CDVC+ W+H  CD   D       +  ++ Y+CPTCR 
Sbjct: 1088 ECDVCKFWIHLSCDKDEDRNIETLAI-KHINYKCPTCRS 1125


>gi|330804473|ref|XP_003290219.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
 gi|325079683|gb|EGC33272.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
          Length = 630

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 31/192 (16%)

Query: 54  MLSCKSCGKKYHRNCLK------NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFC 107
           ++ C  C KK+H  CL       +  +N  L    +WKC  C+ CE+C+   D +K + C
Sbjct: 416 LIKCSECQKKFHPQCLGLHQTCVDSIRNNTL----AWKCTDCKNCEVCQNDVDESKIIIC 471

Query: 108 RRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN--GLSVRWFLGYTCCDAC 165
             CD  +H YC +PP  +  SG + C     C  C    P N    +++W   YT CD+C
Sbjct: 472 DVCDKGFHTYCLNPPLSSPPSGGWRCSNCVFCTHCYIR-PENSENKNIKWKDNYTSCDSC 530

Query: 166 GRLFVKG-----NYCPVCLKVYRD----SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 216
              F KG      YC +C    +      +ST    C  C + VH  CD I  +      
Sbjct: 531 ---FSKGFSDKSKYCSICRHSLKSEEEEEDSTTQ--CTYCGKLVHDDCDSIITDNL---- 581

Query: 217 VDGNLQYRCPTC 228
            + +  Y+CP C
Sbjct: 582 ENEHFIYKCPGC 593


>gi|196016261|ref|XP_002117984.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
 gi|190579457|gb|EDV19552.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
          Length = 183

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 10/146 (6%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +++C  CG+ YH  C+        +   + W+C  C +CE C + GD ++ + C  CD +
Sbjct: 46  LIACFQCGQSYHHYCVSAKLTRSVIV--NGWRCLDCAVCEGCGKAGDEDRLLLCDECDIS 103

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
           YH YC +P    V  G + C +   CH CGSN P  G++ +W   YT C  C    V   
Sbjct: 104 YHTYCLNPQLDKVPEGEWKCHRCVSCHDCGSNFP--GINCQWTSNYTQCGPCASTSV--- 158

Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRW 199
            CP C K Y + +   +V C+ C R+
Sbjct: 159 -CPKCNKKYVNDD--IIVQCNNCDRY 181


>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
          Length = 906

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 14/176 (7%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C +CG   H  CL            + W+C +C+IC+ CR++ D  + + C  CD  
Sbjct: 174 LLFCTTCGAHSHARCLNEGIIVTGEVR-AGWQCYTCKICQQCRKSDDDAQMIICETCDKG 232

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
           +H YC +P   +V    + C     C  C     GN + + W   Y+ C  C        
Sbjct: 233 WHTYCLNPVMDSVPKDGWSCTNCRNCIEC-----GNKIHLEWQNDYSTCPVCW----SKQ 283

Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            C VC K Y++ +   ++ C  C RW H   + +  E       + N  ++C  CR
Sbjct: 284 ACSVCNKDYKEDDV--LLKCSECLRWQHALHEQVYSESDADSMAEQN--FKCKLCR 335


>gi|149032108|gb|EDL87020.1| rCG50635 [Rattus norvegicus]
          Length = 609

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
           C +C  G  E C+    +  C SCG  YH  CL      R     + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACR 280

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           + G+ +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340

Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
             Y+ C  C +  V+G+       V   +E  P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369


>gi|332839578|ref|XP_003313789.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
           troglodytes]
          Length = 429

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 10/137 (7%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +  C SCG  YH  CL      R     + W+CP C++C+ CR+ G+ +K + C  CD  
Sbjct: 293 LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACRKPGNDSKMLVCETCDKG 349

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL----- 168
           YH +C  PP + + +  + C     C +CG+       +  WF  Y+ C  C +      
Sbjct: 350 YHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQP 409

Query: 169 --FVKGNYCPVCLKVYR 183
              V   + PVC + +R
Sbjct: 410 IRSVAEQHTPVCSRAWR 426


>gi|82596471|ref|XP_726275.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23481616|gb|EAA17840.1| Bromodomain, putative [Plasmodium yoelii yoelii]
          Length = 4805

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 44/158 (27%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 80   HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
            H+  + C  C  C  C       ++T +   ++ C+ C+   H  C  P   ++    + 
Sbjct: 1337 HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCFPNVPDIYLFNWK 1396

Query: 133  CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
            C    KC  C  SN+     +  W L   CC  C + + K N+C +C + Y+  +S   V
Sbjct: 1397 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1455

Query: 192  CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
             CDVC+ W+H  CD   D       +  ++ Y+CPTCR
Sbjct: 1456 ECDVCKFWIHLSCDKDEDRNIETLAIK-HINYKCPTCR 1492


>gi|148672214|gb|EDL04161.1| mCG145001 [Mus musculus]
          Length = 630

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 5/126 (3%)

Query: 40  CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
           C V E  G  +   +L C SCG  YH  CL      R     + W+CP C++C+ CR+ G
Sbjct: 250 CAVCEGPG--QLCDLLFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQSCRKPG 304

Query: 100 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
           + +K + C  CD  YH +C  PP +++ +  + C     C +CG+       +  WF  Y
Sbjct: 305 NDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWFENY 364

Query: 160 TCCDAC 165
           + C  C
Sbjct: 365 SLCHRC 370


>gi|388854432|emb|CCF52016.1| related to histone acetyltransferase 3 (myst) [Ustilago hordei]
          Length = 1215

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 7/104 (6%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 90
           +C  C    E+   +  + ++SC  CG   H +CLK W +N    R    +  W+C  C+
Sbjct: 83  VCAFCLRTAEHPKGDTPKLLVSCHECGSSGHPSCLK-WGRNPTKVRQALSYD-WRCIECK 140

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            CE+CR  GD  + MFC +CD  +H YC  PP      G + CP
Sbjct: 141 KCEVCRDKGDDAQLMFCDKCDRGWHLYCLSPPLSKPPKGQWHCP 184


>gi|328716146|ref|XP_003245847.1| PREDICTED: hypothetical protein LOC100162709 isoform 2 [Acyrthosiphon
            pisum]
          Length = 1426

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 4/148 (2%)

Query: 37   CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C+LC    +         ++ C  C   YH  CL    +        +W+C  C+ C  C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
            +   D +K +FC  CD  YH YC     + V  G + C +   C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398

Query: 156  FLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            F  +   +  G        C  C ++++
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRMHQ 1426


>gi|156374109|ref|XP_001629651.1| predicted protein [Nematostella vectensis]
 gi|156216656|gb|EDO37588.1| predicted protein [Nematostella vectensis]
          Length = 251

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 78/177 (44%), Gaps = 12/177 (6%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           +++ C  CG+ +H  C+    +   +     W+C  C +CE C +  D  + + C  CD 
Sbjct: 15  QLIVCSQCGQCFHPYCVG--VKVNKMILSKGWRCLDCTLCEGCGKGSDEARLLLCDSCDI 72

Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
           +YH YC  PP + V  G + C     C  CG+   G      W   YT C  C       
Sbjct: 73  SYHTYCLDPPLEKVPPGGWKCKWCVSCDDCGATSAGT--QCEWQSNYTQCGPC----ASK 126

Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
             CPVC   Y  ++   M+ C  C RW+H  CDG+  E+ +    D    Y+C  CR
Sbjct: 127 TSCPVCNIKYNLNDL--MIQCLHCDRWLHGSCDGLMTEEEVDRAAD--YGYQCLYCR 179


>gi|74150118|dbj|BAE24369.1| unnamed protein product [Mus musculus]
          Length = 742

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 5/121 (4%)

Query: 31  SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           SNT+   +M  +C V  + G      +L+C  C + YH  C+ +      L     W+C 
Sbjct: 622 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 679

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C +CE+C +  DP++ + C  CD +YH YC  PP   V  G + C     C  CG+  P
Sbjct: 680 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 739

Query: 148 G 148
           G
Sbjct: 740 G 740


>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
          Length = 4664

 Score = 76.3 bits (186), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 76/196 (38%), Gaps = 59/196 (30%)

Query: 36  MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           MC +C  F    EG     R+L+C  CG+ YH  C+    +   +     W+C  C +CE
Sbjct: 735 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 787

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
            C +  DP + + C  CD +YH YC  PP + V  G                        
Sbjct: 788 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKG------------------------ 823

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
               G+ C                  K YR+ +   ++ C  C RW+H  C  +S E+ +
Sbjct: 824 ----GWKC------------------KCYREEDL--ILQCRQCDRWMHAVCQNLSTEEEV 859

Query: 214 QFQVDGNLQYRCPTCR 229
           +   D  + + C  CR
Sbjct: 860 ENVAD--IGFDCSMCR 873


>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
 gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 6753

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 8/178 (4%)

Query: 58   KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRC 110
            KS  KKY R      +       +  + C  C  C  C       ++T +   ++ C+ C
Sbjct: 1640 KSKNKKYRRCINYIPSVEHSDITYKKFICKDCYRCIYCCESIYDYKQTPNVANYVICKNC 1699

Query: 111  DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
            +   H  C  P   ++    + C    KC+ C  +         W L   CC  C + + 
Sbjct: 1700 NMVAHGSCCFPNVPDIYLFNWKCDDCLKCNKCNYSNLCYINYNEWELHLDCCINCYKEYE 1759

Query: 171  KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
            K N+C +C + Y + +S   V CDVC+ W+H  CD  ++ + ++   + N+ Y+CPTC
Sbjct: 1760 KKNFCIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDK-NESRNIETLSNKNIDYKCPTC 1816


>gi|358252877|dbj|GAA50314.1| histone-lysine N-methyltransferase MLL4 [Clonorchis sinensis]
          Length = 1769

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 77/180 (42%), Gaps = 27/180 (15%)

Query: 53  RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           ++L C SC + +H  C++   + R   H+       CR C  C+    P   + C RC A
Sbjct: 497 QLLFCVSCAEPFHFYCVERQFRPRRKDHFI------CRNCTECKECHSPAADLRCIRCSA 550

Query: 113 AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------------NVPGNGLSVRWF 156
            YH  C   + P ++V  G ++CP    C  CGS                P  G   R  
Sbjct: 551 GYHPSCLSDYAPAQSVHRGNWVCPHCASCVHCGSKPLHKRQEAGTETGKPPNTGGCSRST 610

Query: 157 LGYTC----CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
             ++     C AC     +G+ CP C + Y  + +  M+ CD CQ W+H  C  ++ ++Y
Sbjct: 611 TAWSSEPNKCAACCSAEARGDICPECDRAYLPT-TKQMIQCDTCQLWMHRTCTKLTADEY 669


>gi|319411664|emb|CBQ73708.1| related to histone acetyltransferase 3 (myst) [Sporisorium
           reilianum SRZ2]
          Length = 1223

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 9/105 (8%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN-----RDLFHWSSWKCPSC 89
           +C  C    +    +  + ++SC  CG   H +CL+ W +N     + L +   W+C  C
Sbjct: 87  LCAFCLQTADRPKGDTPKLLISCYECGSSGHPSCLR-WGRNPTKVGKALSY--DWRCIEC 143

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
           + CEICR  GD  + MFC RCD  +H YC  PP      G + CP
Sbjct: 144 KKCEICRDKGDDAQLMFCDRCDRGWHLYCLSPPLLKPPKGQWHCP 188


>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
 gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
          Length = 6442

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 81   WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +  + C  C  C  C       ++T +   ++ C+ C+   H  C  P   ++    + C
Sbjct: 1582 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1641

Query: 134  PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
             +  KC+ C  SN+     +  W     CC  C + + K N+C +C + Y   +S   V 
Sbjct: 1642 DECLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1700

Query: 193  CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            CDVC+ W+H  CD   +       +   + Y+CPTCR
Sbjct: 1701 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCR 1736


>gi|443696184|gb|ELT96955.1| hypothetical protein CAPTEDRAFT_106026 [Capitella teleta]
          Length = 319

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 48/95 (50%), Gaps = 3/95 (3%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC V +  GC      L C SCG+ YH NCL    Q   +   S W+CP C+IC+ CR+ 
Sbjct: 219 LCIVCDVPGC--ISDQLFCTSCGQHYHGNCLDPPVQVNPVVR-SGWQCPECKICQTCRQP 275

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           GD NK + C  CD  YH +C  P    +    + C
Sbjct: 276 GDDNKMLVCDTCDKGYHIFCLRPVMTTIPKNGWKC 310


>gi|20977848|gb|AAM33377.1|AF492830_1 CDK6/MLL fusion protein [Homo sapiens]
          Length = 182

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)

Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
           + + C +C  +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   +
Sbjct: 49  QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 108

Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
           + C  C +LF KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +SDE Y +   +
Sbjct: 109 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 168

Query: 218 DGNLQYRCPTC 228
             ++ Y C  C
Sbjct: 169 PESVAYTCVNC 179


>gi|240952194|ref|XP_002399348.1| zinc finger protein, putative [Ixodes scapularis]
 gi|215490554|gb|EEC00197.1| zinc finger protein, putative [Ixodes scapularis]
          Length = 379

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 60/125 (48%), Gaps = 5/125 (4%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
           +C++C     +  +    ++SC  CGK  H  CL    +         W+C  C++C IC
Sbjct: 232 VCKVC--NNVDASKEGEELISCSECGKVGHVTCLDILPEMAVAIKSYRWQCMECKMCNIC 289

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG-NGLSVR 154
             T +  K MFC RCD  YH +C     K+V +G ++C    +C +CG   PG  G  V+
Sbjct: 290 MATDNEEKMMFCDRCDRGYHSFCVG--MKSVPAGRWICRLCGRCATCGVASPGPEGPRVQ 347

Query: 155 WFLGY 159
           W   Y
Sbjct: 348 WHHEY 352


>gi|389584530|dbj|GAB67262.1| SET domain containing protein [Plasmodium cynomolgi strain B]
          Length = 5788

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 73/162 (45%), Gaps = 10/162 (6%)

Query: 81   WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +  + C  C  C  C       ++T +   ++ C+ C+   H  C  P   ++    + C
Sbjct: 1113 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1172

Query: 134  PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
                KC+ C  SN+     +  W     CC  C + + K N+C +C + Y   +S   V 
Sbjct: 1173 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1231

Query: 193  CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 234
            CDVC+ W+H  CD  ++ + ++     ++ Y+CPTCR   + 
Sbjct: 1232 CDVCKFWIHLSCDK-NESRNIETLSIKSINYKCPTCRSGSFH 1272


>gi|443696185|gb|ELT96956.1| hypothetical protein CAPTEDRAFT_106029, partial [Capitella teleta]
          Length = 175

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 70/148 (47%), Gaps = 8/148 (5%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           LC    + G     R++ C  CG+ YH  C  N   +R +     W+C  C +CE C R 
Sbjct: 26  LCVSCGSLGANEESRLIVCSQCGQCYHPYC-ANVKLSRIILE-KGWRCLDCTVCEGCGRP 83

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
            D ++ + C  CD +YH YC  PP ++V  G + C     C +CG+  PG   +  W   
Sbjct: 84  HDESRLILCDECDISYHIYCLDPPLESVPRGTWKCKWCAICVTCGTTAPGTNCA--WQNN 141

Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSE 186
           Y+ C  C    +    CP+C + Y++ +
Sbjct: 142 YSQCGPCYSTVM----CPLCYRSYKEDD 165


>gi|320163082|gb|EFW39981.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 660

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 74/190 (38%), Gaps = 23/190 (12%)

Query: 56  SCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYH 115
           +C SC    H +C        +      W+C +C+ C +C  T +  + + C  CD  YH
Sbjct: 196 ACASCNNGGHVSCFHMTKLMGETVQTYGWECSNCKSCAMCNSTENETEMLLCDVCDRGYH 255

Query: 116 CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL-------SVRWFLGYTC------- 161
             C     K +  G ++C     C +C S + G  L        V W   Y         
Sbjct: 256 IQCID--LKTMPLGRWVCSLCNACTNCHSKLAGAILPRKEMDRPVDWLTFYATDVQSMKF 313

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C A    + +  YC +CLKV    E   +  CD C R  H  CD       L  +  G  
Sbjct: 314 CSA-RSSWSQHEYCAICLKVNHPGEKLRLYKCDGCLRQTHPSCDK------LYKRPRGRT 366

Query: 222 QYRCPTCRGE 231
           +Y CP CRG+
Sbjct: 367 RYFCPICRGD 376


>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
 gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
          Length = 6587

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 10/162 (6%)

Query: 81   WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +  + C  C  C  C       ++T +   ++ C+ C+   H  C  P   ++    + C
Sbjct: 1629 YKKFVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1688

Query: 134  PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
                KC+ C  SN+     +  W     CC  C + + K N+C +C + Y   +S   V 
Sbjct: 1689 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1747

Query: 193  CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 234
            CDVC+ W+H  CD   +       +   + Y+CPTCR   + 
Sbjct: 1748 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCRSGSFH 1788


>gi|428178234|gb|EKX47110.1| hypothetical protein GUITHDRAFT_137718 [Guillardia theta CCMP2712]
          Length = 1125

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 46/94 (48%), Gaps = 1/94 (1%)

Query: 137 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 195
            KC+SCG+  PG      W   YT C  CG L  K  YC VC KV ++ E+   +  C V
Sbjct: 715 VKCNSCGAKTPGKKPMDAWKQAYTQCSRCGELHAKKRYCSVCDKVLKEIETEDQILKCSV 774

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           C  WVH +CD + +   +   +  +  Y CP  R
Sbjct: 775 CDLWVHGRCDDLDELSLVGMNLLSSEHYVCPKHR 808


>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
          Length = 2857

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 63/153 (41%), Gaps = 9/153 (5%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
            C++C    N+   +   ++ C +C    H +C+             +W+C  C+ C  C 
Sbjct: 2697 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2755

Query: 97   RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
               D +K +FC  CD  YH YC     + V  G + C +   C +CGS  PG   S R  
Sbjct: 2756 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2813

Query: 155  ---WFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
               W   Y   D   R++V    C  C K+  D
Sbjct: 2814 VAQWQHEYKKGDKNTRVYV-STLCVPCSKLRGD 2845


>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
           [Hydra magnipapillata]
          Length = 491

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           +L CK CG K H +CL   A+  +      SW+C  C+ C IC  TGDP+  +FC  CD 
Sbjct: 209 LLICKDCGNKAHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDK 268

Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
            YH  C  P    + SG + C
Sbjct: 269 GYHMNCHEPKLTQMPSGKWAC 289


>gi|390350878|ref|XP_788653.3| PREDICTED: zinc finger protein DPF3-like [Strongylocentrotus
           purpuratus]
          Length = 418

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 8/126 (6%)

Query: 16  VATASKDIAGEEQ------GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRN 67
           V +AS+D  GE        G   TN  C  C     EN+  +    ++SC  CG+  H  
Sbjct: 288 VTSASEDSLGEVSKSSKGGGGPETNNYCDFCLGDATENKKTQTPEDLISCSDCGRSGHPT 347

Query: 68  CLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
           CL+             W+C  C+ C +C  + + ++ +FC  CD  YH YC +PP +   
Sbjct: 348 CLQFTDTMIQKVKGYRWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMQAPP 407

Query: 128 SGPYLC 133
            G ++C
Sbjct: 408 EGSWIC 413


>gi|149056303|gb|EDM07734.1| rCG53696 [Rattus norvegicus]
          Length = 169

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 69/137 (50%), Gaps = 6/137 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
           ++ C+ C   +H  CL+  A+     H  +W C  C+ C +C R G  +K +  C RC  
Sbjct: 29  LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 87

Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
           AYH  C  P +   ++     ++C    +C SCG+  PG    V W   Y+ C  C  L+
Sbjct: 88  AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 146

Query: 170 VKGNYCPVCLKVYRDSE 186
            KGNYCP+C + Y D++
Sbjct: 147 EKGNYCPICTRCYEDND 163


>gi|355702665|gb|AES02007.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
          Length = 167

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 82/161 (50%), Gaps = 8/161 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRCD 111
            + C+ C + +H+ CL+     R L     +W C  C+ C +C R     K +  C +C 
Sbjct: 9   FVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCR 66

Query: 112 AAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
            +YH  C  P +    +     ++C K  +C SCGS  PG G   +W   ++ C  C +L
Sbjct: 67  NSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKL 126

Query: 169 FVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
           F KGN+CP+C K Y D +  + M+ C  C RWVH +C+ +S
Sbjct: 127 FAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 167


>gi|321261507|ref|XP_003195473.1| histone acetyltransferase [Cryptococcus gattii WM276]
 gi|317461946|gb|ADV23686.1| Histone acetyltransferase, putative [Cryptococcus gattii WM276]
          Length = 947

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           M+SC +CG+  H  CL       R       W C  C++CE C   GD ++ MFC  CD 
Sbjct: 41  MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKMCEQCEIKGDDSRLMFCDTCDR 100

Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
            +H YC +PP      G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123


>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
 gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
          Length = 265

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           ML C SCG+ YH  CL    +   L     W+CP C++C+ CR+ GD NK + C  CD  
Sbjct: 187 MLFCTSCGRHYHGRCLDPAVEITSLVR-MGWQCPDCKVCQGCRQPGDDNKMLVCDVCDRG 245

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH +C  PP   +    + C
Sbjct: 246 YHTFCLDPPMTTIPKTGWKC 265


>gi|405122036|gb|AFR96804.1| Myst4 protein [Cryptococcus neoformans var. grubii H99]
          Length = 943

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           M+SC +CG+  H  CL       R       W C  C+ CE C   GD ++ MFC  CD 
Sbjct: 41  MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCEIKGDDSRLMFCDTCDR 100

Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
            +H YC +PP      G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123


>gi|328716148|ref|XP_003245848.1| PREDICTED: hypothetical protein LOC100162709 isoform 3 [Acyrthosiphon
            pisum]
          Length = 1397

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 3/113 (2%)

Query: 37   CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C+LC    +         ++ C  C   YH  CL    +        +W+C  C+ C  C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
            +   D +K +FC  CD  YH YC     + V  G + C +   C SCG + PG
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPG 1392


>gi|58269200|ref|XP_571756.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57227992|gb|AAW44449.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 940

 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           M+SC +CG+  H  CL       R       W C  C+ CE C   GD ++ MFC  CD 
Sbjct: 41  MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100

Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
            +H YC +PP      G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123


>gi|134114447|ref|XP_774152.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256785|gb|EAL19505.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 940

 Score = 69.3 bits (168), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           M+SC +CG+  H  CL       R       W C  C+ CE C   GD ++ MFC  CD 
Sbjct: 41  MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100

Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
            +H YC +PP      G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123


>gi|71018437|ref|XP_759449.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
 gi|46099056|gb|EAK84289.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
          Length = 1283

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWS---SWKCPSCRICEICRRTGDPNKFMFCRRC 110
           ++SC  CG   H +CLK W +     H +   +W+C  C+ CE+C   GD  + MFC RC
Sbjct: 215 LISCYECGSSGHPSCLK-WGRKSTKVHKALSYNWRCIECKKCEVCDDKGDDAQLMFCDRC 273

Query: 111 DAAYHCYCQHPPHKNVSSGPYLCP 134
           D  +H YC  P       G + CP
Sbjct: 274 DRGWHLYCLTPALSKPPKGQWHCP 297


>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
 gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
          Length = 2007

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 53/93 (56%), Gaps = 6/93 (6%)

Query: 139 CHSCGSNVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           C SCG  V G+ GL  +   G   C  C +L+ +G +CPVC +V++ +   PMV CD C 
Sbjct: 180 CGSCG--VTGDEGLKSKNAQGR--CSLCQKLYKEGQFCPVCDRVWQWATGDPMVGCDRCD 235

Query: 198 RWVHCQCDGISDEKYLQFQVDG-NLQYRCPTCR 229
            W+H +CD ++ E   + + DG  L Y CP CR
Sbjct: 236 MWIHRECDALAAEVLDREENDGEELAYECPKCR 268


>gi|299749795|ref|XP_002911422.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
 gi|298408603|gb|EFI27928.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
          Length = 2272

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)

Query: 45  NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
           N   E+  +M  C  CG+  H  C++  A   D+     W+C  C+ICE+C R GD  +F
Sbjct: 649 NVRTEQPEQMTHCIECGRSGHPTCMQ-LAHIGDVIRSYPWRCIECKICEVCSRKGDDVRF 707

Query: 105 --------------------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
                               MFC  CD  +H YC +PP      G + CP+      C  
Sbjct: 708 VQLDLLGQTTDNPLLFQEKMMFCDSCDRGWHMYCLNPPMDETPPGKWSCPQ------CSP 761

Query: 145 NVPGNGLSV 153
             P  G+ +
Sbjct: 762 LFPEQGIPM 770


>gi|47225576|emb|CAG12059.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1216

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 2/83 (2%)

Query: 132  LCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPM 190
            +C    +C SCG   PG    + W      C  C +LF  GNYCP+C K Y D++  + M
Sbjct: 1035 VCMTCIRCKSCGV-TPGKSWDIEWNHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQM 1093

Query: 191  VCCDVCQRWVHCQCDGISDEKYL 213
            + C  C  WVH +C+ +++  ++
Sbjct: 1094 MQCGTCNHWVHAKCEDLTEGSHV 1116


>gi|390332559|ref|XP_003723528.1| PREDICTED: uncharacterized protein LOC580929 isoform 1
           [Strongylocentrotus purpuratus]
          Length = 3300

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 35  VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           ++CR C  G  E C R  +   +LSC  CG   H +CLK  +  ++    S W+C  C+ 
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
           C IC + G   + + C  C+  YH  C  PP K +  G + C P  T K    G   PG 
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324

Query: 150 GLSVRWFLG 158
             + R F G
Sbjct: 325 PGTNRLFRG 333


>gi|390332561|ref|XP_003723529.1| PREDICTED: uncharacterized protein LOC580929 isoform 2
           [Strongylocentrotus purpuratus]
          Length = 3278

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 35  VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           ++CR C  G  E C R  +   +LSC  CG   H +CLK  +  ++    S W+C  C+ 
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
           C IC + G   + + C  C+  YH  C  PP K +  G + C P  T K    G   PG 
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324

Query: 150 GLSVRWFLG 158
             + R F G
Sbjct: 325 PGTNRLFRG 333


>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
 gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 2437

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 49/179 (27%), Positives = 75/179 (41%), Gaps = 19/179 (10%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR-RTGDPNKFMFCRRCDAAYH 115
           C  CG  YH  C+     NR+      + C +C  C IC  +  D N  + C  C   +H
Sbjct: 603 CFYCGAYYHTQCILQEEDNRN----QHFACDTCEPCSICHGKITDDN--ITCCECKTHFH 656

Query: 116 CYCQHPPHKNVSSGP-----YLCPKHTKCHSCGSNVPG--NGLSVRWFLGYTCCDACGRL 168
             C      ++++       + C    +C  C + +    NG S +    Y  C+ C   
Sbjct: 657 KKCGFSIAYDMNTSDKQIMRWYCESCVQCCICNNKLSDFQNGYSFKDDQIY--CNDCNEQ 714

Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--SDEKYLQFQVDGNLQYRC 225
             K  YCP+C K +    +  MV C  C  W+H  CD I   D+ Y +++ +   QYRC
Sbjct: 715 LQKKEYCPICKKFWSQETNKDMVQC-TCAMWIHRACDPILKDDKLYDEYKNNLRQQYRC 772



 Score = 56.6 bits (135), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 48/230 (20%), Positives = 86/230 (37%), Gaps = 65/230 (28%)

Query: 52   RRMLSCKSCGKKYHRNCL-------------KNWAQNRDLFHWSSWKCPSCRICEICRRT 98
            + +L C SC + YH  CL             +    NR+      W CP C++C++C + 
Sbjct: 1131 KSLLFCTSCFESYHPYCLMIPGRQEYFKEKMERAMNNRE------WNCPKCQVCKVCSKG 1184

Query: 99   GDPNKFMFCRRCDAAYHCYCQHPP----HKNVSSGPYLCPKHTKCHSCGSNVPGNG---- 150
             +  K +FCR+CDA  H  C+       +++ +   + C     C  C S    +     
Sbjct: 1185 PNITKNLFCRKCDAMVHFECEFKDVQVWNESKNELYWQCSDCFNCAKCSSKSLIDESDKQ 1244

Query: 151  --LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP------------------- 189
              +++ +   ++ C  CG L     +C  C K  +   + P                   
Sbjct: 1245 LMINLDFTDNFSLCYKCGFLDAYYRFCKFCNKYCKKIPTLPKPANQQPLDDLFQEKSVRL 1304

Query: 190  ----------MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
                      +  C +CQ+  H +C   +  +Y+Q       Q+ C TC+
Sbjct: 1305 KYLDQYLEEVLYQCKLCQQVYHKKCFEKAYPEYIQ-------QFFCYTCK 1347


>gi|409078326|gb|EKM78689.1| hypothetical protein AGABI1DRAFT_107193 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1494

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M++C  CG+  H +C++  ++  D+     WKC  C+ CE+C   GD  + +FC  CD  
Sbjct: 76  MVTCSECGRSGHPSCME-LSKIGDMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134

Query: 114 YHCYCQHPPHKNVSSGPYLCP 134
           +H  C  PP   +  G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155


>gi|340376023|ref|XP_003386533.1| PREDICTED: zinc finger protein ubi-d4-like [Amphimedon
           queenslandica]
          Length = 402

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 49/102 (48%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C   +  N+   +  RM+SC  CG+  H +CL+       +     W+C  C+ 
Sbjct: 298 NPYCDFCLGDDTLNQKSGKPERMVSCADCGRSGHPSCLQFSPSLAAVVLTYRWQCIECKS 357

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C ++ + ++ +FC  CD  YH YC  PP K    G + C
Sbjct: 358 CSLCGKSDNDDQLLFCDDCDRGYHMYCLKPPMKEAPEGSWSC 399


>gi|196002948|ref|XP_002111341.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
 gi|190585240|gb|EDV25308.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
          Length = 879

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 40/80 (50%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +LSC  CG   H +CLK   +         W+C  C+ C  C+  G+P+  +FC  CD  
Sbjct: 207 LLSCVDCGNSGHPSCLKYSPELTSRVKTEPWQCIECKTCSYCQNAGNPDNLLFCDACDKG 266

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           +H  C  PP   + SG ++C
Sbjct: 267 FHMECLSPPLTGMPSGRWVC 286


>gi|390332557|ref|XP_786050.3| PREDICTED: uncharacterized protein LOC580929 isoform 3
           [Strongylocentrotus purpuratus]
          Length = 2843

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 35  VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           ++CR C  G  E C R  +   +LSC  CG   H +CLK  +  ++    S W+C  C+ 
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
           C IC + G   + + C  C+  YH  C  PP K +  G + C P  T K    G   PG 
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324

Query: 150 GLSVRWFLG 158
             + R F G
Sbjct: 325 PGTNRLFRG 333


>gi|332030886|gb|EGI70522.1| Zinc finger protein ubi-d4 [Acromyrmex echinatior]
          Length = 527

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 2/118 (1%)

Query: 18  TASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
           +A K I G +   +  +  C  C     EN+    +  ++SC  CG+  H  CL+  A  
Sbjct: 399 SAEKKIGGGKSSAAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANM 458

Query: 76  RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                   W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 459 IVSVRKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516


>gi|299115811|emb|CBN74374.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 3157

 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 74/195 (37%), Gaps = 47/195 (24%)

Query: 53   RMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----SWKCPSCRICEIC--------RRTGD 100
            + L C +C   YH  CL      R++   +     W+C  C+ C+ C        RR GD
Sbjct: 1085 QTLVCSTCLMHYHPGCLDPPMTPREIASHAHSKEEWRCDYCQTCQGCGKGNEDEMRRGGD 1144

Query: 101  PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
            P   + C      +H +   P  +    GP    K                   W     
Sbjct: 1145 P---LVCHERGKVHHVHLDAP--EGCDPGPRAQEKI------------------WL---- 1177

Query: 161  CCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 219
             C  C   + + NYCP C   Y + ++    + C  C+ WVH  C+G+S  +Y Q  VDG
Sbjct: 1178 -CSPCLDKYKERNYCPKCGVTYDEHDNMVQAIGCAACEFWVHASCEGLSTAEY-QMLVDG 1235

Query: 220  -----NLQYRCPTCR 229
                   +Y CP CR
Sbjct: 1236 KDSWFGAEYLCPVCR 1250



 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 91   ICEICRRTG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
            +CE+C  T   D +  + C  CD AYH YC  P       G ++C +   C +C
Sbjct: 1411 VCEVCTETAKSDESLLLMCELCDRAYHTYCLTPSTDKPPEGTWICGQCISCTTC 1464


>gi|340381940|ref|XP_003389479.1| PREDICTED: hypothetical protein LOC100636799 [Amphimedon
           queenslandica]
          Length = 469

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 78/188 (41%), Gaps = 37/188 (19%)

Query: 107 CRRCDAAYH--CYCQHPPHKNVSSG-PYLCPKHTKCHSCGSNVPGN-------------G 150
           C  C+  YH  C   H       +   + C   TKC SC SN+P                
Sbjct: 260 CCECNEWYHEDCLLGHTSRPRTDNKRVWKCVTCTKCISCHSNMPNKIDDSGRSLLSSLLA 319

Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
              +W  GY  C  C  L  KGN CPVC + Y D++  + MV C  C  WVH  C+ ++D
Sbjct: 320 TPTQWSDGYCYCSDCIVLKAKGNSCPVCGECYLDNDFDSKMVQCSQCDNWVHSHCENMTD 379

Query: 210 EKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTE 269
           E+Y                      + DL D+V  + R     ++ +I+S++ A G    
Sbjct: 380 EEY--------------------EILSDLPDSVEYVCRLCIAYNRTMISSIKTALGPIWV 419

Query: 270 DEIFSISP 277
           ++I  ++P
Sbjct: 420 NDINVLTP 427


>gi|196010657|ref|XP_002115193.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
 gi|190582576|gb|EDV22649.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
          Length = 167

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 2/92 (2%)

Query: 33  TNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
           +N  C  C   V  N+   ++ R++SC  CG+  H +CLK      D     +W+C  C+
Sbjct: 66  SNRYCDFCLGDVNLNKKTGQSERLISCADCGRSGHPSCLKFTPSLTDTVLMYAWQCIECK 125

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
            C IC  + + +K +FC  CD  YH YC  PP
Sbjct: 126 SCSICGTSDNDDKLLFCDDCDRGYHMYCLSPP 157


>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
 gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
          Length = 2678

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 7/130 (5%)

Query: 104 FMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTC 161
           ++ C  C   +H  C +P  P        + C     C  CG           W   +T 
Sbjct: 697 YVLCNICHVGFHGSCSNPFIPPLIAEKKYFKCSTCCYCSHCGYRDHNFMDYAAWDTTFTS 756

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG--ISDEKYLQFQVDG 219
           C  C R F KG YC +C +++  S     + CD C+ W+H +CD   I+  + L+     
Sbjct: 757 CIRCCRGFEKGQYCAICRQIWSSSWDGDWLQCDTCRFWIHTECDTNLINSIEVLK---SP 813

Query: 220 NLQYRCPTCR 229
           ++ Y CP CR
Sbjct: 814 SVSYHCPVCR 823


>gi|390349864|ref|XP_003727298.1| PREDICTED: uncharacterized protein LOC100893490 [Strongylocentrotus
           purpuratus]
          Length = 631

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 40/80 (50%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L CK C  K H +C+K   Q  +    S W+C  C+ C +C   GD +  +FC  CD  
Sbjct: 248 LLVCKDCTIKVHPSCMKYSRQLAERSRLSPWQCIDCKTCHVCNDAGDADTLLFCDSCDKG 307

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH  C +P  +    G ++C
Sbjct: 308 YHMACHNPKVEEKPLGRWVC 327


>gi|340381804|ref|XP_003389411.1| PREDICTED: hypothetical protein LOC100638610 [Amphimedon
           queenslandica]
          Length = 2366

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 57/135 (42%), Gaps = 8/135 (5%)

Query: 8   ARRFESG-------YVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKS 59
           AR  E G        V  A K +      ++  N +C  C    EN   ++  ++LSC  
Sbjct: 162 ARMVEKGDLEKKGRLVKVAVKILEPFPSPKARPNSICSFCLGTEENNRDKQYEQLLSCHE 221

Query: 60  CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
           CG   H +CLK   +  +      W C  C+ C  C  + + +  + C  CD  +H  C 
Sbjct: 222 CGNSGHPSCLKYSKELVEFITAEPWLCLECKKCIYCNASANADDLLICDACDKGFHMVCL 281

Query: 120 HPPHKNVSSGPYLCP 134
            PP  ++  G ++CP
Sbjct: 282 DPPISSLPEGRWVCP 296


>gi|356554670|ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max]
          Length = 832

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH +CL +  Q +   H   W CPSC IC++C    D NK + C  CD AYH YC  P
Sbjct: 683 KYYHVSCLSS-KQLKSYGH--CWYCPSC-ICQVCLTDKDDNKIVLCDACDHAYHVYCMKP 738

Query: 122 PHKNVSSGPYLCPK 135
           P  ++  G + C K
Sbjct: 739 PQNSIPKGKWFCIK 752


>gi|426199317|gb|EKV49242.1| hypothetical protein AGABI2DRAFT_177299 [Agaricus bisporus var.
           bisporus H97]
          Length = 1474

 Score = 65.9 bits (159), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M++C  CG+  H +C++  ++  ++     WKC  C+ CE+C   GD  + +FC  CD  
Sbjct: 76  MVTCSECGRSGHPSCME-LSKIGEMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134

Query: 114 YHCYCQHPPHKNVSSGPYLCP 134
           +H  C  PP   +  G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155


>gi|157128953|ref|XP_001661565.1| requim, req/dpf2 [Aedes aegypti]
 gi|108872440|gb|EAT36665.1| AAEL011279-PA [Aedes aegypti]
          Length = 433

 Score = 65.9 bits (159), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 2/123 (1%)

Query: 13  SGYVATASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLK 70
           S   + A   +   E+G++  +  C  C     EN+       ++SC  CG+  H  CL+
Sbjct: 301 SNASSNAPSPVPPIEKGRAIPSPYCDFCLGDARENKKTLEPEELVSCSDCGRSGHPTCLQ 360

Query: 71  NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP 130
             A          W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G 
Sbjct: 361 FTANMIISVRKYRWQCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLLTPPEGS 420

Query: 131 YLC 133
           + C
Sbjct: 421 WSC 423


>gi|170059917|ref|XP_001865571.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
 gi|167878516|gb|EDS41899.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
          Length = 450

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           E+G++  +  C  C     EN+    A  ++SC  CG+  H  CL+  A          W
Sbjct: 332 EKGRAVPSPYCDFCLGDARENKKTFEAEELVSCSDCGRSGHPTCLQFTANMIISVRKYRW 391

Query: 85  KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C +C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 392 QCIECKYCTMCGTSDNDDQLLFCDDCDRGYHMYCLSPPLISPPEGSWSC 440


>gi|325192337|emb|CCA26782.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 2128

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 85/206 (41%), Gaps = 26/206 (12%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
            ++T     LC V  + G   A  +L C  C +  H  C+ +     + F  +++ CP+C 
Sbjct: 1162 AHTYAWLDLCVVCASAG--SASSLLFCMDCAQTVHTFCVLSDTSAWNKF--NAFHCPNCL 1217

Query: 91   ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
             C +C  T      + C +C    H  C  PP  + S+    C +  +C  C +      
Sbjct: 1218 QCRVCDET---EGVLACPKCGKGAHGACLDPPIDDPSA--IYCGECVECKHCQTPGTPRT 1272

Query: 151  LSVRWFLGYTCCD-----ACGRLFVK--GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
             S+   L   CC         R F K   + CP+CL+     E+   + CD C+RW H  
Sbjct: 1273 YSLVPDLCLQCCSRQERWKSARQFTKPLADKCPICLETCNVKEA---IHCDACERWTHST 1329

Query: 204  CDGISDEKYLQFQVDGNLQYRCPTCR 229
            CD ++         D +  Y CP+CR
Sbjct: 1330 CDPMAFR-------DKDALYVCPSCR 1348



 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 29/140 (20%)

Query: 91  ICEICRR-TGDPNKFMF-CRRCDAAYHCYCQHPPHKNV--------------SSGPYLCP 134
           IC  CR  + D N+ +  C+RC+  +H  C  PP+ +                S P+ C 
Sbjct: 740 ICSGCRLPSMDSNQSLLSCQRCNKRFHATCTDPPYVDTITVWDPHDHSELDHISLPFTCA 799

Query: 135 KHTKCHSCGSNVPGNGLSVRWFLGYTC---CDACGRLFVKGNYCPVCLKVYRDSES-TP- 189
               C  C  N   N   VRW L  T    C AC  L+    +C +C   YR  E+ TP 
Sbjct: 800 DCDVCAGCNQNRTENPW-VRWRLPLTIVSLCGACELLYRSDQFCSIC---YRALEALTPR 855

Query: 190 ----MVCCDVCQRWVHCQCD 205
               ++ C  C+ +VH +C+
Sbjct: 856 KELSLLSCSSCRHFVHPECE 875


>gi|449543136|gb|EMD34113.1| hypothetical protein CERSUDRAFT_141605, partial [Ceriporiopsis
           subvermispora B]
          Length = 260

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+SC  CG+  H  CL + A   D+     WKC  C+ CEIC    D N+ MFC  CD  
Sbjct: 1   MVSCAECGRSAHPTCL-DLADIGDVMRSYDWKCMECKNCEICHSKEDDNRMMFCDFCDRG 59

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           +H  C  PP      G + C
Sbjct: 60  WHMDCLDPPLSEAPPGKWHC 79


>gi|260818085|ref|XP_002603915.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
 gi|229289239|gb|EEN59926.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
          Length = 479

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%)

Query: 18  TASKDIAGEEQGQSNTNVMCRLCFVGENEGCERA---RRMLSCKSCGKKYHRNCLKNWAQ 74
           T   D   E   +S ++  C  C +     C R+     +L CK C  K H +C++  A 
Sbjct: 211 THGPDFEQEPLVKSPSDPRCDYCLLTSE--CNRSGEEEDLLICKDCNAKAHPSCMRYSAD 268

Query: 75  NRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                  S W+C  C+ C IC  +GD    +FC  CD  YH  C  P   +   G ++C
Sbjct: 269 LARRSRMSPWQCIDCKTCYICDDSGDAETLLFCDACDKGYHMACHEPAVTHKPLGKWVC 327


>gi|443897765|dbj|GAC75104.1| hypothetical protein PANT_14d00040 [Pseudozyma antarctica T-34]
          Length = 1176

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 7/104 (6%)

Query: 36  MCRLCFVGENEGCERARRML-SCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 90
           +C  C    +   E   ++L SC  CG   H  CL+ W +     R    +  W+C  C+
Sbjct: 91  LCAFCQQPADRPKENTPKLLISCFECGSSGHPACLR-WGRKPTKVRSALSYE-WRCIECK 148

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            CEIC   GD  + MFC  CD  +H YC  PP      G + CP
Sbjct: 149 KCEICCDKGDDAQLMFCDGCDRGWHLYCLSPPLAKPPKGQWQCP 192


>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
           PHD-bromo-3xPHD-SET domain and associated cysteine
           cluster at the C-terminus [Cryptosporidium parvum Iowa
           II]
 gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
           PHD-bromo-3xPHD-SET domain and associated cysteine
           cluster at the C-terminus [Cryptosporidium parvum Iowa
           II]
          Length = 2244

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 55/130 (42%), Gaps = 3/130 (2%)

Query: 103 KFMFCRRCDAAYHCYC--QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
           +F+ C  C   YH  C     P        + C    KC  CG    G      W   ++
Sbjct: 538 EFVVCGTCGICYHGSCGNSFVPPLLFGGNNFNCSNCCKCIHCGYRDNGFMDYASWDSTFS 597

Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 220
            C  C + F +G +C +C K++  S     + CD+C+ WVH  CD   +E  ++F  +  
Sbjct: 598 SCIRCCKGFERGQFCSICRKIWTSSWEGEWLQCDICKFWVHYDCDKDLNEP-IEFYSNVK 656

Query: 221 LQYRCPTCRG 230
             Y CP CR 
Sbjct: 657 NLYNCPACRS 666


>gi|391344898|ref|XP_003746731.1| PREDICTED: zinc finger protein DPF3-like [Metaseiulus occidentalis]
          Length = 470

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 2/101 (1%)

Query: 35  VMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           + C  C     EN+  ++   M++C  CG+  H  CL+      +      W+C  C+ C
Sbjct: 348 LYCDFCLGTESENKKTKQPEEMVTCADCGRSGHPTCLQFTDVMTNNVKKYRWQCIECKTC 407

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +C  + + ++ +FC  CD  YH YC  PP K    G + C
Sbjct: 408 TLCGTSENDDQMLFCDDCDRGYHMYCLSPPLKEPPEGSWSC 448


>gi|196007802|ref|XP_002113767.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
 gi|190584171|gb|EDV24241.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
          Length = 443

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 64/141 (45%), Gaps = 9/141 (6%)

Query: 8   ARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYH 65
           +R   S ++  A   +A  +   +N+   C +CF  E  N+  E    ++ C  CG   H
Sbjct: 298 SRNSSSDHIGNARGKLAANKTVSTNS---CGVCFASEYVNDLGE-IEELIKCSQCGSLTH 353

Query: 66  RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
             CL+   +   +     W+C  C+ C  C    D +K MFC RCD  YH +C      +
Sbjct: 354 PTCLELTPEMVKVIQTYHWQCMDCKTCTACSDPYDEDKMMFCDRCDRGYHTFCVG--LDS 411

Query: 126 VSSGPYLCPKHTKCHSCGSNV 146
           + SG ++CP  T+ HS   N 
Sbjct: 412 IPSGNWICPSCTQ-HSGNKNA 431


>gi|307169876|gb|EFN62385.1| Zinc finger protein ubi-d4 A [Camponotus floridanus]
          Length = 528

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 2/112 (1%)

Query: 24  AGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
           AG +   +  +  C  C     EN+    +  ++SC  CG+  H  CL+  A        
Sbjct: 406 AGGKSSAAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRK 465

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 466 YRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 517


>gi|118344068|ref|NP_001071860.1| zinc finger protein [Ciona intestinalis]
 gi|70571572|dbj|BAE06775.1| zinc finger protein [Ciona intestinalis]
          Length = 399

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)

Query: 30  QSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----S 83
           Q+ +N  C  C     EN+    +  ++SC  CG+  H  CL    Q  D+   +    S
Sbjct: 273 QTQSNNYCDFCLGDADENKKTGESEELVSCSDCGRSGHPTCL----QFTDIMTMNVKKYS 328

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           W+C  C+ C +C  + +  + +FC  CD  YH YC  P  +N   G ++C
Sbjct: 329 WQCIECKSCHVCGTSDNDEQLLFCDDCDRGYHMYCLQPRMENPPEGSWIC 378


>gi|281205681|gb|EFA79870.1| hypothetical protein PPL_06690 [Polysphondylium pallidum PN500]
          Length = 486

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 11/109 (10%)

Query: 132 LCPKHTKCHSCGSNVPGNGLSVRWFLGYT---CCDACGRLFVKGNYCPVCLKVYRDSEST 188
           + P+  +C  CGS  PG G + +W  G +    C++CG   +K   CP+C K+Y  ++ +
Sbjct: 200 MVPRVNRC-VCGSTTPGKGPTCKWRKGPSGEVLCNSCGLQNMKKPKCPLCGKMYNKNKDS 258

Query: 189 PMVC-------CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
              C       CD C +WV  +CDGI D           L Y CP CR 
Sbjct: 259 STECDSDEWIRCDDCSQWVMTECDGIKDISLYDDTQPNPLHYSCPKCRN 307


>gi|195132645|ref|XP_002010753.1| GI21713 [Drosophila mojavensis]
 gi|193907541|gb|EDW06408.1| GI21713 [Drosophila mojavensis]
          Length = 2287

 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 2/98 (2%)

Query: 50   RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
            R    + C SC +K H +C++   +        +W+C  C+ C  CR +  P K ++C +
Sbjct: 1971 RPEAFIRCYSCRRKVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQ 2030

Query: 110  CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            CD  YH YC     K V  G + C + + C  CG+  P
Sbjct: 2031 CDRGYHIYCLG--IKTVPEGRWSCERCSICMRCGATRP 2066


>gi|198431091|ref|XP_002124209.1| PREDICTED: similar to monocytic leukemia zinc finger protein [Ciona
           intestinalis]
          Length = 2554

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 1/99 (1%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C+   E     +   +LSC  CG   H  C+K  +        S W+C  C+ C +
Sbjct: 257 ICSYCYGTAECNKTGKQEELLSCADCGSSGHPICMKLSSDLVPKIRGSRWQCIECKSCRV 316

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C   G+ +  +FC  CD  +H  C +PP   +  G ++C
Sbjct: 317 CGSKGNADNLLFCDSCDRGFHMECCNPPLLKMPKGSFIC 355


>gi|327259541|ref|XP_003214595.1| PREDICTED: zinc finger protein DPF3-like [Anolis carolinensis]
          Length = 398

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G   +N  C  C  G N  +   R   ++SC  CG+  H  CL+  A   +      W+C
Sbjct: 274 GAIISNNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTANMTEAVKTYQWQC 333

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 334 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVSEPPEGSWSC 380


>gi|427778555|gb|JAA54729.1| Putative d4 [Rhipicephalus pulchellus]
          Length = 487

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 2/105 (1%)

Query: 31  SNTNVMCRLCFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           +N +  C  C    GEN+   +   ++SC  CG+  H +CL+             W+C  
Sbjct: 346 ANPSPYCDFCLGDNGENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIE 405

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 406 CKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLQPPLSEPPEGLWSC 450


>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
 gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
          Length = 2062

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 4/129 (3%)

Query: 25   GEEQGQSNTNVMCRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
            G E  Q++  + C +C   ++    +     + C SC ++ H +C+    +        +
Sbjct: 1720 GRESSQASRLLNCGVCLRSQHRNARDMPEVFIRCYSCRRRVHPSCIDMPQRMVGRVRNYN 1779

Query: 84   WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
            W+C  C+ C  CR    P K +FC +CD  YH YC     + V  G + C +   C  CG
Sbjct: 1780 WQCSGCKCCIKCRSNQRPGKMLFCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCG 1837

Query: 144  SNVPGNGLS 152
            +  P  GLS
Sbjct: 1838 ATRP-EGLS 1845


>gi|156384761|ref|XP_001633301.1| predicted protein [Nematostella vectensis]
 gi|156220369|gb|EDO41238.1| predicted protein [Nematostella vectensis]
          Length = 315

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C   V EN+   R   +LSC  CG+  H +CL+   +         W+C  C+ 
Sbjct: 189 NSYCDFCLGDVSENKKSGRPEELLSCSDCGRSGHPSCLQFTPKLTYNVKKYRWQCIECKS 248

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC +PP      G ++C
Sbjct: 249 CTLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMDKPPEGHWMC 290


>gi|269785119|ref|NP_001161515.1| Cer-d4 protein [Saccoglossus kowalevskii]
 gi|268054007|gb|ACY92490.1| Cer-d4 [Saccoglossus kowalevskii]
          Length = 392

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 2/103 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C     EN+    +  ++SC  CG+  H +CL+   +         W+C
Sbjct: 286 GIAAPNNYCDFCLGDASENKKTGVSEELISCSDCGRSGHPSCLQFTTKMTSNVKKYRWQC 345

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
             C+ C +C  + + ++ +FC  CD  YH YC +PP  +   G
Sbjct: 346 IECKSCHLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMSHPPEG 388


>gi|156353196|ref|XP_001622960.1| predicted protein [Nematostella vectensis]
 gi|156354354|ref|XP_001623361.1| predicted protein [Nematostella vectensis]
 gi|156209598|gb|EDO30860.1| predicted protein [Nematostella vectensis]
 gi|156210052|gb|EDO31261.1| predicted protein [Nematostella vectensis]
          Length = 132

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 4/113 (3%)

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
            + C +C   YH  C  P +  V  G    ++C +  +C  CGS   G      W   +T
Sbjct: 1   LLMCDKCQRGYHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFT 60

Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
            C  CG  +  GNYCP+C K Y D++  + M+ C+ CQ WVH  C  I+ ++Y
Sbjct: 61  HCYDCGTAWDNGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 113


>gi|71895933|ref|NP_001025643.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
 gi|60550967|gb|AAH91612.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
          Length = 430

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 11/141 (7%)

Query: 10  RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
           R E    +  SK ++G  + +SN N +C +C  G++   + R+ R++ C  C    H +C
Sbjct: 286 RKEGSKRSVLSKSVSGY-KPKSNPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 344

Query: 69  LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS 128
           L    +   +     W+C  C+ C IC +     + MFC  CD  YH +C       + S
Sbjct: 345 LDMSPELVTVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFCVG--LGALPS 402

Query: 129 GPYLCPKHTKCHSCGSNVPGN 149
           G ++C        C  NVP  
Sbjct: 403 GRWIC-------DCCQNVPST 416


>gi|84997431|ref|XP_953437.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304433|emb|CAI76812.1| hypothetical protein, conserved [Theileria annulata]
          Length = 3595

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 58/134 (43%), Gaps = 12/134 (8%)

Query: 105  MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
            + C  C  + H  C +P   N+     + C   T+C SCG         + W L +  C 
Sbjct: 1141 VVCVSCSISAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDHNTADYLNWGLFFFFCL 1200

Query: 164  ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
             C  L  + NYC +C KV+   D  S   V C+ C+ W+H +CD ++     +   D   
Sbjct: 1201 KCWELLERSNYCGICYKVWTHFDGSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1255

Query: 219  -GNLQYRCPTCRGE 231
              N  YRC  CR E
Sbjct: 1256 SRNQNYRCCICRSE 1269


>gi|290973625|ref|XP_002669548.1| predicted protein [Naegleria gruberi]
 gi|284083097|gb|EFC36804.1| predicted protein [Naegleria gruberi]
          Length = 370

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 9/100 (9%)

Query: 40  CFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRR 97
           C+V    NEG ++ + ML+CK+CG K H  C +N   +  L +  SW+C  C+ CE+C+ 
Sbjct: 32  CYVCHKTNEGTDK-QTMLTCKTCGLKVHAGCYEN--THFSLKYADSWECVDCKKCEVCKD 88

Query: 98  TG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG--PYLC 133
               + N+ + C RCD  YH  C     + V  G  P+ C
Sbjct: 89  ANYREGNEILMCNRCDKGYHQLCCSEKFRTVPEGDKPWFC 128


>gi|195345773|ref|XP_002039443.1| GM22724 [Drosophila sechellia]
 gi|194134669|gb|EDW56185.1| GM22724 [Drosophila sechellia]
          Length = 1889

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+   A+        +W+C  C+ C  C
Sbjct: 1568 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPARMVGRVRNYNWQCAGCKCCIKC 1627

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K +FC +CD  YH YC     K V  G + C +   C  CG+  P
Sbjct: 1628 RSSQRPGKMLFCEQCDRGYHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 1677


>gi|405964745|gb|EKC30194.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
          Length = 387

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 40/80 (50%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C+ C  K H +C+   A        S W+C  C+ C +C+ +GDP+  +FC  CD  
Sbjct: 163 ILVCQDCNAKAHPSCMGYNAILARRTLESPWQCIDCKTCTVCQDSGDPDTMLFCDACDKG 222

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH  C  P  ++   G + C
Sbjct: 223 YHMTCHEPAIEDKPQGKWEC 242


>gi|194864252|ref|XP_001970846.1| GG10866 [Drosophila erecta]
 gi|190662713|gb|EDV59905.1| GG10866 [Drosophila erecta]
          Length = 497

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 53/120 (44%), Gaps = 2/120 (1%)

Query: 16  VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
            ATA+K     E+  +  +  C  C     EN+       ++SC  CG+  H +CL+  A
Sbjct: 367 FATANKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 426

Query: 74  QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                     W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 427 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486


>gi|37362278|gb|AAQ91267.1| requiem, apoptosis response zinc finger gene [Danio rerio]
          Length = 368

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H +CL+  A          W+C
Sbjct: 240 GLALPNDYCDFCLGDSSMNQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 299

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 300 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMSDPPEGSWSC 346


>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
          Length = 1990

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           +C SCG +     L  + F     C  C ++  +G YCP C KV+  +   PM+ CD C+
Sbjct: 223 QCASCGVSCARKELDAKQF-----CLLCAKMHGEGQYCPCCGKVWHYANCGPMIQCDTCE 277

Query: 198 RWVHCQCDGISDEKYLQFQVDG--------NLQYRCPTC 228
            WVH  CD  + E  L+ + D          + Y CPTC
Sbjct: 278 MWVHDLCDATAAE-ILKKEADALKEGREEEEIPYNCPTC 315


>gi|156380495|ref|XP_001631804.1| predicted protein [Nematostella vectensis]
 gi|156218850|gb|EDO39741.1| predicted protein [Nematostella vectensis]
          Length = 273

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 3/102 (2%)

Query: 30  QSNTNVMCRLCFVGENEGCERA--RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           ++  N +C  C +G  E  +      ++SC  CG   H +CLK             W+C 
Sbjct: 172 KATPNPLCGFC-LGPAESNKEGDYEELISCADCGNSGHPSCLKYSPALTARVQSEPWQCI 230

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
            C+ C +CR  GD +  +FC  CD  +H  C  PP   + +G
Sbjct: 231 ECKTCSVCRDAGDADNLLFCDMCDRGFHMECLDPPMSEMPTG 272


>gi|71029596|ref|XP_764441.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351395|gb|EAN32158.1| hypothetical protein, conserved [Theileria parva]
          Length = 3588

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 59/134 (44%), Gaps = 12/134 (8%)

Query: 105  MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
            + C  C  + H  C +P   N+     + C   T+C SCG         + W L +  C 
Sbjct: 1176 VVCVSCSTSAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDITCADYLNWGLFFFFCL 1235

Query: 164  ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
             C  L  + NYC +C KV+   D+ S   V C+ C+ W+H +CD ++     +   D   
Sbjct: 1236 KCWELLERSNYCGICYKVWTNFDTSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1290

Query: 219  -GNLQYRCPTCRGE 231
              N  YRC  CR E
Sbjct: 1291 SRNQNYRCLICRSE 1304


>gi|320167672|gb|EFW44571.1| MYST histone acetyltransferase 2 [Capsaspora owczarzaki ATCC 30864]
          Length = 570

 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 3/102 (2%)

Query: 34  NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           + +C LC   ++  C     ++ C +C    H +CL              W+C +C+ C 
Sbjct: 100 DAVCALC---QSATCAARDSLIMCSNCSDCAHPSCLNLTKAAAAKVKTYPWRCSNCKTCS 156

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           +C + G   K MFC  CD   H +C  PP K+ S   + CP+
Sbjct: 157 VCDKAGHEKKMMFCITCDRGTHSFCAQPPMKDPSEVAWSCPE 198


>gi|62177137|ref|NP_997861.2| D4, zinc and double PHD fingers family 2, like [Danio rerio]
 gi|62026699|gb|AAH92130.1| D4, zinc and double PHD fingers family 2, like [Danio rerio]
 gi|182892074|gb|AAI65789.1| Dpf2l protein [Danio rerio]
          Length = 405

 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H +CL+  A          W+C
Sbjct: 277 GLALPNDYCDFCLGDSSMNQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 336

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 337 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMSDPPEGSWSC 383


>gi|291221144|ref|XP_002730583.1| PREDICTED: d4-like [Saccoglossus kowalevskii]
          Length = 493

 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 7/107 (6%)

Query: 33  TNVMCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDL-FHWSSWKCPS 88
           T+  C  C +     C R      +L CK C  K H +C+ N+ Q   +    S W+C  
Sbjct: 198 TDTKCDYCLLTAE--CNRNGKTEDLLVCKDCNAKAHPSCM-NYTQELAVRARMSPWQCFD 254

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           C+ C +C  +GD +  +FC  CD  YH  C  P      +G ++C K
Sbjct: 255 CKTCCVCGDSGDADNLLFCDACDKGYHMACHTPQILRKPTGKWMCIK 301


>gi|405965654|gb|EKC31016.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
          Length = 2037

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 1/100 (1%)

Query: 35  VMCRLCFVGENEGCERARR-MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           ++C  C   EN+  +     ++SC  CG   H +CLK   +  +      W+C  C+ C 
Sbjct: 207 LICCFCLGDENKNRDGVPEDLISCAECGNSGHPSCLKFSPELTETVKKLRWQCIDCKTCS 266

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+++G  +  +FC  CD  +H  C  PP      G + C
Sbjct: 267 FCQKSGREDNMLFCDLCDRGFHMECCDPPLSKAPKGKWKC 306


>gi|312377713|gb|EFR24474.1| hypothetical protein AND_10892 [Anopheles darlingi]
          Length = 539

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           E+ ++  +  C  C     EN+       ++SC  CG+  H +CL+  A          W
Sbjct: 420 EKSRAVPSPYCDFCLGDARENKKTLEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRW 479

Query: 85  KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 480 QCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 528


>gi|45382857|ref|NP_989970.1| zinc finger protein DPF3 [Gallus gallus]
 gi|18202301|sp|P58270.1|DPF3_CHICK RecName: Full=Zinc finger protein DPF3; AltName: Full=Zinc finger
           protein cer-d4
 gi|14010362|gb|AAK51968.1|AF362754_1 cer-d4 [Gallus gallus]
          Length = 427

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C  C+ 
Sbjct: 308 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 367

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 368 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 409


>gi|449502401|ref|XP_004174505.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein DPF3
           [Taeniopygia guttata]
          Length = 392

 Score = 62.8 bits (151), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C  C+ 
Sbjct: 273 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 332

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 374


>gi|345480756|ref|XP_001605917.2| PREDICTED: zinc finger protein DPF3 [Nasonia vitripennis]
          Length = 551

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 2/113 (1%)

Query: 23  IAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
           + G++   +  +  C  C     EN+    +  ++SC  CG+  H  CL+  A       
Sbjct: 428 MDGKKAKPAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVR 487

Query: 81  WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
              W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 488 KYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLTSPPEGSWSC 540


>gi|195048475|ref|XP_001992534.1| GH24152 [Drosophila grimshawi]
 gi|193893375|gb|EDV92241.1| GH24152 [Drosophila grimshawi]
          Length = 2464

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 2/93 (2%)

Query: 55   LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
            + C SC ++ H +C++   +        +W+C  C+ C  CR +  P K ++C +CD  Y
Sbjct: 2085 IRCYSCRRRVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQCDRGY 2144

Query: 115  HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            H YC     K V  G + C + + C  CG+  P
Sbjct: 2145 HIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2175


>gi|355733003|gb|AES10880.1| myeloid/lymphoid or mixed-lineage leukemia 3 isoform 2 [Mustela
           putorius furo]
          Length = 102

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 2/95 (2%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
           +C V  + G     R+L+C  CG+ YH  C+    +   +     W+C  C +CE C + 
Sbjct: 8   MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 65

Query: 99  GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            DP + + C  CD +YH YC  PP + V  G + C
Sbjct: 66  SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKC 100


>gi|256070756|ref|XP_002571708.1| requim req/dpf2 [Schistosoma mansoni]
 gi|350646387|emb|CCD58946.1| requim, req/dpf2, putative [Schistosoma mansoni]
          Length = 450

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C   E  N+   R+  ML C  CG+  H +CL+         H   W+C  C+ C +
Sbjct: 331 CDFCLGDESLNKKTGRSEDMLRCSDCGRFAHFSCLQFTPNMITSVHTYRWQCIECKTCWL 390

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + +  + +FC  CD  YH YC +PP      G + C
Sbjct: 391 CGTSENDEQMLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 429


>gi|91094021|ref|XP_967377.1| PREDICTED: similar to d4 CG2682-PB [Tribolium castaneum]
          Length = 525

 Score = 62.4 bits (150), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+       ++SC  CG+  H +CL      +      SW+C  C+ C +
Sbjct: 418 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 477

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 478 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 516


>gi|162287269|ref|NP_001104639.1| zinc finger protein DPF3 [Danio rerio]
 gi|215275221|sp|A9LMC0.1|DPF3_DANRE RecName: Full=Zinc finger protein DPF3
 gi|159906528|gb|ABX10892.1| D4 zinc and double PHD fingers family 3 protein [Danio rerio]
          Length = 391

 Score = 62.4 bits (150), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C    G N    +A  ++SC  CG+  H +CL+             W+C  C+ 
Sbjct: 273 NDYCDFCLGDSGSNRKTGQAEELVSCSDCGRSGHPSCLQFTDNMMQAVRTYQWQCIECKS 332

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374


>gi|348523632|ref|XP_003449327.1| PREDICTED: zinc finger protein neuro-d4-like [Oreochromis
           niloticus]
          Length = 381

 Score = 62.4 bits (150), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 6/124 (4%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 260 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 317

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTKCHSCGS 144
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +  K  + G 
Sbjct: 318 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSCHLCLRQLKEKASGI 377

Query: 145 NVPG 148
             PG
Sbjct: 378 EDPG 381


>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
 gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
          Length = 973

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 47/94 (50%), Gaps = 6/94 (6%)

Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C  CG ++P N       L   G   C  C RL    +YC +C KV+  S+S   V CD 
Sbjct: 368 CDECGLDLPFNMSKKTKDLTPGGQLLCKTCARLMKSKHYCGICKKVWNQSDSGSWVRCDG 427

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           C+ WVH +CD IS    + F+  G+  Y CP C+
Sbjct: 428 CKVWVHAECDKISS---ILFKNLGSTDYFCPACK 458


>gi|14010360|gb|AAK51967.1|AF362753_1 cer-d4 [Gallus gallus]
          Length = 378

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C  C+ 
Sbjct: 259 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 318

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 319 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 360


>gi|353523840|ref|NP_001088070.2| PHD finger protein 10 [Xenopus laevis]
 gi|296439270|sp|Q63ZP1.2|PHF10_XENLA RecName: Full=PHD finger protein 10
          Length = 506

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)

Query: 10  RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
           R E    +  SK ++G  + +S  N +C +C  G++   + R+ R++ C  C    H +C
Sbjct: 362 RKEGSKRSVLSKSVSGY-KPKSIPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 420

Query: 69  LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
           L   A+   +     W+C  C+ C IC +     + MFC  CD  YH +C
Sbjct: 421 LDMSAELVAVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFC 470


>gi|26336851|dbj|BAC32109.1| unnamed protein product [Mus musculus]
          Length = 440

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414

Query: 117 YCQHPPHKNVSSGPYLC 133
           +C  P  K+V +  + C
Sbjct: 415 FCLQPVMKSVPTNGWKC 431


>gi|301769749|ref|XP_002920299.1| PREDICTED: zinc finger protein DPF3-like [Ailuropoda melanoleuca]
          Length = 633

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 509 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 568

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 569 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 615


>gi|443694019|gb|ELT95254.1| hypothetical protein CAPTEDRAFT_227914 [Capitella teleta]
          Length = 675

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 13/133 (9%)

Query: 8   ARRFESG----YVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGK 62
           AR+  SG    +  T S D AG     S+    C +C    +     +  +M++C  C  
Sbjct: 532 ARKKVSGSHTEWAPTISSDSAG-----SDVVPKCAVCGGTSQRNKDNQEEQMVTCIKCHT 586

Query: 63  KYH-RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             H   CL+  A+   + H  SW+C  C++C  C   GD  K MFC  CD  +H +C   
Sbjct: 587 PAHPSTCLELSAEMIPIIHTYSWQCMDCKMCAKCNDAGDEEKMMFCDHCDRGFHTFCLG- 645

Query: 122 PHKNVSSGPYLCP 134
             + + +G ++CP
Sbjct: 646 -LRVIPTGRWVCP 657


>gi|380014950|ref|XP_003691477.1| PREDICTED: zinc finger protein DPF3-like [Apis florea]
          Length = 527

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516


>gi|350413485|ref|XP_003490006.1| PREDICTED: zinc finger protein DPF3-like [Bombus impatiens]
          Length = 468

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 359 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 418

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 419 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 457


>gi|393908955|gb|EJD75261.1| hypothetical protein LOAG_17567 [Loa loa]
          Length = 371

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)

Query: 32  NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
           + + +C LC     +N+   +  +++SC  CG+  H +CLK             W+C  C
Sbjct: 248 DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 307

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           + C IC  + + ++ +FC  CD  +H YC  PP      G + C
Sbjct: 308 KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 351


>gi|388580883|gb|EIM21195.1| hypothetical protein WALSEDRAFT_38994 [Wallemia sebi CBS 633.66]
          Length = 808

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 54  MLSCKSCGKKYHRNCL---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRC 110
           M++C  C    H  CL       QN   ++WS   C  C+ CE+C+  G+ +  +FC  C
Sbjct: 1   MITCGQCQSSAHPTCLHFTPTLTQNTLSYNWS---CVDCKGCEVCKNKGNEDDIIFCDLC 57

Query: 111 DAAYHCYCQHPPHKNVSSGPYLCP 134
           D  +H +C +PP     +G + CP
Sbjct: 58  DRGWHMHCLNPPMNEPPAGDFACP 81


>gi|198455671|ref|XP_001357517.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
 gi|198135345|gb|EAL24641.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
          Length = 507

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       ++SC  CG+  H +CL+  A          W+C  C+ C IC  + + ++
Sbjct: 407 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 466

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 467 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 496


>gi|325181008|emb|CCA15418.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 639

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 30/59 (50%)

Query: 172 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
           G YCPVC +VY D ++   VCCD C+ WVH  CD       L      N  Y CP C G
Sbjct: 580 GQYCPVCNQVYEDDDAASFVCCDSCEMWVHSACDTDLTPAKLATLAGTNETYICPLCGG 638


>gi|195476347|ref|XP_002086095.1| GE11247 [Drosophila yakuba]
 gi|194185954|gb|EDW99565.1| GE11247 [Drosophila yakuba]
          Length = 497

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       ++SC  CG+  H +CL+  A          W+C  C+ C IC  + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486


>gi|18032212|gb|AAL56647.1|AF217500_1 histone acetyltransferase MOZ2 [Homo sapiens]
          Length = 2072

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++A  +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|392564180|gb|EIW57358.1| hypothetical protein TRAVEDRAFT_125931 [Trametes versicolor
           FP-101664 SS1]
          Length = 270

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+SC  CG+  H +C+       D      W+C +C+ C +CRR G+    + C  CD  
Sbjct: 1   MVSCVDCGRSGHPSCM-GLDNMGDAMRGYDWQCATCKSCSVCRRKGNEASMLICDHCDRG 59

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
           +H  C  PP +    G + CP    C   G   P
Sbjct: 60  WHMSCFDPPFRAPPEGTWHCP---SCPRVGETFP 90


>gi|307195046|gb|EFN77104.1| Zinc finger protein ubi-d4 A [Harpegnathos saltator]
          Length = 534

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 425 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 484

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 485 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 523


>gi|52354758|gb|AAH82869.1| LOC494765 protein [Xenopus laevis]
          Length = 417

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)

Query: 10  RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
           R E    +  SK ++G  + +S  N +C +C  G++   + R+ R++ C  C    H +C
Sbjct: 273 RKEGSKRSVLSKSVSGY-KPKSIPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 331

Query: 69  LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
           L   A+   +     W+C  C+ C IC +     + MFC  CD  YH +C
Sbjct: 332 LDMSAELVAVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFC 381


>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1782

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 5/94 (5%)

Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           +C SCG     +  S+R       C  C +L  +G +CPVC +V+  S    MV CD C+
Sbjct: 211 RCASCGVT---DDESLRSKNSNGLCALCRKLHKEGQFCPVCDRVWHWSAGDAMVGCDRCE 267

Query: 198 RWVHCQCDGISDEKYLQFQ--VDGNLQYRCPTCR 229
            W+H +CD ++ E   + Q   D ++ Y CP CR
Sbjct: 268 MWIHRECDAVAAEVLDREQNGEDEDIPYACPVCR 301


>gi|332842752|ref|XP_001140541.2| PREDICTED: zinc finger protein DPF3 [Pan troglodytes]
          Length = 367

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349


>gi|340717364|ref|XP_003397154.1| PREDICTED: zinc finger protein DPF3-like [Bombus terrestris]
          Length = 527

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516


>gi|328701324|ref|XP_001945217.2| PREDICTED: zinc finger protein ubi-d4-like isoform 1 [Acyrthosiphon
           pisum]
          Length = 521

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 5/121 (4%)

Query: 14  GYVATASKDIAGEEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNW 72
           GY  T S    G     +N +  C  C   E +    +   ++SC  CG+  H  CL+  
Sbjct: 395 GYTKTKS----GGNNKSANPSPYCDFCLGDEVSNKSGQPEILISCSDCGRSGHPTCLQFT 450

Query: 73  AQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
           A          W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + 
Sbjct: 451 ANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHVYCLTPPLTSPPEGCWS 510

Query: 133 C 133
           C
Sbjct: 511 C 511


>gi|344273540|ref|XP_003408579.1| PREDICTED: zinc finger protein DPF3-like [Loxodonta africana]
          Length = 427

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 303 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 362

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 363 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 409


>gi|354468677|ref|XP_003496778.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Cricetulus
           griseus]
          Length = 1756

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++A  +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|195148883|ref|XP_002015392.1| GL11042 [Drosophila persimilis]
 gi|194109239|gb|EDW31282.1| GL11042 [Drosophila persimilis]
          Length = 567

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       ++SC  CG+  H +CL+  A          W+C  C+ C IC  + + ++
Sbjct: 467 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 526

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 527 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 556


>gi|354468679|ref|XP_003496779.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Cricetulus
           griseus]
          Length = 2047

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++A  +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|328778645|ref|XP_395098.4| PREDICTED: zinc finger protein DPF3-like [Apis mellifera]
          Length = 533

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 424 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 483

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 484 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 522


>gi|363735536|ref|XP_421609.3| PREDICTED: histone acetyltransferase KAT6B [Gallus gallus]
          Length = 2025

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 5/131 (3%)

Query: 8   ARRFESGYVAT---ASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKK 63
           A ++ SG+ A+    S     ++Q +++   +C  C    E+   ++   +LSC  CG  
Sbjct: 184 APKYSSGFPASLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSS 243

Query: 64  YHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPP 122
            H +CLK   +         W+C  C+ C  CR  G + +  +FC  CD  +H  C  PP
Sbjct: 244 GHPSCLKFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPP 303

Query: 123 HKNVSSGPYLC 133
              +  G ++C
Sbjct: 304 LSRMPKGMWIC 314


>gi|148670787|gb|EDL02734.1| D4, zinc and double PHD fingers, family 3, isoform CRA_a [Mus
           musculus]
          Length = 381

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 252 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 311

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 312 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 358


>gi|221120366|ref|XP_002164134.1| PREDICTED: histone acetyltransferase KAT6B-like [Hydra
           magnipapillata]
          Length = 832

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 9/111 (8%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L C  CG   H +C++   +         W+C  C+ C IC+  G+    +FC  CD  
Sbjct: 214 LLVCDECGNSGHPSCMQYSKELTARVRQEPWQCMECKKCNICKDQGEAANLLFCDACDKG 273

Query: 114 YHCYCQHPPHKNVSSGPYLC-----PKHTKCHSCGSNVPGNGL----SVRW 155
           YH  C  PP  ++  G ++C      ++ K     S+VP + L     V+W
Sbjct: 274 YHMACLDPPLDDMPIGTWICDNCLSERNGKRRRISSSVPASLLLSTPDVKW 324


>gi|158292808|ref|XP_314129.4| AGAP005225-PA [Anopheles gambiae str. PEST]
 gi|157017167|gb|EAA09477.4| AGAP005225-PA [Anopheles gambiae str. PEST]
          Length = 468

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           E+ ++  +  C  C     EN+       ++SC  CG+  H +CL+  A          W
Sbjct: 349 EKSRAVPSPYCDFCLGDARENKKTFEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRW 408

Query: 85  KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 409 QCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 457


>gi|395857461|ref|XP_003801110.1| PREDICTED: zinc finger protein DPF3 [Otolemur garnettii]
          Length = 489

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 365 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 424

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 425 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 471


>gi|328701326|ref|XP_003241562.1| PREDICTED: zinc finger protein ubi-d4-like isoform 2 [Acyrthosiphon
           pisum]
          Length = 458

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 5/121 (4%)

Query: 14  GYVATASKDIAGEEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNW 72
           GY  T S    G     +N +  C  C   E +    +   ++SC  CG+  H  CL+  
Sbjct: 332 GYTKTKS----GGNNKSANPSPYCDFCLGDEVSNKSGQPEILISCSDCGRSGHPTCLQFT 387

Query: 73  AQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
           A          W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP  +   G + 
Sbjct: 388 ANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHVYCLTPPLTSPPEGCWS 447

Query: 133 C 133
           C
Sbjct: 448 C 448


>gi|321460287|gb|EFX71331.1| hypothetical protein DAPPUDRAFT_308933 [Daphnia pulex]
          Length = 501

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 48/110 (43%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
           E + ++  +  C  C     EN+       ++SC  CG+  H +CL+  A          
Sbjct: 382 EMKARAPPSPYCDFCLGDATENKKSGHPEELVSCADCGRSGHPSCLQFTANMIISVKQYR 441

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           W+C  C+ C +C  + +  + +FC  CD  YH YC  PP      G + C
Sbjct: 442 WQCIECKCCSLCGNSDNDEQLLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 491


>gi|308235954|ref|NP_001184101.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
           tropicalis]
          Length = 388

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +A  ++SC  CG+  H +CL+             W+C
Sbjct: 264 GLALPNNYCDFCLGDSKINKKTNQAEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 323

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 324 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370


>gi|194758284|ref|XP_001961392.1| GF11022 [Drosophila ananassae]
 gi|190622690|gb|EDV38214.1| GF11022 [Drosophila ananassae]
          Length = 812

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 37  CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+       ++SC  CG+  H +CL+  A          W+C  C+ C I
Sbjct: 389 CDFCLGDQRENKKTSMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSI 448

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           C  + + ++ +FC  CD  YH YC  PP
Sbjct: 449 CGTSDNDDQLLFCDDCDRGYHMYCLSPP 476


>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 7/90 (7%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C K++  S+    VCCD C  WVH +CD IS + +   +   N 
Sbjct: 359 CKPCAKLIKSKQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKHFKDLE---NT 415

Query: 222 QYRCPTCRGECYQVRDLEDAVRELWRRKDM 251
            Y CP C+G+     + E +  + ++ K+M
Sbjct: 416 DYYCPDCKGK----FNCESSTSQTYKSKNM 441


>gi|402876619|ref|XP_003902055.1| PREDICTED: zinc finger protein DPF3-like, partial [Papio anubis]
          Length = 277

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C  C+ 
Sbjct: 158 NNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKS 217

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 218 CILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 259


>gi|432891023|ref|XP_004075510.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Oryzias
           latipes]
          Length = 381

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 260 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 317

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 318 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 363


>gi|410059939|ref|XP_003318982.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Pan
           troglodytes]
          Length = 365

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347

Query: 117 YCQHPPHKNVSSGPYLC 133
           +C  P  K+V +  + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364


>gi|19921648|ref|NP_610163.1| d4, isoform A [Drosophila melanogaster]
 gi|16417832|gb|AAL18868.1|AF427473_1 dd4 protein [Drosophila melanogaster]
 gi|16198077|gb|AAL13829.1| LD29238p [Drosophila melanogaster]
 gi|21626860|gb|AAF57340.2| d4, isoform A [Drosophila melanogaster]
 gi|220942560|gb|ACL83823.1| d4-PA [synthetic construct]
 gi|220952536|gb|ACL88811.1| d4-PA [synthetic construct]
          Length = 497

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       ++SC  CG+  H +CL+  A          W+C  C+ C IC  + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 486


>gi|270003134|gb|EEZ99581.1| hypothetical protein TcasGA2_TC001567 [Tribolium castaneum]
          Length = 481

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+       ++SC  CG+  H +CL      +      SW+C  C+ C +
Sbjct: 374 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 433

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 434 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 472


>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
          Length = 591

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 4/118 (3%)

Query: 9   RRFESGYVATASKDIAGE--EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKY 64
           +R +SGY+    ++++           N  C  C     EN+   +   ++SC  CG+  
Sbjct: 334 KRLDSGYLVEKYEELSHSCILMYMVMANNYCDFCLGDSEENKKSNQPEELVSCSDCGRSG 393

Query: 65  HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
           H  CL+  A          W+C  C+ C +C  + + ++ +FC  CD  YH YC +PP
Sbjct: 394 HPTCLQFTANMIISVKKYPWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPP 451



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 24  AGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
           +G ++   + N  C  C     EN+   +   ++SC  CG+  H  CL+  A        
Sbjct: 463 SGMDKRDISANNYCDFCLGDSEENKKSNQPEELVSCSDCGRSGHPTCLQFTANMIISVKK 522

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             W+C  C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 523 YPWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPLSEPPEGNWSC 574


>gi|442622301|ref|NP_001260707.1| d4, isoform D [Drosophila melanogaster]
 gi|440214084|gb|AGB93242.1| d4, isoform D [Drosophila melanogaster]
          Length = 496

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 2/120 (1%)

Query: 16  VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
            AT +K     E+  +  +  C  C     EN+       ++SC  CG+  H +CL+  A
Sbjct: 366 FATGNKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 425

Query: 74  QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                     W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 426 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 485


>gi|24585823|ref|NP_724404.1| d4, isoform C [Drosophila melanogaster]
 gi|7302246|gb|AAF57339.1| d4, isoform C [Drosophila melanogaster]
          Length = 495

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 2/120 (1%)

Query: 16  VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
            AT +K     E+  +  +  C  C     EN+       ++SC  CG+  H +CL+  A
Sbjct: 365 FATGNKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 424

Query: 74  QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                     W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 425 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 484


>gi|357607405|gb|EHJ65481.1| putative requim, req/dpf2 [Danaus plexippus]
          Length = 513

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 2/106 (1%)

Query: 30  QSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           Q + +  C  C     EN+      R++SC  CG+  H  CL+             W+C 
Sbjct: 400 QVSPSPYCDFCLGDDRENKKTGTPERLVSCSDCGRSGHPTCLQFTVNMIVSVRKYRWQCI 459

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 460 ECKCCSVCGTSDNDDQLLFCDDCDRGYHMYCLAPPLDAPPEGSWSC 505


>gi|383856201|ref|XP_003703598.1| PREDICTED: zinc finger protein ubi-d4-like [Megachile rotundata]
          Length = 559

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)

Query: 37  CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+    +  ++SC  CG+  H  CL+  A          W+C  C+ C I
Sbjct: 450 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 509

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP  +   G + C
Sbjct: 510 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 548


>gi|403276505|ref|XP_003929938.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
           boliviensis boliviensis]
          Length = 371

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347

Query: 117 YCQHPPHKNVSSGPYLC 133
           +C  P  K+V +  + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364


>gi|395503977|ref|XP_003756337.1| PREDICTED: zinc finger protein DPF3 [Sarcophilus harrisii]
          Length = 375

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G   N+   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 251 GTVIPNNYCDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 310

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 311 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 357


>gi|345803644|ref|XP_854603.2| PREDICTED: zinc finger protein DPF3 [Canis lupus familiaris]
          Length = 322

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 198 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 257

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 258 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 304


>gi|426234251|ref|XP_004011110.1| PREDICTED: zinc finger protein DPF3 [Ovis aries]
          Length = 408

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 284 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 343

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 344 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 390


>gi|25396154|pir||A88925 protein F33E11.3 [imported] - Caenorhabditis elegans
          Length = 1192

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 46/103 (44%), Gaps = 2/103 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+ C +C   YH  C++   +   L     W C  CR+C IC +    ++ +FC RCD  
Sbjct: 437 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 496

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
           +H YC     K +  G ++C  +    +   N   +  +V  F
Sbjct: 497 FHTYCVG--LKKLPQGTWICDTYCAIENMKFNRRASAAAVGGF 537


>gi|432891025|ref|XP_004075511.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Oryzias
           latipes]
          Length = 371

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 250 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 307

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 308 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 353


>gi|390407656|ref|NP_001254554.1| zinc finger protein DPF3 isoform 1 [Mus musculus]
 gi|215274004|sp|P58269.2|DPF3_MOUSE RecName: Full=Zinc finger protein DPF3; AltName:
           Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
           Full=Zinc finger protein cer-d4
 gi|26332973|dbj|BAC30204.1| unnamed protein product [Mus musculus]
          Length = 378

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360


>gi|410962581|ref|XP_003987847.1| PREDICTED: zinc finger protein DPF3 [Felis catus]
          Length = 411

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 287 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 346

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 347 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 393


>gi|341879787|gb|EGT35722.1| hypothetical protein CAEBREN_06378 [Caenorhabditis brenneri]
          Length = 375

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 2/81 (2%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           ++SC  CG+  H +C+ N+ QN   + + S W+C  C+ C IC  + + +K +FC  CD 
Sbjct: 279 LISCHDCGRSGHPSCM-NFNQNVTKIINRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 337

Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
            YH YC  P  +      Y C
Sbjct: 338 GYHLYCLRPALEKAPDDEYSC 358


>gi|268571913|ref|XP_002641182.1| Hypothetical protein CBG09043 [Caenorhabditis briggsae]
          Length = 373

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 40/80 (50%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           ++SC  CG+  H +C+        +   S W+C  C+ C IC  + + +K +FC  CD  
Sbjct: 277 LVSCHDCGRSGHPSCMSFNQNVTMIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 336

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH YC +PP +      Y C
Sbjct: 337 YHLYCLNPPLEKAPDDEYSC 356


>gi|126282822|ref|XP_001375927.1| PREDICTED: zinc finger protein DPF3-like [Monodelphis domestica]
          Length = 384

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G   N+   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 260 GTVIPNNYCDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 319

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 320 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 366


>gi|296215435|ref|XP_002754118.1| PREDICTED: zinc finger protein DPF3 isoform 1 [Callithrix jacchus]
 gi|332229063|ref|XP_003263707.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Nomascus leucogenys]
 gi|397507377|ref|XP_003824173.1| PREDICTED: zinc finger protein DPF3 isoform 3 [Pan paniscus]
 gi|215274167|sp|Q92784.3|DPF3_HUMAN RecName: Full=Zinc finger protein DPF3; AltName:
           Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
           Full=Zinc finger protein cer-d4
 gi|60459281|gb|AAX20019.1| DPF3 [Homo sapiens]
          Length = 378

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360


>gi|297695458|ref|XP_002824959.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Pongo abelii]
          Length = 378

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360


>gi|116008472|ref|NP_724405.2| d4, isoform B [Drosophila melanogaster]
 gi|113194576|gb|AAM68376.2| d4, isoform B [Drosophila melanogaster]
          Length = 339

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       ++SC  CG+  H +CL+  A          W+C  C+ C IC  + + ++
Sbjct: 239 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 298

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 299 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 328


>gi|403223612|dbj|BAM41742.1| predicted protein [Theileria orientalis strain Shintoku]
          Length = 4555

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 105  MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
            + C  C+ A H  C +P   N+     + C   T+C SCG         + W L +  C 
Sbjct: 1805 VVCVSCNVAAHRGCCNPVVPNLLFIESWKCDSCTQCISCGYRDTTCIEYLNWGLFFFFCL 1864

Query: 164  ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
             C  L  + NYC VC KV+   DS +   V C+ C+ W+H +CD ++     +   D   
Sbjct: 1865 KCWELLERSNYCGVCYKVWTNFDSNTQKWVQCEGCKLWIHVECDDLA-----RIITDCPS 1919

Query: 219  -GNLQYRCPTCRG 230
              +  YRC  CR 
Sbjct: 1920 SKSQNYRCCVCRS 1932


>gi|354488945|ref|XP_003506626.1| PREDICTED: zinc finger protein DPF3-like, partial [Cricetulus
           griseus]
          Length = 367

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349


>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
          Length = 1844

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 51/127 (40%), Gaps = 22/127 (17%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C K YH  CL    +     HWS   CPSC                
Sbjct: 260 EVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWS---CPSCEAAGIPQKDEEEEKKV 316

Query: 93  ----EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
               E CR   D    + C  C ++YH YC +PP   V  G + CP+   C     N P 
Sbjct: 317 ATNMEYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPR-CLCPEP-KNRPE 374

Query: 149 NGLSVRW 155
             LS RW
Sbjct: 375 KVLSWRW 381



 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 25/113 (22%)

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
           CE+C++ G+    + C  C  AYH  C  P  +    G + CP    C + G  +P    
Sbjct: 259 CEVCQQGGE---IILCDTCPKAYHMVCLDPDMEEAPEGHWSCP---SCEAAG--IPQKD- 309

Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
                      +   ++     YC VC  V        ++CCD C    H  C
Sbjct: 310 ----------EEEEKKVATNMEYCRVCKDV------GWLLCCDTCPSSYHAYC 346


>gi|25148780|ref|NP_498281.2| Protein DPFF-1, isoform a [Caenorhabditis elegans]
 gi|22096399|sp|Q09477.2|YP99_CAEEL RecName: Full=Uncharacterized zinc finger protein C28H8.9
 gi|351058508|emb|CCD65970.1| Protein DPFF-1, isoform a [Caenorhabditis elegans]
          Length = 372

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 2/81 (2%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
           ++SC  CG+  H +CL N+ QN   +   S W+C  C+ C IC  + + +K +FC  CD 
Sbjct: 276 LVSCHDCGRSGHPSCL-NFNQNVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 334

Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
            YH YC  P  +      Y C
Sbjct: 335 GYHLYCLTPALEKAPDDEYSC 355


>gi|417410666|gb|JAA51801.1| Putative phd zn-finger protein, partial [Desmodus rotundus]
          Length = 433

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 2/100 (2%)

Query: 20  SKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDL 78
           SK + G  + ++  N +C +C  G+  G   +A  ++ C  CG   H +CL    +   +
Sbjct: 299 SKSVPGY-KPKAPPNALCGICLKGKESGRRGKAESLIHCSQCGNSGHPSCLDMTTELVSM 357

Query: 79  FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
                W+C  C+ C +C +     + MFC  CD  YH +C
Sbjct: 358 IKTYPWQCMECKTCIVCGQPHHEEEMMFCDVCDRGYHTFC 397


>gi|355778710|gb|EHH63746.1| hypothetical protein EGM_16777, partial [Macaca fascicularis]
          Length = 363

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350


>gi|355693412|gb|EHH28015.1| hypothetical protein EGK_18348, partial [Macaca mulatta]
          Length = 368

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350


>gi|338719785|ref|XP_001489567.3| PREDICTED: zinc finger protein DPF3-like [Equus caballus]
          Length = 415

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 291 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 350

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 351 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 397


>gi|344241713|gb|EGV97816.1| Histone acetyltransferase MYST4 [Cricetulus griseus]
          Length = 709

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++A  +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|148235385|ref|NP_001079117.1| zinc finger protein ubi-d4 A [Xenopus laevis]
 gi|18203564|sp|Q9W638.1|REQUA_XENLA RecName: Full=Zinc finger protein ubi-d4 A; AltName: Full=Apoptosis
           response zinc finger protein A; AltName: Full=Protein
           requiem A; Short=xReq A
 gi|4808462|dbj|BAA77574.1| Requiem protein [Xenopus laevis]
          Length = 388

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H +CL+  A          W+C
Sbjct: 264 GIALPNNYCDFCLGDSKINKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 323

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 324 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLVPPVAEPPEGSWSC 370


>gi|440907612|gb|ELR57740.1| Zinc finger protein DPF3, partial [Bos grunniens mutus]
          Length = 368

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350


>gi|348573139|ref|XP_003472349.1| PREDICTED: zinc finger protein DPF3-like [Cavia porcellus]
          Length = 369

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 245 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 304

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 305 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 351


>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
 gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
          Length = 2166

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 2/69 (2%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN- 220
           C  C +L  +G YCPVC +V++ +    MV CD C  WVHC CD  +    ++ Q  G+ 
Sbjct: 226 CRLCAKLHREGQYCPVCDRVWQWANCPAMVGCDSCDFWVHCACDEPA-RTVMEAQERGDE 284

Query: 221 LQYRCPTCR 229
           + Y CP CR
Sbjct: 285 VDYHCPRCR 293


>gi|410916367|ref|XP_003971658.1| PREDICTED: zinc finger protein DPF3-like [Takifugu rubripes]
          Length = 391

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C   +  N    +A  ++SC  CG+  H  CL+             W+C  C+ 
Sbjct: 273 NDYCDFCLGDQDSNRKTGQAEELVSCSDCGRSGHPTCLQFTDNMMQAVRTYQWQCIECKS 332

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 333 CSICGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374


>gi|195027395|ref|XP_001986568.1| GH21439 [Drosophila grimshawi]
 gi|193902568|gb|EDW01435.1| GH21439 [Drosophila grimshawi]
          Length = 504

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 2/123 (1%)

Query: 13  SGYVATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLK 70
           S  +A+  K+    E+  +  +  C  C     EN+       ++SC  CG+  H +CL+
Sbjct: 371 SNPLASPKKNKLRAERDVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQ 430

Query: 71  NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP 130
                        W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G 
Sbjct: 431 FTPNMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGS 490

Query: 131 YLC 133
           + C
Sbjct: 491 WSC 493


>gi|4808460|dbj|BAA77573.1| Requiem protein [Xenopus laevis]
          Length = 290

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 166 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 225

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 226 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 272


>gi|351714578|gb|EHB17497.1| Histone acetyltransferase MYST4 [Heterocephalus glaber]
          Length = 2108

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 3/133 (2%)

Query: 3   AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCG 61
           AA  Y   F S  +   S     ++Q +++   +C  C    E+   ++   +LSC  CG
Sbjct: 185 AAPQYPSAFPSS-LPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCG 243

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQH 120
              H +CLK   +         W+C  C+ C  CR  G + +  +FC  CD  +H  C  
Sbjct: 244 SSGHPSCLKFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD 303

Query: 121 PPHKNVSSGPYLC 133
           PP   +  G ++C
Sbjct: 304 PPLSRMPKGMWIC 316


>gi|195567753|ref|XP_002107423.1| GD15571 [Drosophila simulans]
 gi|194204830|gb|EDX18406.1| GD15571 [Drosophila simulans]
          Length = 770

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 44/94 (46%), Gaps = 2/94 (2%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
            + C +C K+ H +C+    +        +W+C  C+ C  CR +  P K +FC +CD  
Sbjct: 478 FIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKCRSSQRPGKMLFCEQCDRG 537

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
           YH YC     K V  G + C +   C  CG+  P
Sbjct: 538 YHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 569


>gi|403418283|emb|CCM04983.1| predicted protein [Fibroporia radiculosa]
          Length = 1278

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 42  VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP 101
           VGE E       M+SC  CG+  H +CLK   +  +      W C  C+ CEICR+    
Sbjct: 130 VGEPE------LMVSCAECGRSGHPSCLK-LVEMSETIRLYPWICSECKNCEICRKKEGE 182

Query: 102 NKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
           N+ + C  CD  +H  C  PP   +  G
Sbjct: 183 NRMIMCDFCDRGWHMDCLQPPLVEMPPG 210


>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
 gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
          Length = 1053

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 47/99 (47%), Gaps = 12/99 (12%)

Query: 139 CHSCGSNVPGN------GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
           C +CG  +P        GL+     G   C  C RL    +YC +C KV   S+S   V 
Sbjct: 389 CEACGLALPYKMSKKIKGLTPN---GQLLCKTCTRLTKSKHYCGICKKVSNHSDSGSWVR 445

Query: 193 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
           CD C+ WVH +CD IS   +   +      Y CPTCRG+
Sbjct: 446 CDGCKVWVHAECDKISSNHFKDLE---TTDYFCPTCRGK 481


>gi|355755784|gb|EHH59531.1| hypothetical protein EGM_09668 [Macaca fascicularis]
          Length = 340

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 219 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 276

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 277 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 330


>gi|260837382|ref|XP_002613683.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
 gi|229299071|gb|EEN69692.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
          Length = 809

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 45/101 (44%), Gaps = 5/101 (4%)

Query: 36  MCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           +C  C +G  E C R   A  +LSC  CG   H +CLK   Q         W+C  C+ C
Sbjct: 191 VCSFC-LGTAE-CNRDGQAEELLSCADCGNSGHPSCLKYSPQLTAKVRSMRWQCIDCKTC 248

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C    D +  +FC  CD  +H  C +PP   +  G + C
Sbjct: 249 TACENKNDLDNILFCDACDRGFHMKCCNPPLTKMPKGNWEC 289


>gi|194226394|ref|XP_001914899.1| PREDICTED: histone acetyltransferase MYST3 [Equus caballus]
          Length = 2012

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK  A+         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSAELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|432924374|ref|XP_004080595.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
           KAT6B-like [Oryzias latipes]
          Length = 2014

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   +RA  +LSC  CG   H +CLK             W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNRDKRAEELLSCADCGSSGHPSCLKFSPDLTSNVKKLRWQCIECKTCSS 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G +  + +FC  CD  +H  C  PP   +  G ++C
Sbjct: 274 CRIQGKNAEEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313


>gi|317419460|emb|CBN81497.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
          Length = 2149

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   +R   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + ++ +FC  CD  +H  C  PP   +  G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313


>gi|301780976|ref|XP_002925892.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Ailuropoda
           melanoleuca]
 gi|410983100|ref|XP_003997881.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Felis catus]
          Length = 414

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404


>gi|205830430|ref|NP_001128627.1| zinc finger protein neuro-d4 isoform a [Homo sapiens]
 gi|297704629|ref|XP_002829197.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Pongo abelii]
 gi|395847017|ref|XP_003796183.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Otolemur
           garnettii]
 gi|402905395|ref|XP_003915505.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Papio anubis]
 gi|387540168|gb|AFJ70711.1| zinc finger protein neuro-d4 isoform a [Macaca mulatta]
          Length = 414

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404


>gi|432100457|gb|ELK29089.1| Histone acetyltransferase MYST4 [Myotis davidii]
          Length = 2022

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
          Length = 3609

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 12/131 (9%)

Query: 107  CRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 165
            C  C+ + H  C +PP  N+     + C   ++C SCG         + W L +  C  C
Sbjct: 1097 CVSCNLSAHRNCCNPPVPNLLFIESWKCDWCSQCISCGYRDTNGTEYLHWGLFFLLCLKC 1156

Query: 166  GRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD----G 219
                 K N+C VC KV+   DS +   V C+ C+ W+H +CD ++     Q   D     
Sbjct: 1157 WESLEKNNFCGVCYKVWTNYDSVTQKWVQCEGCKLWIHVECDDLA-----QIITDCPSSR 1211

Query: 220  NLQYRCPTCRG 230
            +  YRC  CR 
Sbjct: 1212 SQNYRCKVCRS 1222


>gi|6002696|gb|AAF00100.1|AF119231_1 histone acetyltransferase MORF beta [Homo sapiens]
          Length = 2073

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|449505049|ref|XP_002192975.2| PREDICTED: histone acetyltransferase KAT6B [Taeniopygia guttata]
          Length = 1842

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|348506563|ref|XP_003440828.1| PREDICTED: zinc finger protein DPF3-like [Oreochromis niloticus]
          Length = 391

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 2/102 (1%)

Query: 34  NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           N  C  C   +  N    +A  ++SC  CG+  H  CL+             W+C  C+ 
Sbjct: 273 NDYCDFCLGDQDSNRKTGQAEELVSCSDCGRSGHPTCLQFTENMMQAVRTYQWQCIECKS 332

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374


>gi|431909715|gb|ELK12873.1| Zinc finger protein neuro-d4 [Pteropus alecto]
          Length = 425

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 304 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 361

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 362 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415


>gi|297459745|ref|XP_001254780.2| PREDICTED: zinc finger protein DPF3 [Bos taurus]
 gi|297479887|ref|XP_002691098.1| PREDICTED: zinc finger protein DPF3 [Bos taurus]
 gi|296483064|tpg|DAA25179.1| TPA: Zinc finger protein DPF3-like [Bos taurus]
          Length = 474

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 350 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 409

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 410 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 456


>gi|197091705|gb|ACH42085.1| PHD zinc finger protein 10 [Crassostrea gigas]
          Length = 367

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 10/119 (8%)

Query: 36  MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
           +C +C  G+N   +    ++ C  C K+ H  CL    +   +     W+C  C+ C  C
Sbjct: 224 VCSICTQGQNSEVKGESDLVVCSECNKEGHPGCLDLTNEMVTVIKTYPWQCMDCKTCVEC 283

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
               D +K MFC  CD  YH +C     K++ +G      H KC SC    P N  +V+
Sbjct: 284 MDPYDEDKMMFCDLCDRGYHTFCVG--LKSIPTG------HWKCKSCKG--PENPQTVQ 332


>gi|301773210|ref|XP_002922022.1| PREDICTED: histone acetyltransferase MYST4-like [Ailuropoda
           melanoleuca]
 gi|281342250|gb|EFB17834.1| hypothetical protein PANDA_010953 [Ailuropoda melanoleuca]
          Length = 2063

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|187957110|gb|AAI50619.1| MYST histone acetyltransferase (monocytic leukemia) 4 [Homo
           sapiens]
          Length = 2073

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|100816397|ref|NP_036462.2| histone acetyltransferase KAT6B isoform 1 [Homo sapiens]
 gi|143811424|sp|Q8WYB5.3|KAT6B_HUMAN RecName: Full=Histone acetyltransferase KAT6B; AltName:
           Full=Histone acetyltransferase MOZ2; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
           AltName: Full=Monocytic leukemia zinc finger
           protein-related factor
 gi|119574944|gb|EAW54559.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
           CRA_c [Homo sapiens]
          Length = 2073

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|397483738|ref|XP_003813054.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Pan paniscus]
          Length = 2075

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|355782819|gb|EHH64740.1| hypothetical protein EGM_18047 [Macaca fascicularis]
          Length = 2069

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|355562477|gb|EHH19071.1| hypothetical protein EGK_19714 [Macaca mulatta]
          Length = 2077

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|297301091|ref|XP_002805720.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Macaca
           mulatta]
          Length = 1893

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|195382643|ref|XP_002050039.1| GJ20409 [Drosophila virilis]
 gi|194144836|gb|EDW61232.1| GJ20409 [Drosophila virilis]
          Length = 490

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 2/120 (1%)

Query: 16  VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
           +A+  K+    E+  +  +  C  C     EN+       ++SC  CG+  H +CL+   
Sbjct: 360 LASPKKNKLRAEREVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTP 419

Query: 74  QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                     W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 420 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 479


>gi|344243487|gb|EGV99590.1| Zinc finger protein DPF3 [Cricetulus griseus]
          Length = 408

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 2/96 (2%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
             C+ C +C  + + ++ +FC  CD  YH YC +PP
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPP 338


>gi|332834457|ref|XP_003312688.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
           [Pan troglodytes]
          Length = 2070

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|402880388|ref|XP_003903785.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Papio anubis]
          Length = 2070

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|341942234|sp|Q8BRB7.3|KAT6B_MOUSE RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
           AltName: Full=Protein querkopf
          Length = 1872

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
 gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
          Length = 2002

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+    +        +W+C  C+ C  C
Sbjct: 1696 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKC 1755

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K ++C +CD  YH YC     + V  G + C +   C  CG+  P
Sbjct: 1756 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCGATKP 1805


>gi|73953062|ref|XP_536397.2| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Canis lupus
           familiaris]
          Length = 2090

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|348508657|ref|XP_003441870.1| PREDICTED: histone acetyltransferase MYST4 [Oreochromis niloticus]
          Length = 2141

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   +R   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + ++ +FC  CD  +H  C  PP   +  G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313


>gi|426365193|ref|XP_004049670.1| PREDICTED: histone acetyltransferase KAT6B [Gorilla gorilla
           gorilla]
          Length = 2072

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|397483742|ref|XP_003813056.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Pan paniscus]
          Length = 1892

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|403293045|ref|XP_003937534.1| PREDICTED: zinc finger protein neuro-d4 [Saimiri boliviensis
           boliviensis]
          Length = 293

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 172 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 229

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 230 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 283


>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
            finger protein HRX) (ALL-1) (Trithorax-like protein)
            (Lysine N-methyltransferase 2A) (CXXC-type zinc finger
            protein 7) [Ciona intestinalis]
          Length = 3406

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 74/200 (37%), Gaps = 64/200 (32%)

Query: 35   VMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW---SSWKCPSC 89
             +C +C  F       E A  ++ C  C + YH  C +    N D+      ++W C  C
Sbjct: 1022 TVCLICGSFSSAKSISEGA--LVHCACCCEPYHPFCAEPDFLNTDVLAQMKRNTWICRKC 1079

Query: 90   RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
            + C +C   G P   + CRRC  +Y                     H++C   G + P N
Sbjct: 1080 QCCHVC---GHPKNLLVCRRCKKSY---------------------HSEC--LGPSYPTN 1113

Query: 150  GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
              S                               D ES  M+ C  C+RWVH +C+ +S 
Sbjct: 1114 CYS-----------------------------DEDYESR-MIQCSGCKRWVHSKCESLSV 1143

Query: 210  EKY-LQFQVDGNLQYRCPTC 228
            E Y L   +  N+ Y+CP C
Sbjct: 1144 EMYTLLAHMPNNVTYKCPDC 1163


>gi|374349205|ref|NP_001243397.1| histone acetyltransferase KAT6B isoform 2 [Homo sapiens]
 gi|119574942|gb|EAW54557.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
           CRA_a [Homo sapiens]
          Length = 1890

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|6002694|gb|AAF00099.1|AF119230_1 histone acetyltransferase MORF alpha [Homo sapiens]
          Length = 1890

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|426255810|ref|XP_004021541.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Ovis aries]
          Length = 1869

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|332244078|ref|XP_003271198.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
           [Nomascus leucogenys]
          Length = 2055

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 186 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 245

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 246 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 295


>gi|317419461|emb|CBN81498.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
          Length = 1996

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   +R   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + ++ +FC  CD  +H  C  PP   +  G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313


>gi|110556652|ref|NP_059507.2| histone acetyltransferase KAT6B [Mus musculus]
 gi|327365366|ref|NP_001192170.1| histone acetyltransferase KAT6B [Mus musculus]
 gi|148669523|gb|EDL01470.1| mCG123147, isoform CRA_a [Mus musculus]
          Length = 1763

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|6716789|gb|AAF26744.1|AF222800_1 histone acetyltransferase querkopf [Mus musculus]
          Length = 1763

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|74184716|dbj|BAE27963.1| unnamed protein product [Mus musculus]
          Length = 1763

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|6002686|gb|AAF00095.1| histone acetyltransferase MORF [Homo sapiens]
 gi|20521021|dbj|BAA20837.2| KIAA0383 [Homo sapiens]
 gi|152012887|gb|AAI50271.1| MYST4 protein [Homo sapiens]
 gi|168267336|dbj|BAG09724.1| MYST histone acetyltransferase 4 [synthetic construct]
          Length = 1781

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|338716911|ref|XP_003363544.1| PREDICTED: histone acetyltransferase MYST4 isoform 3 [Equus
           caballus]
          Length = 1878

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|410975403|ref|XP_003994122.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
           [Felis catus]
          Length = 2078

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|390472131|ref|XP_002807481.2| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
           [Callithrix jacchus]
          Length = 2066

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|291404131|ref|XP_002718449.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
           isoform 2 [Oryctolagus cuniculus]
          Length = 1774

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|291404129|ref|XP_002718448.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
           isoform 1 [Oryctolagus cuniculus]
          Length = 2065

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|241640825|ref|XP_002409304.1| requim, req/dpf2, putative [Ixodes scapularis]
 gi|215501331|gb|EEC10825.1| requim, req/dpf2, putative [Ixodes scapularis]
          Length = 379

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 2/105 (1%)

Query: 31  SNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           +N +  C  C    GEN+   +   ++SC  CG+  H +CL+             W+C  
Sbjct: 256 ANPSPYCDFCLGDNGENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIE 315

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 316 CKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLTPPLSEPPEGLWSC 360


>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
 gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
          Length = 1982

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 58/137 (42%), Gaps = 4/137 (2%)

Query: 25   GEEQGQSNTNVMCRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
            G E   S     C +C   ++    +     + C +C K+ H +C+    +        +
Sbjct: 1671 GNEFSSSTRLSSCGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYN 1730

Query: 84   WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
            W+C  C+ C  CR +  P K ++C +CD  YH YC     + V  G + C +   C  CG
Sbjct: 1731 WQCAGCKCCIKCRSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCIRCG 1788

Query: 144  SNVPGNGLSVRWFLGYT 160
            +  P  GL+    L  T
Sbjct: 1789 ATKP-EGLAHVAALSQT 1804


>gi|395820446|ref|XP_003783577.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Otolemur
           garnettii]
          Length = 2062

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|297467918|ref|XP_872746.3| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Bos taurus]
 gi|297491533|ref|XP_002698931.1| PREDICTED: histone acetyltransferase KAT6B [Bos taurus]
 gi|296472060|tpg|DAA14175.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 4 [Bos
           taurus]
          Length = 2054

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|403298004|ref|XP_003939830.1| PREDICTED: histone acetyltransferase KAT6B [Saimiri boliviensis
           boliviensis]
          Length = 2051

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|402880392|ref|XP_003903787.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Papio anubis]
          Length = 1887

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|397483740|ref|XP_003813055.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Pan paniscus]
          Length = 1783

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|395820450|ref|XP_003783579.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Otolemur
           garnettii]
          Length = 1880

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|440893247|gb|ELR46092.1| Histone acetyltransferase MYST4 [Bos grunniens mutus]
          Length = 2054

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|68565919|sp|Q8WML3.1|KAT6B_MACFA RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4
 gi|17025966|dbj|BAB72094.1| histone acetyltransferase MORF [Macaca fascicularis]
          Length = 1784

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|395501560|ref|XP_003755161.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Sarcophilus
           harrisii]
          Length = 1862

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|374349207|ref|NP_001243398.1| histone acetyltransferase KAT6B isoform 3 [Homo sapiens]
 gi|119574943|gb|EAW54558.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
           CRA_b [Homo sapiens]
          Length = 1781

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|410053854|ref|XP_001162157.3| PREDICTED: zinc finger protein neuro-d4 [Pan troglodytes]
          Length = 519

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 398 GTVIPNGYCDFCLGGSKKTGCPED--LISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 455

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 456 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 509


>gi|194042830|ref|XP_001928984.1| PREDICTED: histone acetyltransferase MYST4 [Sus scrofa]
          Length = 2065

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|147904561|ref|NP_001081346.1| zinc finger protein ubi-d4 B [Xenopus laevis]
 gi|47682308|gb|AAH70839.1| LOC397786 protein [Xenopus laevis]
 gi|52078456|gb|AAH82478.1| LOC397786 protein [Xenopus laevis]
          Length = 387

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 263 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 322

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 323 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 369


>gi|417406752|gb|JAA50020.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
          Length = 1807

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|380798715|gb|AFE71233.1| zinc finger protein neuro-d4 isoform a, partial [Macaca mulatta]
          Length = 407

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 286 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 343

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 344 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 397


>gi|341888296|gb|EGT44231.1| hypothetical protein CAEBREN_09890 [Caenorhabditis brenneri]
 gi|341902393|gb|EGT58328.1| hypothetical protein CAEBREN_07005 [Caenorhabditis brenneri]
          Length = 454

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 3/89 (3%)

Query: 46  EGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
           + CER    M+ C +C   YH +C++   +   +     W C  CR+C +C + G+ N+ 
Sbjct: 339 DSCERVGGEMVCCATCNIAYHPHCIEMPERMAMIVKTYEWSCVDCRVCSVCHKPGEENEV 398

Query: 105 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +FC RCD  +H  C     K+   G ++C
Sbjct: 399 VFCDRCDRGFHNSCVG--LKSTPIGSWIC 425


>gi|149689991|ref|XP_001504001.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Equus
           caballus]
          Length = 2061

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|348576162|ref|XP_003473856.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Cavia
           porcellus]
          Length = 2053

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|45383492|ref|NP_989662.1| zinc finger protein ubi-d4 [Gallus gallus]
 gi|18202299|sp|P58268.1|REQU_CHICK RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
           response zinc finger protein; AltName: Full=Protein
           requiem
 gi|14010356|gb|AAK51965.1|AF362751_1 requiem [Gallus gallus]
          Length = 405

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEGSWSC 385


>gi|402880390|ref|XP_003903786.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Papio anubis]
          Length = 1778

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|119577164|gb|EAW56760.1| D4, zinc and double PHD fingers family 1, isoform CRA_a [Homo
           sapiens]
          Length = 425

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 304 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 361

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 362 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415


>gi|417406735|gb|JAA50012.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
          Length = 1778

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|395820448|ref|XP_003783578.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Otolemur
           garnettii]
          Length = 1771

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|395501558|ref|XP_003755160.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Sarcophilus
           harrisii]
          Length = 2045

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|426255806|ref|XP_004021539.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Ovis aries]
          Length = 2052

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|297301093|ref|XP_002805721.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Macaca
           mulatta]
          Length = 1784

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
          Length = 1843

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+    +        +W+C  C+ C  C
Sbjct: 1532 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1591

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K ++C +CD  YH YC     + V  G + C +   C  CG+  P
Sbjct: 1592 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1641


>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 985

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C K++  S+    VCCD C  WVH +CD IS + +   +   N 
Sbjct: 358 CKPCAKLIKSRQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKLFKDLE---NA 414

Query: 222 QYRCPTCRGECYQVRDLEDAVRELWRRKDMA 252
            Y CP C+G+     + E +  + ++ K+++
Sbjct: 415 DYYCPDCKGK----FNYESSTSQTYKSKNIS 441


>gi|126272817|ref|XP_001366112.1| PREDICTED: histone acetyltransferase MYST4 [Monodelphis domestica]
          Length = 2045

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   ++   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|324512595|gb|ADY45214.1| Zinc finger protein [Ascaris suum]
          Length = 382

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)

Query: 32  NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
           + +++C LC     EN+      +++SC  CG+  H +CLK         +   W+C  C
Sbjct: 260 DVSLICDLCLGDRNENKKTSLPEQLVSCHDCGRSGHPSCLKFTENMITSTNKYGWQCIEC 319

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           + C IC  + + ++ +FC  CD  +H YC  P       G + C
Sbjct: 320 KSCAICGTSDNDDQLLFCDDCDRGFHLYCLRPRLATAPEGEWSC 363


>gi|297298207|ref|XP_002805197.1| PREDICTED: hypothetical protein LOC694878 [Macaca mulatta]
          Length = 472

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G    N  C  C  G N  +   R   ++SC  CG+  H  CL+      +      W+C
Sbjct: 348 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 407

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 408 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 454


>gi|302141752|emb|CBI18955.3| unnamed protein product [Vitis vinifera]
          Length = 795

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 11/117 (9%)

Query: 22  DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS--C-GKKYHRNCLKNWAQNRDL 78
           ++  ++Q ++    +C++C        E    +L C    C  K YH++CL +       
Sbjct: 626 EVGIQQQKETKYFQLCKIC----GSDMEFGEHLLECGHPFCPNKYYHKSCLTS---TELR 678

Query: 79  FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            +   W CPSC +C  C    D  K + C  CD AYH YC +PP  ++  G + C K
Sbjct: 679 MYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNPPRTSIPRGKWFCRK 734


>gi|426255808|ref|XP_004021540.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Ovis aries]
          Length = 1760

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
 gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
 gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
 gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
            Full=Protein enhancer of yellow 3
 gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
 gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
 gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
 gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
          Length = 2006

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+    +        +W+C  C+ C  C
Sbjct: 1695 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1754

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K ++C +CD  YH YC     + V  G + C +   C  CG+  P
Sbjct: 1755 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1804


>gi|395501556|ref|XP_003755159.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Sarcophilus
           harrisii]
          Length = 1753

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|338716908|ref|XP_003363543.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Equus
           caballus]
          Length = 1769

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
 gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
          Length = 2012

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+    +        +W+C  C+ C  C
Sbjct: 1701 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1760

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K ++C +CD  YH YC     + V  G + C +   C  CG+  P
Sbjct: 1761 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1810


>gi|395741628|ref|XP_002820847.2| PREDICTED: histone acetyltransferase KAT6B, partial [Pongo abelii]
          Length = 1870

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   ++   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 8   ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 67

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 68  CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 107


>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
 gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
          Length = 2011

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 37   CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C +C   ++    +     + C +C K+ H +C+    +        +W+C  C+ C  C
Sbjct: 1700 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1759

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            R +  P K ++C +CD  YH YC     + V  G + C +   C  CG+  P
Sbjct: 1760 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1809


>gi|431904096|gb|ELK09518.1| Histone acetyltransferase MYST4 [Pteropus alecto]
          Length = 1926

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNRDKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|344274631|ref|XP_003409118.1| PREDICTED: histone acetyltransferase MYST4 [Loxodonta africana]
          Length = 1878

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|56790323|ref|NP_001007153.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
 gi|54035542|gb|AAH83281.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
 gi|182890270|gb|AAI65788.1| Dpf2 protein [Danio rerio]
          Length = 400

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H +CL+             W+C
Sbjct: 272 GLALPNNYCDFCLGDSSLNQKTGQSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQC 331

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 332 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPMSVPPEGSWSC 378


>gi|426235049|ref|XP_004011503.1| PREDICTED: PHD finger protein 10 [Ovis aries]
          Length = 451

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 41/86 (47%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  CG   H +CL   A+   +     W+C  C+ C
Sbjct: 330 NALCGICLKGKESSKRGKAEPLVHCSQCGNSGHPSCLDMPAELVSMIKTYPWQCMECKAC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 390 IVCGQPHHEEEMMFCDVCDRGYHTFC 415


>gi|348518782|ref|XP_003446910.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
          Length = 399

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N    ++  ++SC  CG+  H +CL+             W+C
Sbjct: 271 GLALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 330

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 331 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 377


>gi|13431818|sp|Q9QX66.2|DPF1_MOUSE RecName: Full=Zinc finger protein neuro-d4; AltName:
           Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
           Full=D4, zinc and double PHD fingers family 1
          Length = 387

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377


>gi|18203563|sp|Q9W636.2|REQUB_XENLA RecName: Full=Zinc finger protein ubi-d4 B; AltName: Full=Apoptosis
           response zinc finger protein B; AltName: Full=Protein
           requiem B; Short=xReq B
          Length = 366

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 242 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 301

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 302 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 348


>gi|335289596|ref|XP_003355927.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Sus scrofa]
 gi|115527907|gb|AAI25154.1| DPF1 protein [Homo sapiens]
 gi|119577167|gb|EAW56763.1| D4, zinc and double PHD fingers family 1, isoform CRA_d [Homo
           sapiens]
 gi|208966118|dbj|BAG73073.1| D4, zinc and double PHD fingers family 1 [synthetic construct]
          Length = 387

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377


>gi|347800720|ref|NP_001099199.2| zinc finger protein neuro-d4 [Rattus norvegicus]
          Length = 387

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377


>gi|327276821|ref|XP_003223166.1| PREDICTED: histone acetyltransferase MYST4-like [Anolis
           carolinensis]
          Length = 2024

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   ++   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCLELTTNVKALRWQCIECKTCSA 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 274 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313


>gi|7305309|ref|NP_038902.1| zinc finger protein neuro-d4 [Mus musculus]
 gi|6649546|gb|AAF21455.1|U48238_1 zinc finger protein neuro-d4 [Mus musculus]
 gi|30481687|gb|AAH52348.1| D4, zinc and double PHD fingers family 1 [Mus musculus]
 gi|148692120|gb|EDL24067.1| neuronal d4 domain family member [Mus musculus]
          Length = 388

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378


>gi|348576164|ref|XP_003473857.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Cavia
           porcellus]
          Length = 1762

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|205830434|ref|NP_001128628.1| zinc finger protein neuro-d4 isoform c [Homo sapiens]
 gi|395751088|ref|XP_002829195.2| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Pongo abelii]
 gi|395751090|ref|XP_003779218.1| PREDICTED: zinc finger protein neuro-d4 [Pongo abelii]
 gi|395847019|ref|XP_003796184.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Otolemur
           garnettii]
 gi|395847023|ref|XP_003796186.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Otolemur
           garnettii]
 gi|402905397|ref|XP_003915506.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Papio anubis]
 gi|402905399|ref|XP_003915507.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Papio anubis]
 gi|410983102|ref|XP_003997882.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Felis catus]
 gi|193787694|dbj|BAG52900.1| unnamed protein product [Homo sapiens]
          Length = 332

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 211 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 268

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 269 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322


>gi|432897021|ref|XP_004076387.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
          Length = 399

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N    ++  ++SC  CG+  H +CL+             W+C
Sbjct: 271 GLALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 330

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 331 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 377


>gi|126010798|gb|AAI33649.1| DPF1 protein [Bos taurus]
          Length = 388

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378


>gi|116003927|ref|NP_001070323.1| zinc finger protein neuro-d4 [Bos taurus]
 gi|115305354|gb|AAI23606.1| D4, zinc and double PHD fingers family 1 [Bos taurus]
          Length = 387

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377


>gi|432877939|ref|XP_004073268.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
          Length = 407

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H  CL+             W+C
Sbjct: 278 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 337

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 338 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLTPPMTEPPEGSWSC 384


>gi|189526911|ref|XP_697383.3| PREDICTED: hypothetical protein LOC568932 [Danio rerio]
          Length = 2011

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   +R   +LSC  CG   H +CLK  A          W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSADLTANVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  C+  G + ++ +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSSCQIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|149056396|gb|EDM07827.1| neuronal d4 domain family member, isoform CRA_b [Rattus norvegicus]
 gi|166796914|gb|AAI59416.1| D4, zinc and double PHD fingers family 1 [Rattus norvegicus]
          Length = 332

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 211 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 268

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 269 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322


>gi|345305893|ref|XP_001506182.2| PREDICTED: histone acetyltransferase MYST4 isoform 1
           [Ornithorhynchus anatinus]
          Length = 2066

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 2/109 (1%)

Query: 27  EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
           +Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265

Query: 86  CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  C+ C  CR  G +    +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 CIECKTCSACRIQGKNAENMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|302843023|ref|XP_002953054.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
            nagariensis]
 gi|300261765|gb|EFJ45976.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
            nagariensis]
          Length = 2579

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 60/134 (44%), Gaps = 16/134 (11%)

Query: 14   GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
            G  A    D AG  +G + + +  R+C   +      +   +S +SCG++   N + N  
Sbjct: 1226 GLGARLESDKAGIPEG-AQSGLRRRVCVGAQQHPPWDSAEEMSIRSCGQRSRCNIIHNLL 1284

Query: 74   QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
              R LF      CP  R+C +     D N+ + C  CD  YHCYC  PP   V +G + C
Sbjct: 1285 --RLLF-----SCP--RVCWLDE---DKNRILLCDGCDGEYHCYCVEPPLLEVPAGAWFC 1332

Query: 134  PKHTKCHSCGSNVP 147
            P    C + G  +P
Sbjct: 1333 P---SCTARGLGLP 1343


>gi|260830577|ref|XP_002610237.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
 gi|229295601|gb|EEN66247.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
          Length = 354

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 48/112 (42%), Gaps = 5/112 (4%)

Query: 23  IAGEEQGQSNTNVMCRLCFVGENEGCERA--RRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
           +  +  G SN    C  C   E E  +      ++SC  CG+  H  CL+      +   
Sbjct: 244 VPAKGSGPSN---YCDFCLGDEKENKKSGVPEELISCSDCGRSGHPTCLQFTPHMTESVK 300

Query: 81  WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
              W+C  C+ C +C  + + ++ +FC  CD  YH YC  PP  +   G  L
Sbjct: 301 KYRWQCIECKSCHLCGTSENDDQLLFCDDCDRGYHMYCLSPPLSSPPEGKTL 352


>gi|348514482|ref|XP_003444769.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
          Length = 405

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H  CL+             W+C
Sbjct: 276 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 335

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 336 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLTPPMTEPPEGSWSC 382


>gi|270013653|gb|EFA10101.1| hypothetical protein TcasGA2_TC012280 [Tribolium castaneum]
          Length = 559

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 57/131 (43%), Gaps = 18/131 (13%)

Query: 24  AGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
           +G EQG      +C +C V    G     RM++C+ C    H +CL        ++  ++
Sbjct: 283 SGREQGG-----VCSVCHVQNKMGPND--RMVACRECT---HYSCLNGDDMMLRMYPDNT 332

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP-PHKNVSSGP--YLC-----PK 135
           W+CP C+ C IC  T D      C  C  AYH  C  P  H+     P  +LC     P+
Sbjct: 333 WQCPHCKTCVICFETSDAGYLTVCAVCADAYHAGCHQPRIHEKFIKPPAKWLCINCEMPE 392

Query: 136 HTKCHSCGSNV 146
             K +   SN+
Sbjct: 393 ELKINEIQSNI 403


>gi|410895483|ref|XP_003961229.1| PREDICTED: histone acetyltransferase KAT6B-like [Takifugu rubripes]
          Length = 2123

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E+   ++   +LSC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 214 ICSFCLGTKESNRDKQPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + ++ +FC  CD  +H  C +PP   +  G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCNPPLSRMPKGTWIC 313


>gi|308497276|ref|XP_003110825.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
 gi|308242705|gb|EFO86657.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
          Length = 372

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 38/80 (47%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           ++SC  CG+  H +CL        +   S W+C  C+ C IC  + + +K +FC  CD  
Sbjct: 276 LVSCHDCGRSGHPSCLSFNENVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 335

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH YC  P  +      Y C
Sbjct: 336 YHLYCLRPALEKAPDDEYSC 355


>gi|195425644|ref|XP_002061104.1| GK10624 [Drosophila willistoni]
 gi|194157189|gb|EDW72090.1| GK10624 [Drosophila willistoni]
          Length = 515

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 2/99 (2%)

Query: 37  CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           C  C     EN+       ++SC  CG+  H +CL+             W+C  C+ C I
Sbjct: 406 CDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSI 465

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 466 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 504


>gi|440894955|gb|ELR47273.1| Zinc finger protein neuro-d4, partial [Bos grunniens mutus]
          Length = 383

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 262 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 319

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 320 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 373


>gi|37998957|dbj|BAD00088.1| chimeric MOZ-ASXH2 fusion protein [Homo sapiens]
          Length = 2228

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|115533182|ref|NP_001041113.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
 gi|351062484|emb|CCD70456.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
          Length = 447

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+ C +C   YH  C++   +   L     W C  CR+C IC +    ++ +FC RCD  
Sbjct: 345 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 404

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           +H YC     K +  G ++C
Sbjct: 405 FHTYCVG--LKKLPQGTWIC 422


>gi|195122592|ref|XP_002005795.1| GI20660 [Drosophila mojavensis]
 gi|193910863|gb|EDW09730.1| GI20660 [Drosophila mojavensis]
          Length = 492

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 2/120 (1%)

Query: 16  VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
           +A+  K     E+  +  +  C  C     EN+       ++SC  CG+  H +CL+   
Sbjct: 362 LASPKKPKLRTEREVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTP 421

Query: 74  QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                     W+C  C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 422 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 481


>gi|410913627|ref|XP_003970290.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
          Length = 407

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H  CL+             W+C
Sbjct: 278 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 337

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 338 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMTEPPEGSWSC 384


>gi|358338934|dbj|GAA39327.2| zinc finger protein ubi-d4, partial [Clonorchis sinensis]
          Length = 331

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 44/105 (41%), Gaps = 2/105 (1%)

Query: 31  SNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
           S+T   C  C      N+   +   ML C  CG+  H  CL+  A          W+C  
Sbjct: 165 SDTEYACDFCLGDARLNKKTGQPEDMLRCSDCGRCAHFTCLQFTANMVSSVRTYRWQCIE 224

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+ C +C  + +  + +FC  CD  YH YC  PP      G + C
Sbjct: 225 CKTCWLCGTSENDEQMLFCDDCDRGYHMYCLSPPLSEPPEGSWSC 269


>gi|324512933|gb|ADY45341.1| PHD finger protein 10 [Ascaris suum]
          Length = 481

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 54/126 (42%), Gaps = 25/126 (19%)

Query: 30  QSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
           QS     C +C      G   + +MLSC +C  K H +C  L     N  L +   W C 
Sbjct: 344 QSGNENRCSIC------GVTNSAQMLSCATCSTKVHPDCAGLPERVVNVALNYM--WSCI 395

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPK-------HT 137
            C+ C +C +  + +  MFC RCD  YH +C     PP     +G ++C          +
Sbjct: 396 ECKKCTVCEKPDNEDAMMFCDRCDRGYHTFCVGLSAPP-----TGTWVCTNFCADQTIQS 450

Query: 138 KCHSCG 143
           KC+ C 
Sbjct: 451 KCNKCS 456


>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
           scapularis]
 gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
           scapularis]
          Length = 1882

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 14/101 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
           E C++   ++ C +C + YH  CL+   +      WS   CP C              E 
Sbjct: 421 EVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWS---CPHCEGEGIQEQEEDEHMEF 477

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           CR   D  + + C  C AA+H +C +PP KNV +G + CP+
Sbjct: 478 CRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPR 518


>gi|410915434|ref|XP_003971192.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
          Length = 398

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N    ++  ++SC  CG+  H +CL+             W+C
Sbjct: 270 GIALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQC 329

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 330 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 376


>gi|33869836|gb|AAH21191.1| DPF1 protein, partial [Homo sapiens]
          Length = 414

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404


>gi|301765980|ref|XP_002918393.1| PREDICTED: histone acetyltransferase MYST3-like [Ailuropoda
           melanoleuca]
          Length = 1702

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|356551207|ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786712 [Glycine max]
          Length = 525

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL   +  +   + + W CPSC IC++C    D +K + C  CD AYH YC  P
Sbjct: 376 KYYHVRCL---SSKQLKSYGNCWYCPSC-ICQVCLTDKDDDKIVLCDGCDHAYHIYCMKP 431

Query: 122 PHKNVSSGPYLCPK 135
           P  ++  G + C K
Sbjct: 432 PQNSIPKGKWFCIK 445


>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
           max]
          Length = 989

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 220
           CC  C +L     YC +C +++  S+    VCCD C  WVH +CD IS + +   +   N
Sbjct: 360 CCKYCSKLRKSKQYCGICKRIWHHSDGGNWVCCDGCNVWVHAECDKISSKVFKDLE---N 416

Query: 221 LQYRCPTCRGE 231
             Y CP C+G+
Sbjct: 417 TDYYCPDCKGK 427


>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
 gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
          Length = 3094

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2598 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2652

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2653 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2706

Query: 145  NVP 147
              P
Sbjct: 2707 RAP 2709



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2600 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2659

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2660 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2696


>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
          Length = 2227

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
           E C++   ++ C +C K YH  CL    ++     WS   CP+C              E 
Sbjct: 460 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 516

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           CR   D  + + C  C +AYH +C +PP  ++  G + CP+
Sbjct: 517 CRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPR 557


>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
 gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
          Length = 3148

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2644 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2698

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2699 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2752

Query: 145  NVP 147
              P
Sbjct: 2753 RAP 2755



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2646 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2705

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2706 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2742


>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
 gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
          Length = 3131

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2635 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2689

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2690 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2743

Query: 145  NVP 147
              P
Sbjct: 2744 RAP 2746



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2637 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2696

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2697 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2733


>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
 gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
          Length = 3129

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2630 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2684

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2685 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2738

Query: 145  NVP 147
              P
Sbjct: 2739 RAP 2741



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2632 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2691

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2692 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2728


>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
 gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
          Length = 3058

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2562 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2616

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2617 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2670

Query: 145  NVP 147
              P
Sbjct: 2671 RAP 2673



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2564 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2623

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2624 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2660


>gi|417406854|gb|JAA50068.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
          Length = 2010

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|443693752|gb|ELT95039.1| hypothetical protein CAPTEDRAFT_19718 [Capitella teleta]
          Length = 421

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 30  QSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           +S  +  C  C     EN+       ++SC  CG+  H +CL+  A          W+C 
Sbjct: 299 RSKASRYCDFCLGDDIENKKTGMKEELVSCADCGRSGHPSCLQFTANMIISVKKYPWQCI 358

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC +PP      G + C
Sbjct: 359 ECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 404


>gi|195399353|ref|XP_002058285.1| GJ15575 [Drosophila virilis]
 gi|194150709|gb|EDW66393.1| GJ15575 [Drosophila virilis]
          Length = 2162

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 6/112 (5%)

Query: 40   CFVGENEGCERARRM----LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
            C V +      AR M    + C SC ++ H +C++   +        +W+C  C+ C  C
Sbjct: 1896 CGVCQRTQHRNARNMPEAFIRCYSCRRRVHPSCIEMPHRMVGRVRNYNWQCADCKCCIKC 1955

Query: 96   RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
              +  P K ++C +CD  YH YC     K V  G + C + + C  CG+  P
Sbjct: 1956 GSSQQPGKMLYCEQCDRGYHIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2005


>gi|380792697|gb|AFE68224.1| histone acetyltransferase KAT6B, partial [Macaca mulatta]
          Length = 1077

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|392333207|ref|XP_003752828.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
          Length = 1855

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   ++SC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|6648954|gb|AAF21305.1|AF108133_1 neuro-d4 [Mus musculus]
          Length = 323

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 202 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAPVRTYRWQCI 259

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 260 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 313


>gi|148232579|ref|NP_001090745.1| D4, zinc and double PHD fingers family 1 [Xenopus (Silurana)
           tropicalis]
 gi|120537304|gb|AAI29026.1| dpf1 protein [Xenopus (Silurana) tropicalis]
          Length = 383

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 262 GSVIANGYCDFCLGGAKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 319

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 320 ECKSCILCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 365


>gi|354482358|ref|XP_003503365.1| PREDICTED: histone acetyltransferase MYST3 [Cricetulus griseus]
          Length = 1992

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSTELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|47225244|emb|CAG09744.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 393

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H  CL+             W+C
Sbjct: 267 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 326

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 327 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMTEPPEGSWSC 373


>gi|395535344|ref|XP_003769687.1| PREDICTED: PHD finger protein 10 [Sarcophilus harrisii]
          Length = 598

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N MC +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 477 NAMCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTC 536

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 537 IICGQPHHEEEMMFCDVCDRGYHTFC 562


>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
          Length = 3109

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2584 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2638

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S
Sbjct: 2639 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2692

Query: 145  NVP 147
              P
Sbjct: 2693 RAP 2695



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2586 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2645

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2646 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2682


>gi|334312613|ref|XP_001373063.2| PREDICTED: histone acetyltransferase MYST3 [Monodelphis domestica]
          Length = 1951

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|355697899|gb|EHH28447.1| Histone acetyltransferase MYST3 [Macaca mulatta]
          Length = 2099

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 304 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 363

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 364 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 403


>gi|345781619|ref|XP_003432152.1| PREDICTED: histone acetyltransferase KAT6A [Canis lupus familiaris]
          Length = 2017

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|432099934|gb|ELK28828.1| Histone acetyltransferase MYST3 [Myotis davidii]
          Length = 1861

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|426359475|ref|XP_004046999.1| PREDICTED: histone acetyltransferase KAT6A [Gorilla gorilla
           gorilla]
          Length = 2005

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|426256598|ref|XP_004023426.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
           [Ovis aries]
          Length = 1931

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|119918267|ref|XP_874495.2| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
 gi|297491293|ref|XP_002698753.1| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
 gi|296472344|tpg|DAA14459.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 3 [Bos
           taurus]
          Length = 2018

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|150378463|ref|NP_001092882.1| histone acetyltransferase KAT6A [Homo sapiens]
 gi|150378493|ref|NP_006757.2| histone acetyltransferase KAT6A [Homo sapiens]
 gi|150378543|ref|NP_001092883.1| histone acetyltransferase KAT6A [Homo sapiens]
 gi|215274095|sp|Q92794.2|KAT6A_HUMAN RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
           AltName: Full=Monocytic leukemia zinc finger protein;
           AltName: Full=Runt-related transcription factor-binding
           protein 2; AltName: Full=Zinc finger protein 220
 gi|119583643|gb|EAW63239.1| MYST histone acetyltransferase (monocytic leukemia) 3, isoform
           CRA_a [Homo sapiens]
 gi|208965270|dbj|BAG72649.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
           construct]
 gi|225000792|gb|AAI72379.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
           construct]
          Length = 2004

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|297682771|ref|XP_002819083.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
           [Pongo abelii]
          Length = 2010

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|403303692|ref|XP_003942458.1| PREDICTED: histone acetyltransferase KAT6A [Saimiri boliviensis
           boliviensis]
          Length = 1968

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|395507492|ref|XP_003758058.1| PREDICTED: histone acetyltransferase KAT6A [Sarcophilus harrisii]
          Length = 1993

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|149410700|ref|XP_001509833.1| PREDICTED: histone acetyltransferase MYST3-like isoform 2
           [Ornithorhynchus anatinus]
          Length = 2003

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308


>gi|118101408|ref|XP_424402.2| PREDICTED: histone acetyltransferase KAT6A [Gallus gallus]
          Length = 1981

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308


>gi|348562787|ref|XP_003467190.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4-like
           [Cavia porcellus]
          Length = 358

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 237 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 294

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 295 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 348


>gi|1517914|gb|AAC50662.1| monocytic leukaemia zinc finger protein [Homo sapiens]
          Length = 2004

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|348557736|ref|XP_003464675.1| PREDICTED: histone acetyltransferase MYST3 [Cavia porcellus]
          Length = 2016

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 207 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 266

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 267 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 306


>gi|440901050|gb|ELR52053.1| Histone acetyltransferase MYST3 [Bos grunniens mutus]
          Length = 1923

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|397505596|ref|XP_003846081.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
           [Pan paniscus]
          Length = 2002

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|339522307|gb|AEJ84318.1| D4 zinc and double PHD fingers family 2 [Capra hircus]
          Length = 391

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 48/116 (41%), Gaps = 8/116 (6%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
             C+ C IC  + + ++ +FC  CD  YH YC  P       G +      +CH C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDACDRGYHMYCLTPSMSEPPEGSW------RCHLC 374


>gi|332241000|ref|XP_003269676.1| PREDICTED: histone acetyltransferase KAT6A [Nomascus leucogenys]
          Length = 2004

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|149470805|ref|XP_001506848.1| PREDICTED: zinc finger protein ubi-d4-like [Ornithorhynchus
           anatinus]
          Length = 425

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 263 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 322

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G
Sbjct: 323 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEG 365


>gi|114619920|ref|XP_001140373.1| PREDICTED: histone acetyltransferase KAT6A isoform 1 [Pan
           troglodytes]
 gi|332826020|ref|XP_003311745.1| PREDICTED: histone acetyltransferase KAT6A [Pan troglodytes]
          Length = 2002

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|395857485|ref|XP_003801122.1| PREDICTED: histone acetyltransferase KAT6A [Otolemur garnettii]
          Length = 2002

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|326932701|ref|XP_003212452.1| PREDICTED: histone acetyltransferase MYST3-like [Meleagris
           gallopavo]
          Length = 1981

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308


>gi|74197305|dbj|BAC32253.2| unnamed protein product [Mus musculus]
          Length = 933

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|402878102|ref|XP_003902742.1| PREDICTED: histone acetyltransferase KAT6A [Papio anubis]
          Length = 2010

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|431902227|gb|ELK08728.1| Histone acetyltransferase MYST3 [Pteropus alecto]
          Length = 1731

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|312081277|ref|XP_003142959.1| hypothetical protein LOAG_07378 [Loa loa]
          Length = 147

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 52/116 (44%), Gaps = 2/116 (1%)

Query: 20  SKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRD 77
           +  IA       + + +C LC     +N+   +  +++SC  CG+  H +CLK       
Sbjct: 12  AASIAALLSPSIDVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLT 71

Query: 78  LFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                 W+C  C+ C IC  + + ++ +FC  CD  +H YC  PP      G + C
Sbjct: 72  STGKYGWQCIECKSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 127


>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
 gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
          Length = 1078

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C  CG+++P      +      G   C  C RL    ++C +C KV+  S+S   V CD 
Sbjct: 407 CEGCGTSLPLKPAKKIKGTSPGGQLLCKTCARLTKSKHFCGICKKVWNHSDSGSWVRCDG 466

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
           C+ WVH +CD IS  +   F+  G   Y CP C+ +
Sbjct: 467 CKVWVHAECDKISSNR---FKDLGGTDYYCPACKAK 499


>gi|118344120|ref|NP_001071881.1| zinc finger protein [Ciona intestinalis]
 gi|70571741|dbj|BAE06812.1| zinc finger protein [Ciona intestinalis]
          Length = 667

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 3/100 (3%)

Query: 35  VMCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
           ++C +C   G +     A  ++ C  C    H +CL+   +   +    +W+C  C+ C 
Sbjct: 399 IVCGICSKDGSSNKKGEAEELIKCSQCDNHGHPSCLEMSVEQVSVIETYNWQCMECKTCT 458

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           IC      +  MFC RCD  YH +C     + + SG + C
Sbjct: 459 ICSMPHREDLMMFCDRCDRGYHTFCVS--LRAIPSGVWAC 496


>gi|226498206|ref|NP_001147779.1| LOC100281389 [Zea mays]
 gi|195613724|gb|ACG28692.1| PHD-finger family protein [Zea mays]
 gi|219885501|gb|ACL53125.1| unknown [Zea mays]
 gi|413921539|gb|AFW61471.1| PHD-finger family [Zea mays]
          Length = 558

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 36  MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKNWA----QNRDLFHWSSWKCPS 88
           +C+LC   E+E     ++ + C    C  K YH  CLK       + R+L     W CPS
Sbjct: 419 LCKLCGTCEDEN----KKFVVCGHGYCSFKFYHALCLKESQIASEKQRNL---KCWYCPS 471

Query: 89  CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C +C  C +  D  K + C  CD AYH YC  PP  +V  G + C
Sbjct: 472 C-LCRRCFKNKDDEKIVLCDGCDEAYHTYCMDPPRSSVPRGKWFC 515


>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
 gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
          Length = 2123

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 1633 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 1687

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S  P
Sbjct: 1688 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 1738



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 1633 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 1692

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 1693 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 1725


>gi|449488248|ref|XP_004176107.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
           [Taeniopygia guttata]
          Length = 2010

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308


>gi|296222095|ref|XP_002757039.1| PREDICTED: histone acetyltransferase KAT6A [Callithrix jacchus]
          Length = 2003

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
 gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
          Length = 2999

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2509 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2563

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S  P
Sbjct: 2564 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2614



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2509 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2568

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2569 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2601


>gi|344281584|ref|XP_003412558.1| PREDICTED: histone acetyltransferase MYST3 [Loxodonta africana]
          Length = 2011

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|224082358|ref|XP_002306661.1| predicted protein [Populus trichocarpa]
 gi|222856110|gb|EEE93657.1| predicted protein [Populus trichocarpa]
          Length = 524

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 37/73 (50%), Gaps = 4/73 (5%)

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
           GK YH  CL N    + + +   W CPSC +C  C    D +K + C  CD AYH YC  
Sbjct: 377 GKYYHVRCLTN---RQLILYGPRWYCPSC-LCRGCLTDKDDDKIVLCDGCDHAYHLYCMI 432

Query: 121 PPHKNVSSGPYLC 133
           PP  +V  G + C
Sbjct: 433 PPRISVPKGKWFC 445


>gi|124487239|ref|NP_001074618.1| histone acetyltransferase KAT6A [Mus musculus]
 gi|148700926|gb|EDL32873.1| mCG13090 [Mus musculus]
          Length = 2003

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|168045004|ref|XP_001774969.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673716|gb|EDQ60235.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 293

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 22/159 (13%)

Query: 604 VLQSLPKDSKPP---LRLKFRK-PNLE------NQNSQVSQPEEEKSLIKGQRSKRKRPS 653
           +L S P    P    LRLK +K P  E      +++++ S+ + E ++   Q SK+KR +
Sbjct: 142 LLHSSPGSDSPVTRRLRLKIKKQPAREVVIPTTSKDNKASEHDGELAIKAHQTSKKKRTA 201

Query: 654 PFTEKTLFNEDEDAAQSNQDSLM--SEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKG 711
                       +A  S Q  L       D +WIL++LG DAI KRVEV    DN W+KG
Sbjct: 202 --------TPQRNAEGSTQRRLAIGDATDDDSWILQRLGTDAISKRVEVFWPIDNIWYKG 253

Query: 712 VVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
            +      ++   +  DD   +TLE GK+ VR +   +K
Sbjct: 254 TIVAVF--STQFCVDYDDGDQETLEFGKEKVRLLSTTKK 290


>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
 gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
          Length = 2037

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
           E C++   ++ C +C K YH  CL    ++     WS   CP+C              E 
Sbjct: 417 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 473

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           CR   D  + + C  C +AYH +C +PP  ++  G + CP+
Sbjct: 474 CRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPR 514


>gi|68565903|sp|Q8BZ21.2|KAT6A_MOUSE RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
           AltName: Full=Monocytic leukemia zinc finger homolog;
           AltName: Full=Monocytic leukemia zinc finger protein
          Length = 2003

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
           echinatior]
          Length = 1304

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 77/202 (38%), Gaps = 55/202 (27%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGC----ERARRMLSC--KSCGKKYHRNCLKNWAQNRDL 78
           G+   Q +    C  C  G    C    ER    + C   +CGK YH NCL +W Q+   
Sbjct: 583 GKSDEQEDETFKCIDCLSGVAPACFLCNEREGDRIRCIVPACGKHYHSNCLLSWPQS--- 639

Query: 79  FHWSSWK--CPSCRICEIC-------RRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNV-- 126
            HW   +  CP   +C  C       +R+  PN+ M  C RC ++YH      P  +V  
Sbjct: 640 -HWQGGRLTCPY-HVCHTCSSDNPQDKRSRAPNEKMARCVRCPSSYHASTLCLPAGSVIL 697

Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
           ++   +CPKH K        P   L+  W      C  C R                   
Sbjct: 698 TANQIICPKHYK-------APHPPLNAAW------CFLCTR------------------- 725

Query: 187 STPMVCCDVCQRWVHCQCDGIS 208
              ++CCD C    H +C GI+
Sbjct: 726 GGSLICCDTCPTSFHLECLGIN 747


>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
 gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
          Length = 3244

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2867 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2921

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +
Sbjct: 2922 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIA 2975

Query: 145  NVP 147
              P
Sbjct: 2976 RAP 2978



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2869 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2928

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2929 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2965


>gi|327286450|ref|XP_003227943.1| PREDICTED: histone acetyltransferase MYST3-like [Anolis
           carolinensis]
          Length = 2017

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308


>gi|338710071|ref|XP_001916345.2| PREDICTED: zinc finger protein neuro-d4-like [Equus caballus]
          Length = 205

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 84  GTVIPNGYCDFCLGGSKKTGCPE--DLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 141

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G +   LC +H K
Sbjct: 142 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 195


>gi|148669525|gb|EDL01472.1| mCG123147, isoform CRA_c [Mus musculus]
          Length = 938

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 216 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 275

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 276 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 325


>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
 gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
          Length = 3047

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2542 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2596

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S  P
Sbjct: 2597 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2647



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2542 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2601

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2602 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2634


>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
 gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
          Length = 2944

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2454 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2508

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S  P
Sbjct: 2509 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2559



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2454 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2513

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2514 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2546


>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
           quinquefasciatus]
 gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
           quinquefasciatus]
          Length = 1982

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 15/102 (14%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI------------CE 93
           E C++   ++ C +C K YH  CL+   ++     WS   CP+C               E
Sbjct: 403 EVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWS---CPTCEADGGVAEDDDDEHQE 459

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            CR   D  + + C  C +AYH +C  PP  ++  G + CP+
Sbjct: 460 FCRICKDGGELLCCDMCPSAYHTFCLTPPLDDIPDGDWRCPR 501


>gi|63146269|gb|AAH95974.1| Myst4 protein, partial [Mus musculus]
          Length = 828

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
 gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
          Length = 1018

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C  CG ++P      +      G   C  C +L    +YC +C K++  S+S   V CD 
Sbjct: 351 CEGCGVSLPFKLSKKMKSSITGGQFLCKTCAKLTKLKHYCGICKKIWNHSDSGSWVRCDG 410

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
           C+ WVH +CD IS+ +   F+  G   Y CP C+ +
Sbjct: 411 CKVWVHAECDKISNSR---FKDLGATDYYCPACKAK 443


>gi|56001099|dbj|BAD72833.1| monocytic leukemia zinc finger protein [Rattus norvegicus]
          Length = 1991

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 201 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 260

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 261 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 300


>gi|213972547|ref|NP_001094040.1| histone acetyltransferase KAT6A [Rattus norvegicus]
 gi|68565633|sp|Q5TKR9.2|KAT6A_RAT RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
           YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
           AltName: Full=Monocytic leukemia zinc finger homolog;
           AltName: Full=Monocytic leukemia zinc finger protein
 gi|149057780|gb|EDM09023.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Rattus
           norvegicus]
          Length = 1998

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|351707582|gb|EHB10501.1| Histone acetyltransferase MYST3 [Heterocephalus glaber]
          Length = 2068

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTIRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLIRMPKGMWIC 307


>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
 gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
          Length = 3415

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 17/123 (13%)

Query: 31   SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
            SN+   C+ C  GENE      ++L C  C K YH  C K    N     W  ++C    
Sbjct: 2871 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2925

Query: 87   PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
             + R C +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +
Sbjct: 2926 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIT 2979

Query: 145  NVP 147
              P
Sbjct: 2980 RAP 2982



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)

Query: 88   SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
            S + C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC 
Sbjct: 2873 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2932

Query: 141  SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
             CG + P     + +      CD C R +    Y P  LKV R
Sbjct: 2933 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2969


>gi|357631309|gb|EHJ78887.1| hypothetical protein KGM_12125 [Danaus plexippus]
          Length = 501

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 47/100 (47%), Gaps = 2/100 (2%)

Query: 22  DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
           ++  +E     T+ +C +C + +  G     R++ C+ C  K H +CL++ +        
Sbjct: 257 EMNNDEPLSHETSGVCTVCLIQKTRGSND--RLVECRDCNNKAHLSCLQSGSGILKPRPD 314

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           ++W+CP C+ C +C  T D      C  C  +YH  C  P
Sbjct: 315 NTWQCPHCKTCVVCCETNDAGILTVCSICSDSYHALCHTP 354


>gi|326923554|ref|XP_003208000.1| PREDICTED: histone acetyltransferase MYST4-like [Meleagris
           gallopavo]
          Length = 2028

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 8/134 (5%)

Query: 8   ARRFESGYVAT---ASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCG-- 61
           A ++ SG+ A+    S     ++Q +++   +C  C    E+   ++   +LSC  CG  
Sbjct: 184 APKYSSGFPASLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSS 243

Query: 62  -KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQ 119
            K  H +CLK   +         W+C  C+ C  CR  G + +  +FC  CD  +H  C 
Sbjct: 244 GKLEHPSCLKFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECC 303

Query: 120 HPPHKNVSSGPYLC 133
            PP   +  G ++C
Sbjct: 304 DPPLSRMPKGMWIC 317


>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
 gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
          Length = 1418

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 86/229 (37%), Gaps = 75/229 (32%)

Query: 25   GEEQGQSNTNVMCRLCFVGENEGC-----------------ERAR-------RMLSCKS- 59
             +E G   T+++C  C VGE EGC                 E A        ++L+C   
Sbjct: 835  AKEVGAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDAPSHTPIEDKLLTCSQP 894

Query: 60   -CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRR 109
             CGK++H +C K W Q     H  S +CP   +C  C  + DP         +K   C R
Sbjct: 895  VCGKRFHTSCCKYWPQASSSKH--SARCPR-HVCHTC-VSNDPSGRFQQLGSSKLAKCVR 950

Query: 110  CDAAYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
            C A YH   +C     + +++   +CP+H        N+      V     Y C      
Sbjct: 951  CPATYHQDSHCIPAGTQMLNATNIICPRH--------NIAKADAHVNVLWCYIC------ 996

Query: 168  LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
              VKG                 +VCC+ C   VH  C  I   ++E Y+
Sbjct: 997  --VKGGE---------------LVCCETCPIAVHAHCRNIPIKTNENYI 1028


>gi|154757359|gb|AAI51762.1| MYST4 protein [Bos taurus]
          Length = 349

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 204 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 263

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 264 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313


>gi|334324322|ref|XP_001381625.2| PREDICTED: PHD finger protein 10-like [Monodelphis domestica]
          Length = 662

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N MC +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 541 NAMCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSIIKTYPWQCMECKTC 600

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 601 IICGQPHHEEEMMFCDVCDRGYHTFC 626


>gi|327272024|ref|XP_003220786.1| PREDICTED: PHD finger protein 10-like [Anolis carolinensis]
          Length = 489

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 2/102 (1%)

Query: 18  TASKDIAGEEQGQSNTNVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNR 76
           + SK + G  + ++  N +C +C  G E+    +A  ++ C  C    H +CL   A+  
Sbjct: 353 SVSKSVPGY-KPKAFPNAICGICLKGKESNKKGKAEALIHCSQCENSGHPSCLDMSAELV 411

Query: 77  DLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
            +     W+C  C+ C IC +     + MFC  CD  YH +C
Sbjct: 412 AIIKTYPWQCMECKTCIICGQPHHEEEMMFCDLCDRGYHTFC 453


>gi|327276253|ref|XP_003222884.1| PREDICTED: zinc finger protein neuro-d4-like [Anolis carolinensis]
          Length = 388

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370


>gi|29387208|gb|AAH48199.1| MYST4 protein, partial [Homo sapiens]
 gi|33874216|gb|AAH14143.1| MYST4 protein, partial [Homo sapiens]
          Length = 325

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314


>gi|297299302|ref|XP_001094798.2| PREDICTED: histone acetyltransferase MYST3 [Macaca mulatta]
          Length = 1905

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 73/191 (38%), Gaps = 18/191 (9%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
           CR  G + +  +FC  CD  +H  C  PP   +  G  L        +C   +   G+  
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGNTLA-----VEACSGEL---GVEA 319

Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQR----WVHCQCDGISD 209
            W +   C  A     VKG  C      +R       V   +C++    W+     GI  
Sbjct: 320 GWHMS-ACVGAFRGAKVKGG-CGQRGTSHRSHSGQTCVVVVMCKQLYWNWISTFTSGIRL 377

Query: 210 EKYLQFQVDGN 220
           E+  Q   DGN
Sbjct: 378 EE--QLAADGN 386


>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
           max]
          Length = 1067

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 6/96 (6%)

Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C +CG ++P   L         G   C  C RL    +YC +C KV+  S+S   V CD 
Sbjct: 403 CEACGLSLPYKMLKKTKDSSPGGQFLCRTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 462

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
           C+ WVH +CD IS   +   +      Y CPTC+ +
Sbjct: 463 CKVWVHAECDKISSNLFKNLE---GTDYYCPTCKAK 495


>gi|392353369|ref|XP_003751480.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
          Length = 1640

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 51/110 (46%), Gaps = 2/110 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   ++SC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315


>gi|291409041|ref|XP_002720841.1| PREDICTED: MYST histone acetyltransferase 2-like [Oryctolagus
           cuniculus]
          Length = 1806

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|359492251|ref|XP_002284634.2| PREDICTED: uncharacterized protein LOC100247132 [Vitis vinifera]
          Length = 386

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 11/117 (9%)

Query: 22  DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS--C-GKKYHRNCLKNWAQNRDL 78
           ++  ++Q ++    +C++C        E    +L C    C  K YH++CL +       
Sbjct: 217 EVGIQQQKETKYFQLCKIC----GSDMEFGEHLLECGHPFCPNKYYHKSCLTSTELR--- 269

Query: 79  FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            +   W CPSC +C  C    D  K + C  CD AYH YC +PP  ++  G + C K
Sbjct: 270 MYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNPPRTSIPRGKWFCRK 325


>gi|4808456|dbj|BAA77571.1| Requiem protein [Xenopus laevis]
          Length = 198

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 74  GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 133

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 134 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 180


>gi|4808454|dbj|BAA77570.1| Requiem protein [Xenopus laevis]
          Length = 386

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 46/107 (42%), Gaps = 4/107 (3%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  ++SC  CG+  H +CL+  A          W+C
Sbjct: 264 GIALPNNYCDFCLGDSKINKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 323

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  +   N  +FC  CD  YH YC  PP      G + C
Sbjct: 324 IECKCCNICGTSE--NDLLFCDDCDRGYHMYCLVPPVAEPPEGSWSC 368


>gi|55726215|emb|CAH89880.1| hypothetical protein [Pongo abelii]
          Length = 1275

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|410912120|ref|XP_003969538.1| PREDICTED: PHD finger protein 10-like [Takifugu rubripes]
          Length = 493

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 3/87 (3%)

Query: 34  NVMCRLCFVGENEGCERAR--RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           + +C +C  G  E  +R R   ++ C  C    H +CL    +   +    SW+C  C+ 
Sbjct: 368 SAICGICQKG-REANKRGRPEALIHCSQCDNSGHPSCLDMSGELVSVIQTYSWQCMECKT 426

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYC 118
           C +C++    ++ MFC +CD  YH +C
Sbjct: 427 CTVCQQPHHEDEMMFCDKCDRGYHTFC 453


>gi|148745647|gb|AAI42660.1| MYST3 protein [Homo sapiens]
          Length = 1149

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|355779658|gb|EHH64134.1| Histone acetyltransferase MYST3 [Macaca fascicularis]
          Length = 2276

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 444 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 503

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 504 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 543


>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
 gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
          Length = 3214

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2720 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2774

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +  P
Sbjct: 2775 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIARAP 2825



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2720 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2779

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2780 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2812


>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
           sativus]
          Length = 1073

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C +C RL    +YC +C K++  S+S   V CD C+ WVH +CD IS      F+  G+ 
Sbjct: 434 CKSCTRLTNSKHYCGICKKIWNHSDSGSWVRCDGCKVWVHAECDKISSN---LFKDLGST 490

Query: 222 QYRCPTCRGE 231
            Y CPTC+ +
Sbjct: 491 DYFCPTCKAK 500


>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
          Length = 1890

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 45/108 (41%), Gaps = 12/108 (11%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  G+ E      ++L C  C K YH  C +    N     W  ++C       R C
Sbjct: 1611 CQFCHSGDKED-----QLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNC 1665

Query: 93   EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCH 140
             +C + G+    + C +C  AYH  C  PP   V  G +LC     CH
Sbjct: 1666 IVCGKPGNKTISVLCDQCPKAYHIECLQPPLAKVPRGKWLC---VLCH 1710



 Score = 46.2 bits (108), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 39/100 (39%), Gaps = 17/100 (17%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK-------HTKCHSCGS 144
            C+ C      ++ + C  CD  YH YC  PP  N+  G + C +          C  CG 
Sbjct: 1611 CQFCHSGDKEDQLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNCIVCGK 1670

Query: 145  NVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
              PGN  +SV        CD C + +      P   KV R
Sbjct: 1671 --PGNKTISV-------LCDQCPKAYHIECLQPPLAKVPR 1701


>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
 gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
          Length = 3148

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2657 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2711

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +  P
Sbjct: 2712 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2762



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2657 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2716

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2717 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2749


>gi|74228562|dbj|BAE25366.1| unnamed protein product [Mus musculus]
          Length = 1291

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|344306723|ref|XP_003422034.1| PREDICTED: PHD finger protein 10-like [Loxodonta africana]
          Length = 533

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 412 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTC 471

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +    ++ MFC  CD  YH +C
Sbjct: 472 IVCGQPHHEDEMMFCDVCDRGYHTFC 497


>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
 gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
          Length = 1057

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 5/93 (5%)

Query: 139 CHSCGSNVPGNGLS--VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVC 196
           C SC   +P   +      F     C  C +L     YC +C K++  S+    VCCD C
Sbjct: 399 CDSCNLILPCKTIKRKASVFQTELICKHCAKLRKSKQYCGICKKIWHHSDGGNWVCCDGC 458

Query: 197 QRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
             WVH +CD IS + +   +   N  Y CP CR
Sbjct: 459 NVWVHAECDNISRKLFKDLE---NFDYYCPDCR 488


>gi|345316943|ref|XP_001509649.2| PREDICTED: PHD finger protein 10, partial [Ornithorhynchus
           anatinus]
          Length = 468

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 347 NAICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTTELVSMIKTYPWQCMECKTC 406

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 407 IICGQPHHEEEMMFCDVCDRGYHTFC 432


>gi|6850865|emb|CAB71104.1| putative protein [Arabidopsis thaliana]
          Length = 902

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C +++  S+    VCCD C  WVH +CD I++E++ + + +   
Sbjct: 338 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHNN-- 395

Query: 222 QYRCPTCR 229
            Y CP C+
Sbjct: 396 -YYCPDCK 402


>gi|358336360|dbj|GAA54889.1| histone acetyltransferase MYST3 [Clonorchis sinensis]
          Length = 1190

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 30/68 (44%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
            L CK CG + H  CL  W +  +    S W+C  C+ C +C+        + C  CD  
Sbjct: 808 FLICKDCGLRAHPTCLDYWPELTERARQSPWQCTDCKTCTVCQNKQITTDLLVCDACDKG 867

Query: 114 YHCYCQHP 121
           +H  C  P
Sbjct: 868 FHIECHVP 875


>gi|195447676|ref|XP_002071320.1| GK25190 [Drosophila willistoni]
 gi|194167405|gb|EDW82306.1| GK25190 [Drosophila willistoni]
          Length = 2262

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 11/113 (9%)

Query: 14   GYVATASKDIAGEEQGQSNTNVM----CRLCFVGENEGCERARRM----LSCKSCGKKYH 65
            G  + AS D   +++ Q+N+ +     C +C   ++     AR M    + C SC +  H
Sbjct: 1842 GASSVASSDNEQQQRPQNNSRLSHGSNCGVCLRNQHRN---ARNMPEAFIRCYSCRRNVH 1898

Query: 66   RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
             +C++   +        +W+C  C+ C  CRR     K ++C +CD  YH YC
Sbjct: 1899 PSCIEMPQRMLGRVRNYNWQCAECKCCIKCRRRQKEGKMLYCEQCDRGYHIYC 1951


>gi|297679669|ref|XP_002817646.1| PREDICTED: PHD finger protein 10 [Pongo abelii]
          Length = 498

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 436

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 437 IICGQPHHEEEMMFCDMCDRGYHTFC 462


>gi|345778329|ref|XP_532272.3| PREDICTED: PHD finger protein 10 isoform 1 [Canis lupus familiaris]
          Length = 410

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|338722861|ref|XP_001499514.3| PREDICTED: PHD finger protein 10 [Equus caballus]
          Length = 451

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E     +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NALCGICLKGKETNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415


>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
 gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
          Length = 2976

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2620 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2674

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +  P
Sbjct: 2675 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2725



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2620 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2679

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2680 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2712


>gi|432852854|ref|XP_004067418.1| PREDICTED: PHD finger protein 10-like [Oryzias latipes]
          Length = 442

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    R   ++ C  C    H +CL   ++   +     W+C  C+ C
Sbjct: 317 NAICGICQKGKESNKKGRPEALIHCSQCDNSGHPSCLDMSSELVSVIQTYRWQCMECKTC 376

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C++    ++ MFC +CD  YH +C
Sbjct: 377 TVCQQPHHEDEMMFCDKCDRGYHTFC 402


>gi|402868771|ref|XP_003898462.1| PREDICTED: PHD finger protein 10 [Papio anubis]
          Length = 498

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 17  ATASKDIAGEEQGQSNTNVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
           +  SK + G  + ++  N +C +C  G E+    +A  ++ C  C    H +CL    + 
Sbjct: 361 SVLSKSVPGY-KPKAIPNALCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVEL 419

Query: 76  RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
             +     W+C  C+ C IC +     + MFC  CD  YH +C
Sbjct: 420 VSMIKTYPWQCMECKTCIICGQPHHEEEMMFCDVCDRGYHTFC 462


>gi|357130254|ref|XP_003566765.1| PREDICTED: uncharacterized protein LOC100821699 [Brachypodium
           distachyon]
          Length = 918

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 9/124 (7%)

Query: 14  GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSC--KSCGKKY-HRNCLK 70
             V + ++   GEE         C++C   E++     +R L C    C  KY H  CLK
Sbjct: 739 AIVISCTEPAEGEELPNIVVGGSCKMCGTPEDDD----KRFLICGHSHCPYKYYHIRCLK 794

Query: 71  N-WAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
           +    ++       W CPSC +C +C   GD  + + C  CD AYH YC  P   +V  G
Sbjct: 795 SKQIASKVQRDKPCWYCPSC-LCRVCLSDGDDEQTILCDGCDEAYHLYCMTPRRTSVPKG 853

Query: 130 PYLC 133
            + C
Sbjct: 854 KWYC 857


>gi|296439269|sp|Q4V7A6.2|PHF10_RAT RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
           factor 45a; Short=BAF45a
          Length = 497

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461


>gi|154152087|ref|NP_001093826.1| zinc finger protein ubi-d4 [Bos taurus]
 gi|296218741|ref|XP_002755572.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Callithrix
           jacchus]
 gi|426252022|ref|XP_004019718.1| PREDICTED: zinc finger protein ubi-d4 [Ovis aries]
 gi|118582243|gb|ABL07500.1| zinc-finger protein ubi-d4 [Capra hircus]
 gi|151557067|gb|AAI49970.1| DPF2 protein [Bos taurus]
 gi|152941218|gb|ABS45046.1| D4, zinc and double PHD fingers family 2 [Bos taurus]
 gi|296471617|tpg|DAA13732.1| TPA: D4, zinc and double PHD fingers family 2 [Bos taurus]
 gi|417400089|gb|JAA47013.1| Putative transcription factor requiem/neuro-d4 [Desmodus rotundus]
          Length = 391

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
 gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
          Length = 982

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C +++  S+    VCCD C  WVH +CD I++E++ + + +   
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408

Query: 222 QYRCPTCR 229
            Y CP C+
Sbjct: 409 NYYCPDCK 416


>gi|291414421|ref|XP_002723450.1| PREDICTED: D4, zinc and double PHD fingers family 2-like
           [Oryctolagus cuniculus]
          Length = 388

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 262 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 321

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 322 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 368


>gi|308488788|ref|XP_003106588.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
 gi|308253938|gb|EFO97890.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
          Length = 452

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 4/92 (4%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M+ C  C   YH  C++   +   L     W C  CR+C IC +    N+ +FC +CD  
Sbjct: 352 MICCSVCQIVYHPRCIEMPDRMAALVRTYEWSCVDCRVCSICNKPEKENEIVFCDKCDRG 411

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
           +H +C     K++  G ++C   T C     N
Sbjct: 412 FHTFCVG--LKSLPRGTWIC--DTYCSETNRN 439


>gi|341942257|sp|Q9D8M7.4|PHF10_MOUSE RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
           factor 45a; Short=BAF45a
          Length = 497

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461


>gi|390470774|ref|XP_003734353.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Callithrix
           jacchus]
          Length = 405

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385


>gi|74208866|dbj|BAE21185.1| unnamed protein product [Mus musculus]
          Length = 1148

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|54020946|ref|NP_001005717.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
           tropicalis]
 gi|49522323|gb|AAH75306.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
           tropicalis]
          Length = 428

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 41/90 (45%)

Query: 44  ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
           EN+       M+SC  CG+  H +CL+             W+C  C+ C +C  + + ++
Sbjct: 321 ENKKTGSKEEMVSCADCGRSGHPSCLQFSPNMIISVKKYPWQCIECKSCGLCGTSDNDDQ 380

Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +FC  CD  YH YC  PP      G + C
Sbjct: 381 LLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 410


>gi|443725765|gb|ELU13216.1| hypothetical protein CAPTEDRAFT_167868 [Capitella teleta]
          Length = 236

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 52/122 (42%), Gaps = 16/122 (13%)

Query: 35  VMCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           V+C  C   +   C R      +L C  C  K H +C+   +        S W+C  C+ 
Sbjct: 14  VLCDYCL--QTASCNRKGAQEDLLICTDCQAKAHPSCMDYSSDLARRARRSPWQCIDCKT 71

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-----------PKHTKCH 140
           C +C   GDP+  +FC  CD  YH  C  P  ++  +G ++C           P+ T C 
Sbjct: 72  CCLCEDAGDPDAMLFCDACDKGYHMSCHSPVIEDKPTGKWVCSRCCQEIEADAPETTFCG 131

Query: 141 SC 142
           SC
Sbjct: 132 SC 133


>gi|356518577|ref|XP_003527955.1| PREDICTED: uncharacterized protein LOC100795906 [Glycine max]
          Length = 646

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL     N+   +   W CPSC +C +C    D ++ + C  CD AYH YC  P
Sbjct: 499 KYYHVRCL---TINQLKSYGHCWYCPSC-LCRVCLTDQDDDRIVLCDGCDHAYHIYCMKP 554

Query: 122 PHKNVSSGPYLCPK 135
           P  ++  G + C K
Sbjct: 555 PRTSIPRGNWFCRK 568


>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
 gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
          Length = 3086

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
            C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 2585 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2639

Query: 93   EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +C   R     K ++C  C  AYH  C  PP   V  G +       CH C +  P
Sbjct: 2640 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2690



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
            C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 2585 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2644

Query: 145  NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
            + P     + +      CD C R +    Y P  LKV R
Sbjct: 2645 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2677


>gi|12841710|dbj|BAB25323.1| unnamed protein product [Mus musculus]
          Length = 410

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|395544822|ref|XP_003774305.1| PREDICTED: zinc finger protein ubi-d4 [Sarcophilus harrisii]
          Length = 423

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 297 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 356

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 357 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 403


>gi|297267436|ref|XP_002808108.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein ubi-d4-like
           [Macaca mulatta]
          Length = 391

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
 gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
           Full=Protein SET DOMAIN GROUP 14; AltName:
           Full=Trithorax-homolog protein 3; Short=TRX-homolog
           protein 3
 gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
          Length = 1018

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C +++  S+    VCCD C  WVH +CD I++E++ + + +   
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408

Query: 222 QYRCPTCR 229
            Y CP C+
Sbjct: 409 NYYCPDCK 416


>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
          Length = 1018

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C +++  S+    VCCD C  WVH +CD I++E++ + + +   
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408

Query: 222 QYRCPTCR 229
            Y CP C+
Sbjct: 409 NYYCPDCK 416


>gi|431910278|gb|ELK13351.1| Zinc finger protein ubi-d4 [Pteropus alecto]
          Length = 391

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|194328734|ref|NP_060758.2| PHD finger protein 10 isoform a [Homo sapiens]
 gi|296439276|sp|Q8WUB8.3|PHF10_HUMAN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
           factor 45a; Short=BAF45a; AltName: Full=XAP135
 gi|119567827|gb|EAW47442.1| PHD finger protein 10, isoform CRA_a [Homo sapiens]
          Length = 498

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 436

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 437 IICGQPHHEEEMMFCDMCDRGYHTFC 462


>gi|431904613|gb|ELK09995.1| PHD finger protein 10 [Pteropus alecto]
          Length = 451

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NALCGICLKGKESNKKGKAESLIHCSQCDSSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415


>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
 gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
          Length = 1070

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
           G   C  C RL    ++C +C KV+  S+S     CD C+ W+H +CD IS      F+ 
Sbjct: 427 GQFLCKKCARLTKSKHFCGICKKVWNHSDSGSWARCDGCKVWIHAECDRISSN---HFKD 483

Query: 218 DGNLQYRCPTCRGE 231
            G + Y CPTC+ +
Sbjct: 484 LGGIDYYCPTCKAK 497


>gi|224066781|ref|XP_002302212.1| predicted protein [Populus trichocarpa]
 gi|222843938|gb|EEE81485.1| predicted protein [Populus trichocarpa]
          Length = 604

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 55/124 (44%), Gaps = 8/124 (6%)

Query: 11  FESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC-GKKYHRNCL 69
           FE     +A+ ++    +G  N    C++C  G   G     ++     C GK YH  CL
Sbjct: 410 FEEASNFSANNEVKLSSEGTGNV-CTCKIC--GSPVGNGEKIKICDHSECPGKYYHVRCL 466

Query: 70  KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
               Q     H   W CPSC +C +C    D +K + C  CD AYH YC  PP  +V  G
Sbjct: 467 TT-RQIDSCGH--RWYCPSC-LCRVCITDRDDDKIVLCDGCDHAYHLYCMIPPRISVPKG 522

Query: 130 PYLC 133
            + C
Sbjct: 523 KWFC 526


>gi|403165473|ref|XP_003325473.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375165738|gb|EFP81054.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 1108

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 2/96 (2%)

Query: 54  MLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK-FMFCRRCD 111
           M+SC  CG+  H +C++ N    +       W C  CR C  C + GD ++  + C  CD
Sbjct: 294 MVSCWECGQSGHFSCMELNNLTIKSHAKSYPWLCLECRRCHGCDKKGDDDQNMLLCAVCD 353

Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
             +H  C +PP + V SG + CP   +   C   +P
Sbjct: 354 RGWHGECLNPPLRTVPSGDFTCPFDHQSTQCIPPLP 389


>gi|194328736|ref|NP_579866.2| PHD finger protein 10 isoform b [Homo sapiens]
          Length = 496

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 375 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 434

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 435 IICGQPHHEEEMMFCDMCDRGYHTFC 460


>gi|255522851|ref|NP_077212.3| PHD finger protein 10 [Mus musculus]
          Length = 497

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461


>gi|444724505|gb|ELW65108.1| Zinc finger protein ubi-d4 [Tupaia chinensis]
          Length = 412

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 286 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 345

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 346 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 392


>gi|443690042|gb|ELT92280.1| hypothetical protein CAPTEDRAFT_224752 [Capitella teleta]
          Length = 1892

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 1/99 (1%)

Query: 36  MCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   +++  +    +++SC  CG   H +CLK      +      W+C  C+ C +
Sbjct: 210 LCSFCLGADDKNRDGVPEQLISCADCGNCGHPSCLKFSDSLVERVGHMRWQCIECKKCSL 269

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C  TG  +  +FC  CD   H  C  PP  +   G ++C
Sbjct: 270 CGETGKEDNMLFCDACDRGIHMECCIPPLTSAPEGKWVC 308


>gi|387542916|gb|AFJ72085.1| PHD finger protein 10 isoform a [Macaca mulatta]
          Length = 498

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 436

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 437 IICGQPHHEEEMMFCDVCDRGYHTFC 462


>gi|67078518|ref|NP_001019918.1| PHD finger protein 10 [Rattus norvegicus]
 gi|66910931|gb|AAH98049.1| PHD finger protein 10 [Rattus norvegicus]
          Length = 410

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|197098008|ref|NP_001127678.1| zinc finger protein ubi-d4 [Pongo abelii]
 gi|332250195|ref|XP_003274239.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Nomascus
           leucogenys]
 gi|397516926|ref|XP_003828672.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Pan paniscus]
 gi|402892869|ref|XP_003909629.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Papio anubis]
 gi|403293484|ref|XP_003937746.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|426369137|ref|XP_004051553.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Gorilla gorilla
           gorilla]
 gi|56403615|emb|CAI29608.1| hypothetical protein [Pongo abelii]
          Length = 405

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385


>gi|30584805|gb|AAP36655.1| Homo sapiens requiem, apoptosis response zinc finger gene
           [synthetic construct]
 gi|61370771|gb|AAX43549.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
          Length = 392

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|149047117|gb|EDL99837.1| PHD finger protein 10 [Rattus norvegicus]
          Length = 449

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 328 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 387

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 388 IICGQPHHEEEMMFCDVCDRGYHTFC 413


>gi|440907399|gb|ELR57553.1| Zinc finger protein ubi-d4, partial [Bos grunniens mutus]
          Length = 380

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 254 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 313

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 314 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 360


>gi|336370765|gb|EGN99105.1| hypothetical protein SERLA73DRAFT_160636 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 1506

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 28/51 (54%)

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
           WKC  C+ CE+CR  GD  + +FC  CD  +H  C  PP +    G + CP
Sbjct: 9   WKCLECKNCEVCREKGDDERILFCDFCDRGWHMDCLQPPLQESPPGKWHCP 59


>gi|5454004|ref|NP_006259.1| zinc finger protein ubi-d4 [Homo sapiens]
 gi|350534556|ref|NP_001233580.1| zinc finger protein ubi-d4 [Pan troglodytes]
 gi|73983120|ref|XP_866588.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Canis lupus
           familiaris]
 gi|332250193|ref|XP_003274238.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Nomascus
           leucogenys]
 gi|397516924|ref|XP_003828671.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Pan paniscus]
 gi|402892867|ref|XP_003909628.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Papio anubis]
 gi|403293482|ref|XP_003937745.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|410974412|ref|XP_003993641.1| PREDICTED: zinc finger protein ubi-d4 [Felis catus]
 gi|426369135|ref|XP_004051552.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Gorilla gorilla
           gorilla]
 gi|2842711|sp|Q92785.2|REQU_HUMAN RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
           response zinc finger protein; AltName:
           Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
           Full=D4, zinc and double PHD fingers family 2; AltName:
           Full=Protein requiem
 gi|2121234|gb|AAB58307.1| requiem homolog [Homo sapiens]
 gi|2529705|gb|AAB81203.1| requiem [Homo sapiens]
 gi|15928853|gb|AAH14889.1| D4, zinc and double PHD fingers family 2 [Homo sapiens]
 gi|28144169|gb|AAO26041.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
 gi|30582275|gb|AAP35364.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
 gi|61361059|gb|AAX41982.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
 gi|61361064|gb|AAX41983.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
 gi|119594781|gb|EAW74375.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
           sapiens]
 gi|119594782|gb|EAW74376.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
           sapiens]
 gi|123983164|gb|ABM83323.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
 gi|123997873|gb|ABM86538.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
 gi|158257320|dbj|BAF84633.1| unnamed protein product [Homo sapiens]
 gi|208967739|dbj|BAG72515.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
 gi|343962233|dbj|BAK62704.1| zinc-finger protein ubi-d4 [Pan troglodytes]
 gi|355566318|gb|EHH22697.1| Protein requiem [Macaca mulatta]
 gi|355751970|gb|EHH56090.1| Protein requiem [Macaca fascicularis]
 gi|380815318|gb|AFE79533.1| zinc finger protein ubi-d4 [Macaca mulatta]
 gi|410218232|gb|JAA06335.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
 gi|410249592|gb|JAA12763.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
 gi|410288496|gb|JAA22848.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
 gi|410336195|gb|JAA37044.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
          Length = 391

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|380792751|gb|AFE68251.1| PHD finger protein 10 isoform a, partial [Macaca mulatta]
          Length = 480

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 436

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 437 IICGQPHHEEEMMFCDVCDRGYHTFC 462


>gi|311247329|ref|XP_003122585.1| PREDICTED: zinc finger protein ubi-d4 [Sus scrofa]
          Length = 391

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|149725409|ref|XP_001492666.1| PREDICTED: zinc finger protein ubi-d4 [Equus caballus]
          Length = 391

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|26331782|dbj|BAC29621.1| unnamed protein product [Mus musculus]
          Length = 1010

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|432091134|gb|ELK24346.1| Zinc finger protein ubi-d4 [Myotis davidii]
          Length = 405

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385


>gi|380792753|gb|AFE68252.1| PHD finger protein 10 isoform b, partial [Macaca mulatta]
          Length = 478

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 375 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 434

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 435 IICGQPHHEEEMMFCDVCDRGYHTFC 460


>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
           max]
          Length = 1003

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C RL    +YC +C K++  S+S   V CD C+ WVH +CD IS      F+  G  
Sbjct: 364 CKTCARLTKSKHYCGICKKIWNYSDSGSWVRCDGCKVWVHAECDKISSN---LFKNLGGS 420

Query: 222 QYRCPTCR 229
            Y CPTC+
Sbjct: 421 DYFCPTCK 428


>gi|410960397|ref|XP_004001392.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 10 [Felis catus]
          Length = 440

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 319 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 378

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 379 IVCGQPHHEEEMMFCDVCDRGYHTFC 404


>gi|12805463|gb|AAH02206.1| PHD finger protein 10 [Mus musculus]
 gi|148688526|gb|EDL20473.1| PHD finger protein 10, isoform CRA_b [Mus musculus]
          Length = 408

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 287 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 346

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 347 IICGQPHHEEEMMFCDVCDRGYHTFC 372


>gi|395852334|ref|XP_003798694.1| PREDICTED: zinc finger protein ubi-d4 [Otolemur garnettii]
          Length = 391

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
           max]
          Length = 1035

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 45/96 (46%), Gaps = 6/96 (6%)

Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C +CG ++P   L         G   C  C RL    +YC +C KV+  S+S   V CD 
Sbjct: 370 CEACGLSLPYKMLKKTKDSSPGGQFLCKTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 429

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
           C+ WVH +CD I    +   +      Y CPTC+ +
Sbjct: 430 CKVWVHAECDKICSNLFKNLE---GTDYYCPTCKAK 462


>gi|332825485|ref|XP_518861.3| PREDICTED: PHD finger protein 10 [Pan troglodytes]
          Length = 451

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415


>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
 gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
          Length = 1983

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 14/101 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
           E C++   ++ C +C K YH  CL    ++     WS   CP+C              E 
Sbjct: 382 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 438

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           CR   D  + + C  C +AYH +C  PP  ++  G + CP+
Sbjct: 439 CRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPR 479


>gi|148688525|gb|EDL20472.1| PHD finger protein 10, isoform CRA_a [Mus musculus]
          Length = 469

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 348 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 407

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 408 IICGQPHHEEEMMFCDVCDRGYHTFC 433


>gi|301121094|ref|XP_002908274.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103305|gb|EEY61357.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 634

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 9/72 (12%)

Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG-------ISD--EKYLQFQVD 218
           L  +G YCPVC +VY D +    VCCD C+ WVH  CD        + D  E  +    +
Sbjct: 562 LRAQGQYCPVCNEVYEDDDQNTFVCCDSCELWVHGACDPSLTPYVVLLDMMESIIAAMAN 621

Query: 219 GNLQYRCPTCRG 230
              +Y CP C G
Sbjct: 622 TEDKYICPLCAG 633


>gi|119567828|gb|EAW47443.1| PHD finger protein 10, isoform CRA_b [Homo sapiens]
          Length = 449

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 328 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 387

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 388 IICGQPHHEEEMMFCDMCDRGYHTFC 413


>gi|332263993|ref|XP_003281033.1| PREDICTED: PHD finger protein 10 [Nomascus leucogenys]
          Length = 451

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415


>gi|355569151|gb|EHH25368.1| hypothetical protein EGK_21333 [Macaca mulatta]
          Length = 410

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|281351742|gb|EFB27326.1| hypothetical protein PANDA_014721 [Ailuropoda melanoleuca]
          Length = 397

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 276 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 335

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 336 IVCGQPHHEEEMMFCDVCDRGYHTFC 361


>gi|7023354|dbj|BAA91934.1| unnamed protein product [Homo sapiens]
 gi|48146663|emb|CAG33554.1| PHF10 [Homo sapiens]
 gi|82571445|gb|AAI10324.1| PHD finger protein 10 [Homo sapiens]
 gi|261858284|dbj|BAI45664.1| PHD finger protein 10 [synthetic construct]
          Length = 410

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374


>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
 gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
          Length = 1149

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 44/98 (44%), Gaps = 14/98 (14%)

Query: 139 CHSCGSNVPGNGLS-------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
           C SCG  +P   +           FL    C  C +L     YC +C K++  S+    V
Sbjct: 283 CASCGLMLPCKTMKKVKDSSHAPQFL----CKHCVKLRKSKQYCGICKKIWHHSDGGNWV 338

Query: 192 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           CCD C  WVH +CD IS E +   +   N  Y CP C+
Sbjct: 339 CCDGCNVWVHAECDKISTEHFKDLE---NTDYYCPDCK 373


>gi|348561411|ref|XP_003466506.1| PREDICTED: PHD finger protein 10-like [Cavia porcellus]
          Length = 614

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 493 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSVELVSMIKTYPWQCMECKTC 552

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 553 IICGQPHHEEEMMFCDVCDRGYHTFC 578


>gi|74219112|dbj|BAE26697.1| unnamed protein product [Mus musculus]
          Length = 391

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTRQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|344295884|ref|XP_003419640.1| PREDICTED: zinc finger protein ubi-d4 [Loxodonta africana]
          Length = 391

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|237839305|ref|XP_002368950.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
 gi|211966614|gb|EEB01810.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
          Length = 551

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 8/142 (5%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
           E+ A R +  + A  ++ +   E+G+       R         C R  R    +L C  C
Sbjct: 297 EELAARGKVFFAACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 353

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
            + +H +C  +   N +L     W C  C+ CE C+   +  + + C  CD AYH  C  
Sbjct: 354 RQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 412

Query: 121 PPHKNVSSGPYLCPKHTKCHSC 142
           PP + V  G + C    +C  C
Sbjct: 413 PPVEEVPDGTWFCADCGRCACC 434


>gi|426355212|ref|XP_004045024.1| PREDICTED: PHD finger protein 10 [Gorilla gorilla gorilla]
          Length = 451

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELISMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415


>gi|72015501|ref|XP_785947.1| PREDICTED: uncharacterized protein LOC580820 [Strongylocentrotus
           purpuratus]
          Length = 1065

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 9/105 (8%)

Query: 34  NVMCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
             +C LC        +     ++ C  C    H +CL+   +         W+C  C+ C
Sbjct: 845 TAICGLCLKDRRSNTKGVPENLVHCSQCDNSGHPSCLEMNDELVATIKTYPWQCMECKTC 904

Query: 93  EICRRTGDP---NKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
             C   GDP   +K MFC +CD  YH +C      ++ +G +LCP
Sbjct: 905 SQC---GDPTHEDKMMFCDKCDRGYHTFCVG--LTDIPTGNWLCP 944


>gi|18088065|gb|AAH20954.1| PHD finger protein 10 [Homo sapiens]
 gi|123981058|gb|ABM82358.1| PHD finger protein 10 [synthetic construct]
 gi|123995863|gb|ABM85533.1| PHD finger protein 10 [synthetic construct]
          Length = 408

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 287 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 346

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 347 IICGQPHHEEEMMFCDMCDRGYHTFC 372


>gi|148744463|gb|AAI42960.1| MYST3 protein [Homo sapiens]
          Length = 815

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|390462274|ref|XP_002747237.2| PREDICTED: PHD finger protein 10 [Callithrix jacchus]
          Length = 451

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415


>gi|291240495|ref|XP_002740154.1| PREDICTED: PHD finger protein 10-like [Saccoglossus kowalevskii]
          Length = 459

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 15/122 (12%)

Query: 19  ASKDIAGEEQGQSNTNVMCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRD 77
           A K   GE + +   + +C +C  G++   ++ + +++ C  C    H  CL+    N  
Sbjct: 322 AKKTTNGEYKPKDIPDAVCGICLKGKDSSKKKFSEQLVHCSQCDNSGHPTCLQ---MNDS 378

Query: 78  LFH---WSSWKCPSCRICEICRRTGDP---NKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
           L H      W+C  C+ C +C   GDP   +K MFC  CD  +H +C     K++ +G +
Sbjct: 379 LVHVIKTYPWQCMECKTCTLC---GDPTHEDKMMFCDDCDRGHHTFCVG--LKSIPTGQW 433

Query: 132 LC 133
            C
Sbjct: 434 TC 435


>gi|26333367|dbj|BAC30401.1| unnamed protein product [Mus musculus]
          Length = 461

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|351701967|gb|EHB04886.1| Zinc finger protein ubi-d4 [Heterocephalus glaber]
          Length = 601

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 475 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 534

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 535 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 581


>gi|242014022|ref|XP_002427697.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212512132|gb|EEB14959.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 639

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 3/109 (2%)

Query: 36  MCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C++C F  + +      R  SC  CGKK H  C+++  Q    F  S W+C  C+ C  
Sbjct: 390 LCKICSFNIDFKSSRNLDRWTSCHFCGKKAHVTCIQDTEQ-WTRFKLSKWQCRDCKNCST 448

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
           C+        + C  CD AYH  C +   K+ S+  + C K ++  S G
Sbjct: 449 CKNKFSDGDLIVCGLCDDAYHLTCAN-VKKSKSNQKWFCNKCSRVFSDG 496


>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
          Length = 1603

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 15/132 (11%)

Query: 36  MCRLCFVGENEGCE----RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
           +C++C  G+             ++ C  CG+ +H  C+    +   +    +WKC  C+ 
Sbjct: 857 LCKVCLSGDVPSVVGKSFVPSTLICCVDCGEVFHTFCIGLPEEVASVIDRLTWKCADCKC 916

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP----YLCPKHTKCHSCG--SN 145
           C +C    + +  + C RCD  +H YC       + + P    ++CP  +K  S G   +
Sbjct: 917 CSVCMALDNEDLLLICDRCDLGFHTYC-----AGLDALPEEDDWVCPSCSKIQSNGDEQD 971

Query: 146 VPGNGLSVRWFL 157
           V    +  +W L
Sbjct: 972 VKVETVKEKWLL 983


>gi|426388550|ref|XP_004060697.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4
           [Gorilla gorilla gorilla]
          Length = 388

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 6/114 (5%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
            C+ C +C  + + ++ +FC   D  YH YC  PP      G +   LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDSDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378


>gi|355749042|gb|EHH53525.1| hypothetical protein EGM_14185 [Macaca fascicularis]
          Length = 410

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|13487236|gb|AAK27451.1|AF338735_1 hypothetical PHD zinc finger protein XAP135 [Homo sapiens]
          Length = 410

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374


>gi|26344145|dbj|BAC35729.1| unnamed protein product [Mus musculus]
          Length = 435

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|403305903|ref|XP_003943488.1| PREDICTED: PHD finger protein 10 [Saimiri boliviensis boliviensis]
          Length = 410

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|159164819|pdb|2YSM|A Chain A, Solution Structure Of The First And Second Phd Domain From
           MyeloidLYMPHOID OR MIXED-Lineage Leukemia Protein 3
           Homolog
          Length = 111

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)

Query: 57  CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
           C +CG+ YH  CL            + W+CP C++C+ C+++G+ +K + C  CD  YH 
Sbjct: 25  CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 81

Query: 117 YCQHPPHKNVSSGPYLC 133
           +C  P  K+V +  + C
Sbjct: 82  FCLQPVMKSVPTNGWKC 98


>gi|347966735|ref|XP_001689318.2| AGAP001877-PA [Anopheles gambiae str. PEST]
 gi|333469922|gb|EDO63223.2| AGAP001877-PA [Anopheles gambiae str. PEST]
          Length = 2382

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 5/115 (4%)

Query: 36   MCRLCFVGENEG-CERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
            +C +C   EN+    +    + C  C +K H +C+   +          W+C  C++C  
Sbjct: 2153 LCAVCMGPENKNKYSKPELFVRCTRCRRKAHPSCIGMSSVMYKRVQQYKWQCSECKLCMK 2212

Query: 95   CRR--TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            C R      +K ++C +CD  YH  C+    +N+  G + C   T C  CG+  P
Sbjct: 2213 CNRQPAAIDSKMVYCDQCDRGYHLACKG--LRNLPEGRWHCNICTICGLCGAQTP 2265


>gi|291190717|ref|NP_001167047.1| Zinc finger protein ubi-d4 [Salmo salar]
 gi|223647844|gb|ACN10680.1| Zinc finger protein ubi-d4 [Salmo salar]
          Length = 402

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   ++  + SC  CG+  H +CL+             W+C
Sbjct: 274 GLAIPNNYCDFCLGDSALNQKTGQSEELQSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 333

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 334 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPAMAEPPEGSWSC 380


>gi|432875795|ref|XP_004072911.1| PREDICTED: histone acetyltransferase KAT6A-like [Oryzias latipes]
          Length = 1964

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 236 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 295

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 296 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 335


>gi|84000081|ref|NP_001033141.1| PHD finger protein 10 [Bos taurus]
 gi|122136994|sp|Q2T9V9.1|PHF10_BOVIN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
           factor 45a; Short=BAF45a
 gi|83405479|gb|AAI11244.1| PHD finger protein 10 [Bos taurus]
 gi|296483819|tpg|DAA25934.1| TPA: PHD finger protein 10 [Bos taurus]
          Length = 410

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 289 NALCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374


>gi|440893739|gb|ELR46406.1| PHD finger protein 10, partial [Bos grunniens mutus]
          Length = 469

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 348 NALCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 407

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 408 IICGQPHHEEEMMFCDVCDRGYHTFC 433


>gi|13938144|gb|AAH07188.1| Dpf2 protein, partial [Mus musculus]
          Length = 351

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 225 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 284

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 285 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 331


>gi|328785548|ref|XP_003250614.1| PREDICTED: hypothetical protein LOC100576266 [Apis mellifera]
          Length = 659

 Score = 55.8 bits (133), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)

Query: 23  IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
           I+ E Q  S T +    CR C +   E  E    +++C+ C  + H +C+ +  +     
Sbjct: 409 ISQESQDSSKTTIKDGPCRQCSLCAKEKQEN---LVACRDCTVRAHPSCIYS-PEEMIQK 464

Query: 80  HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             S+W+C  C+ C IC  T D      C  CD AYH YC  P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506


>gi|380029159|ref|XP_003698249.1| PREDICTED: uncharacterized protein LOC100865213 [Apis florea]
          Length = 659

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)

Query: 23  IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
           I+ E Q  S T +    CR C +   E  E    +++C+ C  + H +C+ +  +     
Sbjct: 409 ISQESQDSSKTTIKDGPCRQCSLCAKEKQEN---LVACRDCTVRAHPSCIYS-PEEMIQK 464

Query: 80  HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             S+W+C  C+ C IC  T D      C  CD AYH YC  P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506


>gi|354505054|ref|XP_003514587.1| PREDICTED: zinc finger protein ubi-d4-like [Cricetulus griseus]
 gi|344258641|gb|EGW14745.1| Zinc finger protein ubi-d4 [Cricetulus griseus]
          Length = 391

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|427797535|gb|JAA64219.1| Putative histone acetyltransferase myst family, partial
           [Rhipicephalus pulchellus]
          Length = 2019

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)

Query: 51  ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
           A  +LSC SC    H +CLK N      L     W+C  CR+C  C    +    + C  
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
           CD+ +H  C  PP      G +      KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336


>gi|187469691|gb|AAI66788.1| Dpf2 protein [Rattus norvegicus]
          Length = 390

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 264 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 323

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 324 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 370


>gi|6755314|ref|NP_035392.1| zinc finger protein ubi-d4 [Mus musculus]
 gi|2500148|sp|Q61103.1|REQU_MOUSE RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
           response zinc finger protein; AltName:
           Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
           Full=D4, zinc and double PHD fingers family 2; AltName:
           Full=Protein requiem
 gi|1167972|gb|AAC52783.1| ubi-d4 [Mus musculus]
 gi|12836275|dbj|BAB23583.1| unnamed protein product [Mus musculus]
 gi|15215228|gb|AAH12709.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
 gi|74184334|dbj|BAE25702.1| unnamed protein product [Mus musculus]
 gi|74201274|dbj|BAE26098.1| unnamed protein product [Mus musculus]
 gi|74201435|dbj|BAE26153.1| unnamed protein product [Mus musculus]
 gi|74206142|dbj|BAE23543.1| unnamed protein product [Mus musculus]
 gi|148701237|gb|EDL33184.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
          Length = 391

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
 gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
          Length = 1441

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/221 (25%), Positives = 83/221 (37%), Gaps = 75/221 (33%)

Query: 33   TNVMCRLCFVGENEGC-----------------ERAR-------RMLSCKS--CGKKYHR 66
            T+++C  C VGE EGC                 E A        ++L+C    CGK++H 
Sbjct: 856  TSLVCHECNVGEPEGCVICHQVESPAVPSTPMKEEAPSHIPIEDKLLTCSQPLCGKRFHT 915

Query: 67   NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYH-- 115
            +C K W Q     H  S +CP   +C  C  + DP         +K   C RC A YH  
Sbjct: 916  SCCKYWPQANSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQD 971

Query: 116  CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
             +C     + +++   +CP+H        N+      V     Y C        VKG   
Sbjct: 972  SHCIPAGTQMLNATHIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1014

Query: 176  PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
                          +VCC+ C   VH  C  I   ++E Y+
Sbjct: 1015 --------------LVCCETCPIAVHAHCRNIPIKTNENYI 1041


>gi|327291384|ref|XP_003230401.1| PREDICTED: zinc finger protein ubi-d4-like, partial [Anolis
           carolinensis]
          Length = 178

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 52  GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 111

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C IC  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 112 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEGSWSC 158


>gi|427797307|gb|JAA64105.1| Putative histone acetyltransferase myst family, partial
           [Rhipicephalus pulchellus]
          Length = 2011

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)

Query: 51  ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
           A  +LSC SC    H +CLK N      L     W+C  CR+C  C    +    + C  
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
           CD+ +H  C  PP      G +      KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336


>gi|291414252|ref|XP_002723376.1| PREDICTED: PHD finger protein 10 [Oryctolagus cuniculus]
          Length = 626

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 505 NAICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSTELVSMIKTYPWQCMECKTC 564

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 565 IVCGQPHHEEEMMFCDVCDRGYHTFC 590


>gi|157817959|ref|NP_001101986.1| zinc finger protein ubi-d4 [Rattus norvegicus]
 gi|149062118|gb|EDM12541.1| D4, zinc and double PHD fingers family 2 (predicted) [Rattus
           norvegicus]
          Length = 391

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|326915596|ref|XP_003204100.1| PREDICTED: PHD finger protein 10-like, partial [Meleagris
           gallopavo]
          Length = 361

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 240 NAICGICLKGKESNKKGKAEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 299

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 300 IICGQPHHEEEMMFCDVCDRGYHTFC 325


>gi|410922269|ref|XP_003974605.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
           KAT6A-like [Takifugu rubripes]
          Length = 2234

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    +R   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 229 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G +    +FC  CD  +H  C  PP   +  G ++C
Sbjct: 289 CQDQGKNAENMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328


>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
           humanus corporis]
 gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
           humanus corporis]
          Length = 1999

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
           E C++   ++ C +C + YH  CL    +      WS   CP C              E 
Sbjct: 355 EVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWS---CPHCEAEGTQEQDDDEHNEF 411

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 412 CRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPR 452


>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
 gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
          Length = 2943

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 58/262 (22%), Positives = 86/262 (32%), Gaps = 88/262 (33%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNR----DLFHWSSWKCPSCRICEICRRTG---------- 99
             + C  CG+ +H  C+   +  R    D    + W+CP+C++CEIC + G          
Sbjct: 1593 FIFCVDCGEGFHSFCVSGMSAARLEDSDQLR-AYWRCPNCKMCEICGQPGAVCGAESSAR 1651

Query: 100  ---------DPNK-------------FMFCRRCDAAYHCYCQHPPHK---------NVSS 128
                      PN               + C  CD  +H  C  P  K           SS
Sbjct: 1652 VPATAGNVDSPNTDTETSLELAKTESLLLCGHCDRGFHGSCLVPAIKLPLNPKKRDGTSS 1711

Query: 129  GPYL-CPKHTKCHSCGSN--------VPGNGLSV-------------------------- 153
             P + C     C +C S+         P + L                            
Sbjct: 1712 SPVIYCASCVSCVNCKSSREYLDSDVAPNDHLDALERTYSYEQDKCLHCHNRKEREVQAL 1771

Query: 154  ---RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--- 207
                  L     DA  R       CP+C + + D++   ++ CD C+RWVH  CD +   
Sbjct: 1772 RERTRLLTEVWMDAARRSKKDAEKCPLCRRKW-DADLEELMQCDACERWVHPPCDDLLKK 1830

Query: 208  SDEKYLQFQVDGNLQYRCPTCR 229
              ++Y     D N  Y C  CR
Sbjct: 1831 EPKRYQTLVSDPNAVYVCAACR 1852



 Score = 46.2 bits (108), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 47/138 (34%), Gaps = 29/138 (21%)

Query: 97   RTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVS--------------SGPYLCPKHTKCH 140
            R GD  +   + C +CD  +H  C  PPH  +S                P++C   T C 
Sbjct: 1063 RRGDAGEEELLACAQCDNQFHATCCDPPHAPLSLVSPDDGDVLVADLKTPFVCSDCTSCA 1122

Query: 141  SC----GSNVPGNGLSVRW------FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE---S 187
             C             S RW            C  C   +    +C VC  V  D +   S
Sbjct: 1123 GCRCRKSDEARAEEPSPRWSQWRLPLQTAALCTTCIPYYKANRFCGVCNLVLDDEQLATS 1182

Query: 188  TPMVCCDVCQRWVHCQCD 205
              ++ C  C  W+H  C+
Sbjct: 1183 VDLLTCATCHHWIHADCE 1200


>gi|168001639|ref|XP_001753522.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695401|gb|EDQ81745.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 428

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 48/99 (48%), Gaps = 22/99 (22%)

Query: 148 GNGLSVRWFL----------GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPM-----VC 192
           G G + RW +            T C+AC   F +G YCP C+++YR+ +         V 
Sbjct: 72  GKGGTCRWHVRNYGSQKDPKHVTLCNACKINFDQGKYCPFCVQIYREKDPDSFDGKEWVG 131

Query: 193 CD--VCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           CD   C+RWVH +C+ IS       QVD    Y CP+CR
Sbjct: 132 CDNRTCRRWVHVECE-ISGGN----QVDSTAFYLCPSCR 165


>gi|221507890|gb|EEE33477.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 546

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 8/142 (5%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
           E+ A R +  + A  ++ +   E+G+       R         C R  R    +L C  C
Sbjct: 292 EELAARGKVFFSACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 348

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
            + +H +C  +   N +L     W C  C+ CE C+   +  + + C  CD AYH  C  
Sbjct: 349 RQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 407

Query: 121 PPHKNVSSGPYLCPKHTKCHSC 142
           PP + V  G + C    +C  C
Sbjct: 408 PPVEEVPDGTWFCADCGRCACC 429


>gi|6648956|gb|AAF21306.1|AF108134_1 ubi-d4/requiem [Mus musculus]
          Length = 380

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 254 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 313

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 314 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 360


>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
           [Acyrthosiphon pisum]
          Length = 2002

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 20/104 (19%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
           E C++   ++ C +C + YH  CL    ++     WS   CP C                
Sbjct: 378 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCESEGGQEQEEDEHQEF 434

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           C +C+  G+    + C  C AAYH +C  PP  +V  G + CP+
Sbjct: 435 CRVCKDGGE---LLCCDSCPAAYHTFCLSPPITDVPDGDWKCPR 475


>gi|47227720|emb|CAG09717.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2476

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    +R   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 441 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 500

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 501 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 540


>gi|348564966|ref|XP_003468275.1| PREDICTED: zinc finger protein ubi-d4-like [Cavia porcellus]
          Length = 391

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371


>gi|348522233|ref|XP_003448630.1| PREDICTED: histone acetyltransferase MYST3-like [Oreochromis
           niloticus]
          Length = 2258

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 243 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 302

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 303 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 342


>gi|26389386|dbj|BAC25728.1| unnamed protein product [Mus musculus]
          Length = 803

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)

Query: 36  MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C   + +  E+    ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           CR  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307


>gi|344257401|gb|EGW13505.1| PHD finger protein 10 [Cricetulus griseus]
          Length = 329

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 208 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTC 267

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 268 IICGQPHHEEEMMFCDVCDRGYHTFC 293


>gi|405951463|gb|EKC19373.1| Bromodomain adjacent to zinc finger domain protein 2B [Crassostrea
            gigas]
          Length = 2317

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 43/114 (37%), Gaps = 34/114 (29%)

Query: 91   ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
            +C++CRR  +  + + C  CD  YH YC  P   N+  G +       C+ C S   G  
Sbjct: 2033 LCQLCRRDDNEAQLLLCDGCDQGYHTYCFKPKMDNIPDGDWY------CYECISKATGE- 2085

Query: 151  LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
                      CC  CG+   +                  +V CD+C R +H  C
Sbjct: 2086 ---------PCCVVCGKRMGR------------------IVECDLCPRAIHLDC 2112



 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 51/123 (41%), Gaps = 24/123 (19%)

Query: 30   QSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC--- 86
            +S   V+C+LC   +NE      ++L C  C + YH  C K    N     W  ++C   
Sbjct: 2027 KSIMKVLCQLCRRDDNEA-----QLLLCDGCDQGYHTYCFKPKMDNIPDGDWYCYECISK 2081

Query: 87   ----PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
                P C +C   +R G   + + C  C  A H  C +PP   +       P+   C +C
Sbjct: 2082 ATGEPCCVVC--GKRMG---RIVECDLCPRAIHLDCLNPPLPRM-------PRKWVCPAC 2129

Query: 143  GSN 145
             +N
Sbjct: 2130 TAN 2132


>gi|221483410|gb|EEE21729.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
           GT1]
          Length = 556

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/239 (23%), Positives = 89/239 (37%), Gaps = 37/239 (15%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
           E+ A R +  + A  ++ +   E+G+       R         C R  R    +L C  C
Sbjct: 302 EELAARGKVFFAACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 358

Query: 61  GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
              +H +C  +   N +L     W C  C+ CE C+   +  + + C  CD AYH  C  
Sbjct: 359 RHSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 417

Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--------SVRWF---------------- 156
           PP + V  G + C    +C  C   +    +        S+R                  
Sbjct: 418 PPVEEVPDGTWFCADCGRCACCDRRLSDEKILDPHSCVGSMRRLCFDCKERHRRGKRSRL 477

Query: 157 --LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP---MVCCDVCQRWVHCQCDGISDE 210
             LG +  DA      + + C VC+K     E  P    V CD+C++ VH  C  +  E
Sbjct: 478 SRLGSSQGDAGTHSAKRTSLCDVCVKSLCACEGKPPKMRVACDLCKQVVHADCARLPQE 536


>gi|328872173|gb|EGG20540.1| hypothetical protein DFA_00401 [Dictyostelium fasciculatum]
          Length = 436

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 8/114 (7%)

Query: 142 CGSNVPGNGLSVRWFLG---YTCCDACGRLFVKGNYCPVCLKVYRDSE----STPMVCCD 194
           CGS  PG G + +W  G      C++CG   +K   C +C  VY   E    S   + CD
Sbjct: 278 CGSTTPGKGPTCKWRKGPNGEVLCNSCGLQNMKKPKCLLCGIVYNSKEAMASSISWIRCD 337

Query: 195 VCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 247
            C++WV  +CD G+ D           L Y CP CR +  + +   +  R L +
Sbjct: 338 DCKQWVMSKCDSGMGDISLYDDSNPNPLHYSCPKCRTDPSKPKTTRNNHRSLLK 391


>gi|170595283|ref|XP_001902318.1| Hypothetical C28H8.9 in chromosome III [Brugia malayi]
 gi|158590068|gb|EDP28834.1| Hypothetical C28H8.9 in chromosome III, putative [Brugia malayi]
          Length = 149

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)

Query: 32  NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
           + + +C LC     +N+   +  +++SC  CG+  H +CLK             W+C  C
Sbjct: 26  DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 85

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           + C IC  + + ++ +FC  CD  +H YC  PP      G + C
Sbjct: 86  KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129


>gi|1083466|pir||A55302 probable transcription factor requiem - mouse
 gi|606661|gb|AAA64637.1| Requiem [Mus musculus]
          Length = 371

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 29  GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           G +  N  C  C      N+   +   ++SC  CG+  H +CL+             W+C
Sbjct: 245 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 304

Query: 87  PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
             C+ C +C  + + ++ +FC  CD  YH YC  P       G + C
Sbjct: 305 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 351


>gi|402591828|gb|EJW85757.1| Dpf2 protein [Wuchereria bancrofti]
          Length = 149

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)

Query: 32  NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
           + + +C LC     +N+   +  +++SC  CG+  H +CLK             W+C  C
Sbjct: 26  DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 85

Query: 90  RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           + C IC  + + ++ +FC  CD  +H YC  PP      G + C
Sbjct: 86  KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129


>gi|350396306|ref|XP_003484507.1| PREDICTED: hypothetical protein LOC100744391 [Bombus impatiens]
          Length = 658

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 7/102 (6%)

Query: 23  IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
           I  E Q  S T V    CR C +   E  E    +++C+ C  + H +C+ +  +     
Sbjct: 408 IPQESQDSSKTTVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMIQK 463

Query: 80  HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             S+W+C  C+ C IC  T D      C  CD AYH YC  P
Sbjct: 464 AGSNWQCERCKSCTICCETSDAGPLATCFTCDDAYHYYCHTP 505


>gi|301779690|ref|XP_002925264.1| PREDICTED: PHD finger protein 10-like [Ailuropoda melanoleuca]
          Length = 636

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 515 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 574

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 575 IVCGQPHHEEEMMFCDVCDRGYHTFC 600


>gi|290992402|ref|XP_002678823.1| predicted protein [Naegleria gruberi]
 gi|284092437|gb|EFC46079.1| predicted protein [Naegleria gruberi]
          Length = 457

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 5/88 (5%)

Query: 28  QGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           +G+  T  +C++C        E+  + + C SC   +H  C     ++ +  H  +W C 
Sbjct: 305 RGRPRTRKLCKIC-----NSFEQETKFIQCLSCNSYFHTFCYTPNLEHLEQVHKDNWLCS 359

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYH 115
            C+IC  CR+  +    +FC  CD  +H
Sbjct: 360 DCKICLKCRKGPNEGTLVFCDYCDCGFH 387


>gi|290976199|ref|XP_002670828.1| predicted protein [Naegleria gruberi]
 gi|284084391|gb|EFC38084.1| predicted protein [Naegleria gruberi]
          Length = 349

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 10/74 (13%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSES----TPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
           C+ACG  + KG++C  C ++Y++S++     P + CD C RWVH  C     E+   F++
Sbjct: 36  CNACGLHYKKGHFCIYCNQIYKESDADDKEEPWIGCDSCHRWVHQNC-----ERQNGFEI 90

Query: 218 DGNLQYRCPTCRGE 231
             N  Y CP CR +
Sbjct: 91  KPN-GYLCPCCRNQ 103


>gi|340722214|ref|XP_003399503.1| PREDICTED: hypothetical protein LOC100648836 [Bombus terrestris]
          Length = 659

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 7/102 (6%)

Query: 23  IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
           I  E Q  S T V    CR C +   E  E    +++C+ C  + H +C+ +  +     
Sbjct: 409 IPQESQDSSKTTVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMVQK 464

Query: 80  HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             S+W+C  C+ C IC  T D      C  CD AYH YC  P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDDAYHYYCHTP 506


>gi|169146772|emb|CAQ13473.1| novel protein similar to human D4, zinc and double PHD fingers
           family 1 (DPF1) [Danio rerio]
          Length = 127

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 6   GSVIPNGYCDFCLGGSKKTGCPE--DLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 63

Query: 88  SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            C+ C +C  + + ++ +FC  CD  YH YC  PP      G + C
Sbjct: 64  ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 109


>gi|118402055|ref|XP_001033347.1| PHD-finger family protein [Tetrahymena thermophila]
 gi|89287695|gb|EAR85684.1| PHD-finger family protein [Tetrahymena thermophila SB210]
          Length = 510

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 6/121 (4%)

Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 171
           + ++ YC  P  KN+SS P    + TK  S         L  +     T CD C + + +
Sbjct: 165 SCFNQYC--PKDKNISSVPSPQQESTKKQSVNGQYKQAKLE-KTGEKVTFCDLCCQRYQE 221

Query: 172 GNYCPVCLKVYRDS---ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
            ++C  C +VY D    +    + CD C RW H  C+    +K        N+QY CP C
Sbjct: 222 KHFCYYCQQVYFDDYNIDDKEWILCDTCDRWCHLHCEEEKIKKAFSSSQSENVQYDCPRC 281

Query: 229 R 229
           R
Sbjct: 282 R 282


>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
          Length = 1093

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 73/202 (36%), Gaps = 55/202 (27%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGC----ERARRMLSC--KSCGKKYHRNCLKNWAQNRDL 78
           G+   Q +    C  C  G    C    ER    + C   +CGK YH  CL  W Q+   
Sbjct: 529 GKSDEQEDETFKCIDCLSGVAPACFLCNEREGDRIRCIVPACGKHYHSKCLIPWPQS--- 585

Query: 79  FHWSSWK--CPSCRICEIC-------RRTGDPN-KFMFCRRCDAAYHC--YCQHPPHKNV 126
            HW   +  CP   +C  C        R+  PN K   C RC ++YH    C     + +
Sbjct: 586 -HWQGGRLTCPY-HVCHTCSSDNPQNNRSRAPNEKVAKCVRCPSSYHASALCLPAGSEIL 643

Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
           ++   +CPKH K        P   L+  W      C  C R                   
Sbjct: 644 TASQIICPKHYK-------APHPPLNAAW------CFLCTR------------------- 671

Query: 187 STPMVCCDVCQRWVHCQCDGIS 208
              ++CCD C    H +C GI+
Sbjct: 672 GGSLICCDTCPTSFHLECLGIN 693


>gi|118344196|ref|NP_001071920.1| zinc finger protein [Ciona intestinalis]
 gi|92081536|dbj|BAE93315.1| zinc finger protein [Ciona intestinalis]
          Length = 257

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 34/80 (42%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           ML CK C  K H +C+K  +          W+C  C+ C  C    D    +FC  CD A
Sbjct: 36  MLFCKDCDAKAHPSCMKYSSTLAAQALSYPWQCVECKTCSSCFTARDGASILFCDGCDKA 95

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH  C  P       G +LC
Sbjct: 96  YHMLCHEPEVITKPEGKWLC 115


>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 2202

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 46/102 (45%), Gaps = 15/102 (14%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR------------ICE 93
           E C++   ++ C +C K +H  CL    +      WS   CP+C               E
Sbjct: 352 EVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWS---CPNCEGEGIPEPEPADEHME 408

Query: 94  ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            CR   D  + + C +C ++YH +C +PP + +    ++CP+
Sbjct: 409 FCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPR 450


>gi|62896783|dbj|BAD96332.1| PHD finger protein 10 isoform a variant [Homo sapiens]
          Length = 410

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +         W+C  C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSTIKTYPWQCMECKTC 348

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374


>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
          Length = 1018

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)

Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C  CG  +P      + V    G   C  C RL     YC +C K+   S+S   V CD 
Sbjct: 352 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 411

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
           C+ WVH +C  IS +    F+  G   Y CP C+ +     + E +  E W+ K   +K+
Sbjct: 412 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 464


>gi|298708138|emb|CBJ30479.1| myst-related protein [Ectocarpus siliculosus]
          Length = 1620

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 15/101 (14%)

Query: 47  GC---ERARRMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCPSC-----RI-CEIC 95
           GC   +R   +L C    CG +YH  CL           W  W CP C     R+ C +C
Sbjct: 560 GCMQNDRPTEILQCDGPMCGLEYHYGCLDPPLDKVPSSKW--WYCPDCVRTDNRVGCRVC 617

Query: 96  RRTGDPNKFMFC--RRCDAAYHCYCQHPPHKNVSSGPYLCP 134
           +   D +K + C    C+  +H YC  PP K V  G + CP
Sbjct: 618 KVDVDYDKLLKCDGPGCELEWHTYCLKPPVKTVPKGDFFCP 658


>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
 gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
          Length = 1982

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480


>gi|74183205|dbj|BAE22542.1| unnamed protein product [Mus musculus]
          Length = 331

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 26  EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           ++Q +++   +C  C    E+   ++   +LSC  CG   H +CLK   +         W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265

Query: 85  KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
           +C  C+ C  CR  G + +  +FC  CD  +H  C  PP   +  G
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKG 311


>gi|320167424|gb|EFW44323.1| jumonji [Capsaspora owczarzaki ATCC 30864]
          Length = 2147

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 25/44 (56%)

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
           +CE+C R  D +K + C  CD  YH YC HPP   V  G + CP
Sbjct: 388 VCEVCLRPDDESKIILCDSCDHGYHVYCLHPPLPRVPDGDWYCP 431


>gi|302393562|ref|NP_001116784.3| histone acetyltransferase MYST3 [Danio rerio]
          Length = 2247

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 329


>gi|190339720|gb|AAI63677.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Danio rerio]
          Length = 2246

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328


>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
          Length = 1982

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480


>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1002

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC +C +++  S+    VCCD C  WVH  CD I++E++ + + +   
Sbjct: 355 CKHCSKLRKFNQYCGICKRIWHPSDDGDWVCCDGCNVWVHAGCDNITNERFKELEHN--- 411

Query: 222 QYRCPTCR 229
            Y CP C+
Sbjct: 412 NYYCPDCK 419


>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
 gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
 gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
           homolog; AltName: Full=ATP-dependent helicase Mi-2;
           Short=dMi-2
 gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
 gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
          Length = 1982

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480


>gi|293332508|ref|NP_001169841.1| uncharacterized protein LOC100383733 [Zea mays]
 gi|224031939|gb|ACN35045.1| unknown [Zea mays]
 gi|413941582|gb|AFW74231.1| hypothetical protein ZEAMMB73_231911 [Zea mays]
          Length = 555

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 9/104 (8%)

Query: 36  MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKN-WAQNRDLFHWSSWKCPSCRI 91
           +C+ C   E+E     R+ + C    C  K YH  CLK     +    +   W CPSC +
Sbjct: 417 LCKNCGTCEDED----RKFMVCGHGLCSFKFYHVLCLKERQIASEKQKNLKCWYCPSC-L 471

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           C  C +  D  K + C  CD AYH YC  PP ++V  G + C +
Sbjct: 472 CRRCFKDKDDEKIVLCDGCDEAYHIYCMDPPCESVPRGKWFCTR 515


>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
 gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
          Length = 2013

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475


>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
 gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
          Length = 1921

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471


>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
 gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
          Length = 1983

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481


>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
           [Crassostrea gigas]
          Length = 2123

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 49/121 (40%), Gaps = 16/121 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR-----------ICEI 94
           E C++   ++ C +C + YH  C     +      WS   CP C              E 
Sbjct: 329 EVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWS---CPHCEGEGIKEQEEDDHMEF 385

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
           CR   D  + + C  C +AYH +C +PP K +  G + CP+   C      V    L+ R
Sbjct: 386 CRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPR-CSCEPLKGRV-AKILTWR 443

Query: 155 W 155
           W
Sbjct: 444 W 444


>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
 gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
          Length = 1973

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471


>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
 gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
          Length = 2012

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475


>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
           vinifera]
          Length = 1094

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)

Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
           C  CG  +P      + V    G   C  C RL     YC +C K+   S+S   V CD 
Sbjct: 422 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 481

Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
           C+ WVH +C  IS +    F+  G   Y CP C+ +     + E +  E W+ K   +K+
Sbjct: 482 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 534


>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
 gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
 gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
          Length = 683

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)

Query: 37  CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
           C+ C  GENE      ++L C  C K YH  C K    N     W  ++C     + R C
Sbjct: 193 CQFCTSGENE-----DKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 247

Query: 93  EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            +C   R     K ++C  C  AYH  C  PP   V  G +       CH C S  P
Sbjct: 248 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 298



 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
           C+ C    + +K + C  CD  YH YC  P   N+  G + C           KC  CG 
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 252

Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
           + P     + +      CD C R +    Y P  LKV R
Sbjct: 253 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 285


>gi|383850174|ref|XP_003700672.1| PREDICTED: uncharacterized protein LOC100875893 [Megachile
           rotundata]
          Length = 659

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)

Query: 23  IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
           ++ E Q  S + V    CR C +   E  E    +++C+ C  + H +C+ +  +     
Sbjct: 409 VSQEPQDSSKSAVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMIQK 464

Query: 80  HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             SSW+C  C+ C IC  T D      C  CD AYH YC  P
Sbjct: 465 AGSSWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506


>gi|348680969|gb|EGZ20785.1| hypothetical protein PHYSODRAFT_298767 [Phytophthora sojae]
          Length = 606

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPT 227
           L   G YCPVC +VY D + +  VCCD C+ WVH  CD  S   Y+  ++D        T
Sbjct: 539 LRALGQYCPVCSEVYEDDDQSTFVCCDSCELWVHGACDP-SLTPYVAVKIDRATTLISNT 597

Query: 228 C 228
           C
Sbjct: 598 C 598


>gi|294948371|ref|XP_002785717.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
           ATCC 50983]
 gi|239899765|gb|EER17513.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
           ATCC 50983]
          Length = 1340

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 18/103 (17%)

Query: 139 CHSC--GSN---VPGNGLS---VRWFL---GYTCC-DACGRLFVKGNYCPVCLKVYRDSE 186
           CH C  G N    P  G +   +R  L     TC  D C R      YCPVCL+ +    
Sbjct: 123 CHGCILGLNQRGCPSQGPASPCIRMLLTVDSLTCIKDLCDR------YCPVCLRAWSTVW 176

Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
              MV CD C+ WVH +CD  + ++      D +++Y CP CR
Sbjct: 177 CDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 219


>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
 gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
          Length = 2023

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 387 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 443

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 444 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 486


>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
 gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
          Length = 1441

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/239 (25%), Positives = 89/239 (37%), Gaps = 81/239 (33%)

Query: 15   YVATASKDIAGEEQGQSNTNVMCRLCFVGENEGC-----------------ERAR----- 52
            +V   +K +AG       T+++C  C VGE EGC                 E A      
Sbjct: 844  HVTADAKVVAG------GTSLVCHECNVGETEGCVICHQVESPAVPSTPMKEEAPSHSPI 897

Query: 53   --RMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP------- 101
              ++L+C    CGK++H +C K W Q        S +CP   +C  C  + DP       
Sbjct: 898  EDKLLTCSQPLCGKRFHTSCCKYWPQANS--SKLSARCPR-HVCHTC-VSDDPSGKFQQL 953

Query: 102  --NKFMFCRRCDAAYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFL 157
              +K   C RC A YH   +C     + +++   +CP+H        N+      V    
Sbjct: 954  GSSKLAKCVRCPATYHQDSHCIPAGTQMLNATHIICPRH--------NIAKADAHVNVLW 1005

Query: 158  GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
             Y C        VKG                 +VCC+ C   VH  C  I   ++E Y+
Sbjct: 1006 CYIC--------VKGGE---------------LVCCETCPIAVHAHCRNIPIKTNENYI 1041


>gi|190337311|gb|AAI63678.1| Myst3 protein [Danio rerio]
          Length = 2247

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLMRMPKGMWIC 329


>gi|345493038|ref|XP_003426985.1| PREDICTED: hypothetical protein LOC100678755 isoform 1 [Nasonia
           vitripennis]
 gi|345493040|ref|XP_003426986.1| PREDICTED: hypothetical protein LOC100678755 isoform 2 [Nasonia
           vitripennis]
          Length = 728

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 51/120 (42%), Gaps = 14/120 (11%)

Query: 24  AGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
           A  ++   + N  C LC   + E       + +C+ C  + H +C+    +  +  H +S
Sbjct: 486 AALKESPKDPNRHCSLCSKDKQEA------LTACRDCTVRAHPSCIYTPEEIMNKTH-TS 538

Query: 84  WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
           W+C  C+ C +C  T +    + C  CD A+H  C H P   VS   +       CH C 
Sbjct: 539 WQCERCKTCVVCYETSEAGPLVACYSCDDAFHYTC-HTPRIPVSKAKW------NCHECS 591


>gi|449496893|ref|XP_002189921.2| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 10 [Taeniopygia
           guttata]
          Length = 426

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +   ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 305 NAICGICLKGKESNKKGKPEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 364

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 365 IICGQPHHEEEMMFCDVCDRGYHTFC 390


>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
 gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
          Length = 1971

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 366 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 422

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 423 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 465


>gi|328871667|gb|EGG20037.1| hypothetical protein DFA_07153 [Dictyostelium fasciculatum]
          Length = 1433

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/113 (22%), Positives = 53/113 (46%), Gaps = 18/113 (15%)

Query: 37  CRLCFVGENEGCERA----------RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
           C +CF+  + G ++             +++C +C + +H++C+ +   + +  + S W C
Sbjct: 716 CVICFLSTSGGNKKFYHQKTKKNQNTTLVTCFACERSFHQDCITDQPNSNN--NNSEWYC 773

Query: 87  P-----SCRI-CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
                 +C++ C +C++    + F+ C +C   YH YC  P    V   P+ C
Sbjct: 774 SIDCSMTCQVRCNVCQKGDHEDSFVLCDKCSDGYHIYCLSPQLSEVPYDPWEC 826


>gi|294868766|ref|XP_002765684.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
           50983]
 gi|239865763|gb|EEQ98401.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
           50983]
          Length = 1071

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 32/56 (57%)

Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           YCPVCL+ +       MV CD C+ WVH +CD  + ++      D +++Y CP CR
Sbjct: 353 YCPVCLRAWSTVWCDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 408


>gi|229594235|ref|XP_001024908.3| PHD-finger family protein [Tetrahymena thermophila]
 gi|225566987|gb|EAS04663.3| PHD-finger family protein [Tetrahymena thermophila SB210]
          Length = 425

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 1/90 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           +L CK+C K +H  C  +    + +     W C  C++C  C +    N+ + C  CD  
Sbjct: 279 ILVCKNCNKSFHAECC-DPPLEKGIVSKYDWFCTECKLCIACNKNTKENELLMCDCCDRP 337

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
           +H  C  P   ++  G + C    KC  CG
Sbjct: 338 FHMSCLEPARTDIPEGRWFCKDCEKCPCCG 367


>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
 gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
          Length = 1992

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 368 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 424

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 425 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 467


>gi|413950797|gb|AFW83446.1| hypothetical protein ZEAMMB73_198866 [Zea mays]
          Length = 870

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 7/99 (7%)

Query: 133 CPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
           C +  +C SCG+  P    + + + +    C  C R+     YC +CLK  +       V
Sbjct: 372 CRRVLQCESCGNCFPNKDTNKMVYVMEQLACRLCARILALKKYCGICLKNLQHKYGGRRV 431

Query: 192 CCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
           CC  C+ WVH +CD   S+ K LQ     + +Y CP CR
Sbjct: 432 CCHGCESWVHAECDENCSNLKDLQ-----DKKYHCPYCR 465


>gi|323450933|gb|EGB06812.1| hypothetical protein AURANDRAFT_28864 [Aureococcus anophagefferens]
          Length = 266

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 2/97 (2%)

Query: 49  ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC--RRTGDPNKFMF 106
           +R   +L C  CG+  H  C      +      ++W+CP+C++CE+C   +  D ++ ++
Sbjct: 66  KRGGELLFCVDCGEACHAMCASTPIDSMSDAARATWRCPNCKVCELCGESKVDDESRLLY 125

Query: 107 CRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
           C  CD AYH  C  P       G ++C     C  CG
Sbjct: 126 CDLCDKAYHLDCVTPKLDVAPPGRWICGLCVTCRHCG 162


>gi|3540206|gb|AAC34356.1| Hypothetical protein [Arabidopsis thaliana]
          Length = 1250

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 62   KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
            K YH  CL +    +   H   W C SC +C  C    D +K + C  CD AYH YC  P
Sbjct: 1107 KYYHIRCLTS---RQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1162

Query: 122  PHKNVSSGPYLC 133
            P ++V +G + C
Sbjct: 1163 PCESVPNGEWFC 1174


>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
           sativus]
 gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
           sativus]
          Length = 1055

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 3/68 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
           C  C +L     YC VC K++  S+    VCCD C  WVH +CD IS +    F+   + 
Sbjct: 415 CKHCHKLRQSKQYCGVCKKIWHHSDGGNWVCCDGCNVWVHAECDKISSK---LFKDLAHS 471

Query: 222 QYRCPTCR 229
           +Y CP C+
Sbjct: 472 EYYCPDCK 479


>gi|449278058|gb|EMC86025.1| PHD finger protein 10 [Columba livia]
          Length = 414

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +   ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 293 NAICGICLKGKESNKKGKPEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 352

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            IC +     + MFC  CD  YH +C
Sbjct: 353 IICGQPHHEEEMMFCDVCDRGYHTFC 378


>gi|297842501|ref|XP_002889132.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297334973|gb|EFH65391.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1160

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 62   KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
            K YH  CL      +   H   W C SC +C  C    D +K + C  CD AYH YC  P
Sbjct: 1017 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1072

Query: 122  PHKNVSSGPYLC 133
            P ++V +G + C
Sbjct: 1073 PCESVPNGEWFC 1084


>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
          Length = 2221

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
            C+ C  G+NE      ++L C  C K YH  C K   +N        W C  C       
Sbjct: 1937 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1988

Query: 90   RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            R C +C  + +    + + C  C  AYH  C HP    V  G + C K
Sbjct: 1989 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 2036



 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C+ C    + +K + C  CD  YH YC  P  +N+  G +       CH C +   G   
Sbjct: 1937 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1988

Query: 152  SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
                      C  CG+                 +  T ++ C++C R  H  C
Sbjct: 1989 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 2019


>gi|195174305|ref|XP_002027919.1| GL27102 [Drosophila persimilis]
 gi|194115608|gb|EDW37651.1| GL27102 [Drosophila persimilis]
          Length = 2142

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
             + C SC ++ H +C+    +        +W+C  C+ C  C+ +  P K ++C +CD  
Sbjct: 1837 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1896

Query: 114  YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            YH YC     K V    + C +   C  CG+  P
Sbjct: 1897 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1928


>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
          Length = 2075

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
            C+ C  G+NE      ++L C  C K YH  C K   +N        W C  C       
Sbjct: 1791 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1842

Query: 90   RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
            R C +C  + +    + + C  C  AYH  C HP    V  G + C K
Sbjct: 1843 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 1890



 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C+ C    + +K + C  CD  YH YC  P  +N+  G +       CH C +   G   
Sbjct: 1791 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1842

Query: 152  SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
                      C  CG+                 +  T ++ C++C R  H  C
Sbjct: 1843 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 1873


>gi|198471111|ref|XP_002133666.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
 gi|198145773|gb|EDY72293.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
          Length = 2132

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 54   MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
             + C SC ++ H +C+    +        +W+C  C+ C  C+ +  P K ++C +CD  
Sbjct: 1827 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1886

Query: 114  YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
            YH YC     K V    + C +   C  CG+  P
Sbjct: 1887 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1918


>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
 gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
           Short=CHD-3
 gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
 gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
          Length = 1787

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 25/109 (22%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------- 90
           E C +   ++ C +C + YH  C+    +N +      W CP C                
Sbjct: 269 EVCNQDGELMLCDTCTRAYHVACID---ENMEQPPEGDWSCPHCEEHGPDVLIVEEEPAK 325

Query: 91  ----ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
                C IC+ T +    + C  C ++YH YC  PP   +  G + CP+
Sbjct: 326 ANMDYCRICKETSN---ILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPR 371


>gi|255558536|ref|XP_002520293.1| DNA binding protein, putative [Ricinus communis]
 gi|223540512|gb|EEF42079.1| DNA binding protein, putative [Ricinus communis]
          Length = 510

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL N   N    +   W CPSC +C  C    D ++ + C  CD AYH YC  P
Sbjct: 378 KYYHVRCLTN---NLLKSYGPRWYCPSC-LCRTCFVDRDDDQIVLCDGCDHAYHMYCMSP 433

Query: 122 PHKNVSSGPYLC 133
           P  ++  G + C
Sbjct: 434 PRTSIPRGKWFC 445


>gi|209882276|ref|XP_002142575.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
 gi|209558181|gb|EEA08226.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
          Length = 305

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 44/96 (45%), Gaps = 1/96 (1%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
            + C  C K YH  C     +   + + S W C  C  C +CR++G  +  + C  C+ A
Sbjct: 132 FIECSICKKSYHLTCCDPIIEKVSI-NNSKWICSDCNGCIVCRKSGREDYQVLCDVCNRA 190

Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
           +H YC +P   +V  G ++C     C  C  N+  N
Sbjct: 191 FHIYCLYPTLDSVPQGIWICDDCYVCAFCQGNIKYN 226


>gi|47118082|gb|AAT11171.1| monocytic leukemia zinc finger protein [Danio rerio]
          Length = 2246

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)

Query: 36  MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C  C    E    ++   ++SC  CG   H +CLK   +         W+C  C+ C  
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKSCSS 288

Query: 95  CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C+  G + +  +FC  CD  +H  C  PP   +  G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328


>gi|313233623|emb|CBY09794.1| unnamed protein product [Oikopleura dioica]
          Length = 326

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 43/98 (43%), Gaps = 1/98 (1%)

Query: 37  CRLCFVGENEGCERARRMLSCKS-CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
           C LC  G +   E    ++ C   C +  H  C+   A         +W+C  C+ C  C
Sbjct: 225 CDLCSNGPDTSSEDMSMLVKCSGPCKRLTHPYCVNLPANIVKNVSTYAWECQDCKHCSKC 284

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
               + +K +FC  CD   H YC +PP KN  SG + C
Sbjct: 285 GLDENDDKLLFCDDCDRGVHLYCLNPPLKNAPSGRWTC 322


>gi|303286287|ref|XP_003062433.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455950|gb|EEH53252.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 450

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV--CQRWVHCQCDGISDEKYLQFQVDG 219
           C  C +L  +G +CP C KV++ +    MV CD   C+ WVH  CD  + E  +    + 
Sbjct: 85  CALCAKLHKEGQFCPACDKVWQWANCPAMVGCDAPGCEFWVHASCDARAKE-VMDAPENE 143

Query: 220 NLQYRCPTC 228
           +++Y CP C
Sbjct: 144 DIEYHCPRC 152


>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
          Length = 1084

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
           E C++   ++ C +C + YH  CL+      D      W CP C                
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           E CR   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481


>gi|156392562|ref|XP_001636117.1| predicted protein [Nematostella vectensis]
 gi|156223217|gb|EDO44054.1| predicted protein [Nematostella vectensis]
          Length = 229

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
           C++CRR GD  K + C  CD  +H YC  PP K++  G + CP
Sbjct: 1   CKLCRRKGDAEKMLLCDACDRGHHMYCLKPPIKHIPEGNWFCP 43


>gi|363731649|ref|XP_003641005.1| PREDICTED: PHD finger protein 10-like [Gallus gallus]
          Length = 534

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 34  NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
           N +C +C  G E+    +A  ++ C  C    H +CL    +   +     W+C  C+ C
Sbjct: 413 NAICGICLKGKESNKKGKAEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 472

Query: 93  EICRRTGDPNKFMFCRRCDAAYHCYC 118
            +C +     + MFC  CD  YH +C
Sbjct: 473 IMCGQPHHEEEMMFCDVCDRGYHTFC 498


>gi|195432472|ref|XP_002064247.1| GK19801 [Drosophila willistoni]
 gi|194160332|gb|EDW75233.1| GK19801 [Drosophila willistoni]
          Length = 478

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 48/121 (39%), Gaps = 6/121 (4%)

Query: 26  EEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           E+Q Q      C  C   E   G       ++CKSC +K H  CL    +N  +     +
Sbjct: 131 EDQQQQRNMDTCVKCSKSEPKRGSGHKSNFITCKSCMQKCHFACLPLNFENLTMAR-KKY 189

Query: 85  KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
           KC  CR C  C   G     + C  C   YH  C +PP     +   L  +  KCH C +
Sbjct: 190 KCEKCRYCSYCNSKGKEILIILCSSCVDGYHFECHNPPL----NASILDDREWKCHKCDT 245

Query: 145 N 145
           N
Sbjct: 246 N 246


>gi|339246939|ref|XP_003375103.1| zinc finger protein ubi-d4 A [Trichinella spiralis]
 gi|316971607|gb|EFV55360.1| zinc finger protein ubi-d4 A [Trichinella spiralis]
          Length = 694

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 26  EEQGQSNTNVMCRLC----FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
           + QG    +  C LC    F  +  GC  A ++++C  CG+  H  CL+  +        
Sbjct: 139 QRQGMVELSSTCDLCQGDIFENKRTGC--AEQLINCSDCGRAGHPYCLQFSSNMIISTKK 196

Query: 82  SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
             W+C  C+ C IC  +    + +FC  CD  +H YC  P
Sbjct: 197 YGWQCIECKSCAICGTSEHDEQLLFCDDCDRGFHMYCLTP 236


>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
           [Brachypodium distachyon]
          Length = 1037

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 7/94 (7%)

Query: 138 KCHSCGSNVPG-NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVC 196
           +C SCG+  P  +   + + +    C  C R+     YC +CLK ++       VCC  C
Sbjct: 381 QCESCGNCFPNKDSNKMVYVMEQLACKHCARILRSKEYCGICLKSWQHKCGRRWVCCHGC 440

Query: 197 QRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
           + W+H +CD   SD K LQ     +  Y CP CR
Sbjct: 441 ESWIHAECDKKCSDLKDLQ-----DKSYFCPYCR 469


>gi|300508320|pdb|2KWJ|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
           Transcriptional Protein Dpf3 Bound To A Histone Peptide
           Containing Acetylation At Lysine 14
 gi|300508322|pdb|2KWK|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
           Transcriptional Protein Dpf3b Bound To A H3 Peptide Wild
           Type
 gi|300508324|pdb|2KWN|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
           Fingers Of Human Transcriptional Protein Dpf3b Bound To
           A Histone H4 Peptide Containing Acetylation At Lysine 16
 gi|300508326|pdb|2KWO|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
           Fingers Of Human Transcriptional Protein Dpf3b Bound To
           A Histone H4 Peptide Containing N-Terminal Acetylation
           At Serine 1
          Length = 114

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 40/84 (47%)

Query: 50  RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
           R   ++SC  CG+  H  CL+      +      W+C  C+ C +C  + + ++ +FC  
Sbjct: 19  RPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCILCGTSENDDQLLFCDD 78

Query: 110 CDAAYHCYCQHPPHKNVSSGPYLC 133
           CD  YH YC +PP      G + C
Sbjct: 79  CDRGYHMYCLNPPVAEPPEGSWSC 102


>gi|42563280|ref|NP_177849.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
 gi|95147302|gb|ABF57286.1| At1g77250 [Arabidopsis thaliana]
 gi|332197833|gb|AEE35954.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
          Length = 522

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL      +   H   W C SC +C  C    D +K + C  CD AYH YC  P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434

Query: 122 PHKNVSSGPYLC 133
           P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446


>gi|51969394|dbj|BAD43389.1| unnamed protein product [Arabidopsis thaliana]
 gi|51969560|dbj|BAD43472.1| unnamed protein product [Arabidopsis thaliana]
 gi|51969870|dbj|BAD43627.1| unnamed protein product [Arabidopsis thaliana]
          Length = 522

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL      +   H   W C SC +C  C    D +K + C  CD AYH YC  P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434

Query: 122 PHKNVSSGPYLC 133
           P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446


>gi|198437529|ref|XP_002126456.1| PREDICTED: similar to Bromodomain adjacent to zinc finger domain
            protein 1A (ATP-utilizing chromatin assembly and
            remodeling factor 1) (hACF1) (ATP-dependent
            chromatin-remodeling protein) (Williams syndrome
            transcription factor-related chromatin-remodeling fa...
            [Ciona intestinalis]
          Length = 1458

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 24/43 (55%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            C ICRR GD  K + C  CD  +H YC  P  K V SG + CP
Sbjct: 1178 CRICRRKGDGEKMLLCDNCDRGHHMYCLRPALKIVPSGDWFCP 1220


>gi|390478964|ref|XP_002762152.2| PREDICTED: zinc finger protein neuro-d4 [Callithrix jacchus]
          Length = 364

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+   N       + W   
Sbjct: 233 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 290

Query: 85  KCPSCRICEICRRTG-------DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
           +C SC +C      G         ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 291 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 350

Query: 135 KHTK 138
           +H K
Sbjct: 351 RHLK 354


>gi|51969444|dbj|BAD43414.1| unnamed protein product [Arabidopsis thaliana]
          Length = 522

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 62  KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
           K YH  CL      +   H   W C SC +C  C    D +K + C  CD AYH YC  P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434

Query: 122 PHKNVSSGPYLC 133
           P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446


>gi|45382851|ref|NP_989971.1| zinc finger protein neuro-d4 [Gallus gallus]
 gi|18202298|sp|P58267.1|DPF1_CHICK RecName: Full=Zinc finger protein neuro-d4; AltName: Full=D4, zinc
           and double PHD fingers family 1
 gi|14010358|gb|AAK51966.1|AF362752_1 neuro-d4 [Gallus gallus]
          Length = 380

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 36/80 (45%)

Query: 54  MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
           M++C  CG+  H +CL+             W+C  C+ C +C    +  + +FC  CD  
Sbjct: 283 MIACADCGRAGHPSCLQFTLAMAAAARSYRWQCIECKNCSLCGSAENDEQLLFCDDCDRG 342

Query: 114 YHCYCQHPPHKNVSSGPYLC 133
           YH YC  PP      G + C
Sbjct: 343 YHMYCISPPVAEPPEGTWSC 362


>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
 gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
          Length = 1423

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 81/221 (36%), Gaps = 75/221 (33%)

Query: 33   TNVMCRLCFVGENEGCERAR------------------------RMLSCKS--CGKKYHR 66
            T+++C  C VGE EGC                            ++L+C    CGK++H 
Sbjct: 848  TSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHT 907

Query: 67   NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYHCY 117
            +C K W Q     H  S +CP   +C  C  + DP         +K   C RC A YH  
Sbjct: 908  SCCKYWPQASSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQL 963

Query: 118  CQHPPH--KNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
             +  P   + +++   +CP+H        N+      V     Y C        VKG   
Sbjct: 964  SKCIPAGTQMLNTTNIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1006

Query: 176  PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
                          +VCC+ C   VH  C  I   ++E Y+
Sbjct: 1007 --------------LVCCETCPIAVHAHCRNIPIKTNESYI 1033


>gi|348508478|ref|XP_003441781.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oreochromis niloticus]
          Length = 1521

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 6/63 (9%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C++CRR GD  K + C  C+ A+H +C  P    V +G +LCP      +C   V   G 
Sbjct: 1202 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRVPNGEWLCP------ACQPTVARRGS 1255

Query: 152  SVR 154
             VR
Sbjct: 1256 RVR 1258


>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
          Length = 1969

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
           E C++   ++ C +C + YH  CL    ++     WS   CP C                
Sbjct: 380 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 436

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           C IC+  G+    + C  C +AYH +C +PP   +  G + CP+
Sbjct: 437 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 477


>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
 gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
            AltName: Full=Maternal-effect sterile 4 homolog
 gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
 gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
          Length = 1427

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 81/221 (36%), Gaps = 75/221 (33%)

Query: 33   TNVMCRLCFVGENEGCERAR------------------------RMLSCKS--CGKKYHR 66
            T+++C  C VGE EGC                            ++L+C    CGK++H 
Sbjct: 852  TSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHT 911

Query: 67   NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYHCY 117
            +C K W Q     H  S +CP   +C  C  + DP         +K   C RC A YH  
Sbjct: 912  SCCKYWPQASSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQL 967

Query: 118  CQHPPH--KNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
             +  P   + +++   +CP+H        N+      V     Y C        VKG   
Sbjct: 968  SKCIPAGTQMLNTTNIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1010

Query: 176  PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
                          +VCC+ C   VH  C  I   ++E Y+
Sbjct: 1011 --------------LVCCETCPIAVHAHCRNIPIKTNESYI 1037


>gi|196014713|ref|XP_002117215.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
 gi|190580180|gb|EDV20265.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
          Length = 1478

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            C ICRR GD    + C  CD  +H YC  PP  ++ +G + CP
Sbjct: 1141 CRICRRKGDAELMLLCDECDRGHHTYCLRPPLNSIPAGNWYCP 1183


>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
           [Tribolium castaneum]
          Length = 1966

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
           E C++   ++ C +C + YH  CL    ++     WS   CP C                
Sbjct: 377 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 433

Query: 92  CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           C IC+  G+    + C  C +AYH +C +PP   +  G + CP+
Sbjct: 434 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 474


>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
          Length = 1012

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 7/111 (6%)

Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYCPVCL 179
           P   N +  P    +  +C SCG+  P    S+  + +    C  C ++     YC VCL
Sbjct: 369 PMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYCGVCL 428

Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
           K ++       VCC  C+ WVH +CD   S+ K L+     +  Y CP CR
Sbjct: 429 KSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 474


>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
           [Nasonia vitripennis]
          Length = 2009

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 13/100 (13%)

Query: 46  EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR----------ICEIC 95
           E C++   ++ C +C + YH  CL+   +      WS   CP C             E C
Sbjct: 373 EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS---CPHCENDGALEDDDEHMEFC 429

Query: 96  RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
           R   D  + + C  C +AYH +C +PP   +  G + CP+
Sbjct: 430 RVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPR 469


>gi|297596335|ref|NP_001042415.2| Os01g0218900 [Oryza sativa Japonica Group]
 gi|56784089|dbj|BAD81418.1| unknown protein [Oryza sativa Japonica Group]
 gi|222617993|gb|EEE54125.1| hypothetical protein OsJ_00898 [Oryza sativa Japonica Group]
 gi|255673003|dbj|BAF04329.2| Os01g0218900 [Oryza sativa Japonica Group]
          Length = 405

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)

Query: 135 KHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 193
           K   C  CG+ +P    S +   G    C  C +L     YC +C K++  ++    VCC
Sbjct: 306 KSPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365

Query: 194 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           D CQ WVH +C    D+  ++ +   N  Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397


>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
          Length = 1690

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            C ICRR  D +  + C  C+ A+H YC  PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295


>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
          Length = 1279

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 65/158 (41%), Gaps = 12/158 (7%)

Query: 39  LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP--SCRICEICR 96
           LCF  E  G     +  S ++CGKKYHR C+ N    R     +S+KCP   C  C   +
Sbjct: 726 LCFACEQPGGLEGLQTCSVRNCGKKYHRACISN--NPRAALKDNSFKCPLHKCANCTYPQ 783

Query: 97  RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKC--HSCGSNVPGNGLSVR 154
            +  P   + C RC  AYH  C      + ++   LCPKH     H+  +     G   R
Sbjct: 784 ASTYP--LVRCIRCPIAYHTCCVPAGCLHENAIYLLCPKHQPVEKHAKSNICLACGDGGR 841

Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
            F    CCD C   + +     V       SE +P  C
Sbjct: 842 LF----CCDTCPAAYHQECLKDVLALTGTPSEDSPWYC 875


>gi|301780978|ref|XP_002925893.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Ailuropoda
           melanoleuca]
 gi|410983104|ref|XP_003997883.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Felis catus]
          Length = 380

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 249 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 306

Query: 88  SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
            C+ C +C  + +           ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 307 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 366

Query: 135 KHTK 138
           +H K
Sbjct: 367 RHLK 370


>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
          Length = 1032

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 7/115 (6%)

Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYC 175
           + + P   N +  P    +  +C SCG+  P    S+  + +    C  C ++     YC
Sbjct: 355 FFEVPMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYC 414

Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
            VCLK ++       VCC  C+ WVH +CD   S+ K L+     +  Y CP CR
Sbjct: 415 GVCLKSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 464


>gi|205830432|ref|NP_004638.2| zinc finger protein neuro-d4 isoform b [Homo sapiens]
 gi|297704631|ref|XP_002829198.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Pongo abelii]
 gi|395847021|ref|XP_003796185.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Otolemur
           garnettii]
 gi|402905403|ref|XP_003915509.1| PREDICTED: zinc finger protein neuro-d4 isoform 5 [Papio anubis]
 gi|313104100|sp|Q92782.2|DPF1_HUMAN RecName: Full=Zinc finger protein neuro-d4; AltName:
           Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
           Full=D4, zinc and double PHD fingers family 1
          Length = 380

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 249 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 306

Query: 88  SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
            C+ C +C  + +           ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 307 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 366

Query: 135 KHTK 138
           +H K
Sbjct: 367 RHLK 370


>gi|357616639|gb|EHJ70297.1| hypothetical protein KGM_09919 [Danaus plexippus]
          Length = 1569

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 47/119 (39%), Gaps = 19/119 (15%)

Query: 37   CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
            C+ C  G+NE      ++L C  C K YH  C K   +      W  W+C +        
Sbjct: 1256 CQFCLSGDNED-----QLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWECVNKARGGSRE 1310

Query: 90   RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
            R+C +C       + + C  C  AYH  C +PP   +  G + C +      C S  P 
Sbjct: 1311 RVCIVCGGAAR-GRALPCALCVRAYHLDCHYPPLTKMPRGKWYCSQ------CASRAPA 1362


>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
          Length = 1690

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            C ICRR  D +  + C  C+ A+H YC  PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295


>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
            gigas]
          Length = 1488

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
            C ICRR GD  + + C +CD  +H YC  P  K+V  G + CP
Sbjct: 1132 CRICRRKGDAEQMLLCDKCDRGHHMYCLKPRLKHVPKGDWFCP 1174


>gi|332206683|ref|XP_003252426.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4
           [Nomascus leucogenys]
          Length = 459

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+   N       + W   
Sbjct: 328 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 385

Query: 85  KCPSCRICEICRRTG-------DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
           +C SC +C      G         ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 386 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 445

Query: 135 KHTK 138
           +H K
Sbjct: 446 RHLK 449


>gi|145534470|ref|XP_001452979.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420679|emb|CAK85582.1| unnamed protein product [Paramecium tetraurelia]
          Length = 286

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 4/69 (5%)

Query: 162 CDACGRLFVKGNYCPVCLKVY--RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 219
           CD C +L+ KGN+C  C +VY   D E+   V CD CQ+W H  C+  +  + +Q + + 
Sbjct: 87  CDKCSKLYNKGNFCDFCEQVYGSYDDEAV-WVQCDSCQKWNHIVCEQKNRNQNIQIEFET 145

Query: 220 NLQYRCPTC 228
           + QY C TC
Sbjct: 146 S-QYHCLTC 153


>gi|392575621|gb|EIW68754.1| hypothetical protein TREMEDRAFT_63213 [Tremella mesenterica DSM
           1558]
          Length = 2086

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 6/61 (9%)

Query: 91  ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
           +CE+CR  G P+K + C +CD  YH YC  PP K + +      +   C SC    PGNG
Sbjct: 470 VCEVCRSGGAPDKMLLCDKCDCGYHIYCLDPPLKGLPAY-----EEWYCTSCLLG-PGNG 523

Query: 151 L 151
            
Sbjct: 524 F 524


>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Nasonia vitripennis]
          Length = 1382

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 83/220 (37%), Gaps = 52/220 (23%)

Query: 5   EDYARRFESGYVATASKDIAGEEQGQSNT-NVMCRLCFVGENEGC----ER--ARRMLSC 57
           E+ A + +   +   SK+I  +E+   +  N  C  C  G    C    ER   R   S 
Sbjct: 619 ENNAYKEDLKEIKAKSKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVCHERDGERTKCSI 678

Query: 58  KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--------KFMFCRR 109
            +CGK YH +CLK+W Q +  +      CP   IC  C      N        KF  C +
Sbjct: 679 LACGKHYHPDCLKSWPQCQ--WQGGRLTCPH-HICHTCASDNPQNSHPRSAGEKFAKCVK 735

Query: 110 CDAAYHCYCQHPPHKN--VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
           C + YH      P  +  ++    +CPKH K     S+ P                    
Sbjct: 736 CPSTYHASISCLPAGSTILTGSQIVCPKHYK----SSHPP-------------------- 771

Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
             V   +C +C      +E   ++CCD C    H +C GI
Sbjct: 772 --VNATWCFLC------TEGGSLICCDTCPTSFHLECLGI 803


>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
           [Harpegnathos saltator]
          Length = 1948

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 359 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 408

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 409 -CPHCEGEGITGAADDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 467

Query: 132 LCPK 135
            CP+
Sbjct: 468 KCPR 471


>gi|344298355|ref|XP_003420859.1| PREDICTED: zinc finger protein neuro-d4-like [Loxodonta africana]
          Length = 264

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+   N       + W   
Sbjct: 133 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 190

Query: 85  KCPSCRICEICRRTGDP-------NKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
           +C SC +C      G         ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 191 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 250

Query: 135 KHTK 138
           +H K
Sbjct: 251 QHLK 254


>gi|218187759|gb|EEC70186.1| hypothetical protein OsI_00918 [Oryza sativa Indica Group]
          Length = 405

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)

Query: 135 KHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 193
           K   C  CG+ +P    S +   G    C  C +L     YC +C K++  ++    VCC
Sbjct: 306 KLPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365

Query: 194 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
           D CQ WVH +C    D+  ++ +   N  Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397


>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
 gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
 gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
          Length = 991

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 139 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           C SCG+ VP      +   G    C  C +L     YC +C K++  ++    VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401

Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            WVH +CD       +  +   N  Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429


>gi|218188422|gb|EEC70849.1| hypothetical protein OsI_02356 [Oryza sativa Indica Group]
          Length = 1226

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 11/103 (10%)

Query: 36   MCRLCFVGENEGCERARRMLSC---KSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCR 90
            +C++C   E    E+ +R L C       K YH +CLK    A ++ L     W CPSC 
Sbjct: 1074 LCKMCGNPE----EKDKRFLVCGHTHCLYKYYHISCLKATQIASDKQLDK-PCWYCPSC- 1127

Query: 91   ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
            +C +C    D +  + C  CD AYH YC  P   ++  G + C
Sbjct: 1128 LCRVCHSDRDDDLTILCDGCDEAYHLYCITPRRTSIPKGKWYC 1170


>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
           protein Mi-2 homolog [Apis florea]
          Length = 1964

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 359 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 408

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 409 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 467

Query: 132 LCPK 135
            CP+
Sbjct: 468 KCPR 471


>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
           [Apis mellifera]
          Length = 1966

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 360 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 409

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 410 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 468

Query: 132 LCPK 135
            CP+
Sbjct: 469 KCPR 472


>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
           [Megachile rotundata]
          Length = 1967

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 361 GEEGIQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 410

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 411 -CPHCEGEGITGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 469

Query: 132 LCPK 135
            CP+
Sbjct: 470 KCPR 473


>gi|355703500|gb|EHH29991.1| hypothetical protein EGK_10556 [Macaca mulatta]
          Length = 385

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 254 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 311

Query: 88  SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
            C+ C +C  + +           ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 312 ECKSCSLCGTSENDGASLAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 371

Query: 135 KHTK 138
           +H K
Sbjct: 372 RHLK 375


>gi|348534080|ref|XP_003454531.1| PREDICTED: PHD finger protein 10-like [Oreochromis niloticus]
          Length = 491

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 3/99 (3%)

Query: 36  MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
           +C +C  G+    + +   ++ C  C    H +CL    +   +     W+C  C+ C +
Sbjct: 368 ICGICQKGKEANKKGKPEALIHCSECENSGHPSCLDMSEELVSMIQTYRWQCMECKTCTV 427

Query: 95  CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
           C++    ++ MFC  CD  YH +C      ++ +G ++C
Sbjct: 428 CQQPHHEDEMMFCDMCDRGYHTFCVG--MDSIPTGLWIC 464


>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
          Length = 991

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)

Query: 139 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
           C SCG+ VP      +   G    C  C +L     YC +C K++  ++    VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401

Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
            WVH +CD       +  +   N  Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429


>gi|449444070|ref|XP_004139798.1| PREDICTED: uncharacterized protein LOC101205573 [Cucumis sativus]
          Length = 574

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 6/75 (8%)

Query: 62  KKYHRNCL-KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
           K YH  CL K   ++ D    + W CPSC +C  C    D +K + C  CD  +H YC  
Sbjct: 434 KCYHTRCLTKKQLKSYD----ACWYCPSC-LCRACLINQDDDKIVLCDGCDHGFHIYCMR 488

Query: 121 PPHKNVSSGPYLCPK 135
           PP   +  G + C K
Sbjct: 489 PPLAAIPKGKWFCSK 503


>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
           protein Mi-2 homolog [Bombus terrestris]
          Length = 1974

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 369 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 418

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 419 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 477

Query: 132 LCPK 135
            CP+
Sbjct: 478 KCPR 481


>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
           [Bombus impatiens]
          Length = 1965

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 25  GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
           GEE  Q++    C        E C++   ++ C +C + YH  CL+   +      WS  
Sbjct: 360 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 409

Query: 85  KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
            CP C                E CR   D  + + C  C +AYH +C +PP   +  G +
Sbjct: 410 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 468

Query: 132 LCPK 135
            CP+
Sbjct: 469 KCPR 472


>gi|335289600|ref|XP_003355929.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Sus scrofa]
 gi|119577165|gb|EAW56761.1| D4, zinc and double PHD fingers family 1, isoform CRA_b [Homo
           sapiens]
          Length = 353

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 29  GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
           G    N  C  C  G  + GC     ++SC  CG+  H +CL+             W+C 
Sbjct: 222 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 279

Query: 88  SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
            C+ C +C  + +           ++ +FC  CD  YH YC  PP      G +   LC 
Sbjct: 280 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 339

Query: 135 KHTK 138
           +H K
Sbjct: 340 RHLK 343


>gi|432892838|ref|XP_004075862.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oryzias latipes]
          Length = 1572

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 6/63 (9%)

Query: 92   CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
            C++CRR GD  K + C  C+ A+H +C  P    + +G +LCP      +C   V   G 
Sbjct: 1183 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRIPTGEWLCP------ACQPTVARRGS 1236

Query: 152  SVR 154
             +R
Sbjct: 1237 RLR 1239


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.132    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,315,885,204
Number of Sequences: 23463169
Number of extensions: 545413839
Number of successful extensions: 1520511
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2132
Number of HSP's successfully gapped in prelim test: 2700
Number of HSP's that attempted gapping in prelim test: 1501520
Number of HSP's gapped (non-prelim): 16224
length of query: 754
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 603
effective length of database: 8,816,256,848
effective search space: 5316202879344
effective search space used: 5316202879344
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)