BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004423
(754 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449436315|ref|XP_004135938.1| PREDICTED: uncharacterized protein LOC101208296 [Cucumis sativus]
gi|449488832|ref|XP_004158186.1| PREDICTED: uncharacterized protein LOC101230410 [Cucumis sativus]
Length = 847
Score = 968 bits (2502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 503/766 (65%), Positives = 584/766 (76%), Gaps = 53/766 (6%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
EDYARRFESG + AS +I GEEQGQSN NVMCR+CF GENE ERAR+MLSCK+CGKKY
Sbjct: 119 EDYARRFESGNL-DASGNIVGEEQGQSNVNVMCRICFFGENESSERARKMLSCKTCGKKY 177
Query: 65 HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
HR+CLK+WAQ+RDLFHWSSW CPSCR CE+CRRTGDPNKFMFC+RCD AYHCYCQHPPHK
Sbjct: 178 HRSCLKSWAQHRDLFHWSSWTCPSCRACEVCRRTGDPNKFMFCKRCDGAYHCYCQHPPHK 237
Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
NVSSGPYLCPKHT+CHSCGSNVPGNG SVRWFLGYT CDACGRLFVKGNYCPVCLKVYRD
Sbjct: 238 NVSSGPYLCPKHTRCHSCGSNVPGNGQSVRWFLGYTFCDACGRLFVKGNYCPVCLKVYRD 297
Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
SESTPMVCCD+CQRWVHC CD ISDEKYLQFQ+DGNLQY+C CRGECYQV++LEDAV+E
Sbjct: 298 SESTPMVCCDICQRWVHCHCDSISDEKYLQFQIDGNLQYKCTACRGECYQVKNLEDAVQE 357
Query: 245 LWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGV 304
+WRR+D AD+DLI +LRAAAGLPT+DEIFSISPYSDDEENGP V+KNEFGRSLKLSLKG
Sbjct: 358 IWRRRDEADRDLIVNLRAAAGLPTQDEIFSISPYSDDEENGPAVVKNEFGRSLKLSLKGF 417
Query: 305 VDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQSP 364
DK PKK K++GKK NKKY ++KG PL ++ E DQ+FE +DV G G++
Sbjct: 418 ADKVPKKSKDYGKKSSNKKYAKEKG--TPLANQSELDQNFEVRNDVQQSGFGEGNEKNGG 475
Query: 365 -----KNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTS 418
NEGLD S VAG +SH EG CS++QPG+LKHK+VDEVMVSD++K S+ V+ K S
Sbjct: 476 LLPQNNNEGLDT-SPVAGSLSHNEGTCSVNQPGVLKHKFVDEVMVSDEEKTSKVVQIKAS 534
Query: 419 KPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGI 478
K LD+GED GK+ SKSKT K KKLVINLGARKINV SP+SDASSCQR QDL SN
Sbjct: 535 KAQGLDTGEDSGKYASKSKTAKGKKLVINLGARKINVATSPKSDASSCQRGQDLAVSN-- 592
Query: 479 EDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDS 538
G++V++SSQS GLK +V FG+VR SD+
Sbjct: 593 -------------------------GEKVNNSSQSTGLKAGETENSVPSFGKVRFGSSDT 627
Query: 539 NTKVSRGSSADEHE---PEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESR 591
NT RG++A E P+ V S KRN++ S AV +G V+ ++ ++ KQLES
Sbjct: 628 NTTFGRGNTASGSEVGPPDGTRVFSRKRNMEGSTPAVGSLGGVSTVKEEKVPSGKQLESG 687
Query: 592 PNASRESNDD---TSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
+ + +DD T + QSLP+DSKP L+ KF+KP L+N Q+S EEEKSL+KGQRSK
Sbjct: 688 SHICNDGHDDNGQTPLPQSLPRDSKPLLKFKFKKPPLDN---QISCHEEEKSLVKGQRSK 744
Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
RKRPSP EK FNE ED +S+QD+L+ DANWILKKLGKDAIGKRVEV SD SW
Sbjct: 745 RKRPSPLMEKVPFNEVEDLTRSHQDNLLD---DANWILKKLGKDAIGKRVEVQHPSDKSW 801
Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
KGVV D ++GTSTLS+ LDD R KTLELGKQG+R VP KQKRS S
Sbjct: 802 QKGVVRDMIDGTSTLSVALDDGREKTLELGKQGIRLVPLKQKRSKS 847
>gi|224106097|ref|XP_002314042.1| predicted protein [Populus trichocarpa]
gi|222850450|gb|EEE87997.1| predicted protein [Populus trichocarpa]
Length = 845
Score = 918 bits (2372), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/762 (63%), Positives = 566/762 (74%), Gaps = 59/762 (7%)
Query: 9 RRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC 68
++ ESG AS+D GE+ G C++CFVG+ G ERAR+ML CKSCGKKYHR+C
Sbjct: 123 KKVESGDT-VASEDTPGEDTGP-----FCQICFVGQTGGSERARKMLPCKSCGKKYHRSC 176
Query: 69 LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS 128
LK WA++RDLFHWSSW CPSC+ CE+CR+TGDPNKF+FC+RCD AYHCYCQHPPHKNVSS
Sbjct: 177 LKTWARHRDLFHWSSWTCPSCQTCEVCRKTGDPNKFVFCKRCDGAYHCYCQHPPHKNVSS 236
Query: 129 GPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST 188
GPYLCPKHT+CHSCGS+VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST
Sbjct: 237 GPYLCPKHTRCHSCGSSVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSEST 296
Query: 189 PMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRR 248
PMVCCD+CQRWVHC CDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DL+DA++ELWRR
Sbjct: 297 PMVCCDICQRWVHCHCDGISDEKYLQFQVDGNLQYQCATCRGECYQVKDLKDAIQELWRR 356
Query: 249 KDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKS 308
+D AD+ LIASLRAAAGLP +++IFSISPYSD + NGP L+N+F S+ LSLKG+ KS
Sbjct: 357 RDKADRGLIASLRAAAGLPAQEDIFSISPYSDGDGNGPEALRNDFRHSINLSLKGIGGKS 416
Query: 309 PKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD----DTQSP 364
PKK +HGKK NKK+P+KKG SK EP Q HD+HS + D D++S
Sbjct: 417 PKKSNDHGKKHWNKKFPKKKGCHAASISKSEPHQ-----HDIHSSVHDMDDCKIYDSESQ 471
Query: 365 KNEGLDIP-SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKTSKPHD 422
G D S VAGIV+HTEGVCSISQPG+LKHK+VDEVMVSD ++ S V K K++KPHD
Sbjct: 472 AKGGSDKSCSPVAGIVNHTEGVCSISQPGVLKHKFVDEVMVSDGERTSNVFKIKSNKPHD 531
Query: 423 LDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPS 482
+DSG D KH KSK++KAK+LVINLGARKINV++ P+SD SCQ E DL SN
Sbjct: 532 VDSGGDTEKHAGKSKSVKAKRLVINLGARKINVSSPPKSDVQSCQSELDLKASNR----- 586
Query: 483 LQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTK 541
D DHS Q+RGL K A R GN+IKFG+V+ E S+ N K
Sbjct: 587 ----------------------DTADHSGQTRGLIKFARREGNLIKFGKVKAEASNFNPK 624
Query: 542 VSRGSSADEHEP---EHMHVLSGKRNIDRSRAAVSRVG-EVAALRGDR----KQLESRPN 593
GS +D +E +H V S K++++ SRA V G EV LR D+ KQ E RP+
Sbjct: 625 SDGGSHSDGYETVPLDHARVSSAKKSLEGSRAVVRPAGGEVPTLRSDKLSLGKQSEVRPD 684
Query: 594 ASRESNDD---TSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRK 650
ESN D T + SLPK+SK L+LK +KPNLENQ+S + EEEKS I+GQRSKRK
Sbjct: 685 THTESNGDSGDTPIFHSLPKESKLSLKLKIKKPNLENQSSLIHLHEEEKSNIRGQRSKRK 744
Query: 651 RPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHK 710
R S EKT++NEDE S+ DS M+E AN ILKKLGKDAIGKRVEVHQ SDNSWHK
Sbjct: 745 RASSLMEKTMYNEDEGMPPSHLDSEMTE---ANRILKKLGKDAIGKRVEVHQPSDNSWHK 801
Query: 711 GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
GVV+D VEGTS LS+TLDD VKTL+LGKQ VR V QKQKRS
Sbjct: 802 GVVSDIVEGTSKLSVTLDDGIVKTLKLGKQAVRIVSQKQKRS 843
>gi|356544287|ref|XP_003540585.1| PREDICTED: uncharacterized protein LOC100815407 [Glycine max]
Length = 845
Score = 891 bits (2303), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/764 (61%), Positives = 570/764 (74%), Gaps = 34/764 (4%)
Query: 1 MVAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
M+AAE+YARRFESG V ++ GEEQGQ+N + CR+C GENEG E+A++MLSCKSC
Sbjct: 102 MIAAEEYARRFESGDVQNTPGNLTGEEQGQANRS-YCRICKCGENEGSEKAQKMLSCKSC 160
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
GKKYHRNCL++W +NRDLFHWSSW CP CRICE CRRTGDP+KFMFC+RCD AYHCYC
Sbjct: 161 GKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACRRTGDPSKFMFCKRCDGAYHCYCLQ 220
Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF+ YT CDACGRLF KGNYCPVCLK
Sbjct: 221 PPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWFMAYTNCDACGRLFTKGNYCPVCLK 280
Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
VYRDSESTPMVCCD CQ WVHCQCD IS+EKY QFQVDGNLQY+CPTCRGECYQV++ ED
Sbjct: 281 VYRDSESTPMVCCDTCQLWVHCQCDNISEEKYHQFQVDGNLQYKCPTCRGECYQVKNPED 340
Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
A +E+WRR+++A++DLI+SLRAAAGLPT++EIFSISP+SDDE++GP+ LK+E RS K S
Sbjct: 341 AAQEIWRRRNIAERDLISSLRAAAGLPTQEEIFSISPFSDDEDSGPLKLKSESARSFKFS 400
Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDV---HSYGNSF 357
LK + + SPKK +KK +KK Q + SK + S EG+ D+ HS +
Sbjct: 401 LKNLANDSPKKKTS------SKKTAKKKNSQSFMTSKIDTHNSCEGHSDIKSLHSLDDDK 454
Query: 358 GDDTQSPKNEGLDIPSSVA-GIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKF 415
DD QS +NEG D+ SS A G +S TE I+QPGILK K+VDEVMVSD+++ R V+
Sbjct: 455 NDDIQSQRNEGPDVYSSPATGSLSQTEASFPINQPGILKQKFVDEVMVSDEERKPRVVRI 514
Query: 416 KTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTS 475
K++K H DS E+ GKH K++ +K KKLVINLGARKINV +SPRSD+SSCQ++QD T
Sbjct: 515 KSNKAHIPDSEEESGKHSLKTQNVKGKKLVINLGARKINVASSPRSDSSSCQKDQDPVTV 574
Query: 476 NGIEDPSLQRMNSKFVLDRHDGSSKL--GDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQ 533
NG ED S R KF LDR D +++ G G +VD S QS+ +++GR GN+IK G+V+
Sbjct: 575 NGNEDRSQWRKGDKFALDRQDDTARHIDGKGIKVD-SGQSKFFRVSGREGNLIKLGKVKP 633
Query: 534 EVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPN 593
++S+ N RG+ +D K +ID +++VG A RG+R L +
Sbjct: 634 DISEFNLTSGRGNMSDGRI---------KHSID---GMINQVGIKATSRGERTYLGRQSE 681
Query: 594 ASRES-----NDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
S ++ N++ + SLPKDSKP LR KF+KP++E+QNS EEEK IKGQRSK
Sbjct: 682 GSSDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPSIESQNS--PHQEEEKMTIKGQRSK 739
Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
RKRPSPF EK FNE E +QS+QDS M IMDANWIL KLG DAIGKRVEVHQ SDNSW
Sbjct: 740 RKRPSPFKEKASFNESEGVSQSHQDSAMDGIMDANWILMKLGNDAIGKRVEVHQTSDNSW 799
Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
HKG+VTD VEGTS L + LDD +VKT+EL KQGVRFVPQKQKRS
Sbjct: 800 HKGLVTDVVEGTSKLYVALDDGKVKTVELRKQGVRFVPQKQKRS 843
>gi|356529861|ref|XP_003533505.1| PREDICTED: uncharacterized protein LOC100809429 [Glycine max]
Length = 820
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/762 (60%), Positives = 555/762 (72%), Gaps = 59/762 (7%)
Query: 1 MVAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
M+AAE+YARRFESG V ++ G++QGQ+N + CR+C GENEG E+A++MLSCKSC
Sbjct: 106 MIAAEEYARRFESGDVQNTPGNVTGDDQGQANRS-YCRICKCGENEGSEKAQKMLSCKSC 164
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
GKKYHRNCL++W +NRDLFHWSSW CP CRICE CRRTGDP+KFMFC+RCD AYHCYC
Sbjct: 165 GKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACRRTGDPSKFMFCKRCDGAYHCYCLQ 224
Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF+ YT CDACGRLF KGNYCPVCLK
Sbjct: 225 PPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWFMSYTNCDACGRLFTKGNYCPVCLK 284
Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
VYRDSESTPMVCCD CQ WVHCQCD ISDEKY QFQ+DGNLQY+CPTCRGECYQV++ ED
Sbjct: 285 VYRDSESTPMVCCDSCQLWVHCQCDNISDEKYHQFQLDGNLQYKCPTCRGECYQVKNPED 344
Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
A RE+WRR+++A++DLIASLRAAAGLPT++EIFSISP+SDDE++GP+ LK+E RS K S
Sbjct: 345 AAREIWRRRNIAERDLIASLRAAAGLPTQEEIFSISPFSDDEDSGPLKLKSESARSFKFS 404
Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDV---HSYGNSF 357
LK + + SPKK +KK +KK Q+ + SK + S EG+ D+ HS +
Sbjct: 405 LKNLANDSPKKKSS------SKKTAKKKDSQLFMTSKIDTHNSCEGHSDIKSLHSLDDDK 458
Query: 358 GDDTQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKF 415
DD QS +NEG D+ SS AG +S TE I QPGILK K+VDEVMVSD+++ R V+
Sbjct: 459 NDDIQSQRNEGPDVYSSPAAGSLSQTEASFPIDQPGILKQKFVDEVMVSDEERKPRVVRI 518
Query: 416 KTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTS 475
K++K DS E+ GKH K++ +K KKLVINLGARKINV +SPRSD SSCQ++QD T
Sbjct: 519 KSNKALIPDSEEESGKHSLKTQNVKGKKLVINLGARKINVASSPRSDTSSCQKDQDPVTV 578
Query: 476 NGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEV 535
N G++VD S QS+ +++GR GN+IK G+V+ +V
Sbjct: 579 N---------------------------GNKVD-SGQSKIFRVSGREGNLIKLGKVKPDV 610
Query: 536 SDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNAS 595
S+ N RG+ +D K +ID +++VG A RG+R L + S
Sbjct: 611 SEFNLTSGRGNMSDGRI---------KHSID---GMINQVGIKAPSRGERTYLGKQSEGS 658
Query: 596 RES-----NDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRK 650
++ N++ + SLPKDSKP LR KF+KP++E+QNS SQ EEEK IKGQRSKRK
Sbjct: 659 SDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPSIESQNS--SQQEEEKMTIKGQRSKRK 716
Query: 651 RPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHK 710
RPSPF EKT FNE E +QS QDS M IMDANWIL KLG DAIGKRVEVHQ SDNSWHK
Sbjct: 717 RPSPFKEKTTFNESEGVSQSRQDSAMDGIMDANWILMKLGNDAIGKRVEVHQTSDNSWHK 776
Query: 711 GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
GVVTD VEGTS L + LDD +VK +EL KQGVRFVPQKQKRS
Sbjct: 777 GVVTDVVEGTSKLYVALDDGKVKNVELRKQGVRFVPQKQKRS 818
>gi|297736278|emb|CBI24916.3| unnamed protein product [Vitis vinifera]
Length = 679
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/554 (74%), Positives = 462/554 (83%), Gaps = 11/554 (1%)
Query: 2 VAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
+ AEDYARRFESG + SKDI GEEQ QSN NVMCR+CF GE EG ERAR+ML C SCG
Sbjct: 109 MVAEDYARRFESGDLVDTSKDIVGEEQSQSNVNVMCRICFFGEMEGSERARKMLPCNSCG 168
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
KKYHR CLK+W+QNRDLFHWSSW CPSCRICE+CRR+GDPNKFMFCRRCD AYHCYCQ P
Sbjct: 169 KKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVCRRSGDPNKFMFCRRCDDAYHCYCQQP 228
Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
PHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 229 PHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 288
Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDA
Sbjct: 289 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDLEDA 348
Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSL 301
V+ELWRR+D AD+DLIASLRA A LPT+DEIFSISPYSDDEENGPV LK+EFGRSLKLSL
Sbjct: 349 VQELWRRRDKADRDLIASLRAKARLPTQDEIFSISPYSDDEENGPVSLKSEFGRSLKLSL 408
Query: 302 KGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDD- 360
KG VDKSPKK KE+GK+ NKK +KKG+Q PL SK E QSFEG+ D + S GDD
Sbjct: 409 KGSVDKSPKKTKEYGKQSSNKKNVKKKGHQTPLISKKESHQSFEGHDDAQPFEYSLGDDK 468
Query: 361 -TQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKT 417
Q +++G + SS VAG +SHTEG+CSI+QPG+LKHK+VDE+ V+++D+ SRV + K+
Sbjct: 469 NEQPNRSDGRGVFSSPVAGSLSHTEGICSINQPGVLKHKFVDEIAVNNEDRTSRVIQIKS 528
Query: 418 SKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNG 477
+KPH D GED GK SKSKT+K KLVI+LGAR NVTNSPRSDASSCQREQDLTTSNG
Sbjct: 529 NKPHGSDVGEDTGKQASKSKTMKGTKLVIHLGARNRNVTNSPRSDASSCQREQDLTTSNG 588
Query: 478 IEDPSLQRMNSKFVLDRHDGSSKLGD--GDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEV 535
ED S QRM D+HD +K GD GD++D+S Q++G K GR GN+IK G+VR E
Sbjct: 589 SEDTSQQRMG-----DKHDRIAKFGDSKGDKIDYSGQAKGSKHGGREGNLIKLGKVRTEP 643
Query: 536 SDSNTKVSRGSSAD 549
S+ N K RG+ D
Sbjct: 644 SEMNPKFGRGNKDD 657
>gi|297833588|ref|XP_002884676.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297330516|gb|EFH60935.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 778
Score = 827 bits (2135), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/757 (58%), Positives = 518/757 (68%), Gaps = 97/757 (12%)
Query: 3 AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
AAEDYARRFESG S D AGEE S N+MCR+CF+GE EG ERARRMLSCK+CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEEVVHSGMNIMCRMCFLGEGEGSERARRMLSCKTCGK 173
Query: 63 KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233
Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293
Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
RDSESTPMVCCD+CQRWVHC CDGISD+KYLQFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYLQFQVDGKLQYKCATCRGECYQVKDLQDAV 353
Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
+ELW++KD+ DK+LIASLRAAAGLPT++EIFSI P+SDD+ENGPV GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAAGLPTDEEIFSIFPFSDDDENGPVS-----GRSLKFSIK 408
Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
G+V+KSPKK KE+GK L+KK+ KKG L EP+ E + G D+
Sbjct: 409 GLVEKSPKKSKEYGKHSLSKKHASKKGSHTKL----EPELHQEVGSERLRLGGVRIDNVG 464
Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
NE D+ SSVAGI CS +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 465 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 517
Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
D DS ED ++ + K++KAKKLVINLGARKINV+ S +S+ S
Sbjct: 518 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 560
Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTK 541
L R S LG GD+VD + + R LKI+GR FG+ + E S
Sbjct: 561 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGS----- 599
Query: 542 VSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESNDD 601
A G Q + + +D
Sbjct: 600 -------------------------------------KATFGSITQFPASTSEGNHVDDK 622
Query: 602 TSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEKT 659
TS+ +L K+++P L+ K RKPN +Q S V +Q E+EK S KGQRSKRKRPS +
Sbjct: 623 TSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDMA 682
Query: 660 LFNEDEDA-AQSNQDSLMS-EIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTV 717
ED +A S+QDS + E+MDANWILKKLGKD+IGKRVEVH S NSWHKG VTD
Sbjct: 683 SLKEDGEATTHSHQDSSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWHKGTVTDVS 741
Query: 718 EGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
TSTLS++LDD +KT ELGK VRF+PQKQKRS S
Sbjct: 742 GDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 778
>gi|145338256|ref|NP_187459.2| PHD finger-containing protein [Arabidopsis thaliana]
gi|110739634|dbj|BAF01725.1| hypothetical protein [Arabidopsis thaliana]
gi|110741394|dbj|BAF02246.1| hypothetical protein [Arabidopsis thaliana]
gi|332641110|gb|AEE74631.1| PHD finger-containing protein [Arabidopsis thaliana]
Length = 779
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/758 (57%), Positives = 516/758 (68%), Gaps = 98/758 (12%)
Query: 3 AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
AAEDYARRFESG S D AGEE G S N+MCR+CF+GE EG +RARRMLSCK CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEELGHSGMNIMCRMCFLGEGEGSDRARRMLSCKDCGK 173
Query: 63 KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233
Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293
Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
RDSESTPMVCCD+CQRWVHC CDGISD+KY+QFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGECYQVKDLQDAV 353
Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
+ELW++KD+ DK+LIASLRAAAGLPTE+EIFSI P+SDDEENGPV GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAAGLPTEEEIFSIFPFSDDEENGPVS-----GRSLKFSIK 408
Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
G+V+KSPKK KE+G +K + ++K EP+ E + G D+
Sbjct: 409 GLVEKSPKKSKEYG----KHSSSKKHASKKGSHTKLEPEVHQEIGSERRRLGGVRIDNVG 464
Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
NE D+ SSVAGI CS +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 465 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 517
Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
D DS ED ++ + K++KAKKLVINLGARKINV+ S +S+ S
Sbjct: 518 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 560
Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNT- 540
L R S LG GD+VD + + R LKI+GR FG+ + E S +
Sbjct: 561 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGSKATFG 604
Query: 541 KVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESND 600
V++ +A E H+ +D
Sbjct: 605 SVTQFPAASTSEGNHV------------------------------------------DD 622
Query: 601 DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEK 658
TS+ +L K+++P L+ K RKPN +Q S V +Q E+EK S KGQRSKRKRPS +
Sbjct: 623 KTSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDM 682
Query: 659 TLFNEDEDA-AQSNQD-SLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
ED +A S+QD S E+MDANWILKKLGKD+IGKRVEVH S NSW KG VTD
Sbjct: 683 ASLKEDGEATTHSHQDNSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWRKGTVTDV 741
Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
TSTLS++LDD +KT ELGK VRF+PQKQKRS S
Sbjct: 742 SGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 779
>gi|6648214|gb|AAF21212.1|AC013483_36 unknown protein [Arabidopsis thaliana]
Length = 764
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/758 (55%), Positives = 502/758 (66%), Gaps = 113/758 (14%)
Query: 3 AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
AAEDYARRFESG S D AGEE G S N+MCR+CF+GE EG +RARRMLSCK CGK
Sbjct: 114 AAEDYARRFESGVNDLTSNDHAGEELGHSGMNIMCRMCFLGEGEGSDRARRMLSCKDCGK 173
Query: 63 KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
KYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+CRRTGDPNKFMFC+RCDAAYHCYCQHPP
Sbjct: 174 KYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVCRRTGDPNKFMFCKRCDAAYHCYCQHPP 233
Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVY 182
HKNVSSGPYLCPKHT+CHSC S VPGNGLSVRWFL YTCCDACGRLFVKGNYCPVCLKVY
Sbjct: 234 HKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRWFLSYTCCDACGRLFVKGNYCPVCLKVY 293
Query: 183 RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
RDSESTPMVCCD+CQRWVHC CDGISD+KY+QFQVDG LQY+C TCRGECYQV+DL+DAV
Sbjct: 294 RDSESTPMVCCDICQRWVHCHCDGISDDKYMQFQVDGKLQYKCATCRGECYQVKDLQDAV 353
Query: 243 RELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLK 302
+ELW++KD+ DK+LIASLRAAA DDEENGPV GRSLK S+K
Sbjct: 354 QELWKKKDVVDKELIASLRAAA---------------DDEENGPVS-----GRSLKFSIK 393
Query: 303 GVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQ 362
G+V+KSPKK KE+G +K + ++K EP+ E + G D+
Sbjct: 394 GLVEKSPKKSKEYG----KHSSSKKHASKKGSHTKLEPEVHQEIGSERRRLGGVRIDNVG 449
Query: 363 SPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPH 421
NE D+ SSVAGI CS +P I+KHK VD+VMV+D++K SR V+ K SKPH
Sbjct: 450 FQINEQSDVNSSVAGI-------CSTHEPKIVKHKRVDDVMVTDEEKPSRIVRIKCSKPH 502
Query: 422 DLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDP 481
D DS ED ++ + K++KAKKLVINLGARKINV+ S +S+ S
Sbjct: 503 DSDS-EDTLRNAGEEKSVKAKKLVINLGARKINVSGSSKSNVVS---------------- 545
Query: 482 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSN-T 540
L R S LG GD+VD + + R LKI+GR FG+ + E S +
Sbjct: 546 ---------HLSRDKDQSTLG-GDKVDQTGEVRTLKISGR------FGKTQSEGSKATFG 589
Query: 541 KVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESND 600
V++ +A E H+ +D
Sbjct: 590 SVTQFPAASTSEGNHV------------------------------------------DD 607
Query: 601 DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEK-SLIKGQRSKRKRPSPFTEK 658
TS+ +L K+++P L+ K RKPN +Q S V +Q E+EK S KGQRSKRKRPS +
Sbjct: 608 KTSISPALQKEARPLLKFKLRKPNSGDQTSSVTTQSEDEKLSSAKGQRSKRKRPSSLVDM 667
Query: 659 TLFNEDEDA-AQSNQD-SLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
ED +A S+QD S E+MDANWILKKLGKD+IGKRVEVH S NSW KG VTD
Sbjct: 668 ASLKEDGEATTHSHQDNSRNDEMMDANWILKKLGKDSIGKRVEVH-GSQNSWRKGTVTDV 726
Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRSMS 754
TSTLS++LDD +KT ELGK VRF+PQKQKRS S
Sbjct: 727 SGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQKRSRS 764
>gi|413916644|gb|AFW56576.1| RING/FYVE/PHD-type zinc finger family protein [Zea mays]
Length = 819
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 356/773 (46%), Positives = 462/773 (59%), Gaps = 83/773 (10%)
Query: 5 EDYARRFESGYVATASKDIAGEE-------QGQSNTNVMCRLCFVGENEGCERARRMLSC 57
ED ARRFESG A S + G+E QG + VMCRLCF GENEG +A +ML C
Sbjct: 99 EDLARRFESG--AYGSPEAEGDEDEWDREDQGNAAVKVMCRLCFSGENEGSTKAAKMLPC 156
Query: 58 KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCY 117
K C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD YHCY
Sbjct: 157 KLCSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDDPYHCY 216
Query: 118 CQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
CQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPV
Sbjct: 217 CQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPV 276
Query: 178 CLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRD 237
CLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C CRGEC Q+RD
Sbjct: 277 CLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCAACRGECSQIRD 336
Query: 238 LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSL 297
EDA+RELW+R+D+AD +L+ +LRAAA LP+ +++ + P SDDE+ G VLK+E +L
Sbjct: 337 TEDAIRELWKRRDVADHELMITLRAAAKLPSLEDVSPLYPNSDDEKLGAYVLKSESRNTL 396
Query: 298 KLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHS 352
K SLK K P E K NKK +KKG Q + + E HDV S
Sbjct: 397 KFSLKSNSSKPPPDTPEQEKVVFKSSGSNKKPSKKKGGQGNKTNDGHDEIFLERRHDVKS 456
Query: 353 YGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR 412
+ GD + ++ + +S + + S LK + V ++ D I +
Sbjct: 457 SNSRLGDQSIDGNHDMSPFKNDDNAYISSS----TRSSEKNLKSPSMKAV-TNNADMIPK 511
Query: 413 VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDL 472
VK K SK L +D ++ SK+ T KA KLVI+LG+R + SP+S+ S+ QREQDL
Sbjct: 512 VKIKGSKVSSLHY-KDGEENTSKADTGKATKLVIHLGSRHKTRSGSPKSELSNYQREQDL 570
Query: 473 TTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVR 532
+ + G ++D +SQ +G + + +V+K R
Sbjct: 571 GSIH---------------------------GRKLDVTSQLKGSRSEVKERSVMKLVR-E 602
Query: 533 QEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRP 592
V N+ + ++ +H +GK RS A +S G+ +RP
Sbjct: 603 TGVQQRNSLLGDLGTSKKH-------ATGK----RSNALIS-----GMENGNETGTRNRP 646
Query: 593 NASRESN----DDTSVLQSLPKDSKPP-LRLKFRKPNLENQNSQVSQPEEEKSLI----- 642
A ++S+ D+ P + KP L+LKF++P+ E N+Q SQPEE S +
Sbjct: 647 FAQKQSHSSQVDENQGTADSPDNLKPSLLKLKFKRPHYEQLNTQASQPEEPTSWVSQQED 706
Query: 643 -----KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKR 697
KGQRSKRKRPS EK + A+ + S E+MDANWIL+KLGKDAIGKR
Sbjct: 707 QFNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHHQSTDDEVMDANWILRKLGKDAIGKR 764
Query: 698 VEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
+EVH SD WH+G+V++ + G TL I LD+ R + +ELGKQ +R + + K
Sbjct: 765 IEVHLTSDGKWHQGMVSNVMGG--TLCIQLDNGRSENVELGKQAIRLIASRSK 815
>gi|357151790|ref|XP_003575905.1| PREDICTED: uncharacterized protein LOC100821635 [Brachypodium
distachyon]
Length = 809
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 341/772 (44%), Positives = 455/772 (58%), Gaps = 87/772 (11%)
Query: 5 EDYARRFESGY---VATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
ED+ARR E Y A D + +E+G + VMCRLC GENEG +A +ML CK C
Sbjct: 95 EDFARRLEGSYGFPEAEGDDDDSDQEEGNAAIKVMCRLCISGENEGSSKAAKMLPCKLCN 154
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
KKYH+ C+K W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD AYHCYCQ P
Sbjct: 155 KKYHKKCVKYWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQQP 214
Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
HKNVS GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 215 SHKNVSHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLKV 274
Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
YRDSE PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C +CRGEC Q+RD EDA
Sbjct: 275 YRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADENLQYTCASCRGECSQIRDAEDA 334
Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSL 301
VRELW+R+++ D DL+ SLRAAA LP+ +++ +P SDDE G V KN+ +LK S
Sbjct: 335 VRELWKRRNIVDHDLMVSLRAAAALPSLEDVSPSNPNSDDERLGAFVPKNDGRNTLKFSF 394
Query: 302 KGVVDKSPKKVKEHGKK-------WLNKKYPRKKGYQMPLNSKPEPDQSF-EGYHDVHSY 353
K K P + + G++ NKK+ +KKG Q + S +PD+ F E H+ SY
Sbjct: 395 KSNSSKPP--LDQSGQEKNVPKTSGSNKKHSKKKGNQGNI-SVGDPDEIFLEKRHEAKSY 451
Query: 354 GNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV 413
N G + N G + + + V ++S + ++ ++ D I +V
Sbjct: 452 SNLGGHTIEG--NHG-------QSTIKNDDSVFTLSAT-----RSSEKGAANNADMIPKV 497
Query: 414 KFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLT 473
K + SK L + + +KS K KLV + G R + + SP+S+ ++ +EQ+L
Sbjct: 498 KIRGSKAPSLHFKDVGEVNTAKSDAGKGTKLVFHFGTRHKSGSGSPKSEMTNSHKEQELG 557
Query: 474 TSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQ 533
+ +G ++D +SQ + K + +V+K R
Sbjct: 558 SLHG---------------------------GKIDVTSQFKSSKSEKKEKSVMKLVR-ET 589
Query: 534 EVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPN 593
V N+ + ++ +H ++GKR S A +S + E A G R +
Sbjct: 590 GVQQRNSLLGDLGTSKKH-------VTGKR----SNAIISGM-ENAGESGTRSRSFGHKQ 637
Query: 594 ASRESNDDTSVLQSLPKDSKPP------LRLKFRKPNLENQNSQVSQPEE---------E 638
+ D SLP ++ P L+LKF++P+ E ++QV+QPEE E
Sbjct: 638 SIPNQLTDNQATASLPVNNSPDSLKPSLLKLKFKRPHFEQPSAQVAQPEETATWASQQEE 697
Query: 639 KSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRV 698
++ KGQRSKRKRPS +K +E + + +Q S E MDA WIL+KLGKDAIGKR+
Sbjct: 698 LNVAKGQRSKRKRPS--MDKMDGSEGKTPGKRHQQSTGDEAMDATWILRKLGKDAIGKRI 755
Query: 699 EVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
E+ SD WH+GVV++ + G TL + LDD + LELGKQ VR V Q+ K
Sbjct: 756 EIQLPSDGKWHQGVVSNVLSG--TLCVQLDDGSSENLELGKQAVRLVAQRSK 805
>gi|115488844|ref|NP_001066909.1| Os12g0527800 [Oryza sativa Japonica Group]
gi|77556508|gb|ABA99304.1| PHD-finger family protein, expressed [Oryza sativa Japonica Group]
gi|113649416|dbj|BAF29928.1| Os12g0527800 [Oryza sativa Japonica Group]
gi|215717023|dbj|BAG95386.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 688
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 340/745 (45%), Positives = 445/745 (59%), Gaps = 91/745 (12%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180
Query: 216 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 275
Q D NLQY C CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240
Query: 276 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 330
P SDDE+ G V+KN+ +LK S K K E K + NKK+ +KKG
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300
Query: 331 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 383
Q + + E ++ S G S GD D S KN D + V E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356
Query: 384 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 440
+ H + D I +VK K +K H D GE++ +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404
Query: 441 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 500
KLVI++G+R + + SP+S+ S+ Q+EQ+L + +G
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441
Query: 501 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 560
+VD +SQ + + + +V+K VR+ N+ + ++ +H +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKH-------AT 488
Query: 561 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 616
GKR S A VS + E A+ G R Q +S + + + + + + P KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543
Query: 617 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601
Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659
Query: 726 TLDDSRVKTLELGKQGVRFVPQKQK 750
LD+ R + +ELGK+ +R + Q+ K
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQRSK 684
>gi|414878222|tpg|DAA55353.1| TPA: RING/FYVE/PHD-type zinc finger family protein [Zea mays]
Length = 818
Score = 593 bits (1528), Expect = e-166, Method: Compositional matrix adjust.
Identities = 350/777 (45%), Positives = 460/777 (59%), Gaps = 93/777 (11%)
Query: 5 EDYARRFESGYVAT-----ASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS 59
ED ARRFESG + + E+QG + VMCRLCF GENEG +A +ML CK
Sbjct: 100 EDLARRFESGAYGSPEAGEDEDEWDREDQGNAAVKVMCRLCFSGENEGSTKAAKMLPCKL 159
Query: 60 CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+CRR GDPNK MFC+RCD AYHCYCQ
Sbjct: 160 CSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQ 219
Query: 120 HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCL 179
P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCP+CL
Sbjct: 220 QPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPICL 279
Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLE 239
KVYRDSE PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C CRGEC Q+RD E
Sbjct: 280 KVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCAACRGECSQIRDTE 339
Query: 240 DAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKL 299
DA+RELW+R+D+AD +L+A+LRAAA LP+ +++ SDDE+ G LKNE +LK
Sbjct: 340 DAIRELWKRRDVADHELMATLRAAAALPSLEDVSPPYQNSDDEKLGAYALKNESRNTLKF 399
Query: 300 SLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNSKPEP----DQSF-EGYHDVHSYG 354
SLK K P E K KK + + D+ F E H V S
Sbjct: 400 SLKSNSSKPPPDTPEQEKIVFKSSGSNKKPSKKKSGQANKTVDGHDEIFLERRHAVKSSN 459
Query: 355 NSFGDDT------QSP--KNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSD 406
+ GD T +SP ++ + + SS + + + P + + + ++
Sbjct: 460 SCLGDQTINENHDRSPFKNDDNVYVSSSTRSLEKN------LKSPSM-------KAVANN 506
Query: 407 DDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSC 466
D I +VK K SK L +D ++ K+ T KA KLVI+LG+R + SP+S+ S+
Sbjct: 507 ADMIPKVKIKGSKVSSLHY-KDGEENTPKNDTGKATKLVIHLGSRHKTRSGSPKSELSNS 565
Query: 467 QREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVI 526
QREQDL + +G ++D +SQ + + + +V+
Sbjct: 566 QREQDLGSIHG---------------------------GKIDVTSQLKSSRNEVKERSVM 598
Query: 527 KFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRK 586
K R V N+ + ++ +H +GKR S A +S + E A G R
Sbjct: 599 KLVR-DTGVQQRNSLLGDLGTSKKHA-------TGKR----SNALISGM-ENANETGTRN 645
Query: 587 Q--LESRPNASRESNDDTSVLQSLPKDSKPPL-RLKFRKPNLENQNSQVSQPEEEKSLI- 642
+ + + ++S+ N T+ P KP L +LKF++P+ E N+Q SQPEE S +
Sbjct: 646 RSFAQKQSHSSQVENHGTA---DSPDSLKPSLLKLKFKRPHFEQLNTQASQPEEPTSWVS 702
Query: 643 ---------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDA 693
KGQRSKRKRPS EK + A+ +Q S E+MDANWIL+KLGKDA
Sbjct: 703 QQEEQLNVAKGQRSKRKRPS--MEKADGLDGITPAKRHQQS-TDEVMDANWILRKLGKDA 759
Query: 694 IGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
IGKR+EVH SD WH+G+V++ + G TL I LD+ R + +ELGKQ +R + + K
Sbjct: 760 IGKRIEVHLTSDGKWHQGMVSNVMGG--TLCIRLDNGRSENVELGKQAIRLIASRSK 814
>gi|222617191|gb|EEE53323.1| hypothetical protein OsJ_36320 [Oryza sativa Japonica Group]
Length = 756
Score = 593 bits (1528), Expect = e-166, Method: Compositional matrix adjust.
Identities = 340/744 (45%), Positives = 444/744 (59%), Gaps = 91/744 (12%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180
Query: 216 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 275
Q D NLQY C CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240
Query: 276 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 330
P SDDE+ G V+KN+ +LK S K K E K + NKK+ +KKG
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300
Query: 331 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 383
Q + + E ++ S G S GD D S KN D + V E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356
Query: 384 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 440
+ H + D I +VK K +K H D GE++ +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404
Query: 441 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 500
KLVI++G+R + + SP+S+ S+ Q+EQ+L + +G
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441
Query: 501 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 560
+VD +SQ + + + +V+K VR+ N+ + ++ +H +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKHA-------T 488
Query: 561 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 616
GKR S A VS + E A+ G R Q +S + + + + + + P KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543
Query: 617 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601
Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659
Query: 726 TLDDSRVKTLELGKQGVRFVPQKQ 749
LD+ R + +ELGK+ +R + Q Q
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQSQ 683
>gi|326519042|dbj|BAJ92681.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 816
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 340/774 (43%), Positives = 440/774 (56%), Gaps = 83/774 (10%)
Query: 5 EDYARRFESGYVATASK----DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC 60
ED+ARR E Y + ++ D E+QG + VMCRLCF GENEG +A +ML CK C
Sbjct: 94 EDFARRLEGAYGSPEAEGDEDDSDREDQGNAAVKVMCRLCFSGENEGSSKAAKMLPCKLC 153
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
KKYH+ C+KNW ++RDLFHWSSW C SCR CE+CRR GDPNK MFC+RCD AYHCYCQ
Sbjct: 154 NKKYHKKCVKNWGEHRDLFHWSSWICSSCRSCEVCRRPGDPNKLMFCKRCDGAYHCYCQQ 213
Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLK 180
P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLK
Sbjct: 214 PSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLK 273
Query: 181 VYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLED 240
VYRDSE PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C +CRGEC Q+RD ED
Sbjct: 274 VYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQFQADQNLQYTCASCRGECSQIRDAED 333
Query: 241 AVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLS 300
AVRELW+R+++ D DL+ SLRAAAGLP+ +++ S P SDDE G +VLKN+ +LK S
Sbjct: 334 AVRELWKRRNVVDHDLMISLRAAAGLPSLEDV-SPCPNSDDERLGALVLKNDGRNTLKFS 392
Query: 301 LKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPL---------NSKPEPDQSF-EGYHDV 350
LK K P E K P+ G S +P++ F E H+
Sbjct: 393 LKSNSSKPPLDQCEQ-----EKNVPKNSGTNKKHSKKKSSQGNKSVADPNEIFLERRHEA 447
Query: 351 HSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKI 410
S + GD T ++ ++ V + + S LK V + ++ + I
Sbjct: 448 KSMSSHLGDHTVDVNHDRNSFKNNENVFVLPS----TRSSEKDLKSTSV-KATTNNANTI 502
Query: 411 SRVKFKTSKP---HDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQ 467
+VK K SK H D GE++ + T K KLVI+LG R + + SP+S+ S+
Sbjct: 503 PKVKIKGSKVPSLHFKDIGEENN---ANGDTGKGTKLVIHLGTRHKSKSGSPKSEMSNSH 559
Query: 468 REQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIK 527
+EQ+L +++G + S + KL V SS L + R
Sbjct: 560 KEQELGSTHGGKTDVTSLFKSSKSSKKEKSVMKLVGETGVQQSSLLGDLGTSKRHA---- 615
Query: 528 FGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQ 587
+ SSA ++SG N + S G
Sbjct: 616 --------------TGKRSSA---------LISGMENANESGTRSRSFG----------H 642
Query: 588 LESRPNASRESNDDTSVLQSLPKDSKPP--LRLKFRKPNLENQNSQVSQPEEEKS----- 640
+S P+ ES S + DS P L+LKF++P+LE + QVSQ EE +
Sbjct: 643 KQSIPSQLTESQGTASFAVNNSPDSLKPSLLKLKFKRPHLEQPSLQVSQTEEPATWASQQ 702
Query: 641 ----LIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGK 696
+ KGQRSKRKRPS T+K +E ++ + S E MDA WIL+KLG DAIGK
Sbjct: 703 EDLNVAKGQRSKRKRPS--TDKMDGSEGSTPSKRHGQSTGDEAMDATWILRKLGNDAIGK 760
Query: 697 RVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
R+E+ SD WH+GVV++ + G L + LD+ + LELG Q VR + Q+ K
Sbjct: 761 RIEIQLASDGKWHQGVVSNVISG--MLCVQLDNGSSENLELGNQAVRLIAQRLK 812
>gi|224055146|ref|XP_002298424.1| predicted protein [Populus trichocarpa]
gi|222845682|gb|EEE83229.1| predicted protein [Populus trichocarpa]
Length = 797
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 315/535 (58%), Positives = 377/535 (70%), Gaps = 53/535 (9%)
Query: 190 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 249
+ D C V SDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDAV+ELWRR+
Sbjct: 236 IFASDGCTVIVMASEIFCSDEKYLQFQVDGNLQYQCSTCRGECYQVKDLEDAVQELWRRR 295
Query: 250 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 309
D AD+ LIASLRAAAGLP +++IFSI+PYSDD+ENGP +N+FGRS+KLSLKG+V+KSP
Sbjct: 296 DKADRGLIASLRAAAGLPAQEDIFSITPYSDDDENGPAAPRNDFGRSIKLSLKGLVEKSP 355
Query: 310 KKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD----DTQSPK 365
KK K+HGKK LNKKYP++KG SK E Q H+ HSY + GD DT+S
Sbjct: 356 KKSKDHGKKHLNKKYPKRKGPHAASFSKTESYQ-----HESHSYEHDSGDEKNNDTESQA 410
Query: 366 NEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDL 423
GL SS VAGIV+HTEG+CSI+QPG LKHK+V+EVMVSD ++ S+ VK K++KP DL
Sbjct: 411 KGGLGRCSSPVAGIVNHTEGICSINQPGALKHKFVEEVMVSDGERTSKIVKIKSNKPRDL 470
Query: 424 DSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSL 483
DSG DD + SKSK++KAKKLVINLGARKINV++SP+SDA SCQREQDL SN
Sbjct: 471 DSG-DDAEKPSKSKSVKAKKLVINLGARKINVSSSPKSDAQSCQREQDLKASN------- 522
Query: 484 QRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTKV 542
GD VDHS Q RGL K A R GN IKFG+V+ E S N K
Sbjct: 523 --------------------GDGVDHSEQKRGLIKFARREGNFIKFGKVKAEASSLNLKS 562
Query: 543 SRGSSADEHEP---EHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNAS 595
G+ D +E +H V S KR+++ SRAAV GEV LR DR KQ E+R +
Sbjct: 563 DGGNHFDAYETTPLDHARVTSSKRSLEGSRAAVGPAGEVPMLRNDRVSLGKQSEARLDTH 622
Query: 596 RESND---DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRP 652
ESND DT +L SLPKDSK L+LK +KPNLENQ+SQ+ EEEKS +GQRSKRKR
Sbjct: 623 TESNDDSGDTPILHSLPKDSKLSLKLKIKKPNLENQSSQILLHEEEKSNTRGQRSKRKRA 682
Query: 653 SPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNS 707
S F +KT++NEDED ++S+ D SE+M+ANWILKKLGKDAIGKRVEVHQ SDNS
Sbjct: 683 STFMDKTMYNEDEDMSESHLD---SEMMEANWILKKLGKDAIGKRVEVHQPSDNS 734
>gi|359487302|ref|XP_002274438.2| PREDICTED: uncharacterized protein LOC100249974 [Vitis vinifera]
Length = 730
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 232/292 (79%), Positives = 251/292 (85%), Gaps = 9/292 (3%)
Query: 2 VAAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCG 61
+ AEDYARRFESG + SKDI GEEQ QSN NVMCR+CF GE EG ERAR+ML C SCG
Sbjct: 109 MVAEDYARRFESGDLVDTSKDIVGEEQSQSNVNVMCRICFFGEMEGSERARKMLPCNSCG 168
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
KKYHR CLK+W+QNRDLFHWSSW CPSCRICE+CRR+GDPNKFMFCRRCD AYHCYCQ P
Sbjct: 169 KKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVCRRSGDPNKFMFCRRCDDAYHCYCQQP 228
Query: 122 PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 181
PHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV
Sbjct: 229 PHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKV 288
Query: 182 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDA
Sbjct: 289 YRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDLEDA 348
Query: 242 VRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEF 293
V+ELWRR+D AD+ + +S A L + I SI N P VLK++F
Sbjct: 349 VQELWRRRDKADRGVFSS-PVAGSLSHTEGICSI--------NQPGVLKHKF 391
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/392 (58%), Positives = 271/392 (69%), Gaps = 41/392 (10%)
Query: 373 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKTSKPHDLDSGEDDGK 431
S VAG +SHTEG+CSI+QPG+LKHK+VDE+ V+++D+ SRV + K++KPH D GED GK
Sbjct: 366 SPVAGSLSHTEGICSINQPGVLKHKFVDEIAVNNEDRTSRVIQIKSNKPHGSDVGEDTGK 425
Query: 432 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 491
SKSKT+K KLVI+LGAR NVTNSPRSDASSCQREQDLTTSNG
Sbjct: 426 QASKSKTMKGTKLVIHLGARNRNVTNSPRSDASSCQREQDLTTSNG-------------- 471
Query: 492 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 551
D++D+S Q++G K GR GN+IK G+VR E S+ N K RG+ D
Sbjct: 472 -------------DKIDYSGQAKGSKHGGREGNLIKLGKVRTEPSEMNPKFGRGNKDDGV 518
Query: 552 E---PEHMHVLSGKRNIDRSRAAVSRVGEVAALRGD----RKQLESRPNASRESNDDTS- 603
E PE+ VL GKR+I+ S V EV+ RG+ RK ESR N E NDD S
Sbjct: 519 EAIPPENTRVLLGKRSIEGSTNVAGAVTEVS--RGEKVFSRKHPESRLNMYGEGNDDNSS 576
Query: 604 ---VLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTL 660
V SLPKDSKP L+LKF+ P+ ENQ+S E+EKS +KGQRSKRKRPSPF EKT
Sbjct: 577 TPSVSHSLPKDSKPLLKLKFKNPSFENQSSWGLPGEDEKSAVKGQRSKRKRPSPFMEKTS 636
Query: 661 FNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGT 720
F EDED +Q +QD M +IMDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGT
Sbjct: 637 FKEDEDGSQFHQDDSMDQIMDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGT 696
Query: 721 STLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
STL + DD R KTLELGKQ +R + QKQKRS
Sbjct: 697 STLIVKFDDGRAKTLELGKQAIRLISQKQKRS 728
>gi|255553540|ref|XP_002517811.1| protein binding protein, putative [Ricinus communis]
gi|223543083|gb|EEF44618.1| protein binding protein, putative [Ricinus communis]
Length = 734
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 224/267 (83%), Positives = 245/267 (91%), Gaps = 2/267 (0%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
EDYARRFESG +A A+KD+AG EQG SN NVMCR+CF+GE EG ERARRMLSCKSCGKKY
Sbjct: 126 EDYARRFESGDMAVATKDVAGHEQGLSNANVMCRMCFLGEAEGSERARRMLSCKSCGKKY 185
Query: 65 HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
HR+CLK+WAQ+RDLFHWSSW CPSCRICEICRRTGDPNKFMFC+RCD AYHCYCQHPPHK
Sbjct: 186 HRSCLKSWAQHRDLFHWSSWTCPSCRICEICRRTGDPNKFMFCKRCDGAYHCYCQHPPHK 245
Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
NVSSGPYLCPKHT+CHSCGS+VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD
Sbjct: 246 NVSSGPYLCPKHTRCHSCGSSVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 305
Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
SESTPMVCCD+CQRWVHC CDGISDEKYLQFQVDGNLQY+C TCRGECYQV+D EDAV+E
Sbjct: 306 SESTPMVCCDICQRWVHCSCDGISDEKYLQFQVDGNLQYKCATCRGECYQVKDHEDAVQE 365
Query: 245 LWRRKDMADKDLIASLRAAAGLPTEDE 271
LWRR+D AD+ + +S + AG+ E
Sbjct: 366 LWRRRDEADRGVYSS--SIAGVVNHAE 390
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/385 (60%), Positives = 273/385 (70%), Gaps = 38/385 (9%)
Query: 373 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGK 431
SS+AG+V+H EG CS++Q G+LKHKYVDEVMVSD ++ SR V+ K KPHDLDSG+D K
Sbjct: 380 SSIAGVVNHAEGNCSVNQTGVLKHKYVDEVMVSDGERTSRIVRLKNKKPHDLDSGDDAEK 439
Query: 432 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 491
H K K++KAKKLVINLGARKINVTNS RSDASSCQR+QD+TT NG
Sbjct: 440 HAIKFKSVKAKKLVINLGARKINVTNSHRSDASSCQRDQDMTTPNG-------------- 485
Query: 492 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 551
D VDHS Q R LK R GN IKFG+V+ E S+ N K GS AD
Sbjct: 486 -------------DTVDHSVQIRSLKFPRREGNFIKFGKVKNETSNLNPKFQTGSDADGE 532
Query: 552 EPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNASRESNDDTSVLQS 607
+ + V S KR+ID AV V EV LR D+ KQLE R ESNDD+ S
Sbjct: 533 K--MVSVSSSKRSIDGCGTAVGPVDEVPTLRSDKVSIGKQLEVRSETHAESNDDSGD-AS 589
Query: 608 LPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDA 667
LPKDSK L+LK + PNL NQ S+ PEEEKS I+GQRSKRKRPS F +K+LFNE+ED
Sbjct: 590 LPKDSKISLKLKIKNPNLLNQYSRKPPPEEEKSSIRGQRSKRKRPSSFMDKSLFNENEDI 649
Query: 668 AQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITL 727
Q++QDS E+++A+WILKKLGKDAIGKRVEVHQ SDNSWHKGVV+DTVEGTS +S+TL
Sbjct: 650 TQAHQDS---EMLEASWILKKLGKDAIGKRVEVHQPSDNSWHKGVVSDTVEGTSMISVTL 706
Query: 728 DDSRVKTLELGKQGVRFVPQKQKRS 752
DDSRVKTL+LGKQ VRFVPQKQKRS
Sbjct: 707 DDSRVKTLQLGKQAVRFVPQKQKRS 731
>gi|297816482|ref|XP_002876124.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297321962|gb|EFH52383.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 678
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 193/260 (74%), Positives = 223/260 (85%), Gaps = 1/260 (0%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKY 64
EDYA SG + + K++ GE+ ++++ C +C++ E ERA+ MLSCK CGKKY
Sbjct: 112 EDYAVNDGSGDLEVSVKELNGEDHDHHSSSITCHMCYLVEVGKSERAK-MLSCKCCGKKY 170
Query: 65 HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHK 124
HRNCLK+WAQ+RDLF+WSSW CPSCRICE C GDP KFMFC+RCD AYHC CQ P HK
Sbjct: 171 HRNCLKSWAQHRDLFNWSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQQPRHK 230
Query: 125 NVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
NVSSGPYLCPKHTKC+SCGS VPGNG S+RWFLG+TCCDACGRLFVKGNYCPVCLKVYRD
Sbjct: 231 NVSSGPYLCPKHTKCYSCGSTVPGNGQSLRWFLGHTCCDACGRLFVKGNYCPVCLKVYRD 290
Query: 185 SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRE 244
SE+TPMVCCD CQRWVHC CDGISDEKY+QFQVDGNLQY+C TCRGECYQV+DLEDAV+E
Sbjct: 291 SEATPMVCCDFCQRWVHCHCDGISDEKYMQFQVDGNLQYKCSTCRGECYQVKDLEDAVQE 350
Query: 245 LWRRKDMADKDLIASLRAAA 264
+W+RKD+ADKDLIASL+A+A
Sbjct: 351 IWKRKDIADKDLIASLKASA 370
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 98/225 (43%), Positives = 133/225 (59%), Gaps = 9/225 (4%)
Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
V + S SN K + S + E L GK N D R + GEV + + +
Sbjct: 458 VSKLTSKSNGKQEKLQSEETFSREQHRSLLGKNN-DEKRGSR---GEVTTSKAEGGFIGR 513
Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEKSLIKGQRSKR 649
+ + N + S+ KDS+ L+L+ +K N E+Q + S E KG RSKR
Sbjct: 514 HSDGKGDLNSGSH--DSMQKDSRRLLKLRIKKHNPESQEGETPSIVYERGKSGKGHRSKR 571
Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
KR SP EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW
Sbjct: 572 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 631
Query: 710 K--GVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
K GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 632 KGVVSEVGGGGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 676
>gi|218186977|gb|EEC69404.1| hypothetical protein OsI_38556 [Oryza sativa Indica Group]
Length = 625
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 208/358 (58%), Positives = 243/358 (67%), Gaps = 35/358 (9%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI- 94
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVL 60
Query: 95 ----------------------------CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
CRR GDPNK MFC+RCD AYHCYCQ P HKNV
Sbjct: 61 LDWSLGFDVNLAKTLVCGVTGPTSGSSVCRRPGDPNKLMFCKRCDGAYHCYCQQPSHKNV 120
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE
Sbjct: 121 THGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 180
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 246
PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C CRGEC Q+RD EDAVRELW
Sbjct: 181 VIPMVCCDVCEKWVHIECDGISEEKYQQFQSDQNLQYTCGACRGECSQIRDTEDAVRELW 240
Query: 247 RRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVD 306
+R+D+ D DL+ASLRAAA LP+ +++ P SDDE+ G V+KN+ +LK S K
Sbjct: 241 KRRDVVDHDLMASLRAAAALPSLEDVSPSHPNSDDEKLGAYVMKNDGRNTLKFSFKSNST 300
Query: 307 KSPKKVKEHGKKWL-----NKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD 359
K E K + NKK+ +KKG Q + + E ++ S G S GD
Sbjct: 301 KPALDSSEQEKNAIKSSGSNKKHSKKKGNQNNKTVSEQDEIFLEKRNETKSLG-SLGD 357
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/144 (47%), Positives = 95/144 (65%), Gaps = 14/144 (9%)
Query: 616 LRLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 666
L+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 449 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 506
Query: 667 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 725
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 507 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 564
Query: 726 TLDDSRVKTLELGKQGVRFVPQKQ 749
LD+ R + +ELGK+ +R + Q Q
Sbjct: 565 QLDNGRSENIELGKRAIRLIAQSQ 588
>gi|42565848|ref|NP_190778.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332645370|gb|AEE78891.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 696
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 187/244 (76%), Positives = 213/244 (87%), Gaps = 1/244 (0%)
Query: 21 KDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
KD+ E+ + ++ C +C++ E ERA+ MLSCK CGKKYHRNC+K+WAQ+RDLF+
Sbjct: 146 KDLNEEDHDHHSASITCHMCYLVEVGKSERAK-MLSCKCCGKKYHRNCVKSWAQHRDLFN 204
Query: 81 WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCH 140
WSSW CPSCRICE C GDP KFMFC+RCD AYHC CQHP HKNVSSGPYLCPKHTKC+
Sbjct: 205 WSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKCY 264
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWV 200
SC S VPGNG S+RWFLG+TCCDACGRLFVKGNYCPVCLKVYRDSE+TPMVCCD CQRWV
Sbjct: 265 SCESTVPGNGQSLRWFLGHTCCDACGRLFVKGNYCPVCLKVYRDSEATPMVCCDFCQRWV 324
Query: 201 HCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASL 260
HCQCDGISDEKY+QFQVDGNLQY+C TCRGE YQV+DLEDAV+E+W+RKDMADKDLIASL
Sbjct: 325 HCQCDGISDEKYMQFQVDGNLQYKCSTCRGESYQVKDLEDAVQEIWKRKDMADKDLIASL 384
Query: 261 RAAA 264
+A+A
Sbjct: 385 KASA 388
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)
Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
V + S SN K + + + E L GK N D R + GEV L+ + +
Sbjct: 476 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 531
Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 649
+ + N + S KDS+ L+LK +K N E Q S+ E+S KG RSKR
Sbjct: 532 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 589
Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
KR SP EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW
Sbjct: 590 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 649
Query: 710 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
KGVV++ GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 650 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 694
>gi|168045006|ref|XP_001774970.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673717|gb|EDQ60236.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 686
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 162/285 (56%), Positives = 207/285 (72%), Gaps = 3/285 (1%)
Query: 2 VAAEDYARRFESGYVATASKDIAGEEQGQSNT---NVMCRLCFVGENEGCERARRMLSCK 58
+A +A R +G D E+G + V+C LC GE G ++A RMLSC+
Sbjct: 135 ASANGWADRERAGMPVRGGSDEEFNEEGHFESLFPQVVCGLCGCGEAIGSDKAGRMLSCQ 194
Query: 59 SCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
+C K+YHR C K WA++RDLF+W+SW C SCR+CE+C R+GD NK MFC+RCD AYH C
Sbjct: 195 ACRKQYHRKCTKYWAEHRDLFNWASWMCGSCRVCEVCLRSGDSNKLMFCKRCDHAYHSSC 254
Query: 119 QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVC 178
HPP K+V GP++CPKH +C SC + VPG G+S +WFL Y+ CDACGRLF +G YCP+C
Sbjct: 255 LHPPLKHVPKGPFVCPKHVRCTSCNTTVPGGGVSSKWFLSYSLCDACGRLFTRGKYCPIC 314
Query: 179 LKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDL 238
LKVYRDSE PMVCCDVC+ WVHC+CDGISDEKY +FQV+ L+Y+C +CRGECY+V DL
Sbjct: 315 LKVYRDSEPAPMVCCDVCEHWVHCECDGISDEKYQEFQVNSQLRYKCASCRGECYKVADL 374
Query: 239 EDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
+DA E+WRRKD+ D IA +RAAAGLP+ +EI P SD+E+
Sbjct: 375 DDAAVEIWRRKDIRDATQIAEIRAAAGLPSPEEILKAYPSSDEED 419
>gi|4678939|emb|CAB41330.1| putative protein [Arabidopsis thaliana]
Length = 763
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 175/273 (64%), Positives = 205/273 (75%), Gaps = 12/273 (4%)
Query: 3 AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGK 62
A E YA SG + + KD+ E+ + ++ C +C++ E ERA+ MLSCK CGK
Sbjct: 110 AVEVYAANDCSGDLEVSVKDLNEEDHDHHSASITCHMCYLVEVGKSERAK-MLSCKCCGK 168
Query: 63 KYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
KYHRNC+K+WAQ+RDLF+WSSW CPSCRICE C GDP KFMFC+RCD AYHC CQHP
Sbjct: 169 KYHRNCVKSWAQHRDLFNWSSWACPSCRICEGCGTLGDPKKFMFCKRCDDAYHCDCQHPR 228
Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF------LGYTCCDACGRL---FVKGN 173
HKNVSSGPYLCPKHTKC+SC S VPGNG S+R+ L C G L V+G
Sbjct: 229 HKNVSSGPYLCPKHTKCYSCESTVPGNGQSLRYLTFCLVILEIYSCGFWGILVVMLVEGC 288
Query: 174 YC--PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
+ L VYRDSE+TPMVCCD CQRWVHCQCDGISDEKY+QFQVDGNLQY+C TCRGE
Sbjct: 289 LLRGIIVLYVYRDSEATPMVCCDFCQRWVHCQCDGISDEKYMQFQVDGNLQYKCSTCRGE 348
Query: 232 CYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 264
YQV+DLEDAV+E+W+RKDMADKDLIASL+A+A
Sbjct: 349 SYQVKDLEDAVQEIWKRKDMADKDLIASLKASA 381
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)
Query: 531 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 590
V + S SN K + + + E L GK N D R + GEV L+ + +
Sbjct: 469 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 524
Query: 591 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 649
+ + N + S KDS+ L+LK +K N E Q S+ E+S KG RSKR
Sbjct: 525 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 582
Query: 650 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 709
KR SP EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW
Sbjct: 583 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 642
Query: 710 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
KGVV++ GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 643 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 687
>gi|302813786|ref|XP_002988578.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
gi|300143685|gb|EFJ10374.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
Length = 774
Score = 338 bits (868), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 154/287 (53%), Positives = 201/287 (70%), Gaps = 9/287 (3%)
Query: 2 VAAEDYARRFES-----GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLS 56
V AED +++ G V+T S+ EE + C LC + E E ++ RML+
Sbjct: 144 VPAEDNQPDYQTPDDGGGEVSTPSQP---EESVAPPEEIFCGLCQLAEAES-KKQERMLT 199
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C+ C +++HR CLK+WA NRDLF+W+SW+C CR CE C+ TGDPN+ +FC+RCD A+H
Sbjct: 200 CQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDCKVTGDPNRLLFCKRCDEAHHN 259
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
C+ K + GP+LCPKH++CHSCG+ VPG G S RWF Y CDACGRLFVK YCP
Sbjct: 260 NCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRWFHSYLFCDACGRLFVKDKYCP 319
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
+C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +FQ NL++ C CRGEC+Q
Sbjct: 320 ICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEFQTIQNLRFTCAACRGECFQAT 379
Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
+E+AV ELW+RKD AD+D I SLRA+AGLP+E E+ + P SDDE+
Sbjct: 380 SVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARLCPSSDDEQ 426
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 94/182 (51%), Gaps = 23/182 (12%)
Query: 572 VSRVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQ 631
V +V A RG + + + R +D+ P+ + L+LK +KP+ ++
Sbjct: 440 VFKVNSSAKARGKSSEEAADSSKKRSRSDNVEP----PETERKTLKLKIKKPS----GTE 491
Query: 632 VSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGK 691
V E + +GQRSKRKRP+ E+ E DA +S++D IL +LG
Sbjct: 492 VV---EASNTARGQRSKRKRPASSQEE----EVADAVESDEDDTS--------ILHRLGS 536
Query: 692 DAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKR 751
DA+ KRVEV + SD +W KG +T + S ++ D+ KTL+ GK+ VR + ++++
Sbjct: 537 DAVTKRVEVCRSSDKTWLKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRERY 596
Query: 752 SM 753
++
Sbjct: 597 AI 598
>gi|302795017|ref|XP_002979272.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
gi|300153040|gb|EFJ19680.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
Length = 764
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 154/287 (53%), Positives = 200/287 (69%), Gaps = 9/287 (3%)
Query: 2 VAAEDYARRFES-----GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLS 56
V AED +++ G V+T S+ EE + C LC E E ++ RML+
Sbjct: 144 VPAEDNQPDYQTPDDGGGEVSTPSQP---EESVAPPEEIFCGLCQQAEAES-KKQERMLT 199
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C+ C +++HR CLK+WA NRDLF+W+SW+C CR CE C+ TGDPN+ +FC+RCD A+H
Sbjct: 200 CQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDCKVTGDPNRLLFCKRCDEAHHN 259
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
C+ K + GP+LCPKH++CHSCG+ VPG G S RWF Y CDACGRLFVK YCP
Sbjct: 260 NCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRWFHSYLFCDACGRLFVKDKYCP 319
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
+C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +FQ NL++ C CRGEC+Q
Sbjct: 320 ICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEFQTIQNLRFTCAACRGECFQAT 379
Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEE 283
+E+AV ELW+RKD AD+D I SLRA+AGLP+E E+ + P SDDE+
Sbjct: 380 SVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARLCPSSDDEQ 426
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 87/162 (53%), Gaps = 25/162 (15%)
Query: 595 SRESNDDTSVLQSLPKDSKPP------LRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 648
S E D+S +S + +PP L+LK +KP+ ++V E + +GQRSK
Sbjct: 453 SSEEAADSSKKRSRSDNVEPPETERKTLKLKIKKPS----GTEVV---EASNTARGQRSK 505
Query: 649 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 708
RKRP+ E+ E DA +S++D IL +LG DA+ KRVEV + SD +W
Sbjct: 506 RKRPASSQEE----EVADAVESDEDDTS--------ILHRLGSDAVTKRVEVCRSSDKTW 553
Query: 709 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
KG +T + S ++ D+ KTL+ GK+ VR + ++++
Sbjct: 554 LKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRER 595
>gi|242085692|ref|XP_002443271.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
gi|241943964|gb|EES17109.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
Length = 531
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 215/600 (35%), Positives = 302/600 (50%), Gaps = 112/600 (18%)
Query: 190 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 249
MVCCDVC++WVH +CDGISDEKY +FQ D NLQY C CRGEC Q+RD EDA+RELW+R+
Sbjct: 1 MVCCDVCEKWVHIECDGISDEKYQEFQADQNLQYTCAACRGECSQIRDTEDAIRELWKRR 60
Query: 250 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 309
D+ D +L+ +LRAAA LP+ +++ P SDDE+ G VLKNE +LK SLK K P
Sbjct: 61 DVVDHELMVTLRAAAALPSPEDVSPPYPNSDDEKLGAYVLKNESRNTLKFSLKSNSSKPP 120
Query: 310 KKVKEHGKKWL-----------------NKK--------YPRKKGYQMPLNSKPEPDQSF 344
E K NK R+ ++P NS+ DQS
Sbjct: 121 SDTPEQEKIVFKSPGSNKKSSKKKGGQGNKTDDGHDEIFLERRHDVKLP-NSRL-GDQSI 178
Query: 345 EGYHDVHSY---GNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDE 401
+G HD + N++ + + L PS A
Sbjct: 179 DGNHDRSPFKNDDNAYISSSTRSSEKSLKSPSKKA------------------------- 213
Query: 402 VMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRS 461
+ ++ D I +VK K SK L +D ++ K+ T KA KLVI+LG+R + SP+S
Sbjct: 214 -VPNNADMIPKVKIKGSKVSTLHY-KDGEENTPKNDTGKATKLVIHLGSRHKTRSGSPKS 271
Query: 462 DASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGR 521
+ S+ QREQDL + +G +VD +SQ + + +
Sbjct: 272 ELSNSQREQDLGSIHG---------------------------GKVDVTSQLKSSRSEVK 304
Query: 522 GGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAAL 581
+V+K R N+ + ++ +H +GKR S A +S + E A
Sbjct: 305 ERSVMKLVR-ETGAPQRNSLLGDLGTSKKH-------ATGKR----SNALISGM-ENANE 351
Query: 582 RGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPL-RLKFRKPNLENQNSQVSQPEEEKS 640
G R + ++ D+ P + KP L +LKF++P+ E N+Q SQPEE S
Sbjct: 352 TGSRNRSFAQKQYHSSQVDENQGTADSPDNLKPSLLKLKFKRPHFEQLNTQASQPEEPTS 411
Query: 641 LI----------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLG 690
+ KGQRSKRKRPS EK + A+ +Q S E+MDANWIL+KLG
Sbjct: 412 WVSQQEEQLNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHQQSTDDEVMDANWILRKLG 469
Query: 691 KDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
KDAIGKR+EVH SD WH+G+V++ + G TL I LD+ R + +ELGKQ +R + + K
Sbjct: 470 KDAIGKRIEVHLTSDGKWHQGMVSNVIGG--TLCIQLDNGRSENVELGKQAIRLIASRSK 527
>gi|224106095|ref|XP_002314041.1| predicted protein [Populus trichocarpa]
gi|222850449|gb|EEE87996.1| predicted protein [Populus trichocarpa]
Length = 96
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/96 (78%), Positives = 82/96 (85%), Gaps = 3/96 (3%)
Query: 657 EKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 716
EKT++NEDE +QS+ DS E+M+ANWILKKLG DAIGKRVEVHQ SDNSWHKGVV+D
Sbjct: 2 EKTMYNEDEGMSQSHLDS---EMMEANWILKKLGYDAIGKRVEVHQPSDNSWHKGVVSDI 58
Query: 717 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 752
VE TS LSITLDD RVKTLELGKQ VRFV QKQKRS
Sbjct: 59 VEDTSMLSITLDDDRVKTLELGKQAVRFVSQKQKRS 94
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 108/220 (49%), Gaps = 5/220 (2%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 393 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 451
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 452 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 511
Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 512 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 571
Query: 232 CYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDE 271
+ +LE R +D +S R LP D+
Sbjct: 572 PNRGLELERR-RSASPFEDGRRSPFASSSRTPPHLPIPDD 610
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 787 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 844
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 845 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 902
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 903 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 955
Query: 218 DGNLQYRCPTCR 229
D + C CR
Sbjct: 956 DKG--FICTLCR 965
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 392 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 450
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 451 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 510
Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 511 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 570
Query: 232 CYQVRDLE 239
+ +LE
Sbjct: 571 PNRGLELE 578
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 767 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 824
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 825 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 882
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 883 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 935
Query: 218 DGNLQYRCPTCR 229
D + C CR
Sbjct: 936 DKG--FICTLCR 945
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 409 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 467
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 468 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 527
Query: 175 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 231
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 528 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 587
Query: 232 CYQVRDLE 239
+ +LE
Sbjct: 588 PNRGLELE 595
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 784 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 841
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 842 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 899
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 217
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 900 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 952
Query: 218 DGNLQYRCPTCR 229
D + C CR
Sbjct: 953 DKG--FICTLCR 962
>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
Length = 1566
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 91/178 (51%), Gaps = 4/178 (2%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG+ YH +CL + + W+CP C+IC+ CR+ GD NK + C CD Y
Sbjct: 388 LFCTSCGQHYHGSCLDPPVDVNPVVR-AGWQCPECKICQTCRQPGDDNKMLVCDTCDKGY 446
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P + + + C C CGS PG+G S RW L Y+ CD+C + KG
Sbjct: 447 HTFCLRPVMQTIPKNGWKCKNCRICTDCGSRTPGSGPSSRWHLNYSVCDSCYQQRNKGLC 506
Query: 175 CPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
CP+C K YR + M+ C+ C++WVH CD D Q D L Y C CR
Sbjct: 507 CPICGKAYRQHTAHNAMIQCESCKKWVHVDCDESIDISVYQQLKDDKLTTIYNCVDCR 564
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 93/199 (46%), Gaps = 19/199 (9%)
Query: 33 TNVMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T MC C F + EG R+L+C CG+ YH C+ + + W+C C
Sbjct: 797 TQDMCVSCGSFGRDAEG-----RLLTCSQCGQCYHPYCVN--IKITKVVLSKGWRCLDCT 849
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C + D + + C CD +YH YC PP +NV G + C C CG+ P G
Sbjct: 850 VCEGCGKASDEGRLLLCDDCDISYHTYCLEPPLQNVPKGGWKCKWCVCCTKCGATSP--G 907
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W YT C C L CPVC K Y++ E ++ C C RW+H +CDG +E
Sbjct: 908 FNSEWQNNYTQCGPCSSLLT----CPVCFKEYKEDEL--IIQCVQCYRWLHAECDGFHNE 961
Query: 211 KYLQFQVDGNLQYRCPTCR 229
++ D Y C CR
Sbjct: 962 DDIERAADQG--YHCLLCR 978
>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 4990
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)
Query: 20 SKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCL-KNWAQNRDL 78
++ IAGEE C LC + G + L C SCG YH CL + A + ++
Sbjct: 1820 AETIAGEE-------AFCVLCCQADKIG-----KQLFCTSCGHHYHGGCLHPSVALSPEV 1867
Query: 79 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTK 138
+ W+CP C++C++CR+ G+ +K + C CD YH +C P + + C
Sbjct: 1868 --RAGWQCPDCKVCQMCRQPGEDSKMLVCDTCDKGYHTFCLKPVMTAIPKNGWKCKNCRV 1925
Query: 139 CHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQR 198
C CGS PG+G S RW L Y+ CD+C + KG CP+C K YR M+ C C++
Sbjct: 1926 CGDCGSRTPGSGPSSRWHLNYSVCDSCYQQRNKGLSCPLCGKAYRQFTQKAMIQCGTCKK 1985
Query: 199 WVHCQCDGISDEKYL-QFQVDGNLQYRCPTCR 229
VH +CD D L + + + + Y C CR
Sbjct: 1986 HVHAECDDAIDNLMLDRVRNEEQVDYMCSVCR 2017
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
+++ C CG+ YH C + + W+C C +CE C + D + + C CD
Sbjct: 2160 KLIVCTQCGQCYHPYCAS--VKVTKVILSKGWRCLDCTVCEGCGKPHDEGRLLLCDECDI 2217
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
+YH YC PP V G + C C +CG+ PG G + W YT C C R +
Sbjct: 2218 SYHIYCLDPPLDQVPKGTWKCKWCVMCINCGTTTPGFGCN--WQNNYTQCGPC-RSKID- 2273
Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CPVC Y+D E ++ C C RW+H CDG+ E ++ D Y+C CR
Sbjct: 2274 --CPVCRHKYQDDEM--IIQCLQCNRWLHALCDGLRSEDDMERAAD--YDYQCLFCR 2324
>gi|281202543|gb|EFA76745.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
Length = 604
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 102/209 (48%), Gaps = 6/209 (2%)
Query: 23 IAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHW 81
+ E +S T C +C +N ++L C C + +H C+ +
Sbjct: 333 VVPAELKKSTTISTCEVCSHTDNRD-----QLLQCVGCLRSFHGKCINLQTLAIETIKKL 387
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHS 141
++WKC C++CE C+ T + +K +FC CD YH +C +PP + +G + C C
Sbjct: 388 NTWKCTDCKVCEACKDTTNEDKMLFCDVCDRGYHTFCLNPPLERPPTGGWRCSTCVFCIH 447
Query: 142 CGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVH 201
CG+ PG + W YT C+ C L + YC VC KV + E +P + C C RW H
Sbjct: 448 CGTRTPGPQANSAWRGHYTECEQCNVLVAERKYCSVCRKVIKPHEKSPTIQCGYCDRWTH 507
Query: 202 CQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
QCDG+S +F+ + N QY+C CR
Sbjct: 508 SQCDGMSVSNLEKFKDNPNHQYKCQACRN 536
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 108/225 (48%), Gaps = 10/225 (4%)
Query: 29 GQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
GQ++ +C+ + G ++ C SCG+ YH C+ AQ + + W+C
Sbjct: 358 GQASLECENSVCYTCKTLG--DIANLMFCSSCGEHYHGICV-GLAQLPGV--RAGWQCRK 412
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
CRIC++CR TGD K M C +CD YH CQ P ++ + C C CGS PG
Sbjct: 413 CRICQVCRMTGDETKLMTCEQCDKIYHSTCQRPIVTSIPKYGWKCRCCRVCGDCGSRTPG 472
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
GLS RW YT CD+C + KG CP+C + YR MV C +C+++VH CD +
Sbjct: 473 AGLSSRWHAHYTVCDSCYQQRNKGFSCPLCHRAYRAHAHREMVQCTLCRKFVHGTCDPEA 532
Query: 209 DEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMAD 253
D + + + +Y C ++L L +R + D
Sbjct: 533 DLVTYHQRKEAHPEYEY-----VCLMCKNLTQPATLLAKRNSIDD 572
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 83/181 (45%), Gaps = 17/181 (9%)
Query: 33 TNVMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C ++EGC ++SC CG+ YH C+ + + W+C C
Sbjct: 702 TQDICVMCGALGTDHEGC-----LISCVQCGQCYHPYCVN--VKITKVVLQKGWRCLDCT 754
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C + D + + C CD +YH YC PP V G + C C +CG+ P G
Sbjct: 755 VCEGCGQRNDEARLILCDDCDISYHIYCMDPPLDYVPHGNWKCKWCAICQTCGATDP--G 812
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W C CG N CP C + Y SE ++ C C+RW+H CD I E
Sbjct: 813 FNCSWM---NSCSECGPCASHVN-CPSCSEPY--SEGDLIIQCVQCERWLHGTCDSIKTE 866
Query: 211 K 211
+
Sbjct: 867 E 867
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 99/202 (49%), Gaps = 14/202 (6%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSC 89
N +V C C G ++ C CG+ YH +C L R + W+C SC
Sbjct: 375 NGDVTCMQCC-----GMGDVSNLVMCSICGQHYHGSCVGLALLPGVR-----AGWQCASC 424
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
R+C++CR+ D +K M C RC+ AYH C P ++ + C C CGS PG
Sbjct: 425 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 484
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
GLS RW YT CD+C + KG CP+C K YR + MV C C+++VH CD +D
Sbjct: 485 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 544
Query: 210 EKYLQF--QVDGNLQYRCPTCR 229
Q +V + +Y C C+
Sbjct: 545 PLTYQHRKEVKPDYEYVCLHCK 566
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 17/177 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 713 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 765
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C C +CGSN P G
Sbjct: 766 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 823
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
+ W YT C C C C + Y + + ++ C C+RW+HC CD I
Sbjct: 824 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 874
>gi|328717947|ref|XP_001943997.2| PREDICTED: hypothetical protein LOC100159693, partial
[Acyrthosiphon pisum]
Length = 2904
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 101/208 (48%), Gaps = 17/208 (8%)
Query: 34 NVMCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCR 90
NV C +C +G+ ++ C +CG YH CL L + W+C +CR
Sbjct: 383 NVACGICSTLGD------VSNLMMCTACGSHYHGVCL-----GLALLPGVRAGWQCGNCR 431
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
IC++CR+ + K M C CD AYH C P + + C C CGS PG G
Sbjct: 432 ICQVCRQPAEQTKVMLCEGCDKAYHPGCLRPQVTTIPKIGWKCKCCRVCTDCGSRTPGAG 491
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
LS RW YT CD+C + KG+ CP+C + YR + MV C C++++H CD +
Sbjct: 492 LSSRWHAHYTVCDSCYQQRNKGSSCPLCHRAYRAAAHREMVQCISCRKYIHGACD--PEA 549
Query: 211 KYLQFQVDGNL-QYRCPTCRGECYQVRD 237
+Y+ + +Y CP C+ Q RD
Sbjct: 550 EYITSHSKSSASEYMCPLCKNAVQQRRD 577
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 84/172 (48%), Gaps = 10/172 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G ++ ++SC CG+ YH C+ + + W+C C +CE C +
Sbjct: 723 ICVMCGSLGTDQEACLISCSQCGQCYHPFCVN--VKVTKVILQKGWRCLDCTVCEGCGQR 780
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D ++ + C CD +YH YC P V G + C +C +CGSN P G + W
Sbjct: 781 NDESRLILCDECDISYHIYCTDPKLDYVPRGTWKCKWCAQCLTCGSNDP--GFNCSWLNN 838
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
YT C C + CP C + Y D++ ++ C C RW+H +CD I +E
Sbjct: 839 YTECGPCASRSI----CPSCQESYTDNQL--IIKCSQCDRWLHGKCDKIENE 884
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 14/202 (6%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSC 89
N ++ C C G ++ C CG+ YH +C+ L + W+C SC
Sbjct: 366 NGDITCMHCC-----GMGDVSNLVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSC 415
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
R+C++CR+ D +K M C RC+ AYH C P ++ + C C CGS PG
Sbjct: 416 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 475
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
GLS RW YT CD+C + KG CP+C K YR + MV C C+++VH CD +D
Sbjct: 476 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 535
Query: 210 EKYLQFQ--VDGNLQYRCPTCR 229
Q + V + +Y C C+
Sbjct: 536 PLTYQHRKDVKPDYEYVCLHCK 557
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 17/177 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 704 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 756
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C C +CGSN P G
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 814
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
+ W YT C C C C + Y +E ++ C C+RW+HC CD I
Sbjct: 815 FNSSWQKNYTQCGPCA----SHAACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 14/202 (6%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSC 89
N ++ C C G ++ C CG+ YH +C+ L + W+C SC
Sbjct: 366 NGDITCMHCC-----GMGDVSNLVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSC 415
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
R+C++CR+ D +K M C RC+ AYH C P ++ + C C CGS PG
Sbjct: 416 RVCQVCRQPEDVSKVMLCERCEKAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGA 475
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
GLS RW YT CD+C + KG CP+C K YR + MV C C+++VH CD +D
Sbjct: 476 GLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEAD 535
Query: 210 EKYLQFQ--VDGNLQYRCPTCR 229
Q + V + +Y C C+
Sbjct: 536 PLTYQHRKDVKPDYEYVCLHCK 557
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 81/177 (45%), Gaps = 17/177 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 704 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 756
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C C +CGSN P G
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 814
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
+ W YT C C C C + Y +E ++ C C+RW+HC CD I
Sbjct: 815 FNSSWQKNYTQCGPCA----SHTACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865
>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
Length = 1858
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 93/211 (44%), Gaps = 20/211 (9%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKN-----WAQNRDLFHWSSWKCPSCR 90
+CR C G E M C C + YH C+K+ + SWKC C
Sbjct: 578 LCRGC--GTRGTDEETSGMHWCNQCCQPYHDFCVKSSFGDAYESTLKEIAQGSWKCWDCI 635
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GN 149
+C C + + C C H C P V SG +LC + KC SCG+ P G
Sbjct: 636 VCTTCNSSFPEETLVVCDNCAVGRHLGCMDIPLAEVPSGRWLCSQCVKCDSCGAQTPRGM 695
Query: 150 GLS-----------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQ 197
G + W Y+ C CG L +GNYC VC KVY D + TPM+ C+ C
Sbjct: 696 GKTRLPSSFPSSQPCEWMFDYSLCQPCGLLKARGNYCRVCEKVYEDDDYDTPMISCEQCS 755
Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
W+H C G+ +E Y + D NL + CP+C
Sbjct: 756 MWLHTHCVGMDEETYEMYSNDENLAFTCPSC 786
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/224 (33%), Positives = 107/224 (47%), Gaps = 25/224 (11%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRIC 92
V C C+ G ++ C CG+ YH +C L R + W+C SCR+C
Sbjct: 365 VTCVQCY-----GMGDVSNLVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVC 414
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
++CR+ D +K M C RCD AYH C P ++ + C C CGS PG GLS
Sbjct: 415 QVCRQPEDVSKVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLS 474
Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
RW YT CD+C + KG CP+C K YR + MV C C+++VH CD +D
Sbjct: 475 SRWHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLT 534
Query: 213 LQFQVDG--NLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADK 254
Q + + + +Y C C+ ++ + RRKD D+
Sbjct: 535 YQQRKEAKPDYEYVCLHCK-----------SIAMVARRKDSIDE 567
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 85/180 (47%), Gaps = 17/180 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 700 TQDICVMCGSIGMDQEGC-----LIACVQCGQCYHPYCAG--VKITKVILQKGWRCLDCT 752
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C +C +CGSN P G
Sbjct: 753 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAQCQTCGSNDP--G 810
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W YT C C C VC + Y++ + ++ C C+RW+HC CD I E
Sbjct: 811 FNSSWQKSYTQCGPCA----SHTACVVCQEAYQEGDL--IIQCVQCERWLHCGCDSIKSE 864
>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
Length = 6073
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/236 (31%), Positives = 111/236 (47%), Gaps = 22/236 (9%)
Query: 39 LCFVGENEGCE------RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCR 90
L +GE C +L C SCG +H +C+ L + W+C CR
Sbjct: 370 LLPIGETSQCSTCLSLGNVSNILMCTSCGAHHHGSCV-----GLALLPGVRAGWQCFECR 424
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+C++CR+ + K M C CD AYH C P ++ + C C CGS PG+G
Sbjct: 425 VCQVCRQPSEIGKIMLCESCDKAYHPSCLRPIVTSIPKYGWKCKCCRVCSDCGSRTPGSG 484
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
LS RW ++ CD+C + KG CPVC + YR + MV C C+++VH CD +D
Sbjct: 485 LSSRWHNHFSVCDSCYQQRNKGFCCPVCGRAYRAAAHREMVQCIKCRKYVHGSCDNEADI 544
Query: 211 KYLQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 264
+ + N +Y C C ++L R+ +RKD D+ L+ S +A+
Sbjct: 545 SVYAARKETNPDYEYICCIC-------KNLNSMGRQGIKRKDSFDEALLESSLSAS 593
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C + EGC ++SC CG+ YH C+ + + W+C C +CE
Sbjct: 723 ICVMCGALGTDQEGC-----LISCAQCGQCYHPYCVN--VKVTKVILQKGWRCLDCTVCE 775
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + D ++ C CD +YH YC PP V G + C C CGSN P G +
Sbjct: 776 GCGQRNDDSRLTLCDDCDISYHIYCMDPPLDYVPRGVWKCKWCVVCIRCGSNDP--GFNC 833
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W GYT C C +CP CL+ Y + E ++ C+ C+RW+H CDGI ++ L
Sbjct: 834 NWMNGYTECGPCA----SHTFCPSCLEPYVEGEL--VIQCEQCERWLHGSCDGIRND--L 885
Query: 214 QFQVDGNLQYRCPTCR 229
+ + +Y C CR
Sbjct: 886 DAEKCADEKYTCVLCR 901
>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
Length = 3915
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 92/191 (48%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+LSC CG+ YH C+ + + W+C C +CE C +
Sbjct: 20 MCVVCGSFGRGAEGRLLSCSQCGQCYHPFCVN--IKITKVVLSKGWRCLECTVCEACGQA 77
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP +NV G + C C CG+ P GL W
Sbjct: 78 SDPGRLLLCDDCDISYHTYCLDPPLQNVPKGSWKCKWCVLCTHCGATSP--GLRCEWQNN 135
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L V CPVC + YR+ E ++ C C RWVH C G++ ++ ++ D
Sbjct: 136 YTQCGPCASLTV----CPVCTRSYREEEL--ILQCRQCDRWVHGSCQGLNSDEDVENAAD 189
Query: 219 GNLQYRCPTCR 229
+ C CR
Sbjct: 190 EG--FDCTLCR 198
>gi|291240901|ref|XP_002740354.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like
[Saccoglossus kowalevskii]
Length = 4402
Score = 122 bits (305), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 100/202 (49%), Gaps = 20/202 (9%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEI 94
+CF+ + G + ++ C C + +H CL K N D+ W C C+ C +
Sbjct: 1052 VCFLCASTG---QQELVYCNVCCEPFHEFCLEEDEKPLDDNTDI-----WCCKRCKFCHV 1103
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGL 151
C R + + C +C YH C P + + ++C K +C SCG+ PG
Sbjct: 1104 CGRQQN---LLQCDKCHNTYHAECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGQSS 1160
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 210
S +W ++ C CG+LF GNYCP+C + Y D + + M+ C C+ WVH +C+G++DE
Sbjct: 1161 SAQWSHDFSLCQDCGKLFDIGNYCPLCQQCYTDDDYDSKMMQCPCCESWVHAKCEGLTDE 1220
Query: 211 KY-LQFQVDGNLQYRCPTCRGE 231
Y + + ++ Y C C+ E
Sbjct: 1221 MYQIMCEFPEDIHYTCSKCQPE 1242
>gi|449684588|ref|XP_002166105.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
magnipapillata]
Length = 229
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 99/195 (50%), Gaps = 18/195 (9%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
++++C C H +CL + W+C C++C C D ++ MFC CD
Sbjct: 14 KLINCSQCSNGGHPSCLDMNKSLLKVIKGYPWQCMECKVCTECLAPHDEHEMMFCDNCDR 73
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYT----------- 160
YH YC K + G ++C + KC SC S P +G S RW + +T
Sbjct: 74 GYHSYCVGV--KEIPKGRWVCNRCGKCCSCLSRQPVSDGGSGRWKMEFTKPTDGSEPEFL 131
Query: 161 --CCDACGRLFVKGNYCPVCLKVYRDSEST--PMVCCDVCQRWVHCQCDGISDEKYLQFQ 216
C C LF KG++CPVCLKVY D + PMVCCD C RW+H CDGI +++Y++
Sbjct: 132 QNHCRKCSILFRKGSFCPVCLKVYCDDDGVVNPMVCCDNCDRWIHTDCDGIDEQRYIELS 191
Query: 217 VDGNLQYRCPTCRGE 231
D + Y C CRGE
Sbjct: 192 KDHHSAYTCVLCRGE 206
>gi|297736277|emb|CBI24915.3| unnamed protein product [Vitis vinifera]
Length = 75
Score = 120 bits (302), Expect = 2e-24, Method: Composition-based stats.
Identities = 57/73 (78%), Positives = 62/73 (84%)
Query: 680 MDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGK 739
MDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGTSTL + DD R KTLELGK
Sbjct: 1 MDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGTSTLIVKFDDGRAKTLELGK 60
Query: 740 QGVRFVPQKQKRS 752
Q +R + QKQKRS
Sbjct: 61 QAIRLISQKQKRS 73
>gi|427798455|gb|JAA64679.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 951
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 41 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 18 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 76
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 77 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 136
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 137 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 192
Query: 214 QFQVDGNLQYRCPTC 228
DG+ Y CP C
Sbjct: 193 SSPKDGD--YVCPVC 205
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 96/196 (48%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C F EG R+++C CG+ YH C+ + + W+C C +CE
Sbjct: 380 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 432
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + D ++ + C CD +YH YC PP + V G + C C CG+ PGNG
Sbjct: 433 GCGQPHDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--S 490
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
+W YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+
Sbjct: 491 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEEDA 544
Query: 214 QFQVDGNLQYRCPTCR 229
+ + Y CP CR
Sbjct: 545 ERCAE--YGYNCPYCR 558
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 942 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 994
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 995 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1052
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CP+C + YRD E ++ C C RW+H C ++ E+ +
Sbjct: 1053 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1106
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1107 ENIAD--MGFDCTICR 1120
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 82/179 (45%), Gaps = 21/179 (11%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 359 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 415
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P +V + + C C CG+ S +W CD+C + + N
Sbjct: 416 HTFCLQPVMDSVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 468
Query: 175 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCD---GISDEKYLQFQVDGNLQYRCPTCR 229
CP C K+ M+ C +C+RW+H +CD GI E L+ Y C C+
Sbjct: 469 SCPFCDKLCLQDFQKDMLHCHMCKRWIHMECDRSPGIELESQLK-------DYICTLCK 520
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 912 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 964
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 965 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1022
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CP+C + YRD E ++ C C RW+H C ++ E+ +
Sbjct: 1023 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1076
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1077 ENIAD--MGFDCTICR 1090
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 80/176 (45%), Gaps = 15/176 (8%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 327 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 383
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P V + + C C CG+ S +W CD+C + + N
Sbjct: 384 HTFCLQPVMDAVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 436
Query: 175 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CP C K+ M+ C +C+RW+H CD S L+ Q+ Y C CR
Sbjct: 437 SCPFCEKLCLQDFQKDMLHCHMCKRWIHIDCDR-SPGSELESQLK---DYICTLCR 488
>gi|241859648|ref|XP_002416243.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
gi|215510457|gb|EEC19910.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
Length = 1179
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 15/195 (7%)
Query: 41 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+VG C+ M+S C CG YH CL + A W+CP C+ C+
Sbjct: 18 YVGSQANCQSCEEMVSPSELLFCTLCGAHYHGFCL-DPAVRVTTSTRVGWQCPDCKACQA 76
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
CRR GD + + C CD +H YC P NV + C C CGS PG+G S R
Sbjct: 77 CRRPGDEARLLTCDICDKGFHVYCVKPVVANVPKHGWKCQNCRVCGDCGSRTPGSGPSSR 136
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W + ++ CD+C + KG CP+C K YR S M C +C++++H +CD
Sbjct: 137 WHMNFSVCDSCYQQRNKGVACPLCGKAYRQFSHREDMAQCTMCRKYIHMECDS------- 189
Query: 214 QFQVDGNLQYRCPTC 228
Q + Y CP C
Sbjct: 190 QLANHQDADYVCPVC 204
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C F EG R+++C CG+ YH C+ N R + W+C C +CE
Sbjct: 287 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCV-NVKVTRMILK-KGWRCLDCTVCE 339
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + D ++ + C CD +YH YC PP +NV G + C C CG+ PG G
Sbjct: 340 GCGQPHDESRLLLCDECDISYHTYCLSPPLENVPQGNWKCRWCVVCLQCGATDPGFG--S 397
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C CP+CL Y++++ ++ C C+RW+H CD I+ E+
Sbjct: 398 HWQNNYTQCGPC----ASKTSCPLCLLKYQENDL--VIQCVQCERWMHGFCDQIACEE-- 449
Query: 214 QFQVDGNLQYRCPTCR 229
+ Y CP CR
Sbjct: 450 DAEKCAEYGYNCPYCR 465
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 991 MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKITKVVLSKGWRCLECTVCEACGQA 1048
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP +NV + C C CG+ P GL W
Sbjct: 1049 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQNN 1106
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CP+CL Y SE T +V C C RW H C + E+ ++ D
Sbjct: 1107 YTLCAPCASL----STCPICLVDY--SEGTIIVQCRQCDRWFHASCQSLHSEEDIEKAAD 1160
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1161 SS--FDCTMCR 1169
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 73/175 (41%), Gaps = 11/175 (6%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG YH CL + W+CP C++C+ C+ G+ K + C CD Y
Sbjct: 357 LFCTSCGLHYHGICLDMAVTP---LRRAGWQCPECKVCQTCKNPGEDTKMLVCDMCDKGY 413
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P + + + C C CG+ G +W C+ C +
Sbjct: 414 HTFCLQPVIDTLPTNGWRCQNCRVCLQCGTRTGG-----QWHHTSLLCENCVQNQDPALC 468
Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CP+C + +V C C+RW+H +C+ + Q ++ Y C CR
Sbjct: 469 CPMCSCILDPEHHKDLVFCHTCKRWLHLECE---RQNSGQAEIHPREDYVCSNCR 520
>gi|358334996|dbj|GAA53428.1| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
sinensis]
Length = 3518
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 103/230 (44%), Gaps = 19/230 (8%)
Query: 8 ARRFESGYVATASKDIAGEEQGQSNTNVMCRL--CFVGENEGCERA------RRMLSCKS 59
+RR ++ + + E+ +T V+CR FV E + C +L+C
Sbjct: 469 SRRTKTRKTSMYESKMPDEKDDHPSTVVLCRADDTFVLEQDMCVACGSFGLDTVLLACAQ 528
Query: 60 CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
CG+ YH C R + W+C C +CE C T + + + C CD +YH YC
Sbjct: 529 CGQCYHPFCADVPKITRTMLE-KGWRCLDCTVCEGCGGTTNESLLLLCDDCDISYHTYCL 587
Query: 120 HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCL 179
PP + V G + C + C +CG P GL+ +W Y+ C C L CPVC
Sbjct: 588 DPPLQEVPKGGWKCSECVVCTNCGQRDP--GLNGKWHANYSMCAPCASLAT----CPVCT 641
Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
YR+ E ++ C +C RW H CD + E L+ D + Y C CR
Sbjct: 642 LAYREGEL--LIRCALCSRWSHAGCDQLRTEDELELATD--MGYNCLLCR 687
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 82/217 (37%), Gaps = 42/217 (19%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG YH +CL+ Q W+C C+ C IC + D NK + C CD
Sbjct: 53 LLFCTGCGSHYHGSCLEPSLQPNPTIRIG-WQCAECKACLICNESKDENKMLVCDVCDKG 111
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG----SNVPGNGLSV----------RWFLGY 159
+H YC PP + + C + C CG S V G G+ V +W Y
Sbjct: 112 FHTYCLRPPVSCIPRNGFKCERCRVCSDCGAGRASTVSGLGVMVEFNNPQLPVIKWHSNY 171
Query: 160 TCCDACGRLFVKGNY-CPVCLKVYRDSESTPMVC----------------CDVCQRWVHC 202
T CD C + CPVC + +R S P C C+R VH
Sbjct: 172 TLCDRCFHSRKRPTASCPVCERAWRCSLQVPSYISTQPQTSTHVTWPGRRCTKCRRMVHA 231
Query: 203 QCDGI----------SDEKYLQFQVDGNLQYRCPTCR 229
CD + S + G + Y CP CR
Sbjct: 232 DCDPLQSVATGTASPSSAMSEDTNIAGGIAYSCPVCR 268
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 965 MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKINKVVLSKGWRCLECTVCEACGQA 1022
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP +NV + C C CG+ P GL W
Sbjct: 1023 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQSN 1080
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CP+CL Y SE T +V C C RW H C G+ E L+ +
Sbjct: 1081 YTQCAPCASL----STCPICLVNY--SEGTVIVQCRQCDRWFHASCQGLHSEDDLEKAAE 1134
Query: 219 GNLQYRCPTCR 229
+ + C C+
Sbjct: 1135 NS--FDCTICQ 1143
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 75/175 (42%), Gaps = 11/175 (6%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C SCG YH CL + W+CP C++C+ C+ GD K + C CD Y
Sbjct: 357 LFCTSCGLHYHGMCLDMAVTP---LRRAGWQCPECKVCQTCKNHGDDTKMLVCDMCDKGY 413
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P +++ + + C C CG+ G+ W C+ C +
Sbjct: 414 HTFCLQPAMESLPTNGWRCKNCRVCIQCGTRTSGH-----WHHNSLLCENCFQNQDPALC 468
Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C +C + ++ C C+RW+H +C+ + Q +++ Y C CR
Sbjct: 469 CSMCSCILDPEHHKDLLFCQTCKRWLHLECE---RQNSGQTEINPREDYVCFNCR 520
>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 3840
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 99/199 (49%), Gaps = 16/199 (8%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 99 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNVPGNGL 151
K + C +C +YH C P N + P ++C K +C SCGS PG G
Sbjct: 1434 QQATKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKCVRCKSCGSTTPGKGW 1490
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 210
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ ++DE
Sbjct: 1491 DAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGKCDRWVHSKCENLTDE 1550
Query: 211 KY-LQFQVDGNLQYRCPTC 228
Y + + ++ Y C C
Sbjct: 1551 MYEILSNLPESVAYTCINC 1569
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 91/201 (45%), Gaps = 14/201 (6%)
Query: 31 SNTNVMCR--LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
SN N + +C V + G R+L+C CG+ YH C+ + + W+C
Sbjct: 661 SNDNFTLKQDMCVVCGSFGLGAEGRLLACAQCGQCYHPYCVG--IKINKVVLSKGWRCLE 718
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
C +CE C + DP + + C CD +YH YC PP +NV + C C CG+ P
Sbjct: 719 CTVCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVTCTQCGATTP- 777
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
GL W YT C C L CP+CL Y SE T ++ C C RW H C +
Sbjct: 778 -GLRCEWQKNYTQCAPCASLMT----CPICLVDY--SEGTTILQCRQCDRWFHASCQSLH 830
Query: 209 DEKYLQFQVDGNLQYRCPTCR 229
E+ ++ + + C CR
Sbjct: 831 SEEDVEKAAENG--FNCTMCR 849
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
L C SCG+ YH CL + W+CP C+IC+ C+
Sbjct: 138 LFCTSCGQHYHGICLDMAVTP---LRRAGWQCPECKICQTCK 176
>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2688
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1187 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1245
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1246 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1304
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1305 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1364
Query: 228 CRGECY 233
C G +
Sbjct: 1365 CAGATH 1370
>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2711
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1269 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1327
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387
Query: 228 CRGECY 233
C G +
Sbjct: 1388 CAGATH 1393
>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
mutus]
Length = 2524
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1066 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1124
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1125 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1183
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1184 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1243
Query: 228 CRGECY 233
C G +
Sbjct: 1244 CAGATH 1249
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 95/214 (44%), Gaps = 22/214 (10%)
Query: 26 EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
EE NT VM +C V + G R+L+C CG+ YH C+ N
Sbjct: 128 EENSMHNTVVMFSTTDHFTLKQDMCVVCGSFGQGAEGRLLACSQCGQCYHPFCV-NVKMT 186
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
R + W+C C +CE C DP + + C CD +YH YC PP V G + C
Sbjct: 187 RVVL-TKGWRCLECTVCEACGEASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKW 245
Query: 136 HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
+C CGS+ P G+ W Y+CC CG L CP+C + Y E ++ C
Sbjct: 246 CVRCVQCGSSSP--GVRCDWQDNYSCCGPCGSL----RRCPLCQRPYAHDEL--IMQCQQ 297
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C RWVH C + E+ ++ D + C CR
Sbjct: 298 CDRWVHATCQNLMCEEDVEAAADEG--FDCSLCR 329
>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
Length = 3017
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1517 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1575
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1576 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1634
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1635 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1694
Query: 228 CRGECY 233
C G +
Sbjct: 1695 CAGATH 1700
>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 2665
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1165 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1223
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1224 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1282
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1283 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1342
Query: 228 CRGECYQVRDLEDAVRELWR 247
C G AV+ WR
Sbjct: 1343 CAG----------AVQPRWR 1352
>gi|328874899|gb|EGG23264.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
Length = 758
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 89/184 (48%), Gaps = 3/184 (1%)
Query: 53 RMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 111
+ SC C + YH CL+ N + +WKC C++CE+C +K MFC CD
Sbjct: 494 QTFSCIGCHRVYHGKCLQLNQLAIDTIKRNGNWKCIDCKLCEVCNEGVHEDKMMFCDVCD 553
Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 171
YH +C P +G + C + C CGS G S +W YT C+ C
Sbjct: 554 KGYHTFCCSPKLDAPPTGGWKCSQCVHCIHCGSRSAGPSSSSKWNANYTVCEVCTPKVQD 613
Query: 172 GNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCR 229
YC VC K+ + S E P+V C C RW H CD I++E + + + N QY+CPTC+
Sbjct: 614 KKYCTVCRKIIKSSGEKKPIVQCVYCDRWTHAGCDSITEEFLEKMKENPNYHQYKCPTCK 673
Query: 230 GECY 233
Y
Sbjct: 674 TGNY 677
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 962 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1014
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 1015 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1072
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ E ++ C C RW+H C ++ E+ +
Sbjct: 1073 EWQNNYTQCAPCASL----STCPVCYRNYREEEL--ILQCRQCDRWMHAICQNLNTEEEV 1126
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1127 ENIAD--IGFDCTMCR 1140
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 83/173 (47%), Gaps = 12/173 (6%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD YH
Sbjct: 354 CTTCGQHYHGMCLDIAITP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHT 410
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P +V + + C C CG+ S +W CD+C + + CP
Sbjct: 411 FCLQPVIDSVPTNGWKCKNCRVCAECGTRT-----SAQWHHNCLVCDSCYQQ-QESLSCP 464
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C K Y M+ C +C+RW+H +CD +D + L+ Q+ +Y C C+
Sbjct: 465 FCGKYYHPDFQKDMLHCHMCKRWIHIECDKPTDTE-LESQL--REEYICMFCK 514
>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4 [Callithrix jacchus]
Length = 2660
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1160 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1218
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1219 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1277
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1278 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1337
Query: 228 CRGECYQVRDLEDAVRELWR 247
C G AV+ WR
Sbjct: 1338 CAG----------AVQPRWR 1347
>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
homolog; AltName: Full=Trithorax homolog 2; AltName:
Full=WW domain-binding protein 7; Short=WBP-7
Length = 2713
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 228 CRG 230
C G
Sbjct: 1399 CAG 1401
>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
Length = 2713
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 228 CRG 230
C G
Sbjct: 1399 CAG 1401
>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
Length = 3463
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 10/195 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 97
+CF+ ++G ML C+ C + +HR CL+ A+ + +W C CR C +C R+
Sbjct: 1732 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCRFCHVCGRK 1787
Query: 98 TGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
+ + C RC YH C P P +N ++C +C SCG PG + W
Sbjct: 1788 NKNSKPLLECERCQNCYHASCLGPSYPKQNKKRKTWVCVTCIRCKSCGV-TPGKSWDIDW 1846
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 213
C C +LF GNYCP+C K Y D++ + M+ C C WVH +C+ ++DE Y +
Sbjct: 1847 NHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1906
Query: 214 QFQVDGNLQYRCPTC 228
+ ++ Y C C
Sbjct: 1907 LSSLPESVVYSCRPC 1921
>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
anubis]
Length = 2716
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1216 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1274
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1275 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1333
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1334 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1393
Query: 228 CRG 230
C G
Sbjct: 1394 CAG 1396
>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
Length = 2721
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1280 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1338
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 228 CRGECY 233
C G +
Sbjct: 1399 CAGATH 1404
>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
Length = 2605
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1105 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1163
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1164 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1222
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1223 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1282
Query: 228 CRG 230
C G
Sbjct: 1283 CAG 1285
>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein
4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
domain-binding protein 7; Short=WBP-7
gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
Length = 2715
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1273
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1274 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1332
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1333 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1392
Query: 228 CRG 230
C G
Sbjct: 1393 CAG 1395
>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
Length = 2415
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 915 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 973
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 974 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1032
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1033 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1092
Query: 228 CRG 230
C G
Sbjct: 1093 CAG 1095
>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
Length = 2566
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 950 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1008
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1009 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1067
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1068 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1127
Query: 228 CRGECY 233
C G +
Sbjct: 1128 CAGATH 1133
>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
garnettii]
Length = 2714
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1269 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1327
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387
Query: 228 CRG 230
C G
Sbjct: 1388 CAG 1390
>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
gorilla gorilla]
Length = 2536
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1120 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1178
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1179 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1237
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1238 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1297
Query: 228 CRG 230
C G
Sbjct: 1298 CAG 1300
>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
troglodytes]
Length = 2526
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1026 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1084
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1085 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1143
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1144 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1203
Query: 228 CRG 230
C G
Sbjct: 1204 CAG 1206
>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
glaber]
Length = 2592
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1072 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRASKHLLECERCCH 1130
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1131 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1189
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1190 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1249
Query: 228 CRG 230
C G
Sbjct: 1250 CAG 1252
>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Pan paniscus]
Length = 2776
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1276 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1334
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1335 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1393
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1394 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCRP 1453
Query: 228 CRG 230
C G
Sbjct: 1454 CAG 1456
>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Nomascus leucogenys]
Length = 2433
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1192 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1250
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1251 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1309
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1310 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1369
Query: 228 CRG 230
C G
Sbjct: 1370 CAG 1372
>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
Length = 2209
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1067 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1125
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1126 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1184
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1185 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1244
Query: 228 CRG 230
C G
Sbjct: 1245 CAG 1247
>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
mulatta]
Length = 2789
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1337 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1395
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1396 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCIQLY 1454
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1455 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1514
Query: 228 CRG 230
C G
Sbjct: 1515 CAG 1517
>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
Length = 2013
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 512 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 570
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 571 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 629
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 630 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 689
Query: 228 CRG 230
C G
Sbjct: 690 CAG 692
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 188 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 240
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 241 MCGKASDPSRLLLCDDCDVSYHTYCLEPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 298
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
W YT C C L CPVC + + + E ++ C C RWVH C+ + E
Sbjct: 299 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 349
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 905 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 962
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ P GL W
Sbjct: 963 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRCEWQNN 1020
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1021 YTQCAPCASL----STCPVCYRHYREEDL--ILQCRQCDRWMHAICQNLNTEEEVENVAD 1074
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1075 --IGFDCSMCR 1083
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 307 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 363
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 364 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 416
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C VC+RWVH +CD +D ++D L +Y C C+
Sbjct: 417 FCGKCYHPEFQEDMLHCSVCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 466
>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Ailuropoda melanoleuca]
Length = 2611
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1111 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1169
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1170 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1228
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1229 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1288
Query: 228 CRGECY 233
C G +
Sbjct: 1289 CAGATH 1294
>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
tropicalis]
Length = 2771
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 18/207 (8%)
Query: 32 NTNVMCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSC 89
N +MC LC R R ++L C+ C + +HR CL+ R L + +W C C
Sbjct: 1229 NPTLMCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRC 1279
Query: 90 RICEICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSN 145
+ C +C + G K + C C YH C P P K SG + C +C SCG
Sbjct: 1280 KFCNVCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI- 1338
Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC 204
PG + C C L+ KGN+CP+C++ Y +SE + M+ C C +W+H +C
Sbjct: 1339 APGKDGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKC 1398
Query: 205 DGISDEKY-LQFQVDGNLQYRCPTCRG 230
+G+SDE Y L + ++ Y CP C G
Sbjct: 1399 EGLSDEGYELLSNLPDSVVYTCPPCLG 1425
>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
lupus familiaris]
Length = 2713
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1212 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1270
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1271 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1329
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1330 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1389
Query: 228 CRGECY 233
C G +
Sbjct: 1390 CAGATH 1395
>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
Length = 2270
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 771 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 829
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 830 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 888
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 889 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 948
Query: 228 CRG 230
C G
Sbjct: 949 CAG 951
>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
Length = 1503
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 122/283 (43%), Gaps = 28/283 (9%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
+++ C SCG +H C L N R S W C C C+ICR+ D N KF+ C
Sbjct: 216 KLIMCSSCGDHFHSTCIGLANLPDTR-----SGWCCARCTKCQICRQQ-DSNDIKFVKCE 269
Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
+C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLMAYHKKKEQNPDYDYVCP 389
Query: 227 TCRGECYQV--RDLEDAVRELWRRKDMADKDLIA---SLRAAAGLPTEDEIFSISPYSDD 281
C+ + L + + +D L+ + G PT +++ S ++
Sbjct: 390 NCKTNSTRPPQSQLTETIDMALSESQTSDTQLMVKDIEMDPLEGRPTTNDVIS----EEN 445
Query: 282 EENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 317
+ P V N G+ K L+ GV+ + KK GK
Sbjct: 446 HKQPPGAKKKVCFSNLRGKGTKFMLQRMGVMSQISKKRSTRGK 488
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 88/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 532 ICVMCGSLGIESDAVMITCAQCGQCYHPYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 589
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC +PP + V G + C T C CG N +V +
Sbjct: 590 NDEARLLLCDECDISYHIYCVNPPLETVPQGTWKCSFCTMCQKCGRNPTEKSDNVDSNMS 649
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C C C VC Y + E ++ C+ C++W H CD ++ + + + D
Sbjct: 650 E--CPPCA----SQTACSVCTNPYANGEM--IIQCEKCEQWSHFLCDSVNAQLTIDY-YD 700
Query: 219 GNLQYRCPTCR 229
N+ Y+C CR
Sbjct: 701 KNI-YKCLKCR 710
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 418 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 470
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 471 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 528
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
W YT C C L CPVC + + + E ++ C C RWVH C+ + E
Sbjct: 529 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 579
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 9/115 (7%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG+ YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 98 LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 154
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDAC 165
YH +C P ++ + P+ C + C CG+ +PG S +WF Y C+AC
Sbjct: 155 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEAC 206
Score = 40.4 bits (93), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 43/184 (23%), Positives = 63/184 (34%), Gaps = 35/184 (19%)
Query: 38 RLCFVGENEGCERARRMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCP-------- 87
RLC E C+R + C + C + YH C + + CP
Sbjct: 26 RLC-----EHCKRLGATIRCHAEGCSRFYHFPCSAASGSFQSMKQLL-LLCPEHIDKAKE 79
Query: 88 ----SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
+C C +C G+ + +FC C YH C + + CP+ C +C
Sbjct: 80 LGEEAC--CAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKVCQTC- 136
Query: 144 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
PG + CDAC + Y CL+ DS T C C+ C
Sbjct: 137 -RKPGEDSKM------LVCDACDK-----GYHTFCLQPAMDSLPTDPWKCKRCRVCTDCG 184
Query: 204 CDGI 207
G+
Sbjct: 185 ARGL 188
>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Loxodonta africana]
Length = 2200
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 94/186 (50%), Gaps = 8/186 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL A+ H +W C C+ C +C R G K + C RC
Sbjct: 827 LVFCQVCCDPFHPFCLDE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGTKHLLECERCRH 885
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 886 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 944
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 945 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1004
Query: 228 CRGECY 233
C G +
Sbjct: 1005 CAGATH 1010
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 569 ENVAD--IGFDCSMCR 582
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 569 ENVAD--IGFDCSMCR 582
>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
latipes]
Length = 4536
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 101/210 (48%), Gaps = 23/210 (10%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC N + C+ C + +H CL R L + +W C CR C+
Sbjct: 1675 VLCFLCASSGNV------EFVYCRVCCEPFHLFCLGE--SERPLQEQFENWCCRLCRFCQ 1726
Query: 94 ICRRTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSN 145
C R K + C +C +YH C P N + P ++C K +C SCG+
Sbjct: 1727 ACGRQHQKAKQQLVECDKCRNSYHPECLGP---NYPTKPTKKKRIWICTKCVRCKSCGAT 1783
Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQC 204
PG +W ++ C C +LF KGN+CP+C K Y D + M+ C C++WVH +C
Sbjct: 1784 KPGKSWDAQWSHDFSMCHDCAKLFAKGNFCPLCDKSYSDDYYDSKMMECARCKQWVHAKC 1843
Query: 205 DGISDEKY-LQFQVDGNLQYRCPTCRGECY 233
+ I+DE + L ++ N+ Y C C EC+
Sbjct: 1844 ENITDEMFELLSKLPENIAYTCMKC-AECH 1872
>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
Length = 1744
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 270 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 328
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 329 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 387
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 388 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 447
Query: 228 CRG 230
C G
Sbjct: 448 CAG 450
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+LSC CG+ YH C+ + + + H W+C C +CE
Sbjct: 911 MCVVCGSFGKGAEG-----RLLSCSQCGQCYHPYCV-SIKITKVVLH-KGWRCLECTVCE 963
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 964 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1021
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CP+C YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 1022 EWQNNYTQCAPCASL----STCPICCCNYREEDL--ILQCRQCDRWMHTVCQNLNTEEEV 1075
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + C CR
Sbjct: 1076 ESTADNG--FDCTMCR 1089
Score = 95.5 bits (236), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 99/226 (43%), Gaps = 21/226 (9%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 332 LFCTTCGQHYHGMCLDIQV---TALKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 388
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P +V + + C C CG+ S +W CD+C K
Sbjct: 389 HTFCLQPVMDSVPTNGWKCKYCRVCAECGTRT-----SSQWHHNCLMCDSCYNQQEKLP- 442
Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCRG--- 230
CP+C K + C +C+RW+H +CD + ++D +L +Y C C+
Sbjct: 443 CPLCEKTSSPDGQKDRLYCHLCRRWIHIECDRSPNN-----ELDSHLKEYVCSLCKHSIV 497
Query: 231 --ECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFS 274
EC D + +L D D + +A G+ ED++ +
Sbjct: 498 EEECALSCDAMETA-QLLPEPDTGCADEMEIEDSAEGVTNEDQVVT 542
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 102/223 (45%), Gaps = 17/223 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 730 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 787
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP + V G + C C CG+ P
Sbjct: 788 ECIVCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCVQCGAASP 847
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L V CP C + Y + + ++ C C RW+H CD +
Sbjct: 848 --GFHCEWQNNYTHCAPCASLVV----CPFCREKYVEDDL--LIQCRHCDRWLHAACDSL 899
Query: 208 SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKD 250
E+ ++ D + C C + Y V+ + E+ + KD
Sbjct: 900 FTEEEVEQAADEG--FDCSAC--QPYVVKPVPAPSAEMIKAKD 938
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 9/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C +G R + C SCG+ +H CL R S W+CP C++C+ +
Sbjct: 126 CSVC-----DGPGELRDLAFCTSCGQHFHGACLDISLTPR---KRSGWQCPQCKVCQNLQ 177
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
D + + C CD YH C P + + + + C C CG G S +W
Sbjct: 178 PGQD-SAMLVCETCDKGYHTSCTEPAAQGLPTTSWKCKNCWVCSDCGQRPAGPVSSCQWS 236
Query: 157 LGYTCCDAC 165
G C C
Sbjct: 237 PGSEVCGDC 245
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 688 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 740
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 741 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 798
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
W YT C C L CPVC + + + E ++ C C RWVH C+ + E
Sbjct: 799 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 849
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 75/154 (48%), Gaps = 13/154 (8%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG+ YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 227 LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 283
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDACGRLFV 170
YH +C P ++ + P+ C + C CG+ +PG S +WF Y C+AC
Sbjct: 284 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEACQHH-- 338
Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
+ C VC K D + C VC R VH C
Sbjct: 339 RNCTCSVCNK--PDGSVATLQSCSVCHRLVHSGC 370
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 917 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 974
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 975 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1032
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C +S E+ ++ D
Sbjct: 1033 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLSTEEEVENVAD 1086
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1087 --IGFDCSMCR 1095
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 319 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 375
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 376 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 428
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
C K Y M+ C++C+RWVH +CD +D
Sbjct: 429 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD 461
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 102/223 (45%), Gaps = 24/223 (10%)
Query: 17 ATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHR 66
A++SKD EE NT V+ +C V + G R+L+C CG+ YH
Sbjct: 941 ASSSKD--EEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYHP 998
Query: 67 NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
C+ + + W+C C +CE C + DP + + C CD +YH YC PP + V
Sbjct: 999 YCVS--IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTV 1056
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
G + C C CG+ GL W YT C C L + CPVC + YR+ +
Sbjct: 1057 PKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREED 1110
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 1111 L--ILQCRQCDRWMHAVCQNLNTEEEVENIAD--IGFDCSMCR 1149
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 374 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 430
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 431 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 483
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 235
C K Y M+ C +C+RWVH +CD +D + QF+ + Y C C+ ++
Sbjct: 484 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 539
Query: 236 RDLE 239
L+
Sbjct: 540 DPLQ 543
>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4137
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 93/202 (46%), Gaps = 13/202 (6%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V+C LC N + C+ C + +H CL + D W +W C CR C +
Sbjct: 1559 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHV 1611
Query: 95 C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
C R+ + + C +C +YH C HP ++C K +C SCG+ PG
Sbjct: 1612 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1671
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
+W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D
Sbjct: 1672 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1731
Query: 210 EK-YLQFQVDGNLQYRCPTCRG 230
+ L + N+ Y C C G
Sbjct: 1732 DMCELMSSLPENVVYTCTNCTG 1753
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 102/223 (45%), Gaps = 24/223 (10%)
Query: 17 ATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHR 66
A++SKD EE NT V+ +C V + G R+L+C CG+ YH
Sbjct: 797 ASSSKD--EEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYHP 854
Query: 67 NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
C+ + + W+C C +CE C + DP + + C CD +YH YC PP + V
Sbjct: 855 YCVS--IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTV 912
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
G + C C CG+ GL W YT C C L + CPVC + YR+ +
Sbjct: 913 PKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREED 966
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 967 L--ILQCRQCDRWMHAVCQNLNTEEEVENIAD--IGFDCSMCR 1005
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 230 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 286
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 287 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 339
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 235
C K Y M+ C +C+RWVH +CD +D + QF+ + Y C C+ ++
Sbjct: 340 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 395
Query: 236 RDLE 239
L+
Sbjct: 396 DPLQ 399
>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
Length = 1461
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 110/233 (47%), Gaps = 18/233 (7%)
Query: 31 SNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
S+ NV C C +G+ +++ C SCG +H C L N R S W C
Sbjct: 210 SDNNVECLTCSSLGD------LSKLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCA 258
Query: 88 SCRICEICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 146
C C+ICR+ + KF+ C +C YH C P ++ + C + C CGS
Sbjct: 259 RCTKCQICRQQEANDIKFVKCEQCQKIYHANCLRPVISSIPKYGWKCNRCRVCTDCGSRT 318
Query: 147 PGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG 206
PG G S RW YT CD+C + KG CP+C K YR + MV C C ++VH CD
Sbjct: 319 PGGGSSSRWHSHYTICDSCYQQRNKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDE 378
Query: 207 ISD--EKYLQFQVDGNLQYRCPTCR-GECYQVRDLEDAVRELWRRKDMADKDL 256
+D + + + + + Y CP C+ G ++ LE ++ + + + +D+
Sbjct: 379 EADLTAYHKRKEYNPDYDYVCPVCKVGSNIKIEPLERSIADPQASEHFSIRDV 431
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E +++C C + YH C + +R + W+C C +CE C +
Sbjct: 535 ICVMCGSVGVEGDAELITCAQCAQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 592
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC +PP + V G + C T C CG N P ++
Sbjct: 593 NDEARLLLCDECDISYHIYCVNPPLEQVPRGNWKCSFCTICQKCGRN-PTEKIN-HSDSN 650
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C C N C +C K Y D E ++ C+ C++W+H CD I+ + + + D
Sbjct: 651 SPECPPCA----SQNSCSICSKGYSDGEM--IIQCEQCEQWLHFLCDSINSQHTMDY-YD 703
Query: 219 GNLQYRCPTCR 229
N+ Y+C CR
Sbjct: 704 HNM-YKCIKCR 713
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
Length = 3750
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 36 MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 90
+C LC G++E ML C+ C + +HR CL + +N++ +W C C+
Sbjct: 1633 VCLLCASKGQHE-------MLFCQVCCEPFHRFCLDPSERPLEENKE-----NWCCRRCK 1680
Query: 91 ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +C R+ + + C RC YH C P P N P++C +C SCG P
Sbjct: 1681 FCRVCGRKNKESKPLLECERCQNCYHPACLGPNYPKPNKRKKPWVCMTCIRCRSCGV-TP 1739
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
G W C C RLF +GNYC +C K Y D++ + M+ C C WVH +C+G
Sbjct: 1740 GKSWDTEWNHDKGLCPDCTRLFDQGNYCTMCFKCYEDNDYDSQMMQCSTCNHWVHAKCEG 1799
Query: 207 ISDEKY-LQFQVDGNLQYRCPTCRGE 231
++D+ Y + + ++ Y C C E
Sbjct: 1800 LTDDLYEILSSLPESVVYSCQPCLKE 1825
>gi|443723098|gb|ELU11679.1| hypothetical protein CAPTEDRAFT_130729, partial [Capitella teleta]
Length = 625
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 104/208 (50%), Gaps = 27/208 (12%)
Query: 41 FVGENEGCERA----------RRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKC 86
V N C RA +++ C C + +H CL + A N + +W C
Sbjct: 4 LVSSNSLCVRAVCYLCGSGGHNQLIYCSVCCEPFHSFCLDKGERPLADNLE-----NWCC 58
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYH----CYCQHPPHKNVSSGPYLCPKHTKCHSC 142
C+ C +C + N + C +C AYH C ++P + + ++CPK KC SC
Sbjct: 59 RKCQFCRVCGKGS--NNLLQCIQCQDAYHPSQSCLQKYPNKPSKNRRIWVCPKCVKCKSC 116
Query: 143 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVH 201
G+ PG+ W ++ C++CG L KGN+CPVC K Y D + + M+ C C+ WVH
Sbjct: 117 GATSPGDSSDATWMYDFSLCNSCGLLMSKGNFCPVCHKCYADDDWDSKMMQCSTCESWVH 176
Query: 202 CQCDGISDEKY-LQFQVDGNLQYRCPTC 228
+C+G++DE Y + + ++QY CP C
Sbjct: 177 AKCEGLTDEMYSIMSYLPEDVQYHCPRC 204
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 471 MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 523
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 524 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 581
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 582 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 635
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 636 ENVAD--IGFDCSMCR 649
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
Length = 1458
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 97/204 (47%), Gaps = 17/204 (8%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRR--TGDPNKFMFCR 108
+++ C SCG +H C L N R S W C C C+ICR+ T D KF+ C
Sbjct: 220 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQHETNDI-KFIKCE 273
Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
+C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 274 QCQKMYHAMCLRPTISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 333
Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCP 226
KG CP+C K YR + MV C C ++VH CD +D + + + + + Y CP
Sbjct: 334 RNKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEYNPDYDYVCP 393
Query: 227 TCR-----GECYQVRDLEDAVREL 245
C+ + V LE A+ ++
Sbjct: 394 ICKTSSTAAQLKAVDPLERAITDM 417
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 91/195 (46%), Gaps = 20/195 (10%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + G E +++C CG+ YH C + +R + W+C C +CE C +
Sbjct: 532 ICVMCGTLGIESDSVLITCAQCGQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 589
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN----VPGNGLSVR 154
D + + C CD +YH YC PP + V G + C T C CG N V + S+
Sbjct: 590 NDEARLLLCDECDISYHIYCVKPPLETVPHGNWKCSFCTICQKCGRNPTEKVKNSDASLS 649
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 214
C C N CP+C K Y D E ++ C+ C++W H CD I+ + ++
Sbjct: 650 E------CLPCA----SQNSCPLCRKAYSDGEM--IIQCEQCEQWSHFLCDSINAQYTME 697
Query: 215 FQVDGNLQYRCPTCR 229
+ D N+ Y+C CR
Sbjct: 698 Y-YDNNV-YKCMKCR 710
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 565 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 622
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 623 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 680
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 681 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 734
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 735 --IGFDCSMCR 743
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 65/136 (47%), Gaps = 14/136 (10%)
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
R++G+ +K + C CD YH +C P K+V + + C C CG+ S +W
Sbjct: 1 RQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQW 55
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
CD C + + N CP C K Y M+ C++C+RWVH +CD +D
Sbjct: 56 HHNCLICDNCYQQ--QDNLCPFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH----- 108
Query: 216 QVDGNL--QYRCPTCR 229
++D L +Y C C+
Sbjct: 109 ELDPQLKEEYICMYCK 124
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 996 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1053
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1054 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1111
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1112 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1165
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1166 --IGFDCSMCR 1174
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 397 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 453
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ SV+W CD+C + + N CP
Sbjct: 454 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SVQWHHNCLICDSCYQQ--EDNLCP 506
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C+VC+RWVH +CD +D ++D L +Y C C+
Sbjct: 507 FCGKCYHPELQKDMLHCNVCKRWVHLECDKPTDH-----ELDSQLKEEYICMFCK 556
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 953 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1069 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1123 --IGFDCSMCR 1131
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C CR
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCR 518
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 3855
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 99 GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 136
K + C +C +YH C P N + P ++C K
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490
Query: 137 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 195
+C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + + M+ C
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550
Query: 196 CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 957 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1009
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 1010 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1067
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 1068 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1121
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1122 ENVAD--IGFDCSMCR 1135
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 235
C K Y M+ C++C+RWVH +CD +D E QF+ + Y C C+ ++
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELESQFKEE----YICMYCKHLAAEM 524
Query: 236 RDLE 239
L+
Sbjct: 525 DPLQ 528
>gi|328773887|gb|EGF83924.1| hypothetical protein BATDEDRAFT_84646 [Batrachochytrium
dendrobatidis JAM81]
Length = 828
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 96/207 (46%), Gaps = 34/207 (16%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLF---HWSSWKCPSCRICEICRRTGDPNKFMFCRRC 110
+L+C CG K+H C++ +++ L W+C +C++C +C GD +K +FC C
Sbjct: 585 LLNCTQCGTKHHPRCIE--FEDKVLITKVMTFDWRCSNCKLCTVCNNAGDDDKLLFCDTC 642
Query: 111 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN--------------VPGNGLSVRWF 156
D YH YC +PP + + G +LC + C SC V LS++
Sbjct: 643 DRGYHMYCLNPPLEVLPEGSWLCSECAVCKSCKKRPEKQEGTEDMWRHVVIPPSLSLQEI 702
Query: 157 ----------LG-YTC--CDACGRLFVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHC 202
LG Y C C C F +CP+C+ VY DS+ MVCCD C RWVH
Sbjct: 703 QIKPPPATSALGTYLCTYCTDCYDHFEADRFCPLCIHVYSEDSDDLAMVCCDECDRWVHV 762
Query: 203 QCDG-ISDEKYLQFQVDGNLQYRCPTC 228
CD ++D+ Y + + C C
Sbjct: 763 GCDPELTDDVYQKLVEQEEPAFTCALC 789
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 112 bits (280), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 875 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 932
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 933 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 990
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 991 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1044
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1045 --IGFDCSMCR 1053
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 385 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 434
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 10 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 62
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 63 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATC--AGLRC 120
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 121 EWQNNYTQCAPCASL----SACPVCFRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEV 174
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 175 ESVAD--IGFDCSMCR 188
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 891 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 948
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 949 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1006
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1007 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1060
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1061 --IGFDCSMCR 1069
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 400
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 401 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 450
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 906 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 963
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 964 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1021
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1022 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1075
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1076 --IGFDCSMCR 1084
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 306 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 362
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 363 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 415
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 416 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 465
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 953 MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1005
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 1006 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1063
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 1064 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1117
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1118 ENVAD--IGFDCSMCR 1131
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ CR++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L +Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 954 MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1006
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 1007 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 1064
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 1065 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 1118
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1119 ENVAD--IGFDCSMCR 1132
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ CR++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L +Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517
>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
Length = 1465
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 123/293 (41%), Gaps = 27/293 (9%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
+++ C SCG +H C L N R S W C C C+ICR+ + KF+ C +
Sbjct: 223 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQQEANDIKFIKCEQ 277
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 278 CQKIYHATCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 337
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCPT 227
KG CP+C K YR + MV C C ++VH CD +D + + + + Y CP
Sbjct: 338 NKGFSCPICQKAYRAAAYKEMVKCSCCHKFVHSTCDEEADLTAYHKTKEYNPDYDYVCPI 397
Query: 228 CRGECYQVRDLEDAVRELWRRKDMADKDLI----ASLRAAAGLPTEDEIFSISPYSDDEE 283
C+ + ++ R L DM + + G P D + S P
Sbjct: 398 CKSNSAGAKIVDPLERSL---ADMQSSEQFNIKHVEIEPLNGKPCMD-VNSEDPLKLPNT 453
Query: 284 NGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNS 336
V L N G+ K L + + + GKK N R KG Q+ L S
Sbjct: 454 KKKVCLTNIRGKGGKFVLHRM-----GAISQLGKKRSN----RGKGRQLVLQS 497
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 90/192 (46%), Gaps = 12/192 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + G E +++C CG+ YH C + +R + W+C C +CE C +
Sbjct: 533 ICVMCGTLGIESDAVLITCAQCGQCYHPYC-ASVKHSRGMLQ-KGWRCLDCTVCEGCGKK 590
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC +PP + V G + C T C CG N P ++
Sbjct: 591 NDEARLLLCDECDISYHIYCVNPPLETVPHGNWKCSFCTICQKCGRN-PTEKVNFNEPSA 649
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C + N C VC K Y +E ++ C+ C++W H CD ++ + +++ D
Sbjct: 650 PECLPCASQ-----NNCFVCKKSY--TEGDMIIQCEQCEQWSHFLCDSVNTQLTMEY-YD 701
Query: 219 GNLQYRCPTCRG 230
N+ Y+C CR
Sbjct: 702 NNV-YKCMKCRA 712
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 978 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1035
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ PG W
Sbjct: 1036 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1093
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E ++ D
Sbjct: 1094 YTQCAPCASLSI----CPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1147
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1148 --IGFDCTMCR 1156
Score = 99.0 bits (245), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 18/176 (10%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+ +G+ +K + C CD YH
Sbjct: 368 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 424
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY-C 175
+C P +V + + C C CG+ S +W CD+C + + N C
Sbjct: 425 FCLQPIMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDSCYQ--PQDNLSC 477
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
P C K Y+ M+ C +C+RW+H +CD +D ++D L +Y C C+
Sbjct: 478 PFCGKCYQPDLQKDMLHCHMCKRWIHIECDKPADT-----ELDSQLKEEYVCMYCK 528
>gi|427798099|gb|JAA64501.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 926
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 41 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 298 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 356
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 357 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 416
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 417 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 472
Query: 214 QFQVDGNLQYRCPTC 228
DG+ Y CP C
Sbjct: 473 SSPKDGD--YVCPVC 485
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 95/196 (48%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C F EG R+++C CG+ YH C+ + + W+C C +CE
Sbjct: 660 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 712
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + D ++ + C CD +YH YC PP + V G + C C CG+ PGNG
Sbjct: 713 GCGQPHDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--S 770
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
+W YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+
Sbjct: 771 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE-- 822
Query: 214 QFQVDGNLQYRCPTCR 229
+ Y CP CR
Sbjct: 823 DAERCAEYGYNCPYCR 838
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 910 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 967
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 968 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1026 YTQCAPCASL----SSCPVCYRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1080 --IGFDCSMCR 1088
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 315 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 371
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CDAC + + N CP
Sbjct: 372 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHSCLVCDACYQQ--QDNLCP 424
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 425 FCGKCYHPELQKDMLHCNICKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 474
>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
Length = 1481
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 122/278 (43%), Gaps = 21/278 (7%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 228 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 286
C+ + A+ + D + + L SL+ P E + ++ P SD+ P
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-TMDPSSDELHKLP 448
Query: 287 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
V L + GRS K L GV+ + KK GK
Sbjct: 449 TGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 91/204 (44%), Gaps = 14/204 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN-VPGNGLSVRWFL 157
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKNEFGESNML 647
Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
C +C + CPVC Y + E ++ C+ C+ W H CD ++ + +
Sbjct: 648 E---CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQLTID-HY 697
Query: 218 DGNLQYRCPTCRGECYQVRDLEDA 241
D N+ Y+C CR L D+
Sbjct: 698 DNNV-YKCFKCRCSTRSTNSLTDS 720
>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
Length = 1481
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 121/288 (42%), Gaps = 41/288 (14%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 228 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 276
C+ G Q D D++ E K++ L G PT D
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSLSEQLSLKEI-------ELDPLEGKPTMD------ 438
Query: 277 PYSDDEENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 317
P SD+ P V L + GRS K L GV+ + KK GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRIGVMSQINKKRSTRGK 486
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
C +C + CPVC Y + E ++ C+ C+ W H CD ++ +
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQL 692
Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
+ D N+ Y+C CR L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLTDA 720
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 924 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 981
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 982 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1039
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1040 YTQCAPCASL----SSCPVCCRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1093
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1094 --IGFDCSMCR 1102
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 87/185 (47%), Gaps = 16/185 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+ TG +FC C Y+
Sbjct: 327 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKETGKNTFVLFCFTCSLNYNP 383
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYTCCDACGRLFVKGNYC 175
+C P + V + + T+C +C V G S +W CD+C + + N C
Sbjct: 384 FCVSPLVRIVPTNLF-----TQCRNCRICVECGTRSSSQWHHNCLVCDSCYQQ--QDNLC 436
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQ 234
P C K Y M+ C++C+RWVH +CD +D + QF+ + Y C C+ +
Sbjct: 437 PFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELDSQFKEE----YICMYCKHIAAE 492
Query: 235 VRDLE 239
+ L+
Sbjct: 493 MDPLQ 497
>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
Length = 3692
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 96/195 (49%), Gaps = 10/195 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ ++G ML C+ C + +HR CL+ A+ + +W C C+ C +C +
Sbjct: 1730 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCKFCHVCGKK 1785
Query: 99 GDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
K + C RC YH C P P N ++C + +C SCG PG + W
Sbjct: 1786 NQLTKPLLECERCQNCYHASCLGPNYPKLNKKRKAWVCMRCIRCKSCGV-TPGKSWEIDW 1844
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 213
C C +L+ +GNYCP+C K Y D++ + M+ C C WVH +C+ ++DE Y +
Sbjct: 1845 NHDKGLCPDCSKLYEQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1904
Query: 214 QFQVDGNLQYRCPTC 228
+ ++ Y C C
Sbjct: 1905 LSSLPESVVYSCRPC 1919
>gi|427791139|gb|JAA61021.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 741
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 41 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 178 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 236
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 237 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 296
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 297 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 352
Query: 214 QFQVDGNLQYRCPTC 228
DG+ Y CP C
Sbjct: 353 SSPKDGD--YVCPVC 365
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 81/196 (41%), Gaps = 50/196 (25%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C F EG R+++C CG+ YH C+ + + W+C C +CE
Sbjct: 539 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 591
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C G PN CR C C CG+ PGNG
Sbjct: 592 GC---GQPN--WKCRWC--------------------------VICVKCGATEPGNG--S 618
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
+W YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+
Sbjct: 619 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE-- 670
Query: 214 QFQVDGNLQYRCPTCR 229
+ Y CP CR
Sbjct: 671 DAERCAEYGYNCPYCR 686
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 530 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 582
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 583 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 640
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 641 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 694
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 695 ENVAD--IGFDCSMCR 708
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 37/60 (61%), Gaps = 2/60 (3%)
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK--HTKC 139
+ W+CP C++C+ C+++G+ +K + C CD YH +C P K+V + + C + H +C
Sbjct: 13 AGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKRWVHLEC 72
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 910 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 967
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 968 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1026 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1080 --IGFDCSMCR 1088
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 313 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 369
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 370 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 422
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 423 FCGKWYHPELQKDMLHCNMCKRWVHLECDKPTDN-----ELDSQLKEEYICMYCK 472
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 1010 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1067
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ PG W
Sbjct: 1068 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1125
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E ++ D
Sbjct: 1126 YTQCAPCASL----STCPVCSRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1179
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1180 --IGFDCTMCR 1188
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 86/185 (46%), Gaps = 20/185 (10%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+ +G+ +K + C CD YH
Sbjct: 403 CTTCGQHYHGMCLDIAVT---ALKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 459
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P +V + + C C CG+ S +W CD C + CP
Sbjct: 460 FCLQPVMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDNCYQP-QDNTACP 513
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCRGECYQ 234
C K Y+ M+ C +C+RW+H +CD +D ++D L+ Y C C+ Q
Sbjct: 514 FCGKCYQPDFQKDMLHCQMCKRWIHIECDKPADT-----ELDSQLKEDYVCMCCK----Q 564
Query: 235 VRDLE 239
+ DLE
Sbjct: 565 LGDLE 569
>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4219
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 92/200 (46%), Gaps = 13/200 (6%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V+C LC N + C+ C + +H CL + D W +W C CR C +
Sbjct: 1626 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHD-EQWENWCCRRCRFCHV 1678
Query: 95 C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
C R+ + + C +C +YH C HP ++C K +C SCG+ PG
Sbjct: 1679 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1738
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
+W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D
Sbjct: 1739 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1798
Query: 210 EK-YLQFQVDGNLQYRCPTC 228
+ L + N+ Y C C
Sbjct: 1799 DMCELMSSLPENVVYTCTNC 1818
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 953 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1069 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1123 --IGFDCSMCR 1131
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 952 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1009
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1010 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1067
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1068 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1121
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1122 --IGFDCSMCR 1130
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|195347259|ref|XP_002040171.1| GM16061 [Drosophila sechellia]
gi|194135520|gb|EDW57036.1| GM16061 [Drosophila sechellia]
Length = 1476
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 121/278 (43%), Gaps = 21/278 (7%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 211 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 265
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 266 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 325
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 326 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 385
Query: 228 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 286
C+ + + + D + + L SL+ P E + S+ P SD+ P
Sbjct: 386 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-SMDPSSDELHKLP 442
Query: 287 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
V L + GRS K L GV+ + KK GK
Sbjct: 443 AGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 480
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 26/210 (12%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 524 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 581
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 582 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 641
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
C +C + CPVC Y + E+ ++ C+ C+ W H CD ++ +
Sbjct: 642 E---------CPSC----TSQSSCPVCKVSYSNGET--IIQCEHCELWAHFHCDTVNAQL 686
Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
+ D N+ Y+C CR L DA
Sbjct: 687 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 714
>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
Length = 1622
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 18/207 (8%)
Query: 32 NTNVMCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSC 89
N +MC LC R R ++L C+ C + +HR CL+ R L + +W C C
Sbjct: 32 NPTLMCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRC 82
Query: 90 RICEICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSN 145
+ C +C + G K + C C YH C P P K SG + C +C SCG
Sbjct: 83 KFCNVCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI- 141
Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC 204
PG + C C L+ KGN+CP+C++ Y +SE + M+ C C +W+H +C
Sbjct: 142 APGKDGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKC 201
Query: 205 DGISDEKY-LQFQVDGNLQYRCPTCRG 230
+G+SDE Y L + ++ Y CP C G
Sbjct: 202 EGLSDEGYELLSNLPDSVVYTCPPCLG 228
>gi|4336749|gb|AAD17932.1| myeloid/lymphoid leukemia 2 [Homo sapiens]
Length = 1010
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 96/183 (52%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 553 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 611
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V+W Y+ C C +L+
Sbjct: 612 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVQWSGDYSLCPRCTQLY 670
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 671 EKGNYCPICTRCYEDNDYESKMMQCAQCDYWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 730
Query: 228 CRG 230
C G
Sbjct: 731 CAG 733
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 935 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 992
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 993 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1050
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CP+C + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1051 YTQCAPCASL----SSCPICCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1104
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1105 --IGFDCSMCR 1113
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 104/224 (46%), Gaps = 22/224 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 342 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 398
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 399 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCYQQ--QDNLCP 451
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 236
C K Y M+ C++C+RWVH +CD +D + L Q+ + Y C C+ ++
Sbjct: 452 FCGKCYNPEFQKDMLYCNMCKRWVHLECDKPTDHE-LDSQIKED--YICMYCKHLGAEID 508
Query: 237 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSD 280
L EL + D ++G+ EDE+ + P ++
Sbjct: 509 SLHPG-NELEMPELPTD--------YSSGMEIEDEVLFLDPTAN 543
>gi|161611540|gb|AAI55711.1| mll protein [Xenopus (Silurana) tropicalis]
Length = 2316
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 99 GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 136
K + C +C +YH C P N + P ++C K
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490
Query: 137 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 195
+C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + + M+ C
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550
Query: 196 CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584
>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
Length = 1482
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 120/288 (41%), Gaps = 41/288 (14%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 109
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKTYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 227
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 228 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 276
C+ G Q D D+ E K++ L G PT D
Sbjct: 392 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQLSLKEI-------ELDPLEGKPTMD------ 438
Query: 277 PYSDDEENGP-----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 317
P SD+ P V L + GRS K L GV+ + KK GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 151
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
C +C + CPVC Y + E ++ C+ C+ W H CD ++ +
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDSVNAQL 692
Query: 212 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
+ D N+ Y+C CR L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 720
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
niloticus]
Length = 4907
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 93/201 (46%), Gaps = 15/201 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ ++ +C V + G ++L+C C + YH C+ + L W+C
Sbjct: 836 SNTDKFVLLQDMCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCL 893
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CGSN P
Sbjct: 894 ECIVCEVCGKASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP 953
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC + + + E ++ C C RWVH C+ +
Sbjct: 954 --GFHCEWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESL 1005
Query: 208 SDEKYLQFQVDGNLQYRCPTC 228
E ++ D + C +C
Sbjct: 1006 YTEDEVEQASDEG--FACTSC 1024
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 230 LLFCTGCGLHYHAACLEIGATP---IQRAGWQCPECKVCQTCRQPGEDSKMLVCDACDKG 286
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 170
YH +C P ++ S P+ C + C CG +PG S +WF Y C+ C
Sbjct: 287 YHTFCLQPAMDSLPSDPWKCRRCRVCMVCGVRGLVLPG---SAQWFDNYAVCEGCQHH-- 341
Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
+ + C VC K S + + CC +C RWVH +C
Sbjct: 342 RSSICCVCSKAANPSVA--LQCCSMCHRWVHSEC 373
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 845 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 902
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 903 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 960
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 961 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1014
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 1015 --IGFDCSMCR 1023
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 290 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 346
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 347 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 399
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 400 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 449
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 99/224 (44%), Gaps = 22/224 (9%)
Query: 16 VATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYH 65
A AS + EE NT V+ +C V + G R+L+C CG+ YH
Sbjct: 865 AADASSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGAEGRLLACSQCGQCYH 924
Query: 66 RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
C+ + + W+C C +CE C + DP + + C CD +YH YC PP +
Sbjct: 925 PYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQT 982
Query: 126 VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS 185
V G + C C CG+ +G W YT C C L + CPVC + YR+
Sbjct: 983 VPKGGWKCKWCVWCRHCGAT--SSGPRCEWQNNYTQCAPCASL----SSCPVCCRNYREE 1036
Query: 186 ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
+ ++ C C RW+H C S E+ ++ D + + C CR
Sbjct: 1037 DL--ILQCRQCDRWMHAVCQNFSTEEEVENVAD--IGFDCSLCR 1076
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 305 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 361
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 362 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 414
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 235
C K Y M+ C++C+RWVH +CD +D E LQ + + Y C C+ ++
Sbjct: 415 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHEPDLQLREE----YICTYCKHLAAEM 470
Query: 236 RDLE 239
L+
Sbjct: 471 GPLQ 474
>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
Length = 2812
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 98/208 (47%), Gaps = 20/208 (9%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
T ++C LC + +++ C+ C +H CL++ Q SW C C+
Sbjct: 1231 TTQLVCLLC------ASKGFHQLVFCQVCCDPFHVFCLEDDEQPLP-EQEESWCCRRCKF 1283
Query: 92 CEICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGPY------LCPKHTKCHSCGS 144
C +C R +K + C RC YH C P N + P+ +C +C SCG+
Sbjct: 1284 CHVCGRKNKASKQLLECERCRNCYHLACLGP---NYPTKPFRKRKNWVCSACIRCKSCGT 1340
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQ 203
PG W Y+ C AC L KGNYCP+CL Y D++ + M+ C C WVH +
Sbjct: 1341 -APGKNWDTEWSNDYSLCSACSVLHDKGNYCPICLHCYEDNDYESKMMQCAKCDHWVHAK 1399
Query: 204 CDGISDEKY-LQFQVDGNLQYRCPTCRG 230
C+G+SDE Y + + ++ Y C C G
Sbjct: 1400 CEGLSDEGYEILSNLPESVVYACRPCCG 1427
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 92/201 (45%), Gaps = 15/201 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ ++ +C V + G ++L+C C + YH C+ + L W+C
Sbjct: 817 SNTDKFVLLQDMCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCL 874
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C DP++ + C CD +YH YC PP V G + C C CGSN P
Sbjct: 875 ECIVCEVCGEASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP 934
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC + + + E ++ C C RWVH C+ +
Sbjct: 935 --GFHCEWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESL 986
Query: 208 SDEKYLQFQVDGNLQYRCPTC 228
E ++ D + C +C
Sbjct: 987 YTEDEVEQASDEG--FACTSC 1005
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 79/159 (49%), Gaps = 13/159 (8%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG YH CL A + W+CP C++C+ CR+ G+ +K + C C+
Sbjct: 229 LLFCTGCGLHYHATCLDTGATP---ILRAGWQCPECKVCQTCRQPGEDSKMLVCDSCEKG 285
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 170
H +C P +V S + C C CG + +PG + +WF YT C+ C
Sbjct: 286 CHTFCLQPAMDSVPSDRWKCRSCRVCMECGVHGLVLPG---TAQWFESYTLCEGCQHH-- 340
Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
+ + C VC K D+ S + CC +C RW+H +C +++
Sbjct: 341 RSSICCVCSKP--DNPSVSLQCCSLCHRWMHSECSSLTE 377
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 92/215 (42%), Gaps = 22/215 (10%)
Query: 26 EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
EE NT VM +C V + G R+L+C CG+ YH C+ N
Sbjct: 998 EENSMHNTVVMFSTSDHFTLKQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCV-NIKIT 1056
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
R + W+C C +CE C DP + + C CD +YH YC PP V G + C
Sbjct: 1057 RVIL-TKGWRCLECTVCEACGDASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKW 1115
Query: 136 HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CGS P G+ W Y+ C C L + CP C + Y +E ++ C
Sbjct: 1116 CVWCVQCGSTSP--GVHSDWQRNYSLCGPCCSL----SRCPACQQAY--AEDDLILQCQQ 1167
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
C RWVH C G+ E ++ D + C C+
Sbjct: 1168 CDRWVHATCQGLCTEDEVEVAADEG--FDCSLCKA 1200
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 7/101 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
CRLC + G +L C CG YH +CL L W+CP CR+C C
Sbjct: 464 CRLCAGSGDSGG-----LLMCSCCGSCYHGSCLDPPVTPSPLSRVG-WQCPQCRVCRSCS 517
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHT 137
GD + + C RCD AYH +C PP + + C T
Sbjct: 518 LQGD-SGVLLCARCDKAYHAHCLTPPLDDAPHAAWTCKAET 557
>gi|312384476|gb|EFR29199.1| hypothetical protein AND_02074 [Anopheles darlingi]
Length = 2401
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 98/213 (46%), Gaps = 12/213 (5%)
Query: 34 NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
++ CR C + G ++ C CG YH C+ AQ + + W+C SC+ C+
Sbjct: 625 DINCRQCSALGDVG-----NLIICSLCGDHYHGTCV-GLAQLPGV--RTGWQCNSCKKCQ 676
Query: 94 ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
ICR + + C CD YH C P ++ + C C CG+ PG G S
Sbjct: 677 ICRVPDSSEGRSVACELCDKIYHASCLRPIMTSIPKFGWKCRCCRVCSDCGARTPGAGAS 736
Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
RW YT CD+C + KG CP+C + YR + MV C VC ++VH CD +D
Sbjct: 737 SRWHSHYTVCDSCYQQRNKGFSCPICHRAYRAAAYREMVKCSVCSKFVHSTCDPDADLTV 796
Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVR 243
+ + N +Y C C+ + R L AVR
Sbjct: 797 YNGRKEANPDYEYLCTPCKAAIHSGR-LVAAVR 828
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 96/224 (42%), Gaps = 22/224 (9%)
Query: 16 VATASKDIAGEEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYH 65
+A++S +E G N V+C +C + G ++ +++C CG+ YH
Sbjct: 985 LASSSIKTTEDEPGGENRLVLCSAKDKFVLTQDICVMCGAIGTDQEGCLIACTQCGQCYH 1044
Query: 66 RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
C + + W+C C ICE C + D + + C CD +YH YC PP +
Sbjct: 1045 PYCTN--VKVTKVILQKGWRCLDCTICEGCGQRNDEARLILCDDCDISYHIYCMDPPLEQ 1102
Query: 126 VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS 185
V G + C C CG+N P G + W YT C C CP C Y D
Sbjct: 1103 VPQGTWKCKWCAICQKCGTNSP--GFNSGWMNSYTECGPC----ASQTNCPSCNDGYADG 1156
Query: 186 ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
E ++ C C+RW+HC CD I +E + Y C CR
Sbjct: 1157 EL--IIQCHQCERWLHCACDQIKNEAEAERCA--EEAYNCLICR 1196
>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
Length = 1486
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 97/206 (47%), Gaps = 19/206 (9%)
Query: 31 SNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
S+ NV C C +G+ +++ C +CG +H C L N R S W C
Sbjct: 200 SDNNVECLTCSSLGD------LSKLIMCSTCGDHFHSTCVGLANLPDTR-----SGWNCA 248
Query: 88 SCRICEICRRTGDPN--KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
C C+ICR D N K++ C +C YH C P + + C + C CGS
Sbjct: 249 RCTKCQICR-VQDSNDLKYVKCEQCQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSR 307
Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCD 205
PG G S RW YT CD+C + KG CP+C K YR + MV C C ++VH CD
Sbjct: 308 TPGGGSSSRWHSHYTICDSCYQQRNKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCD 367
Query: 206 GISD--EKYLQFQVDGNLQYRCPTCR 229
+D + + +++ + Y CP C+
Sbjct: 368 EEADLTAYHKKKELNPDYDYVCPNCK 393
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 87/202 (43%), Gaps = 34/202 (16%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
D + + C CD +YH YC +PP + V +G + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNMP 647
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
C C + C VC Y + E ++ C+ C+ W H CD +
Sbjct: 648 E-------------CPPCA----SQSACNVCKSAYANGEM--IIQCEHCELWSHFLCDTV 688
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
+ + + D N+ Y+C CR
Sbjct: 689 NAQLTID-HYDSNI-YKCLKCR 708
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 93/202 (46%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 961 SNTDKFVLMQDMCVVCGSFGRGSEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1018
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1019 ECIVCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLNTVPKGGWKCKWCVCCVQCGAVSP 1078
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H CD +
Sbjct: 1079 --GFHCEWQNNYTHCAPCASLVT----CPICQVKYVEEDL--LIQCQHCERWMHAVCDNL 1130
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E+ ++ D + C +C+
Sbjct: 1131 FTEEEVEQAADEG--FDCTSCQ 1150
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 40 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
C V + G R +L C SCG YH CL+ R S W+C C++C+ CR +G
Sbjct: 234 CMVCDAPG--ELRDLLFCTSCGLHYHGTCLEITVTPRK---RSGWQCHECKVCQTCRLSG 288
Query: 100 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ ++ + C C+ YH YC P ++V + + C
Sbjct: 289 EDSRMLVCEACEKCYHTYCLKPAIESVPADSWKC 322
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 695 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 752
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 753 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 810
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 811 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 864
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 865 --IGFDCSMCR 873
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 95 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 151
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 152 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 204
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 205 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 254
>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
Length = 1486
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
+++ C +CG +H C L N R S W C C C+ICR+ D N K++ C
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269
Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
+C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389
Query: 227 TCR 229
C+
Sbjct: 390 NCK 392
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
D + + C CD +YH YC +PP + V SG + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
C +C + C VC Y E ++ C+ C+ W H CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
+ + +++ D N+ Y+C CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708
>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
Length = 1486
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)
Query: 53 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 108
+++ C +CG +H C L N R S W C C C+ICR+ D N K++ C
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269
Query: 109 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
+C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 226
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389
Query: 227 TCR 229
C+
Sbjct: 390 NCK 392
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 147
D + + C CD +YH YC +PP + V SG + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
C +C + C VC Y E ++ C+ C+ W H CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
+ + +++ D N+ Y+C CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708
>gi|348530100|ref|XP_003452549.1| PREDICTED: hypothetical protein LOC100689867 [Oreochromis niloticus]
Length = 2557
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 102/216 (47%), Gaps = 16/216 (7%)
Query: 54 MLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCR 108
M+ C+ C + +H CL + +N++ +W C C+ C +C RR+ + CR
Sbjct: 1183 MIFCQICCEPFHSFCLSPEERPLEENKE-----NWFCRRCKFCHVCGRRSKSTKPVLQCR 1237
Query: 109 RCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACG 166
RC +YH C P P SS P++C +C SCG PG + W C C
Sbjct: 1238 RCQTSYHPSCLGPTYPKPMNSSVPWVCMTCIRCKSCGVT-PGKTWDLTWNHEQDLCPDCT 1296
Query: 167 RLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYR 224
L KGN+C +C K Y DS + + M+ C C W+H +C+GIS+E + L G + +
Sbjct: 1297 SLHKKGNFCTICHKCYEDSIQPSQMLQCSQCSHWIHYRCEGISEELFGLLTSQPGRVDFT 1356
Query: 225 CPTCRGECYQVRDL-EDAVRELWRRKDMADKDLIAS 259
C C L E+ R L R + DL++S
Sbjct: 1357 CSPCSQHQTSHSILKEELQRRLTARVEEVLTDLLSS 1392
>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
norvegicus]
Length = 3894
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1362 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1416
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1417 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1476
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+SDE Y
Sbjct: 1477 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1536
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1537 EILSNLPESVAYTCVNC 1553
>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
Length = 3987
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1455 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1509
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1510 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1569
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+SDE Y
Sbjct: 1570 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1629
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1630 EILSNLPESVAYTCVNC 1646
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 93/202 (46%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 798 SNTDKFVLMQDMCVVCGSFGRGSEGHLLACSQCSQCYHPYCVNSRITKVMLL--KGWRCV 855
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 856 ECIVCEVCGKATDPSRLLLCDDCDISYHTYCLDPPLHTVPKGGWKCRWCVSCMQCGAVTP 915
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC Y + + ++ C C+RW+H C+ +
Sbjct: 916 --GFRSEWQNNYTHCAPCASLV----SCPVCHLKYLEGDL--LIQCRHCERWLHAVCENL 967
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E+ ++ D + C +C+
Sbjct: 968 FTEEEVEQAADEG--FDCSSCQ 987
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 3/125 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C SCG YH CL+ S W+CP C++C+ CR+ G+ + C CD
Sbjct: 233 LLFCTSCGLHYHGTCLEITVSP---LKRSGWQCPECKVCQTCRQPGEDTMMLVCDACDKG 289
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
YH +C P + + + + C C CGS +W+ Y+ C C +
Sbjct: 290 YHTFCLKPAIECLPTDSWKCKTCRVCRICGSRTAHMEPGSQWYDNYSVCSKCQEKRNRAE 349
Query: 174 YCPVC 178
C +C
Sbjct: 350 TCVLC 354
>gi|321469512|gb|EFX80492.1| hypothetical protein DAPPUDRAFT_318677 [Daphnia pulex]
Length = 1953
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 98/214 (45%), Gaps = 24/214 (11%)
Query: 29 GQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
G S+ N C +C N L C +CGK YH +C+ ++W+C
Sbjct: 449 GPSDAN--CAVCHSPSN-----VANQLFCVTCGKHYHGSCV---GLGSSPGVRTAWQCNE 498
Query: 89 CRICEICRR------TG-----DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHT 137
C++C CR TG D K + C CD YH C P N+ + C
Sbjct: 499 CKVCITCRTPVAQQGTGAEAVTDRTKMLVCDTCDKNYHPSCVRPLISNIPKLGWKCKNCR 558
Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
C CGS PG+G S RW +T CD+C + KG CP+C K YR S+ M C C+
Sbjct: 559 VCGDCGSRTPGSGPSSRWHACFTVCDSCYQQRNKGVSCPMCGKAYRHSQRE-MSQCTRCR 617
Query: 198 RWVHCQCDGISDEKYLQFQVDGN--LQYRCPTCR 229
++VH CD +D +Q + D N +Y CP C+
Sbjct: 618 KYVHSGCDPEADRTLVQRKKDMNSDYEYLCPPCK 651
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 40 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
C + + G ++ R++SC CG+ YH C + + W+C C +CE C
Sbjct: 856 CAMCGSFGLDQEGRLISCAQCGQCYHPFCAN--VKVTKVILQKGWRCLDCTVCEGCGERH 913
Query: 100 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF--L 157
D + + C CD +YH YC PP V G + C C CGSN P GL+ W
Sbjct: 914 DEARLLLCDECDISYHIYCMEPPLDYVPQGNWKCKWCAVCQVCGSNEP--GLNANWTHQA 971
Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
+ C C L CP C Y + E ++ C C +W+H CD I +E+ +F
Sbjct: 972 NGSLCGPCASL----RQCPSCSSSYNEGEL--IIQCQQCAQWLHAACDLIRNEREAEFCA 1025
Query: 218 DGNLQYRCPTCR 229
+ Y C CR
Sbjct: 1026 EDG--YTCVLCR 1035
>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
Length = 2934
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 92/200 (46%), Gaps = 21/200 (10%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 627 TQDICVMCGAIGTDQEGC-----LIACTQCGQCYHPYCTN--VKVTKVILQKGWRCLDCT 679
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
ICE C + D + + C CD +YH YC PP + V G + C C CG+N P G
Sbjct: 680 ICEGCGQRNDEGRLILCDDCDISYHTYCMDPPLEQVPQGNWKCKWCAICLKCGTNDP--G 737
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W Y+ C C CP C + Y D E ++ C+ C+RW+HC CD I E
Sbjct: 738 YNCAWLNNYSECGPCASQV----SCPCCGEGYADGEL--IIQCNQCERWLHCGCDQIKSE 791
Query: 211 KYLQ-FQVDGNLQYRCPTCR 229
+ DG Y C CR
Sbjct: 792 NEAERCAEDG---YNCLLCR 808
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 101/229 (44%), Gaps = 17/229 (7%)
Query: 34 NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
++ CR C + G ++ C CG YH C+ AQ + S W+C SC+ C+
Sbjct: 288 DINCRQCSALGDVG-----NLMMCSICGDHYHGTCV-GLAQLPGV--RSGWQCGSCKKCQ 339
Query: 94 ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
ICR + + C +CD YH C P ++ + C C CGS PG G S
Sbjct: 340 ICRVPDSSEGRTVGCEQCDKIYHASCLRPIMTSIPKYGWKCRCCRICSDCGSRTPGAGAS 399
Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
RW ++ CD+C + KG CP+C + YR + MV C C ++VH CD +D
Sbjct: 400 SRWHAHFSVCDSCYQQRNKGFSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDAEADLSV 459
Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIAS 259
+ + N +Y C C+ + R RR D D +++
Sbjct: 460 YHAKKETNPDYEYLCSPCKTAIHSGR------MAAMRRNSSVDDDSMSA 502
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 90/196 (45%), Gaps = 19/196 (9%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 923 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 975
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G + C C CG+ G
Sbjct: 976 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGPRG 1033
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
W YT C C L + CPVC + YR+ + ++ C C RW+H C S E+ +
Sbjct: 1034 EWQNNYTQCAPCASL----SACPVCHRNYREEDL--ILQCRQCDRWMHAVCQNFSTEEEV 1087
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 1088 ENVAD--IGFDCSLCR 1101
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 78/155 (50%), Gaps = 10/155 (6%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 330 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 386
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 387 FCLQPVMKSVPTNGWRCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 439
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 211
C K Y M+ C++C+RWVH +CD +D +
Sbjct: 440 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHE 474
>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
guttata]
Length = 3849
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ + G + C+ C + +H+ CL+ + ++ +W C C+ C +C R
Sbjct: 1297 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEESERPQE-DQLENWCCRRCKFCHVCGRQ 1352
Query: 99 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 154
K + C +C +YH C P + + ++C K +C SCGS PG G +
Sbjct: 1353 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQ 1412
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 212
W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1413 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1472
Query: 213 LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1473 ILSNLPESVAYTCINC 1488
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 370 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 427
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 428 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 487
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 488 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 539
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 540 FTEDDVEQAADEG--FDCVSCQ 559
>gi|66811728|ref|XP_640043.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|60468063|gb|EAL66073.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 795
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 88/190 (46%), Gaps = 27/190 (14%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 111
+++C SC KKYH CL + D + + WKC C+ CE+C +G K +FC CD
Sbjct: 578 LITCSSCSKKYHAKCLNLHQKCIDKYREDPTQWKCTDCKSCELCDDSGHDEKMLFCDVCD 637
Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFV 170
YH +C PP G + C C C S V N L+ ++W YTCCD+C F
Sbjct: 638 KGYHTFCLTPPLSQTPEGGWRCNDCAFCIHCYSRVDKNSLNKIKWKENYTCCDSC---FS 694
Query: 171 KG-----NYCPVCLKVYRD--SESTPMVCCDVCQRWVHCQC-----DGISDEKYLQFQVD 218
KG YCP+C +D E + C C + VH C D + +E ++
Sbjct: 695 KGFSEKSKYCPICSHSIKDEGEEEDSITTCQYCHKSVHDHCDQNIIDNLENEHFI----- 749
Query: 219 GNLQYRCPTC 228
Y+CP C
Sbjct: 750 ----YKCPNC 755
>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Meleagris gallopavo]
Length = 3851
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1310 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1364
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1365 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1424
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1425 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1484
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1485 EILSNLPESVAYTCINC 1501
>gi|431901376|gb|ELK08402.1| Histone-lysine N-methyltransferase MLL2 [Pteropus alecto]
Length = 5640
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1369 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1426
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1427 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1486
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1487 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHASCESL 1538
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1539 FTEDDVEQAADEG--FDCVSCQ 1558
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 61/131 (46%), Gaps = 8/131 (6%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V C +C G E C+ + C SCG YH CL R + W+CP C++C+
Sbjct: 246 VRCAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQA 297
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
CR+ G+ +K + C CD YH +C PP + + + C C +CG+ +
Sbjct: 298 CRKPGNDSKMLVCETCDKGYHTFCLKPPMEELPPHSWKCKACRVCRACGAGSAELNPNSE 357
Query: 155 WFLGYTCCDAC 165
WF Y+ C C
Sbjct: 358 WFENYSLCHRC 368
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 908 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 965
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 966 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAVSP 1025
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP C Y + + ++ C C+RW+H C+ +
Sbjct: 1026 --GFHCEWQNSYTHCGPCASLVT----CPACRAPYVEEDL--LIQCRHCERWMHAGCESL 1077
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E+ ++ D + C +C+
Sbjct: 1078 FTEEEVEQAADEG--FDCASCQ 1097
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 67/141 (47%), Gaps = 8/141 (5%)
Query: 27 EQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
E+ + + C +C +G R +L C SCG+ YH CL R + W+C
Sbjct: 218 EEAEHLEDAHCVVC-----DGLGELRDLLFCTSCGQHYHGACLDTALTARK---RAGWQC 269
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 146
P C++C+ CR+ G+ + + C CD YH +C P +++ + C C +CG+
Sbjct: 270 PDCKVCQTCRQPGEDSMMLVCEACDKGYHTFCLKPAIQSLPPDSWKCKTCRVCRACGACP 329
Query: 147 PGNGLSVRWFLGYTCCDACGR 167
+ +W+ Y+ C+ C R
Sbjct: 330 AELDPNCQWYENYSLCERCQR 350
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1381
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC Y + + ++ C C+RW+H C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1493
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C EG + +L C SCG YH CL R +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1381
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC Y + + ++ C C+RW+H C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1493
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C EG + +L C SCG YH CL R +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
Length = 3463
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +HR CL+ R L +W C C+ C +C R
Sbjct: 914 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 968
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 969 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1028
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1029 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1088
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1089 EILSSLPESVAYTCVNC 1105
>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
rotundus]
Length = 3966
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1403 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1457
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1458 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1517
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1518 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1577
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1578 EILSNLPESVAYTCVNC 1594
>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
Length = 3871
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1368
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1369 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1489 EILSNLPESVAYTCINC 1505
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1381
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1493
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 806 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 863
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 864 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 923
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 924 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESL 975
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 976 FTEDDVEQAADEG--FDCVSCQ 995
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1324 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1381
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1382 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1441
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1442 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1493
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1494 FTEDEVEQAADEG--FDCVSCQ 1513
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
domestica]
Length = 3960
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1421 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1475
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1476 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1535
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1536 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1595
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1596 EILSNLPESVAYTCVNC 1612
>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
Length = 3844
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1309 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1363
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1364 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1423
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1424 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1483
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1484 EILSNLPESVAYTCVNC 1500
>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
[Cavia porcellus]
Length = 2799
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +HR CL+ R L +W C C+ C +C R
Sbjct: 269 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 323
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 324 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 383
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 384 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 443
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 444 EILSNLPESVAYTCINC 460
>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
Length = 3786
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1280 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1334
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1335 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1394
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1395 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1454
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1455 EILSNLPESVAYTCINC 1471
>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
Length = 3554
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1023 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1077
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1078 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1137
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1138 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1197
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1198 EILSNLPESVAYTCVNC 1214
>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
Length = 3969
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
Length = 3824
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1288 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1342
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1343 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1402
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1403 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1462
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1463 EILSNLPESVAYTCVNC 1479
>gi|344254289|gb|EGW10393.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 4002
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1738 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1795
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1796 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1855
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CPVC Y + + ++ C C+RW+H C+ +
Sbjct: 1856 --GFHCEWQNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESL 1907
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1908 FTEDEVEQAADEG--FDCVSCQ 1927
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C+ C+ CR
Sbjct: 642 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKECQACR 693
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 694 KPGNDSKMLVCETCDKGYHTFCLKPPIEELPAHSWKCMTCRVCRACGVGSAELNPNSEWF 753
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+ F V G PVC
Sbjct: 754 ENYSLCHRCHKAQGGQPFISVAGQRLPVC 782
>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
Length = 3459
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 957 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1011
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1012 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1071
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1072 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1131
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1132 EILSNLPESVAYTCVNC 1148
>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Loxodonta africana]
Length = 3962
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1429 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1483
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1484 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1543
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1544 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1603
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1604 EILSNLPESVAYTCVNC 1620
>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
Length = 3710
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1178 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1232
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1233 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1292
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1293 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1352
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1353 EILSNLPESVAYTCVNC 1369
>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
Length = 3969
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|157109809|ref|XP_001650834.1| set domain protein [Aedes aegypti]
gi|108878936|gb|EAT43161.1| AAEL005378-PA [Aedes aegypti]
Length = 1458
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 96/214 (44%), Gaps = 22/214 (10%)
Query: 26 EEQGQSNTNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
EE G N V+C +C + G ++ +++C CG+ YH C +
Sbjct: 733 EEPGGDNRLVLCSAKDKFVLTQDICVMCGAIGTDQEGCLIACTQCGQCYHPYCTN--VKV 790
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
+ W+C C ICE C + D + + C CD +YH YC PP ++V G + C
Sbjct: 791 TKVILQKGWRCLDCTICEGCGQRNDEGRLILCDDCDISYHIYCMDPPLEHVPQGNWKCKW 850
Query: 136 HTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CGS+ PG+ + W Y+ C C CPVC + Y + E ++ C+
Sbjct: 851 CAICLKCGSSNPGH--NSNWLNNYSECGPCASQV----NCPVCAEGYVEGEL--IIQCNT 902
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C+RW+HC CD I E + + Y C CR
Sbjct: 903 CERWLHCGCDQIKTENDAERCAEEG--YNCTLCR 934
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/232 (28%), Positives = 102/232 (43%), Gaps = 16/232 (6%)
Query: 34 NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
++ CR C G ++ C CG YH C+ AQ + + W+C SC+ C+
Sbjct: 414 DINCRQC-----SGLGDVGNLMMCSICGDHYHGKCV-GLAQLPGV--RAGWQCSSCKKCQ 465
Query: 94 ICRRT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
ICR + + C +CD YH C P ++ + C C CGS PG G S
Sbjct: 466 ICRVPDSSEGRTVGCEQCDKIYHASCLRPVMTSIPKYGWKCKCCRVCSDCGSRTPGAGAS 525
Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
RW YT CD+C + KG CP+C + YR + MV C C ++VH CD ++
Sbjct: 526 SRWHAHYTVCDSCYQQRNKGFSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDPEAELTV 585
Query: 213 LQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRA 262
+ + N +Y C C+ + R + R + D + AS+ +
Sbjct: 586 YHAKKENNPDYEYLCNPCKASLHTGR-----FSAMRRTSSIDDDSMSASMES 632
>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
boliviensis boliviensis]
Length = 3985
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1451 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1505
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1506 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1565
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1566 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1625
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1626 EILSNLPESVAYTCVNC 1642
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1263 FTEDDVEQAADEG--FDCVSCQ 1282
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369
>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
sapiens]
gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
7; AltName: Full=Lysine N-methyltransferase 2A;
Short=KMT2A; AltName: Full=Trithorax-like protein;
AltName: Full=Zinc finger protein HRX; Contains: RecName:
Full=MLL cleavage product N320; AltName: Full=N-terminal
cleavage product of 320 kDa; Short=p320; Contains:
RecName: Full=MLL cleavage product C180; AltName:
Full=C-terminal cleavage product of 180 kDa; Short=p180
gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) [Homo sapiens]
Length = 3969
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_a [Homo sapiens]
Length = 3969
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
Length = 3968
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 13/183 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 788 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 845
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 846 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 905
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 906 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 957
Query: 208 SDE 210
E
Sbjct: 958 FTE 960
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 14/192 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC V + G M+SC +C + YH C+ + W+C C ICE C
Sbjct: 442 LCLVCGSIGKGPEASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNG 501
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFL 157
GD K + C CD +YH YC PP ++V SGP+ C ++C C GN L+ +
Sbjct: 502 GDEEKLLLCDECDVSYHVYCMKPPLESVPSGPWRCHWCSRCRRCNHKATSGNDLTPKGL- 560
Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
C +C L V CP C + Y+ ++ ++ C +C++W H C+ + E+ L+
Sbjct: 561 ----CHSCASLQV----CPCCNRGYQINDK--IIRCSLCKKWQHGACENLHTEEQLEQAA 610
Query: 218 DGNLQYRCPTCR 229
+ RC +CR
Sbjct: 611 QNRM--RCASCR 620
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1329 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1386
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1387 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1446
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1447 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1498
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1499 FTEDDVEQAADEG--FDCVSCQ 1518
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CTVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG++ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRICRTCGASSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
++ C C
Sbjct: 341 ENFSLCHRC 349
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 947 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1004
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1005 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1064
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1065 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1116
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1117 FTEDDVEQAADEG--FDCVSCQ 1136
>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
familiaris]
Length = 3829
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1300 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1354
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1355 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1414
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1415 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1474
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1475 EILSNLPESVAYTCVNC 1491
>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
melanoleuca]
Length = 3981
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1450 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1504
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1505 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1564
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1565 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1624
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1625 EILSNLPESVAYTCVNC 1641
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1263 FTEDDVEQAADEG--FDCVSCQ 1282
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1096 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1153
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1154 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1213
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1214 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1265
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1266 FTEDDVEQAADEG--FDCVSCQ 1285
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
garnettii]
Length = 4062
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1528 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1582
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1583 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1642
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1643 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1702
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1703 EILSNLPESVAYTCVNC 1719
>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_d [Homo sapiens]
Length = 4002
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1640
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1641 EILSNLPESVAYTCVNC 1657
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1200 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1257
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1258 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1317
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1318 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1369
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1370 FTEDDVEQAADEG--FDCVSCQ 1389
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1386 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1443
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1444 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1503
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1504 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1555
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1556 FTEDDVEQAADEG--FDCVSCQ 1575
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHHCHKAQGGQPLSSVAEQHTPVC 369
>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 3311
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 105/230 (45%), Gaps = 41/230 (17%)
Query: 12 ESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKN 71
E+G+ S+D + +C LC G +++ C SC + YH C+
Sbjct: 906 ETGFCLIGSEDFS--------VRALCFLC------GSSGQEKLIHCASCCEPYHEFCIDE 951
Query: 72 WAQNRDLFHWS-SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC--------QHPP 122
W W CP C +C C +T + C +CD +YH C H P
Sbjct: 952 AQLKLQNNTWKFDWVCPRCTVCFTCGKTSGQQ--LKCVKCDNSYHIECVDRVGGRLLHSP 1009
Query: 123 HKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKV 181
+ P++C +C SC NG+ V F+G C AC L KGN+CP+C +
Sbjct: 1010 DR-----PWVCSICLRCKSC------NGVDVSVFVGNLPLCRACFVLRQKGNFCPLCQRC 1058
Query: 182 YRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
Y D + + M+ C C+ WVH +C+G+SDEKY L F + +++Y C C
Sbjct: 1059 YNDDDYDSKMMECGQCKCWVHAKCEGLSDEKYQVLSF-LPESVEYVCRMC 1107
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1359 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1416
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1417 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1476
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1477 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1528
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1529 FTEDDVEQAADEG--FDCVSCQ 1548
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369
>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4218
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 91/200 (45%), Gaps = 13/200 (6%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V+C LC N + C+ + +H CL + D W +W C CR C +
Sbjct: 1626 VVCFLCASSGNV------EFVFCQVRCEPFHLFCLGEAERPHD-EQWENWCCRRCRFCHV 1678
Query: 95 C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
C R+ + + C +C +YH C HP ++C K +C SCG+ PG
Sbjct: 1679 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 1738
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
+W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D
Sbjct: 1739 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 1798
Query: 210 EK-YLQFQVDGNLQYRCPTC 228
+ L + N+ Y C C
Sbjct: 1799 DMCELMSSLPENVVYTCTNC 1818
>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
leucogenys]
Length = 3968
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1359 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1416
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1417 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1476
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1477 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1528
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1529 FTEDDVEQAADEG--FDCVSCQ 1548
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369
>gi|349603659|gb|AEP99439.1| Histone-lysine N-methyltransferase MLL3-like protein, partial
[Equus caballus]
Length = 452
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 62 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 119
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 120 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 177
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
YT C C L + CPVC + YR E ++ C C RW+H C +S E+ ++ D
Sbjct: 178 YTQCAPCASL----SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLSTEEEVENVAD 231
Query: 219 GNLQYRCPTCR 229
+ + C CR
Sbjct: 232 --IGFDCSMCR 240
>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
Length = 3963
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSDEMY 1606
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
Length = 4202
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1666 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1720
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1721 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1780
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1781 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1840
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1841 EILSNLPESVAYTCVNC 1857
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1425
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESL 1537
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+L V PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLVSSVAEQQPPVC 369
>gi|241687917|ref|XP_002401627.1| mll protein, putative [Ixodes scapularis]
gi|215504524|gb|EEC14018.1| mll protein, putative [Ixodes scapularis]
Length = 1259
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/237 (31%), Positives = 106/237 (44%), Gaps = 30/237 (12%)
Query: 1 MVAAEDY--ARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSC 57
+ E+Y A+ +G+ +S+ A +C LC GE E +L C
Sbjct: 777 LFGEEEYRRAQIIATGFALVSSRRFA--------VQAVCFLCASAGEEE-------LLFC 821
Query: 58 KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCY 117
C + YH CL + L SW CP C+ C C ++ + C +C YH
Sbjct: 822 TVCCEPYHWFCLDPEEAPQGLDK-ESWCCPRCQTCIACGHRSSVSQLLRCSKCQQTYHTD 880
Query: 118 CQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
C P + + S +LC K +C SCG++ + W + C C L KGNY
Sbjct: 881 CLGPGYPSKPSRKKKIWLCVKCIRCKSCGTSSKQSA----WNFDLSLCQDCMLLREKGNY 936
Query: 175 CPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
CP+C K Y D + + MV C CQ+W+H +CDGIS+E Y L + L Y C C
Sbjct: 937 CPLCEKCYEDDDYESMMVQCSQCQKWIHARCDGISEELYQVLSLLPETEL-YLCRIC 992
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1425
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1537
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1368 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1425
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1426 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1485
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1486 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1537
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1538 FTEDDVEQAADEG--FDCVSCQ 1557
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C M C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELC----NMFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRVCGAGSSELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|297262270|ref|XP_001099471.2| PREDICTED: histone-lysine N-methyltransferase MLL2 [Macaca mulatta]
Length = 5505
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1315 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1372
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1373 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1432
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1433 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1484
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1485 FTEDDVEQAADEG--FDCISCQ 1504
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 203
Y+ C C R V + PVC + +S TP D +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 84/183 (45%), Gaps = 13/183 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1093 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1150
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1151 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1210
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1211 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1262
Query: 208 SDE 210
E
Sbjct: 1263 FTE 1265
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1353 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1410
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1411 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1470
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1471 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1522
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1523 FTEDDVEQAADEG--FDCVSCQ 1542
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1341 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1398
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1399 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1458
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1459 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1510
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1511 FTEDDVEQXADEG--FDCISCQ 1530
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 8/131 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGR 167
Y+ C C R
Sbjct: 341 ENYSLCHRCHR 351
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1380 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1437
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1438 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1497
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1498 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1549
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1550 FTEDDVEQAADEG--FDCVSCQ 1569
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVC 178
Y+ C C + V + PVC
Sbjct: 341 ENYSVCHRCHKAQGGQPVSSVAEQHTPVC 369
>gi|2160396|dbj|BAA03407.1| MLL [Homo sapiens]
Length = 1909
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1367 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1424
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1425 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1484
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1485 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1536
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1537 FTEDDVEQAADEG--FDCISCQ 1556
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 203
Y+ C C R V + PVC + +S TP D +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393
>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
Length = 3846
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1368
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1369 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1489 EILSNLPESVAYTCVNC 1505
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1350 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1407
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1408 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1467
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1468 --GFHCEWQNSYTHCGPCASLVT----CPICHTPYVEEDL--LIQCRHCERWMHAGCESL 1519
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1520 FTEDDVEQAADEG--FDCVSCQ 1539
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 69/149 (46%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+L V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLIGSVAEQHPPVC 369
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1353 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1410
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1411 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1470
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1471 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1522
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1523 FTEDDVEQAADEG--FDCVSCQ 1542
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNSNSEWF 340
Query: 157 LGYTCCDAC-----GRLF--VKGNYCPVC 178
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPLSSVAEQHTPVC 369
>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
taurus]
gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
Length = 3821
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1289 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1343
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1344 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1403
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1404 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1463
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1464 EILSNLPESVAYTCVNC 1480
>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
Length = 4678
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 82/180 (45%), Gaps = 17/180 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 359 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 411
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C C +CGSN P G
Sbjct: 412 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 469
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W YT C C C C + Y + + ++ C C+RW+HC CD I E
Sbjct: 470 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSIKSE 523
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1425 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1482
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1483 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1542
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1543 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1594
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1595 FTEDDVEQAADEG--FDCVSCQ 1614
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1331 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCV 1388
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1389 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1448
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1449 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1500
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1501 FTEDDVEQAADEG--FDCVSCQ 1520
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|86129850|gb|ABC86577.1| myeloid/lymphoid or mixed-lineage leukemia protein [Danio rerio]
Length = 1154
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 92/200 (46%), Gaps = 13/200 (6%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V+C LC N + C+ C + +H CL + D W +W C CR C +
Sbjct: 142 VVCFLCASSGNV------EFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHV 194
Query: 95 C-RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
C R+ + + C +C +YH C HP ++C K +C SCG+ PG
Sbjct: 195 CGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKA 254
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
+W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D
Sbjct: 255 WDAQWSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTD 314
Query: 210 EK-YLQFQVDGNLQYRCPTC 228
+ L + N+ Y C C
Sbjct: 315 DMCELMSSLPENVVYTCTNC 334
>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_c [Homo sapiens]
Length = 3130
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 594 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 648
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 649 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 708
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 709 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 768
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 769 EILSNLPESVAYTCVNC 785
>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
Length = 3826
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1294 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1348
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1349 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1408
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1409 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1468
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1469 EILSNLPESVAYTCVNC 1485
>gi|384494147|gb|EIE84638.1| hypothetical protein RO3G_09348 [Rhizopus delemar RA 99-880]
Length = 690
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 88/207 (42%), Gaps = 20/207 (9%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSS--WKCPSCRICEICRRTGDPNKFMFCRRCD 111
++ C C +KYH C N + + S W CP C++C +CR GD + M C CD
Sbjct: 435 LVKCSRCTRKYHPVC-ANLTTPKQVVGAESYPWLCPECKVCFVCRTAGDESTLMICDGCD 493
Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN------------VPGNGLSVRWFLGY 159
+H C P ++ G +LC KCH C P +
Sbjct: 494 RGWHTGCCTPKVDHIPEGEWLCQLCAKCHGCNERGMKDESQYTHVAAPKSDKCKYPVYLA 553
Query: 160 TCCDACGRLFVKGNYCPVCLKVY----RDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQ 214
T CD C F + +CPVCLK Y D E MV CD C WVH +CD ++ E+Y
Sbjct: 554 TYCDKCVIDFKEDRFCPVCLKTYSDEENDEEDNEMVACDTCDHWVHTRCDESLTPERYQM 613
Query: 215 FQVDGNLQYRCPTCRGECYQVRDLEDA 241
D + +Y CP C D E A
Sbjct: 614 LCDDESAKYSCPMCEDRIKSTVDTEAA 640
>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) variant [Homo sapiens]
Length = 2880
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 344 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 398
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 399 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 458
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 459 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 518
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 519 EILSNLPESVAYTCVNC 535
>gi|256081465|ref|XP_002576990.1| myst-related protein [Schistosoma mansoni]
gi|353229452|emb|CCD75623.1| myst-related protein [Schistosoma mansoni]
Length = 1074
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 83/176 (47%), Gaps = 11/176 (6%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L+C CG+ YH C + R + W+C C +CE C T + + + C C+ +
Sbjct: 784 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 842
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
+H YC PP K V G + C C +CG P GL+ +W Y+ C C L
Sbjct: 843 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASL----T 896
Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CP+C YR+ E +V C +C RW H CD + E L+ D L Y C CR
Sbjct: 897 TCPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR 948
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 44/243 (18%)
Query: 26 EEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
E + T++ C LC E+ G +L C CG YH +CL+ Q W+
Sbjct: 302 EALASNCTSIQCGLC---ESPG--DLTELLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQ 355
Query: 86 CPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-- 143
C C+ C IC + D NK + C CD YH YC PP ++ + C + C CG
Sbjct: 356 CAECKTCLICNESKDENKMLVCDVCDKGYHTYCLKPPVSSIPKNGFRCERCRVCSDCGGG 415
Query: 144 --SNVPG--------NGL--SVRWFLGYTCCDACGRLFVKGN-YCPVCLKV--------- 181
S + G N L +VRW YT CD C + N CPVC +
Sbjct: 416 RSSTLSGLEGPVAFNNQLNPNVRWHSNYTLCDRCFHAHKRPNSCCPVCERAWRCSLPVPE 475
Query: 182 --YRDSESTPMVC----CDVCQRWVHCQCDGISDEKYL------QFQVDG--NLQYRCPT 227
YR+ ST +V C C+R VH +CD S++ + + G Y CP
Sbjct: 476 NFYRNPNSTFLVWPGSRCSQCRRMVHAECDPSSNQATVSPLSAASEETSGICGTNYVCPV 535
Query: 228 CRG 230
CR
Sbjct: 536 CRA 538
>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
Length = 2525
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 9/183 (4%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 110
+ + C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 2 KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 59
Query: 111 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
+YH C P + + ++C K +C SCGS PG G +W ++ C C +
Sbjct: 60 RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 119
Query: 168 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 225
LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + + ++ Y C
Sbjct: 120 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYTC 179
Query: 226 PTC 228
C
Sbjct: 180 VNC 182
>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
Length = 4724
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 337 CTTCGQHYHGMCLDIVVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 393
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 394 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDTCYQQ--QDNLCP 446
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 447 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 496
>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
[Nasonia vitripennis]
Length = 5138
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 81/174 (46%), Gaps = 17/174 (9%)
Query: 36 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 475 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 527
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C D + + C CD +YH YC PP + V G + C +C +CG+N P G +
Sbjct: 528 GCGERNDEGRLILCDDCDISYHIYCTDPPLECVPQGTWKCKWCAQCQTCGANDP--GFNS 585
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
W YT C C C C + Y E ++ C C+RW+HC CD I
Sbjct: 586 NWQKNYTQCGPCS----SHTACAACNESY--GEGDLIIQCVQCERWLHCMCDAI 633
>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Anolis carolinensis]
Length = 3817
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+CF+ + G + C+ C + +H+ CL++ + + +W C C+ C +C R
Sbjct: 1336 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEDGERPLE-DQLENWCCRRCKFCHVCGRQ 1391
Query: 99 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 154
K + C +C +YH C P + + ++C K +C SCG+ PG G +
Sbjct: 1392 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGKGWDAQ 1451
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 212
W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1452 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1511
Query: 213 LQFQVDGNLQYRCPTC 228
L + ++ Y C C
Sbjct: 1512 LLSNLPESVAYTCINC 1527
>gi|432114496|gb|ELK36344.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
Length = 3462
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 1355 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1412
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1413 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1472
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1473 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1524
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1525 FTEDDVEQAADEG--FDCVSCQ 1544
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 261 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKVCQACR 312
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C SCG+ + WF
Sbjct: 313 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRSCGAGSAELNPNSEWF 372
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 373 ENYSLCYRC 381
>gi|410046801|ref|XP_003313790.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 2476
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 92/202 (45%), Gaps = 15/202 (7%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 965 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 1022
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 1023 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 1082
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
G W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1083 --GFHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESL 1134
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1135 FTEDDVEQAADEG--FDCVSCQ 1154
>gi|440792783|gb|ELR13991.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 506
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 92/197 (46%), Gaps = 16/197 (8%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
CR+C + E E A ++ C C +YH +CL+ +N W+C C+ CE C+
Sbjct: 199 CRMCL--KEESAEGA--LIRCTECKDQYHPDCLELKKENIPKMMSFGWRCMHCKKCETCK 254
Query: 97 RTGDPNKFMFCRRC-DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
TGD K R D +H +C PP K G + C + +C SCG G S RW
Sbjct: 255 DTGDEEKARAARAFHDMGFHTFCLSPPLKRPPIGGWFCRECVECKSCGGKTAGKAKSCRW 314
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVC--CDVCQRWVHCQCDGISDEKY 212
GYT C+ C + + YCPVC VY+D ++ P + C C+ VH CDG
Sbjct: 315 HRGYTMCEMCYKRYKHNKYCPVCTLVYQDRDARNPALLRSCVSCRHCVHAGCDG------ 368
Query: 213 LQFQVDGNLQYRCPTCR 229
F Y+CP CR
Sbjct: 369 -NF-AGVTSPYQCPPCR 383
>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
Length = 2720
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 85/181 (46%), Gaps = 8/181 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 111
M+ C+ C + +H CL + R L + +W C C+ C +C RR+ + + CRRC
Sbjct: 996 MIFCQICCEPFHSFCLS--PEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 1053
Query: 112 AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
+YH C P P P++C +C SCG PG + W C C L
Sbjct: 1054 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKSWDLAWNHDEDLCPDCTLLH 1112
Query: 170 VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
KGN+C +C K Y D+ + T M+ C C W+H C+GISDE Y + C C
Sbjct: 1113 NKGNFCTICHKCYDDNMQHTEMIQCSACNHWIHYSCEGISDELYGLVSNQREDSFTCQPC 1172
Query: 229 R 229
R
Sbjct: 1173 R 1173
>gi|428181743|gb|EKX50606.1| hypothetical protein GUITHDRAFT_60438, partial [Guillardia theta
CCMP2712]
Length = 149
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 85/155 (54%), Gaps = 10/155 (6%)
Query: 54 MLSCKSCGKKYHRNC----LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
L C+ CG +H+ C LK + R++ W+CP+CRICE+C+ + ++ + C
Sbjct: 1 FLFCRDCGDSFHKYCFDLTLKIPPEKRNM-----WRCPACRICEVCKGEENWDEMLCCDE 55
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
CD +H YC PP K + + + C + +C SCGS PG S RW YT C +C +
Sbjct: 56 CDRGFHIYCLRPPLKQIPAEGWRCSECVRCLSCGSKTPGPKGSDRWRKDYTLCSSCWVEY 115
Query: 170 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
K NYCP+C KV S+ MV CD CQ WVH C
Sbjct: 116 EKKNYCPIC-KVVTSSKDIKMVNCDSCQMWVHVTC 149
>gi|256081467|ref|XP_002576991.1| myst-related protein [Schistosoma mansoni]
gi|353229451|emb|CCD75622.1| myst-related protein [Schistosoma mansoni]
Length = 914
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 83/176 (47%), Gaps = 11/176 (6%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L+C CG+ YH C + R + W+C C +CE C T + + + C C+ +
Sbjct: 624 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 682
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
+H YC PP K V G + C C +CG P GL+ +W Y+ C C L
Sbjct: 683 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASLTT--- 737
Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CP+C YR+ E +V C +C RW H CD + E L+ D L Y C CR
Sbjct: 738 -CPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR 788
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 44/243 (18%)
Query: 26 EEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
E + T++ C LC E+ G +L C CG YH +CL+ Q W+
Sbjct: 142 EALASNCTSIQCGLC---ESPG--DLTELLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQ 195
Query: 86 CPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-- 143
C C+ C IC + D NK + C CD YH YC PP ++ + C + C CG
Sbjct: 196 CAECKTCLICNESKDENKMLVCDVCDKGYHTYCLKPPVSSIPKNGFRCERCRVCSDCGGG 255
Query: 144 --SNVPG--------NGL--SVRWFLGYTCCDACGRLFVKGN-YCPVCLKV--------- 181
S + G N L +VRW YT CD C + N CPVC +
Sbjct: 256 RSSTLSGLEGPVAFNNQLNPNVRWHSNYTLCDRCFHAHKRPNSCCPVCERAWRCSLPVPE 315
Query: 182 --YRDSESTPMVC----CDVCQRWVHCQCDGISDEKYL------QFQVDG--NLQYRCPT 227
YR+ ST +V C C+R VH +CD S++ + + G Y CP
Sbjct: 316 NFYRNPNSTFLVWPGSRCSQCRRMVHAECDPSSNQATVSPLSAASEETSGICGTNYVCPV 375
Query: 228 CRG 230
CR
Sbjct: 376 CRA 378
>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
norvegicus]
Length = 3725
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1190 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1244
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1245 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1304
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+S +
Sbjct: 1305 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSGTED 1364
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1365 EMYEILSNLPESVAYTCVNC 1384
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 87/174 (50%), Gaps = 15/174 (8%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 400
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCR 229
C K Y M+ C++C+RW+H +CD +D++ QF+ +Y C C+
Sbjct: 401 FCGKCYNPELQKDMLHCNMCKRWIHLECDKPTDQELDSQFRE----EYICTYCK 450
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C +CE C + DP + + C CD +YH YC PP + V G + C
Sbjct: 822 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKC 871
>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
abelii]
Length = 1215
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
R+L+C C + YH C+ + + W+C C +CE C + DP + + C CD
Sbjct: 635 RLLACSQCCQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 692
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
+YH YC PP + V G + C C CG+ GL W YT C C L
Sbjct: 693 SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 746
Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
+ CPVC + YR E ++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 747 SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 799
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 21 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 77
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 78 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 130
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 131 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD-----HELDTQLKEEYICMYCK 180
>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile
rotundata]
Length = 4805
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 17/177 (9%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C +G + EGC +++C CG+ YH C + + W+C C
Sbjct: 472 TQDICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCT 524
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + + C CD +YH YC PP V G + C C +CGSN P G
Sbjct: 525 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--G 582
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
+ W YT C C C C + Y + + ++ C C+RW+HC CD I
Sbjct: 583 FNSSWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 633
>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
Length = 4925
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 106/244 (43%), Gaps = 53/244 (21%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKN--------------------------- 71
LCF+ + G E ML C C + YH+ C+K+
Sbjct: 1713 LCFLCGSAGLES---MLFCVCCCEPYHQYCVKDEYNLRTGTGTGLDDTGNMSLLDVTLGA 1769
Query: 72 ---WAQNRDLFHWSSWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 126
Q + L +W CP C +C C TG K C++C YH C + +
Sbjct: 1770 SPQQQQEQLLIARYNWMCPRCTVCFSCNMATGAKVK---CQKCAKHYHTTCLGTSKRLHG 1826
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
+ P +C +C SCG+ +V F+G C C RL KGNYCP+C K Y D+
Sbjct: 1827 ADRPLICAACLRCKSCGTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1880
Query: 186 E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 243
+ M+ C C+RWVH +C+G++DE+Y V N+++ C C + D+
Sbjct: 1881 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFVCKKC------AKHSSDSTA 1934
Query: 244 ELWR 247
LWR
Sbjct: 1935 HLWR 1938
>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
Length = 3866
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 18/204 (8%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC E+ + C+ C + +H+ CL+ R L +W C C+ C
Sbjct: 1331 VVCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCH 1382
Query: 94 ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGN 149
+C R K + C +C +YH C P + + ++C K +C SCGS PG
Sbjct: 1383 VCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGK 1442
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
G +W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 1443 GWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLS 1502
Query: 209 DEKYLQFQVDGNL----QYRCPTC 228
+ +++ NL Y C C
Sbjct: 1503 GTEDEMYEILSNLPESVAYTCVNC 1526
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 385 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 434
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 71/146 (48%), Gaps = 10/146 (6%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
W+C C +CE C + DP + + C CD +YH YC PP + V G + C C CG
Sbjct: 849 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLXPPLQTVPKGGWKCKWCVWCRHCG 908
Query: 144 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
+ GL W YT C C L + CPVC + YR+ + ++ C C RW+H
Sbjct: 909 AT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAV 960
Query: 204 CDGISDEKYLQFQVDGNLQYRCPTCR 229
C ++ E+ ++ D + + C CR
Sbjct: 961 CQNLNTEEEVENVAD--IGFDCSMCR 984
>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
Length = 3869
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 18/204 (8%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC E+ + C+ C + +H+ CL+ R L +W C C+ C
Sbjct: 1334 VVCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCH 1385
Query: 94 ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGN 149
+C R K + C +C +YH C P + + ++C K +C SCGS PG
Sbjct: 1386 VCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGK 1445
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
G +W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 1446 GWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLS 1505
Query: 209 DEKYLQFQVDGNL----QYRCPTC 228
+ +++ NL Y C C
Sbjct: 1506 GTEDEMYEILSNLPESVAYTCVNC 1529
>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
griseus]
Length = 3907
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1370 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1424
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1425 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1484
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ + DE Y
Sbjct: 1485 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1544
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1545 EILSNLPESVAYTCVNC 1561
>gi|449672214|ref|XP_002156610.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
magnipapillata]
Length = 686
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 89/187 (47%), Gaps = 13/187 (6%)
Query: 48 CERARRM---LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
C+++ M L C SCG+ +H C+ + W+C C++C+ C++ GD K
Sbjct: 246 CQKSDNMQSQLFCTSCGRHFHSYCVDMNIPITPVVRMG-WQCSFCKVCQGCKQPGDEEKM 304
Query: 105 MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDA 164
+ C +CD YH YC +PP V + C KC CGS+ PG+G S RW ++ CD
Sbjct: 305 LCCDQCDKGYHIYCLNPPISVVPKSVWKCVSCRKCSDCGSSKPGSGPSCRWHNNFSLCDR 364
Query: 165 CGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYR 224
C + KG CP+C + R + + C C + VH +C +D +Y
Sbjct: 365 CYQQRKKGQSCPICKRAVRLFNNGDAIQCKKCFKCVHGECHS---------PLDDGAEYI 415
Query: 225 CPTCRGE 231
CP C E
Sbjct: 416 CPDCIEE 422
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 264 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 320
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 321 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 373
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 229
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 374 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 423
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 72/154 (46%), Gaps = 12/154 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK-----NWAQNRDLFH-WSSWKCPSCRIC 92
+C V + G R+L+C CG+ YH C+ + + R+ + W+C C +C
Sbjct: 864 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKGNTGFCEFRNQNERFKGWRCLECTVC 923
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 152
E C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 924 EACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLR 981
Query: 153 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
W YT C C L + CPVC + YR+ +
Sbjct: 982 CEWQNNYTQCAPCASL----SSCPVCYRNYREED 1011
>gi|344249614|gb|EGW05718.1| Histone-lysine N-methyltransferase HRX [Cricetulus griseus]
Length = 3512
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1164 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1218
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1219 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1278
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ + DE Y
Sbjct: 1279 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1338
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 1339 EILSNLPESVAYTCVNC 1355
>gi|322792929|gb|EFZ16759.1| hypothetical protein SINV_09310 [Solenopsis invicta]
Length = 549
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 102/222 (45%), Gaps = 37/222 (16%)
Query: 54 MLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF------- 104
++ C CG+ YH +C L R + W+C SCR+C++CR+ D +K
Sbjct: 223 LVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVCQVCRQPEDVSKINVLQNIK 277
Query: 105 ----------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
M C RCD AYH C P ++ + C C CGS PG GLS R
Sbjct: 278 NIHSKSYTHVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSR 337
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 214
W YT CD+C + KG CP+C K YR + MV C C+++VH CD +D Q
Sbjct: 338 WHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLTYQ 397
Query: 215 FQVDG--NLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADK 254
+ D + +Y C C+ ++ + RRKD D+
Sbjct: 398 QRKDAKPDYEYVCLHCK-----------SIAMVARRKDSIDE 428
>gi|357617693|gb|EHJ70933.1| hypothetical protein KGM_14791 [Danaus plexippus]
Length = 4460
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 17/206 (8%)
Query: 32 NTNVMCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
++ CR C +G+ +++C +CG YH C+ AQ + + W C SCR
Sbjct: 371 TADIDCRTCRTIGD------MANLMTCVTCGGHYHGTCV-GLAQLPGV--RAGWSCRSCR 421
Query: 91 ICEICRRTGDPNKF-----MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
+C++CR + C CD YH C P V + C C CG+
Sbjct: 422 VCQVCRGEAGGGAGGEARAVACEHCDKLYHAACLRPVMATVPKYGWKCKCCRVCSDCGAR 481
Query: 146 VPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCD 205
PG G S RW YT CD+C + KG+ CP+C + YR + M+ C C+R+VH CD
Sbjct: 482 SPGAGPSSRWHAHYTVCDSCYQQRNKGSCCPLCRRAYRAAAYRDMIRCSACRRYVHGMCD 541
Query: 206 GISD-EKYLQFQ-VDGNLQYRCPTCR 229
++ + Y Q + + + +Y CP C+
Sbjct: 542 PEAEPQNYKQKKGENSSYEYTCPICK 567
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 89/200 (44%), Gaps = 22/200 (11%)
Query: 33 TNVMCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
T +C +C VG ++EGC +++C CG+ YH C+ + + W+C C
Sbjct: 713 TQDLCVMCGAVGTDSEGC-----LIACSQCGQTYHPYCVN--IKVSQVIVSLGWRCLDCT 765
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C GD + C CD A+H YC P V G + C + +C CG+
Sbjct: 766 VCEGCGSRGDEPLLVLCDDCDTAWHTYCARPALAEVPRGAWRCGRCRRCLVCGTRD---- 821
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 210
+ W YT C C L + C VC + Y D E ++ C C RW+H CD I E
Sbjct: 822 -TALWCDNYTECAPCASLVM----CCVCSEPYSDGEL--IIQCTACSRWLHAACDSIRSE 874
Query: 211 KYLQFQVDGNLQYRCPTCRG 230
+ Y+C CRG
Sbjct: 875 ADAETCCRAG--YKCTWCRG 892
>gi|328716144|ref|XP_001947369.2| PREDICTED: hypothetical protein LOC100162709 isoform 1 [Acyrthosiphon
pisum]
Length = 1495
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 79/182 (43%), Gaps = 17/182 (9%)
Query: 37 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
+ D +K +FC CD YH YC + V G + C + C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398
Query: 156 FLGY-------------TCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 202
F + T C C R + KG++CP C++ Y M C+ C R++H
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRSWKKGHFCPNCMRCYPIKNVERMTQCNSCDRYLHS 1458
Query: 203 QC 204
+C
Sbjct: 1459 EC 1460
>gi|14626491|gb|AAK70213.1| MLL3-like protein [Mus musculus]
Length = 420
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 12/177 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
R+L+C CG+ YH C+ + + W+C C +CE C + DP + + C CD
Sbjct: 10 RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 67
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
+YH YC PP + V G + C C CG+ GL W YT C C L
Sbjct: 68 SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 121
Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
+ CPVC + YR E ++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 122 SSCPVCCRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 174
>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
niloticus]
Length = 4517
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 90/203 (44%), Gaps = 20/203 (9%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC N + C+ C + +H CL R L + +W C CR C+
Sbjct: 1645 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRYCQ 1696
Query: 94 ICRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNV 146
C R K + C +C +YH C P N + P ++C K +C SCG+
Sbjct: 1697 ACGRQHQKTKQLLECDKCHNSYHPECLGP---NYPTRPTKKKRIWVCTKCVRCKSCGTTK 1753
Query: 147 PGNGLSVRWFLGYTCCDACGRLFVKGNYCPV-CLKVYRDSESTPMVCCDVCQRWVHCQCD 205
PG +W ++ C C +LF KGN+CP+ D + M+ C C WVH +C+
Sbjct: 1754 PGKSWDAQWSHDFSMCHDCAKLFAKGNFCPLCDKCYDDDDYDSKMMLCGRCNHWVHAKCE 1813
Query: 206 GISDEKYLQFQVDGNLQYRCPTC 228
++DE Y ++ Y C C
Sbjct: 1814 NLTDEMYELLSKPESVAYTCTKC 1836
>gi|339244153|ref|XP_003378002.1| putative PHD finger protein [Trichinella spiralis]
gi|316973126|gb|EFV56753.1| putative PHD finger protein [Trichinella spiralis]
Length = 864
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/201 (30%), Positives = 94/201 (46%), Gaps = 17/201 (8%)
Query: 32 NTNVMCR-LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
++ V CR +C V + G + M++C CG+ YH C N N + H W+C C
Sbjct: 208 DSYVFCRDMCVVCGSFGRGQEGHMVACTQCGQCYHTYC-ANVTLNSVIVH-RGWRCLDCT 265
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE C D + C CD +YH YC PP ++ G + C + C CG+ P NG
Sbjct: 266 VCEGCGTGDDEQHLLLCDECDVSYHMYCLDPPLDSIPQGAWRCKWCSTCQFCGAT-PPNG 324
Query: 151 L--SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
+ S++ C C L+ C C Y++ + ++ CD+C RW H C+G+
Sbjct: 325 MLDSIK---NLRACFKCASLYS----CCFCHLQYKEEDM--IILCDICHRWSHANCNGLC 375
Query: 209 DEKYLQFQVDGNLQYRCPTCR 229
E L+ +D + C CR
Sbjct: 376 AEDILKKGLDAG--FICVYCR 394
>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
harrisii]
Length = 3995
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1446 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1500
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1501 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1560
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1561 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1620
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1621 EMYEILSNLPESVAYTCVNC 1640
>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
sapiens]
Length = 3972
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627
>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_e [Homo sapiens]
Length = 3972
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627
>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
Length = 3986
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1447 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1501
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1502 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1561
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1562 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1621
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1622 EMYEILSNLPESVAYTCVNC 1641
>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
jacchus]
Length = 3994
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1467 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1521
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1522 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1581
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1582 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1641
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1642 EMYEILSNLPESVAYTCVNC 1661
>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
Length = 4005
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1640
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1641 EMYEILSNLPESVAYTCVNC 1660
>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL [Pongo abelii]
Length = 4012
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1473 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1527
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1528 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1587
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1588 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1647
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1648 EMYEILSNLPESVAYTCVNC 1667
>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=Zinc finger protein HRX;
Contains: RecName: Full=MLL cleavage product N320;
AltName: Full=N-terminal cleavage product of 320 kDa;
Short=p320; Contains: RecName: Full=MLL cleavage product
C180; AltName: Full=C-terminal cleavage product of 180
kDa; Short=p180
Length = 3966
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1606
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1607 EMYEILSNLPESVAYTCVNC 1626
>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
Length = 3706
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1172 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1226
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1227 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1286
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1287 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1346
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 1347 EMYEILSNLPESVAYTCVNC 1366
>gi|395526071|ref|XP_003765195.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
[Sarcophilus harrisii]
Length = 1005
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 85/164 (51%), Gaps = 7/164 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G K + C RC
Sbjct: 584 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRSTKHLLECERCRH 642
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG W + C +C +L+
Sbjct: 643 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGA-APGKNWDSEWSGDCSLCPSCTQLY 701
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y
Sbjct: 702 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEGY 745
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 81/177 (45%), Gaps = 16/177 (9%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
L C +CG+ YH CL + W+CP C++C+ C+ +GD N+ + C CD Y
Sbjct: 682 LFCTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGDDNQMLVCDTCDKGY 738
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
H +C P +V + + C C CG+ S W L CD C + V
Sbjct: 739 HTFCLQPVMDSVPTNGWKCKNCRICTECGTRT-----SSLWHLNCLLCDPCFQQQVSLP- 792
Query: 175 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
CP+C K + M+ C VC+RW+H C EK + +D L+ Y C C+
Sbjct: 793 CPICDKPLQPELQKDMLHCHVCKRWIHLDC-----EKCTENDIDDQLKEDYACTLCK 844
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 68/146 (46%), Gaps = 10/146 (6%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
W+C C +CE C + DP + + C CD +YH +C PP + V G + C C +C
Sbjct: 1113 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTFCLDPPLQTVPKGGWKCKWCVSCTNCK 1172
Query: 144 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
+ P GL W YT C C L + CPVC + Y + E ++ C C RW H
Sbjct: 1173 AITP--GLRCEWQNNYTQCAPCASL----SACPVCCQNYIEEEL--ILQCRQCIRWSHAS 1224
Query: 204 CDGISDEKYLQFQVDGNLQYRCPTCR 229
C ++ E ++ D + C C+
Sbjct: 1225 CQNLNTEAEVELAADSG--FDCAACK 1248
>gi|553800|gb|AAA92511.1| trithorax, partial [Homo sapiens]
Length = 1012
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 117 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 171
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 172 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 231
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 232 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 291
Query: 213 -LQFQVDGNLQYRCPTC 228
+ + ++ Y C C
Sbjct: 292 EILSNLPESVAYTCVNC 308
>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
Length = 3489
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 83
LCF+ + G + +L C C + YH+ C+K N Q N L +S
Sbjct: 857 LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 913
Query: 84 ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
W CP C +C C TG K C++C YH C + + +
Sbjct: 914 SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 970
Query: 130 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
P +C KC SC + +V F+G C C RL KGN+CP+C + Y D++
Sbjct: 971 PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 1024
Query: 188 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 244
M+ C C+RWVH +C+G++DE+Y + + N+++ C C EC V DAV
Sbjct: 1025 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 1082
Query: 245 LWRRKDMADKDLIASLRAAAGL 266
++ ++ L++ R A L
Sbjct: 1083 EFKAGLLSVVKLLSKSRQACAL 1104
>gi|403362853|gb|EJY81162.1| PHD zinc finger-containing protein [Oxytricha trifallax]
Length = 473
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 82/183 (44%), Gaps = 12/183 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
+ L C C K YH C + N +L W+C C+ C+ C + D +K + C CD
Sbjct: 273 KSLKCFRCLKMYHSTCHQP-PLNTELVKRFQWECSDCKTCKNCNQNNDEDKIIICDMCDK 331
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-----PGNGLSVRWFLG-YTCCDACG 166
A H +C +PP + S + C C SC + GL W G Y C C
Sbjct: 332 AVHIHCLNPPLFQIPSHNWFCKDCVNCLSCDKELGPISQKSQGL---WCEGIYRMCKDCN 388
Query: 167 RLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCP 226
+GN+C VC K Y + + CD CQ W+H CDG EK + + D +Y CP
Sbjct: 389 YQLQQGNFCKVCRKSYSQDSNEDFIQCDECQDWIHAACDGFDSEKLAKMKDDE--KYSCP 446
Query: 227 TCR 229
C+
Sbjct: 447 ICK 449
>gi|380807935|gb|AFE75843.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
Length = 314
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 104 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 162
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 163 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 221
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 222 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 281
Query: 228 CRG 230
C G
Sbjct: 282 CAG 284
>gi|74189196|dbj|BAC35712.2| unnamed protein product [Mus musculus]
Length = 814
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 229
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
porcellus]
Length = 2692
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 86/180 (47%), Gaps = 24/180 (13%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCLK 1273
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
+C +C SCG+ PG V W Y+ C C +L+ KG
Sbjct: 1274 -------------------ICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLYEKG 1313
Query: 173 NYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 230
NYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C C G
Sbjct: 1314 NYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAG 1373
>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
Length = 3069
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 83
LCF+ + G + +L C C + YH+ C+K N Q N L +S
Sbjct: 656 LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 712
Query: 84 ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
W CP C +C C TG K C++C YH C + + +
Sbjct: 713 SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 769
Query: 130 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
P +C KC SC + +V F+G C C RL KGN+CP+C + Y D++
Sbjct: 770 PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 823
Query: 188 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 244
M+ C C+RWVH +C+G++DE+Y + + N+++ C C EC V DAV
Sbjct: 824 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 881
Query: 245 LWRRKDMADKDLIASLRAAAGL 266
++ ++ L++ R A L
Sbjct: 882 EFKAGLLSVVKLLSKSRQACAL 903
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 87/191 (45%), Gaps = 11/191 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C + + G + M+SC +C + YH C+ + W+C C +CE C
Sbjct: 376 VCLICGSIGNDIEGTMVSCATCAQSYHTFCVGLHDKLNSTVVKRGWRCLDCTVCEGCGDG 435
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + + G + C C C + +P NG + G
Sbjct: 436 RDESNLLLCDECDISYHIYCLDPPLECIPHGSWRCKWCATCRRCSAQIP-NGTDTQRMEG 494
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C+ C L CP CL++Y E ++ C C RW+H +C+ I E+ L+ +
Sbjct: 495 L--CETCYSL----RKCPKCLRLYEIGEH--IIKCQHCSRWLHGKCEEICGEEMLEAAAE 546
Query: 219 GNLQYRCPTCR 229
+RC CR
Sbjct: 547 NG--FRCSLCR 555
>gi|156353194|ref|XP_001622959.1| predicted protein [Nematostella vectensis]
gi|156209597|gb|EDO30859.1| predicted protein [Nematostella vectensis]
Length = 634
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 78/163 (47%), Gaps = 12/163 (7%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M CK C + +H CL + D SW C SC C +C G +K + C +C
Sbjct: 1 MFFCKVCSEPFHGFCLDEEPIDED-----SWCCDSCSTCVVC---GQQDKLLMCDKCQRG 52
Query: 114 YHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
YH C P + V G ++C + +C CGS G W +T C CG +
Sbjct: 53 YHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFTHCYDCGTAWD 112
Query: 171 KGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
GNYCP+C K Y D++ + M+ C+ CQ WVH C I+ ++Y
Sbjct: 113 NGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 155
>gi|170058059|ref|XP_001864757.1| trithorax [Culex quinquefasciatus]
gi|167877298|gb|EDS40681.1| trithorax [Culex quinquefasciatus]
Length = 3165
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 113/259 (43%), Gaps = 43/259 (16%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNW---------AQNRDLFHWSS------ 83
LCF+ + G + ML C C + YH+ C+K+ N L +S
Sbjct: 721 LCFLCGSSGLDE---MLFCVCCCEPYHQYCVKDEYNIRHASLDETNISLLELTSTTIVNS 777
Query: 84 -----------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG-P 130
W CP C +C C TG K C++C YH C + + + P
Sbjct: 778 SPAQQALNRFNWMCPRCTVCYTCNMATGTKVK---CQKCCKNYHTTCLGTSKRLLGADRP 834
Query: 131 YLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-ST 188
+C KC SC + +V F+G C C RL KGN+CP+C + Y +++
Sbjct: 835 MICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEENDFDL 888
Query: 189 PMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 247
M+ C CQRWVH +C+G++DE+Y + + N+++ C C DAV ++
Sbjct: 889 KMMECGDCQRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNESANVWRDAVAAEFK 948
Query: 248 RKDMADKDLIASLRAAAGL 266
++ L++ R A L
Sbjct: 949 AGLLSVVKLLSKSRQACAL 967
>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
Length = 3530
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 105/234 (44%), Gaps = 33/234 (14%)
Query: 6 DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
D+ +++ V + G E Q +C LC G ++ C+ C + YH
Sbjct: 880 DFWEQYDPTEVGAKGFALIGSELFQ--IPAICYLC------GSAGKEPLIHCQCCCEPYH 931
Query: 66 RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
CL+ W + +W CP C IC+ C P + C RC ++H C
Sbjct: 932 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 983
Query: 124 KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
VS+ PY+C KC SCGS G++V C C +L +GNYCP+
Sbjct: 984 SGVSARLYSPERPYVCQNCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1038
Query: 178 CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
C + Y +++ T M+ C C WVH C+GISDE+Y L + D +++ C C
Sbjct: 1039 CQRCYNENDFDTKMMECSECSYWVHAYCEGISDERYQILSYLPD-TIEFTCSQC 1091
>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
Length = 4717
Score = 99.4 bits (246), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 112/263 (42%), Gaps = 47/263 (17%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK--------NWAQNRDLFHWS-------- 82
LCF+ + G E ML C C + YH+ C+K N D + S
Sbjct: 1583 LCFLCGSAGLED---MLFCVCCCEPYHQYCVKDEYNLRAGNGGALDDTLNVSLLDVTLGA 1639
Query: 83 --------------SWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 126
+W CP C +C C TG K C++C YH C + +
Sbjct: 1640 SPQEQQQQLLLGRFNWMCPRCTVCFSCNMATGSKVK---CQKCAKYYHTTCLGTSKRLHG 1696
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
+ P +C +C SC + +V F+G C C RL KGNYCP+C K Y D+
Sbjct: 1697 ADRPLICADCLRCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1750
Query: 186 E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 243
+ M+ C C+RWVH +C+G++DE+Y V N+++ C C DAV
Sbjct: 1751 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFICKKCGKHNETANMWRDAVA 1810
Query: 244 ELWRRKDMADKDLIASLRAAAGL 266
++ ++ L++ R A L
Sbjct: 1811 AEFKAGLLSVVKLLSKSRQACAL 1833
>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
Length = 3522
Score = 99.4 bits (246), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 105/237 (44%), Gaps = 39/237 (16%)
Query: 6 DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
D+ +++ V + G E Q +C LC G ++ C+ C + YH
Sbjct: 880 DFWEQYDPTEVGAKGFALIGSELFQ--IPAICYLC------GSAGREPLIHCQCCCEPYH 931
Query: 66 RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ---- 119
CL+ W + +W CP C IC+ C P + C RC ++H C
Sbjct: 932 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLSKSG 985
Query: 120 -----HPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 174
+ P + PY+C KC SCGS G++V C C +L +GNY
Sbjct: 986 VSARLYSPER-----PYVCQNCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNY 1035
Query: 175 CPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
CP+C + Y +++ T M+ C C WVH C+GISDE+Y L + D +++ C C
Sbjct: 1036 CPLCQRCYNENDFDTKMMECSECSYWVHAYCEGISDERYQILSYLPD-TIEFTCSQC 1091
>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
Length = 3434
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 106/234 (45%), Gaps = 33/234 (14%)
Query: 6 DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
D+ +++ V + G E + +C LC G ++ C+ C + YH
Sbjct: 877 DFWEQYDPAEVGAKGFALIGSELFH--ISAICYLC------GSAGKEPLIHCQCCCEPYH 928
Query: 66 RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
CL+ W + +W CP C IC+ C P + C RC ++H C
Sbjct: 929 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 980
Query: 124 KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
VS+ PY+C KC SCGS G++V C C +L +GNYCP+
Sbjct: 981 SGVSARLYSPERPYVCQSCVKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1035
Query: 178 CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
C + Y +++ T M+ C C WVH +C+G+SDE+Y L + D +++ C C
Sbjct: 1036 CQRCYNENDFDTKMMECSECSYWVHARCEGLSDERYQILSYLPDS-IEFTCSQC 1088
>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
mellifera]
Length = 3195
Score = 99.0 bits (245), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 31/204 (15%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 93
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 780 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 829
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 147
C P + C RC ++H C VS+ PY+C KC SCGS
Sbjct: 830 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSARLYSPERPYVCQSCVKCKSCGSE-- 883
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH QC+G
Sbjct: 884 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHAQCEG 940
Query: 207 ISDEKY--LQFQVDGNLQYRCPTC 228
+SDE+Y L + D +++ C C
Sbjct: 941 LSDERYQILSYLPD-TIEFTCSQC 963
>gi|47221226|emb|CAG13162.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3783
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 91/202 (45%), Gaps = 16/202 (7%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC N + C+ C + +H CL R L + +W C CR C+
Sbjct: 1448 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1499
Query: 94 ICRRTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPG 148
C R K + C +C +YH C P H + ++C +C CG+ PG
Sbjct: 1500 ACGRQHQKTKQQLLECDKCRNSYHPECLGPSHPTRPTKKKRVWVCNNCVRCKCCGATKPG 1559
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGI 207
+W ++ C C +LF K N+C +C K Y D E+ M+ C C VH +C+ +
Sbjct: 1560 KSWDAQWSHDFSMCHDCAKLFAKRNFCHICTKCYEDDEADAKMIECGRCHHRVHAKCEKL 1619
Query: 208 SDEKY-LQFQVDGNLQYRCPTC 228
+D+ Y L ++ ++ Y C C
Sbjct: 1620 TDDMYELLSKLPESVAYTCTKC 1641
>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 921
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 79/156 (50%), Gaps = 10/156 (6%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 294 CTTCGQHYHGMCLDVAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 350
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 176
+C P K+V + + C C +CG+ S +W CD+C + + N CP
Sbjct: 351 FCLQPVMKSVPTNGWKCKNCRICVACGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 403
Query: 177 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
C K Y M+ C++C+RWVH +CD +D +
Sbjct: 404 FCGKSYHPELQKDMLHCNMCKRWVHLECDKPADHEL 439
>gi|121483956|gb|ABM54289.1| MLL [Pan paniscus]
Length = 523
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 11/180 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 71 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 125
Query: 98 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 126 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 185
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 186 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 245
>gi|402592532|gb|EJW86460.1| hypothetical protein WUBG_02629, partial [Wuchereria bancrofti]
Length = 2207
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 88/191 (46%), Gaps = 14/191 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 268 LCLICGSIGKDAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 327
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + + GP+ C + C CG+ + N + F
Sbjct: 328 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQI-FNVTDNQNF-- 384
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C+ C L CP CL++Y ++ ++ C C RW+H +C+ + E+ F+
Sbjct: 385 ---CETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MFETA 433
Query: 219 GNLQYRCPTCR 229
+RC CR
Sbjct: 434 SENGFRCSLCR 444
>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Takifugu rubripes]
Length = 4478
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 90/201 (44%), Gaps = 15/201 (7%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC N + C+ C + +H CL R L + +W C CR C+
Sbjct: 1668 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1719
Query: 94 ICRRTGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
C R K + C +C +YH C HP ++C K +C CG+ PG
Sbjct: 1720 ACGRQHQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPGK 1779
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
+W ++ C C +LF K N C +C K Y D E+ MV C C VH +C+ ++
Sbjct: 1780 SWDAQWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKLT 1839
Query: 209 DEKY-LQFQVDGNLQYRCPTC 228
D+ Y L ++ ++ Y C C
Sbjct: 1840 DDMYELLSKLPESVAYTCTKC 1860
>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
Length = 2144
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 20/194 (10%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC + + G M++C +C + YH C+ + W+C C +CE C
Sbjct: 166 LCLICGSIGKGAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 225
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG---SNVPGNGLSVRW 155
D + + C CD +YH YC PP + + GP+ C + C CG SN+ N
Sbjct: 226 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQISNITDN------ 279
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 215
C+ C L CP CL++Y ++ ++ C C RW+H +C+ + E+ F
Sbjct: 280 ---QNFCETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MF 328
Query: 216 QVDGNLQYRCPTCR 229
+ +RC CR
Sbjct: 329 ETASENGFRCSLCR 342
>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
Length = 4498
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 90/202 (44%), Gaps = 16/202 (7%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICE 93
V+C LC N + C+ C + +H CL R L + +W C CR C+
Sbjct: 1668 VLCFLCASSGNV------EFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQ 1719
Query: 94 ICRRTGDPNK--FMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
C R K + C +C +YH C HP ++C K +C CG+ PG
Sbjct: 1720 ACGRQHQKTKQQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPG 1779
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGI 207
+W ++ C C +LF K N C +C K Y D E+ MV C C VH +C+ +
Sbjct: 1780 KSWDAQWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKL 1839
Query: 208 SDEKY-LQFQVDGNLQYRCPTC 228
+D+ Y L ++ ++ Y C C
Sbjct: 1840 TDDMYELLSKLPESVAYTCTKC 1861
>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
Length = 2498
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 93/204 (45%), Gaps = 17/204 (8%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M++C +C + YH C+ + W+C C ICE C + GD M C CD
Sbjct: 447 MIACLNCAQTYHTYCVLLHEKINSAIMTHGWRCLDCTICEGCGKGGDDKNLMLCDECDVP 506
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-PGNGLSVRWFLGYTCCDACGRLFVKG 172
YH YC PP + V +G + C ++C C V G+ L+ R C C L
Sbjct: 507 YHTYCLKPPIEKVPTGSWRCQWCSRCRRCNHKVSSGSELTARGL-----CHPCDSL---- 557
Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGE 231
+C C + Y+ ++ ++ C C +W H +C+G+ +DE+ Q ++ + RC CR
Sbjct: 558 QHCARCRQGYQLNDK--LIRCSQCNKWEHGRCEGLYTDEQLEQAALN---RMRCAACRPN 612
Query: 232 CYQVRDLEDAVRELWRRKDMADKD 255
+ L D V +W DKD
Sbjct: 613 RIRNNGLSD-VDTVWCDSIALDKD 635
>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus
floridanus]
Length = 3218
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 106/234 (45%), Gaps = 33/234 (14%)
Query: 6 DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
D+ +++ V + G E + +C LC G ++ C+ C + YH
Sbjct: 751 DFWEQYDPSEVGAKGFALIGSELF--HIPAICYLC------GSAGKEPLIHCQCCCEPYH 802
Query: 66 RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
CL+ W + +W CP C IC+ C P + C RC ++H C
Sbjct: 803 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 854
Query: 124 KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
VSS PY+C KC SCGS G++V C C +L +GNYCP+
Sbjct: 855 SGVSSRLYNPDRPYVCQSCIKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 909
Query: 178 CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTC 228
C + Y +++ T M+ C C WVH +C+G+SDE+Y L + D +++ C C
Sbjct: 910 CQRCYNENDFDTKMMECSECSCWVHARCEGLSDERYQILSYLPDS-IEFTCSQC 962
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 99/235 (42%), Gaps = 27/235 (11%)
Query: 33 TNVMCR----------LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS 82
T ++CR +C V + G M++C +C + YH C+ +
Sbjct: 411 TAIVCRVNDEFLQKACMCLVCGSIGKGPEGSMVACSNCAQTYHTYCVTLHDKLNSAVVGR 470
Query: 83 SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
W+C C +CE C GD + C CD +YH YC P + GP+ C ++C C
Sbjct: 471 GWRCLDCTVCEGCGTGGDEANLLLCDECDVSYHIYCMKPLLDKIPQGPWRCQWCSRCRRC 530
Query: 143 GSNVP-GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVH 201
GN L+ + C C L CP C + Y+ +E ++ C C +W H
Sbjct: 531 NHKAASGNDLTSQGL-----CFPCASL----RKCPRCERNYQLNEK--LIRCSQCSKWQH 579
Query: 202 CQCDGI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
C+G+ +DE+ Q +D + RC CR + Q D V +W DKD
Sbjct: 580 GACEGLYTDEQLEQAAID---RMRCSACRPKRVQPSGFSD-VDTVWCDYVALDKD 630
>gi|312071355|ref|XP_003138570.1| F/Y-rich family protein [Loa loa]
Length = 1597
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 14/199 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 273 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 332
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + + GP+ C + C C SN N + F
Sbjct: 333 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 389
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C+ C L CP CL+ Y +S ++ C C RW+H +C+ + ++ F+
Sbjct: 390 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 438
Query: 219 GNLQYRCPTCRGECYQVRD 237
+RC CR + V D
Sbjct: 439 SENGFRCSLCRPQGNVVGD 457
>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
Length = 2288
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 86/191 (45%), Gaps = 14/191 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 296 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 355
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D + + C CD +YH YC PP + + GP+ C + C C SN N + F
Sbjct: 356 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 412
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 218
C+ C L CP CL+ Y +S ++ C C RW+H +C+ + ++ F+
Sbjct: 413 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 461
Query: 219 GNLQYRCPTCR 229
+RC CR
Sbjct: 462 SENGFRCSLCR 472
>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
Length = 2812
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 85/182 (46%), Gaps = 7/182 (3%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCDA 112
M+ C+ C + +H CL ++ +W C C+ C +C RR+ + CRRC
Sbjct: 1050 MIFCQICCEPFHSFCLTPEECPQE-DSKENWCCRRCKFCHVCGRRSKSAKPVLQCRRCQT 1108
Query: 113 AYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
YH C P P S P++C +C SCG PG + W C C L
Sbjct: 1109 CYHPSCLGPTYPKPVNCSLPWVCMTCIRCKSCGVT-PGKTWDLAWNHEQDLCPECTILNK 1167
Query: 171 KGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEK-YLQFQVDGNLQYRCPTC 228
KG++C VC K Y D S M+ C C+ W+H +C+G+S+E L + + + C C
Sbjct: 1168 KGHFCTVCQKCYEDGSRPLQMIQCSECRHWIHPKCEGLSEELCGLMSTLPDSGGFTCTPC 1227
Query: 229 RG 230
RG
Sbjct: 1228 RG 1229
>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
Length = 3452
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 30/216 (13%)
Query: 6 DYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYH 65
D+ +++ V + G E +C LC G ++ C+ C + YH
Sbjct: 875 DFWEQYDPSEVGAKGFALIGSELFH--IPAICYLC------GSAGKEPLIHCQCCCEPYH 926
Query: 66 RNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPH 123
CL+ W + +W CP C IC+ C P + C RC ++H C
Sbjct: 927 AFCLEPSEW----NACAQPNWCCPRCTICQSCHLRSGPK--LSCIRCRQSFHHSCLS--K 978
Query: 124 KNVSSG------PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
VSS PY+C KC SCGS G++V C C +L +GNYCP+
Sbjct: 979 SGVSSRLYSPDRPYVCQSCIKCKSCGSE----GVNVH-VGNLPLCSMCFKLRQQGNYCPL 1033
Query: 178 CLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
C + Y +++ T M+ C C WVH +C+G+SDE+Y
Sbjct: 1034 CQRCYNENDFDTKMMECSECSCWVHARCEGLSDERY 1069
>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
Length = 2924
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 69/142 (48%), Gaps = 5/142 (3%)
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPG 148
+C +C G + C RC YH C P P +N ++C T+C SCG PG
Sbjct: 1411 VCFLCASKGQHEPLLECERCQNCYHASCLGPNYPKQNKKRKAWVCMTCTRCKSCGV-TPG 1469
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGI 207
W C C +L+ +GNYCP+C K Y D++ + M+ C C WVH +C+ +
Sbjct: 1470 KSWDTEWNHDKGLCPDCSKLYDQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDL 1529
Query: 208 SDEKY-LQFQVDGNLQYRCPTC 228
+DE Y + + ++ Y C C
Sbjct: 1530 TDELYEILSSLPESVVYSCRPC 1551
>gi|355732606|gb|AES10757.1| histone-lysine N-methyltransferase MLL4-like protein [Mustela
putorius furo]
Length = 212
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 11 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 69
Query: 113 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 70 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 128
Query: 170 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 227
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 129 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 188
Query: 228 CRG 230
C G
Sbjct: 189 CAG 191
>gi|60360484|dbj|BAD90486.1| mKIAA4050 protein [Mus musculus]
Length = 762
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 26 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 80
Query: 98 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 81 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 140
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 141 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 200
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 201 EMYEILSNLPESVAYTCVNC 220
>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
Length = 2027
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 103/264 (39%), Gaps = 24/264 (9%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHW----SSWKCPSCRICEICRRTGDPNKFMFCRR 109
+ C+ C + YHR C + D W W C C+ C C + + + C +
Sbjct: 641 FVHCRVCCQPYHRFCTGDSRLQSDE-RWRDAAEQWMCIDCQTCCACGQAEPRSTLLTCAK 699
Query: 110 CDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
C H +C P Y+C C CGS PG W + C+ C
Sbjct: 700 CTRHIHSHCAGDGVPQGLKRDDVYMCSACVVCTKCGSTSPGEFKGSTWHCQFELCEECHG 759
Query: 168 LFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC--DGISDEKYLQFQVDG--NLQ 222
+KGN CP+C K+Y D + TPM CC+ C+RW+H C L + + G +
Sbjct: 760 QTLKGNICPMCDKLYSDDDFDTPMFCCEKCERWLHATCVDPAFKTNMELYYLISGVQTID 819
Query: 223 YRCPTC----RGECYQVRDLEDAVRELWRRK------DMADKDLIASLRAAAGLPTEDEI 272
Y C C +G L A + ++ + DM ++ +L A + P D
Sbjct: 820 YHCQDCVRLSKGVLPDFAALWKATQAAFQERLVGLVEDMKQQEGYTALAAVSPAPVRDHT 879
Query: 273 F--SISPYSDDEENGPVVLKNEFG 294
+ + SDD P +E G
Sbjct: 880 SDDTRTRASDDTRTRPSTGTDEDG 903
>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
Length = 3837
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 105/245 (42%), Gaps = 48/245 (19%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNC----------------------------LK 70
LCF+ + G + ++ C C + YH+ C +
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFDDTLMNSLLETSSISAIT 1309
Query: 71 NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 129
N A + L +W CP C +C C + + C++C YH C + + +
Sbjct: 1310 NTALTQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGADR 1366
Query: 130 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 187
P +C KC SC + V F+G C AC +L KGN+CP+C K Y D++
Sbjct: 1367 PLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDDNDFD 1420
Query: 188 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 246
M+ C C +WVH +C+G+SDE+Y L + ++++ C C C D+ E W
Sbjct: 1421 LKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRC----DVSRNKAEEW 1476
Query: 247 RRKDM 251
R+ M
Sbjct: 1477 RQAVM 1481
>gi|1042097|gb|AAB34770.1| trx Zinc-finger region homolog [Homo sapiens]
Length = 366
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 92/183 (50%), Gaps = 9/183 (4%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 110
+ + C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 156 KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 213
Query: 111 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
+YH C P + + ++C K +C SCGS PG G +W ++ C C +
Sbjct: 214 RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 273
Query: 168 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 225
LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + + + Y C
Sbjct: 274 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPECVAYTC 333
Query: 226 PTC 228
C
Sbjct: 334 VNC 336
>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
Length = 3272
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 88/186 (47%), Gaps = 28/186 (15%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 93
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 856 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 905
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 147
C P + C RC ++H C +SS PY+C +C SCGS
Sbjct: 906 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGISSRLYSPDRPYVCQSCIRCKSCGSE-- 959
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH +C+G
Sbjct: 960 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSCWVHARCEG 1016
Query: 207 ISDEKY 212
+SDE+Y
Sbjct: 1017 LSDERY 1022
>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
Length = 3822
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 52/268 (19%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1273 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLETSNNACAI 1329
Query: 70 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1330 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1386
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1387 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1440
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLED-- 240
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R+ D
Sbjct: 1441 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADEW 1500
Query: 241 --AVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1501 RQAVMEEFKSSLYSVLKLLSKSRQACAL 1528
>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
Length = 3828
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 52/268 (19%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMTSLLETSNNACAI 1309
Query: 70 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1310 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1366
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1367 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1420
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLED-- 240
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R+ D
Sbjct: 1421 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADEW 1480
Query: 241 --AVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1481 RQAVMEEFKSSLYSVLKLLSKSRQACAL 1508
>gi|195446233|ref|XP_002070689.1| GK10889 [Drosophila willistoni]
gi|194166774|gb|EDW81675.1| GK10889 [Drosophila willistoni]
Length = 3189
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 116/267 (43%), Gaps = 53/267 (19%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1155 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLDTSVN 1208
Query: 83 ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
+W CP C +C C + + C++C YH C + +
Sbjct: 1209 PSTCAPSMNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHATCLGTSKRLLG 1266
Query: 128 SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
+ P +C KC SC + V F+G C AC +L KGNYCP+C K Y D+
Sbjct: 1267 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTACFKLRKKGNYCPICQKCYDDN 1320
Query: 186 E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 239
+ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E ++ E
Sbjct: 1321 DFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSHIKADEWR 1380
Query: 240 DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1381 QAVMEEFKSSLYSVLKLLSKSRQACAL 1407
>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
Length = 3756
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 52/249 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 69
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1234 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLDTSSNACAI 1290
Query: 70 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1291 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1347
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1348 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1401
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAV 242
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R D
Sbjct: 1402 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRQKADE- 1460
Query: 243 RELWRRKDM 251
WR+ M
Sbjct: 1461 ---WRQAVM 1466
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 93/200 (46%), Gaps = 26/200 (13%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+ +LCF+ + G E+ ML C SC + YH C + SW C C C
Sbjct: 855 TLAKLCFLCGSAGREK---MLVCSSCCEWYHVWCAEEAG------GGGSWTCARCVWCAA 905
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYC--QHPP-HKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C R + CR C YH C PP H+ S P +C KC SC SN
Sbjct: 906 CARPA---ARLRCRSCARPYHAACLPSAPPDHR--SDWPQICSSCLKCKSCDSN------ 954
Query: 152 SVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
V F+G C C +L KGNYCP+C YRD++ + M+ C C RWVH C+G+S
Sbjct: 955 RVNKFVGSLPFCRPCFKLRQKGNYCPLCQACYRDNDFDSKMMECGWCARWVHASCEGLSG 1014
Query: 210 EKY-LQFQVDGNLQYRCPTC 228
E Y L + +++Y C C
Sbjct: 1015 EGYQLLSALPPSIEYICCKC 1034
>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1776
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 76/160 (47%), Gaps = 8/160 (5%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 111
M+ C+ C + +H CL + R L + +W C C+ C +C RR+ + + CRRC
Sbjct: 184 MIFCQICCEPFHSFCL--LPEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 241
Query: 112 AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
+YH C P P P++C +C SCG PG + W C C L
Sbjct: 242 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKTWDLAWNHDEDLCPDCTLLH 300
Query: 170 VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGIS 208
KGN+C +C K Y D+ M+ C C W+H C+GIS
Sbjct: 301 KKGNFCTICHKCYDDNMRHAEMIQCSACNHWIHYSCEGIS 340
>gi|167526642|ref|XP_001747654.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773758|gb|EDQ87394.1| predicted protein [Monosiga brevicollis MX1]
Length = 1547
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 77/158 (48%), Gaps = 13/158 (8%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHS 141
W+C +C+ C++C R + C C H C P +++ ++C +C
Sbjct: 633 WQCKNCQTCDVCTRIEPTQHLLSCDVCGVHRHAACASAATPSYMLATQRWVCTDCVQCEH 692
Query: 142 CG-SNVPGNGLSVR------WFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCC 193
CG ++V G+ + W + C CG ++GN+CPVC K YR D MV C
Sbjct: 693 CGATDVRGHRPDPKLREEPTWQCDFRLCFDCGLNKLRGNFCPVCGKTYRGDDYDVKMVGC 752
Query: 194 DVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTCR 229
D C RW+H +CD I + +Y L F V ++ Y CP CR
Sbjct: 753 DRCDRWLHAECDDIDEARYHLLTF-VPSSMSYFCPDCR 789
>gi|349803513|gb|AEQ17229.1| putative trx zinc-finger region [Pipa carvalhoi]
Length = 506
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 76/160 (47%), Gaps = 29/160 (18%)
Query: 85 KCPSCRI---CEICRRTGDPNKF-----------MFCRRCDAAYHCYCQHPPHKNVSSGP 130
+CP C++ C +C D KF M C +C +N
Sbjct: 312 QCPGCQVPDDCGVCTNCLDKPKFGGRNIKKQCCKMECNKC-------------RNKKKRV 358
Query: 131 YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STP 189
++C K +C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + +
Sbjct: 359 WICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESK 418
Query: 190 MVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 228
M+ C C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 419 MMQCGKCDRWVHSKCESLTDEMYEILSNLPESVAYTCINC 458
>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
Length = 3806
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1399 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1458
Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 1459 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1518
Query: 218 DGNLQYRCPTC 228
++ Y C C
Sbjct: 1519 PESVAYTCVNC 1529
>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
gorilla]
Length = 3837
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1363 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1422
Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 1423 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1482
Query: 218 DGNLQYRCPTC 228
++ Y C C
Sbjct: 1483 PESVAYTCVNC 1493
>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
Length = 3899
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1437 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1496
Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 1497 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1556
Query: 218 DGNLQYRCPTC 228
++ Y C C
Sbjct: 1557 PESVAYTCINC 1567
>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
Length = 2509
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 96/225 (42%), Gaps = 37/225 (16%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCL--------------------KNWAQNRDL 78
LC V + G M++C +C + YH C+ + + N +
Sbjct: 437 LCLVCGSIGKGVEASMVACSNCAQTYHTYCISLHDKVGSQMIIKMLSTIIFQKFQLNSAV 496
Query: 79 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTK 138
W+C C CE C GD K + C CD +YH YC PP + + GP+ C ++
Sbjct: 497 IT-RGWRCLDCTFCEGCGAGGDEEKLLLCEECDVSYHMYCIKPPLEAIPKGPWRCQWCSR 555
Query: 139 CHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
C C + GN L+ + C +C L C C + Y+ +E ++ C C
Sbjct: 556 CRRCNHKSTSGNDLTSKGL-----CHSCQSL----QSCSRCNRGYQLNEK--IIKCSQCS 604
Query: 198 RWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 241
+W H C+G+ +DE+ Q ++ + RC +CR Q+ +A
Sbjct: 605 KWHHGFCEGLHTDEQLEQAALN---RMRCTSCRPNRAQINGFSEA 646
>gi|195329040|ref|XP_002031219.1| GM25861 [Drosophila sechellia]
gi|194120162|gb|EDW42205.1| GM25861 [Drosophila sechellia]
Length = 3603
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|899268|emb|CAA58584.1| ALL-1 protein [Homo sapiens]
Length = 395
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 88/176 (50%), Gaps = 11/176 (6%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 222 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 276
Query: 98 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 277 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 336
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 337 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 392
>gi|195501651|ref|XP_002097884.1| GE26459 [Drosophila yakuba]
gi|194183985|gb|EDW97596.1| GE26459 [Drosophila yakuba]
Length = 2853
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1283 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1336
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 1337 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1394
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1395 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1448
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1449 NDFDLKMMECGDCAQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1508
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1509 RQAVMEEFKASLYSVLKLLSKSRQACAL 1536
>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
Length = 3741
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 116/269 (43%), Gaps = 55/269 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1281 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1334
Query: 83 -----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
+W CP C +C C + + C++C YH C +
Sbjct: 1335 ASTGPSASSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRL 1392
Query: 126 VSSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYR 183
+ + P +C KC SC + V F+G C C +L KGN+CP+C + Y
Sbjct: 1393 LGADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYD 1446
Query: 184 DSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE 239
D++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1447 DNDFDLKMMECGDCAQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSRIKAEE 1506
Query: 240 --DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1507 WRQAVMEEFKASLYSVLKLLSKSRQACAL 1535
>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
spliced, exon II-containing isoform} [Drosophila,
embryos, Peptide, 3726 aa]
Length = 3726
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
Full=Lysine N-methyltransferase 2A
gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
Length = 3726
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|326676507|ref|XP_689347.4| PREDICTED: hypothetical protein LOC560856 [Danio rerio]
Length = 1761
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 98/204 (48%), Gaps = 24/204 (11%)
Query: 36 MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 90
+C LC G++E M+ C+ C + +H CL + +N++ +W C C+
Sbjct: 1098 VCLLCASKGQHE-------MVYCQMCCEPFHHFCLPPDDRPKEENKE-----NWCCRRCK 1145
Query: 91 ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +C R++ + C+RC YH C P P + ++C +C SCG P
Sbjct: 1146 FCHVCGRKSKQTKPVLQCKRCMYCYHPSCLGPTYPKPVKMNTSWVCMLCIRCKSCGVT-P 1204
Query: 148 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 206
G W C C +L KG++C VCLK Y + E + M+ C C WVH +C+G
Sbjct: 1205 GKSSDTSWNHELELCPDCNKLHSKGDFCTVCLKCYEEHEFDSSMMQCARCAHWVHPKCEG 1264
Query: 207 ISDEKY-LQFQVDG-NLQYRCPTC 228
++D+ + + ++ G +L + C C
Sbjct: 1265 LTDDLHEILCRLRGKSLVFSCAPC 1288
>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
Length = 3759
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 3779
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1191 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 1244
Query: 83 -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
+W CP C +C C + + C++C YH C + + +
Sbjct: 1245 GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 1302
Query: 130 -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 186
P +C KC SC + V F+G C C +L KGNYCP+C K Y D++
Sbjct: 1303 RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1356
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
M+ C C +WVH +C+G+SDE+Y L + ++++ C C RD +
Sbjct: 1357 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1411
Query: 246 WRRKDMADKDLIASLRAAAGL 266
WR+ M ++ ASL + L
Sbjct: 1412 WRQAVM--EEFKASLYSVLKL 1430
>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
Length = 3708
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 117/267 (43%), Gaps = 53/267 (19%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1266 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEETTLMGSLLETSVT 1319
Query: 83 ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
+W CP C +C C + + C++C YH C + +
Sbjct: 1320 AVGSSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLG 1377
Query: 128 SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 185
+ P +C KC SC + V F+G C C +L KGN+CP+C K Y D+
Sbjct: 1378 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTNCFKLRKKGNFCPICQKCYDDN 1431
Query: 186 E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 239
+ M+ C C++WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1432 DFDLKMMECGDCRQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKMKAEEWR 1491
Query: 240 DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1492 QAVMEEFKVSLYSVLKLLSKSRQACAL 1518
>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
Length = 2805
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 77/188 (40%), Gaps = 23/188 (12%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2585 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2643
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +CGS PG S R
Sbjct: 2644 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2701
Query: 155 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMVCCDVC 196
W Y T C C +L+ KG YCP C + + R +V C C
Sbjct: 2702 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEVNLVHCSAC 2761
Query: 197 QRWVHCQC 204
+++H C
Sbjct: 2762 DKYLHLGC 2769
>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
Length = 3358
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 900 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 954 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153
>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
Length = 3358
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 900 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953
Query: 83 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 126
+W CP C +C C + + C++C YH C + +
Sbjct: 954 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011
Query: 127 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 184
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065
Query: 185 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 239
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125
Query: 240 -DAVRELWRRKDMADKDLIASLRAAAGL 266
AV E ++ + L++ R A L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153
>gi|307199377|gb|EFN80002.1| Supporter of activation of yellow protein [Harpegnathos saltator]
Length = 1532
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 79/193 (40%), Gaps = 23/193 (11%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
+ + C++C N+ + ++ C +C H +C+ +W+C C+
Sbjct: 1306 DVKLKCKMCLKVLNKH-NKTEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1364
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C C D +K +FC CD YH YC + V G + C + C +CGS PG
Sbjct: 1365 CAQCHDPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREPGGAN 1422
Query: 152 SVR-----WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMV 191
S R W Y T C C +L+ KG YCP C + + R +V
Sbjct: 1423 SDRNSVAQWQHEYKKGEKNTRVYVSTLCIPCSKLWRKGRYCPHCSRCHTAQRLDLEPNLV 1482
Query: 192 CCDVCQRWVHCQC 204
C C +++H C
Sbjct: 1483 HCSACDKYLHLDC 1495
>gi|383863051|ref|XP_003706996.1| PREDICTED: uncharacterized protein LOC100875915 [Megachile rotundata]
Length = 3343
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 3123 CKMCLKTLNKHG-KVEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3181
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 3182 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCASKEPGGINSDRNS 3239
Query: 155 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 3240 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3298
Query: 196 CQRWVHCQC 204
C +++H C
Sbjct: 3299 CDKYLHLDC 3307
>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 3474
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 82
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 886 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 939
Query: 83 -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
+W CP C +C C + + C++C YH C + + +
Sbjct: 940 GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 997
Query: 130 -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 186
P +C KC SC + V F+G C C +L KGNYCP+C K Y D++
Sbjct: 998 RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1051
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
M+ C C +WVH +C+G+SDE+Y L + ++++ C C RD +
Sbjct: 1052 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1106
Query: 246 WRRKDMADKDLIASLRAAAGL 266
WR+ M ++ ASL + L
Sbjct: 1107 WRQAVM--EEFKASLYSVLKL 1125
>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
Length = 3066
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 79/189 (41%), Gaps = 25/189 (13%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2845 CKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2903
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +CGS G S R
Sbjct: 2904 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGANSDRNS 2961
Query: 155 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 2962 VAQWQHEYKKGEKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-LVHCSA 3020
Query: 196 CQRWVHCQC 204
C +++H +C
Sbjct: 3021 CDKYLHLEC 3029
>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
Length = 1472
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 25/194 (12%)
Query: 32 NTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
+T + C++C N+ + ++ C +C H +C+ +W+C C+
Sbjct: 1247 DTKLKCKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1305
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C C D +K +FC CD YH YC + V G + C + C +CGS G
Sbjct: 1306 CVQCHDPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGAN 1363
Query: 152 SVR-----WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPM 190
S R W Y T C C +L+ KG YCP C + + D E+ +
Sbjct: 1364 SDRNSVAQWQHEYKKGEKNTRIYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-L 1422
Query: 191 VCCDVCQRWVHCQC 204
V C C +++H +C
Sbjct: 1423 VHCSACDKYLHLEC 1436
>gi|11072099|gb|AAG26335.2| MLL protein [Homo sapiens]
Length = 380
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 95/200 (47%), Gaps = 15/200 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 97
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 123 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 177
Query: 98 TGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
K + C +C +YH C +P ++C K +C SCGS PG G
Sbjct: 178 QHRATKQLLECNKCRNSYHLECLGPTYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 237
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 212
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C +S +
Sbjct: 238 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCGNLSGTED 297
Query: 213 LQFQVDGNL----QYRCPTC 228
+++ NL Y C C
Sbjct: 298 EMYEILSNLPESVAYTCVNC 317
>gi|357614029|gb|EHJ68865.1| putative PHD finger protein 10 [Danaus plexippus]
Length = 2413
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 81/192 (42%), Gaps = 23/192 (11%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
R L C +C K H +C++ SW+C C+ C C R D +K +FC CD
Sbjct: 2163 RFLVCSNCNAKLHPSCVELGPDTIRKCREYSWQCAECKTCCACSRPADDDKMLFCDLCDR 2222
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR--W------------FLG 158
+H YC V +G + C + + C SCGS G W
Sbjct: 2223 GFHIYCVG--LHTVPNGRWHCVECSVCKSCGSRSAGGPGGGPGDWHHQTRRGPGGHKLYS 2280
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSEST-PMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
++ C C R + G YCP+C + + + T +V C +C R +H +C + + +V
Sbjct: 2281 HSLCTPCARAYRIGRYCPLCDRSFIGPKGTMQLVICKLCDRQLHQEC---VKQTTSRLKV 2337
Query: 218 DGNLQYRCPTCR 229
L Y C CR
Sbjct: 2338 ---LDYTCGECR 2346
>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
Length = 4493
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 65/140 (46%), Gaps = 13/140 (9%)
Query: 35 VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
V+C LC G ML C C + +H CL+ + ++ H +W C C+ C++
Sbjct: 944 VICFLC------GSAGKHEMLYCNVCCEPFHEFCLEEEERPHEI-HSDNWCCKKCQSCQV 996
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C G N + C +C YH C +P + ++C K KC SCG+ PG+GL
Sbjct: 997 C---GRQNNLLQCDKCQNTYHPECLGPNYPTKPSKKKNIWICTKCVKCKSCGATTPGSGL 1053
Query: 152 SVRWFLGYTCCDACGRLFVK 171
+ W + C CG+L K
Sbjct: 1054 AATWMYDFQLCYECGQLMDK 1073
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 90/216 (41%), Gaps = 28/216 (12%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGK 62
ED R G S Q Q + V+C +C V N + C C +
Sbjct: 844 EDITRILNIGLPVIVSN------QLQLSNVVLCFVCASSVKHNPA-------VYCAMCCE 890
Query: 63 KYHRNCLKNWAQNRDL--FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
YH C+ Q RD+ + +W CP C+ C C + N+ + C C YH C
Sbjct: 891 PYHTFCV----QIRDVENINTMNWLCPKCQACFECGKRDKRNQMLKCNSCREVYHTECIP 946
Query: 121 PPHKNVSS---GPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPV 177
P + + S + C K +C SC +N + W C C L +GNYCP+
Sbjct: 947 PVYPSKPSKRKNIWTCIKCIRCKSCETNA---RVMSGWNFDLQLCTECVSLRQRGNYCPL 1003
Query: 178 CLKVY-RDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
C K Y D M+ C C++WVH C+ ++ E+Y
Sbjct: 1004 CEKCYDADDYDINMIECARCRKWVHASCEELTAEEY 1039
>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
Length = 3002
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 10/162 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
++L C SC + +H C++ + R H+ C +C C++CRR P + C RC
Sbjct: 965 QLLFCVSCAEPFHFYCVERQFRPRRKEHFV---CRNCTTCKVCRR---PASDLRCIRCST 1018
Query: 113 AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
YH C PP K G + CP+ + C CG W C AC +
Sbjct: 1019 GYHPECLADFPPAKTSQRGCWTCPECSVCLHCGVRA-CKPAETTWSYETNKCAACSQAES 1077
Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
KG+ CP C + Y + + M+ CD CQ W+H C ++ ++Y
Sbjct: 1078 KGDVCPECDRAYLPT-TKQMIQCDSCQLWMHRTCTKLTVDEY 1118
>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
Length = 2891
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2671 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2729
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 2730 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 2787
Query: 155 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 2788 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 2846
Query: 196 CQRWVHCQC 204
C +++H C
Sbjct: 2847 CDKYLHLGC 2855
>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
Length = 3312
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 3092 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3150
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 3151 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 3208
Query: 155 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 195
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 3209 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3267
Query: 196 CQRWVHCQC 204
C +++H C
Sbjct: 3268 CDKYLHLGC 3276
>gi|444515379|gb|ELV10878.1| Histone-lysine N-methyltransferase MLL2 [Tupaia chinensis]
Length = 3975
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 314 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 365
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 366 KPGNDSKMLVCEMCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELDPNSEWF 425
Query: 157 LGYTCCDAC 165
Y+ C C
Sbjct: 426 ENYSLCHRC 434
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 17/141 (12%)
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
C +C R + + C +C YH YC + VSSG C C CG+ PG
Sbjct: 1038 CVVCGSFGRGAE-GHLLACSQCSQCYHPYCVN---SKVSSGLKRC---VSCMQCGAASPG 1090
Query: 149 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 208
W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1091 --FHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLF 1142
Query: 209 DEKYLQFQVDGNLQYRCPTCR 229
E ++ D + C +C+
Sbjct: 1143 TEDDVEQAADEG--FDCVSCQ 1161
>gi|124111214|gb|ABM91995.1| MLL [Pan troglodytes]
Length = 390
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 60
Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 61 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 120
Query: 218 DGNLQYRCPTC 228
++ Y C C
Sbjct: 121 PESVAYTCVNC 131
>gi|326431279|gb|EGD76849.1| hypothetical protein PTSG_12693 [Salpingoeca sp. ATCC 50818]
Length = 3557
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 77/183 (42%), Gaps = 17/183 (9%)
Query: 54 MLSCKSCGKKYHRNCLKN------WAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFM 105
ML C +C +++H C +A N + + W C C C C T M
Sbjct: 783 MLLCMACARRFHPRCAGMQPSSPLFAANEESVR-TCWLCADCTTCSRCDSTASARSRARM 841
Query: 106 FCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 165
CR C +H C P + S Y C +C C N +S W + C C
Sbjct: 842 ACRVCGKEFHPGCAGLPKRPPS---YCCEDCRRCLDCHRT--ANEVS-SWSATFDYCTPC 895
Query: 166 GRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRC 225
+L +GN CPVC YRD ++ MV C+ C WVH +C+G+ D + + Y C
Sbjct: 896 SQLRKRGNVCPVCKVSYRD-DTPDMVLCETCDTWVHAECEGLDDAGLRRLGATDD-PYHC 953
Query: 226 PTC 228
TC
Sbjct: 954 STC 956
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 86/213 (40%), Gaps = 33/213 (15%)
Query: 36 MCRLCF---VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN--RDLFHW--SSWKCPS 88
MC LC VG ++ M+ C +C + +H CL + R +H+ S ++CP
Sbjct: 1038 MCTLCHSAGVGGDDDL-----MVFCDTCCEAFHLGCLLDTVPTTYRHAWHYDFSKFRCPR 1092
Query: 89 CRICEICRRTGDPN------------KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKH 136
C C+ C R P+ + + C RC H C P H ++ + ++CP+
Sbjct: 1093 CLQCQCCARALAPDLVIGRIINMDTAEHVVCSRCQTICHTECIPPSHPSMDTKHWVCPEC 1152
Query: 137 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 195
KC C R+ + T L +CPVC + S + C+
Sbjct: 1153 IKCSMCRRWQAARSSDWRYCMNCT-------LAQDTQHCPVCKDEFEIKLSLQYILQCNA 1205
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
C+ VH CD + + + + N Y CPTC
Sbjct: 1206 CKGLVHDHCDPLIKSRRTRKKYIEN-GYECPTC 1237
>gi|395755470|ref|XP_003779950.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like, partial
[Pongo abelii]
Length = 225
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 71/147 (48%), Gaps = 10/147 (6%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC- 142
W+C C +CE C + DP + + C CD +YH YC PP + V G + C +++C C
Sbjct: 10 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWYSRCVWCR 69
Query: 143 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 202
GL W YT C C L + CPVC + YR+ ++ C C RW+H
Sbjct: 70 HCGATSAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREDL---ILQCRQCDRWMHA 122
Query: 203 QCDGISDEKYLQFQVDGNLQYRCPTCR 229
C ++ E+ ++ D + + C CR
Sbjct: 123 VCQNLNTEEEVENVAD--IGFDCSMCR 147
>gi|403360488|gb|EJY79922.1| Histone-lysine N-methyltransferase [Oxytricha trifallax]
Length = 2438
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
MC LC N LSC CG+ +H CL+ +++ + WKC +C+ CEIC
Sbjct: 1208 MCYLCGSFGN-----GEDFLSCTLCGESFHTYCLQ-LPEDQVSKYQQYWKCLNCKFCEIC 1261
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------NVP 147
++C CD A+H +C P K++ + + C + KC CG+ N
Sbjct: 1262 ASATQEAFLLYCDVCDKAFHSFCLKPQLKSIPNCQWKCQECFKCQQCGTKEFFSAKDNEE 1321
Query: 148 GNGLSVRWF---LGYTCCDACGRLFVKGNYCPVCLKVYRDSES 187
L V F ++ C CG+ K ++C +C K DS+S
Sbjct: 1322 KRNLEVTDFEFSQNFSFCYQCGKNEYKKSFCKICQKKTEDSDS 1364
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 92/221 (41%), Gaps = 32/221 (14%)
Query: 55 LSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
L C +C K YH CLK N N L + W+C C C C + + + C++C+A
Sbjct: 579 LKCSNCSKNYHLQCLKLPNIHSNLGL-RETDWRCQDCIRCTNCLSLRNRDSMLICQKCNA 637
Query: 113 AYHCYC------QHPP----HKNVSSGP-------------YLCPKHTKCHSCGSNVPGN 149
+H C Q P K++ G Y C + +C +CGS G
Sbjct: 638 GFHYDCLDQSVKQGVPSISNEKDLKLGIETRKTTYSQLNQFYKCEQCVECENCGSKEAGE 697
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS----ESTPMVCCDVCQRWVHCQCD 205
+ +W Y C C + +C VC K + D+ + + C C VH CD
Sbjct: 698 KRNNKWSKDYKLCTTCNKKRSNKEFCLVCEKFWPDTKEKQDQLQTIQCVQCLMCVHQDCD 757
Query: 206 GI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 245
I + LQ DG+L+Y C CR ++R + + + +L
Sbjct: 758 RIFKNPNVLQQFSDGSLKYNCQKCRQNV-RLRFISEIIEKL 797
>gi|326436266|gb|EGD81836.1| hypothetical protein PTSG_02551 [Salpingoeca sp. ATCC 50818]
Length = 830
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 79/167 (47%), Gaps = 21/167 (12%)
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
+D+ ++W +CR+ E C T D +FC CD +H YC ++ G ++C
Sbjct: 658 KDIMTGTAW---NCRVFEKCGSTKDEKDILFCDECDRGFHTYCTG--LTSLPRGRWICSH 712
Query: 136 HTKCHSCGSNVPGNGLSVRW------------FLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ C C P + + +W FL T CDAC + + G++CP+CL+++
Sbjct: 713 CSVCDGCNFK-PDDPSTYKWSHFTDRRDNQRRFL-KTFCDACFKKWNTGDFCPICLELF- 769
Query: 184 DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
S ++ C +C R VH C+ ++ E F+ G ++ C C G
Sbjct: 770 -EPSADLLECSLCSRLVHKDCEDLTPEDEETFKAQGWGRFACSICSG 815
>gi|449679774|ref|XP_002159618.2| PREDICTED: uncharacterized protein LOC100210554 [Hydra
magnipapillata]
Length = 1172
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 89/211 (42%), Gaps = 43/211 (20%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+ C C + +H CL + + +W C C+ C +C G + C C
Sbjct: 1 MIFCSLCCEPFHTFCL-----DVEPVFKKNWYCDRCKYCTVC---GMKESLLMCDICHDC 52
Query: 114 YHCYCQHPPHKNVSSG----------------------PYLCPKH----TKCHSC----- 142
YH C P ++ + G Y+ + +C C
Sbjct: 53 YHAECLGPFYQCETEGEDEIWFRWNWLQEKDYNDEHYSAYITKINKAGAVQCTICNKDFY 112
Query: 143 -GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCCDVCQRWV 200
GS V N S RW +T C+ C + KGNYCPVCL +Y D ++ M+ C C+ WV
Sbjct: 113 YGSEVTMNK-SSRWMQNFTMCEKCDFNWKKGNYCPVCLVLYNDDDDALKMMFCTKCEAWV 171
Query: 201 HCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 230
H +C+GI+++ Y + + ++ Y C C G
Sbjct: 172 HMECEGITEDDYEILADLPDDVPYLCKLCTG 202
>gi|68070837|ref|XP_677332.1| SET-domain protein [Plasmodium berghei strain ANKA]
gi|56497407|emb|CAH97187.1| SET-domain protein, putative [Plasmodium berghei]
Length = 1439
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 10/159 (6%)
Query: 80 HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
H+ + C C C C ++T + ++ C+ C+ H C +P ++ +
Sbjct: 969 HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCYPNVPDIYLFNWK 1028
Query: 133 CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
C KC C SN+ + W L CC C + + K N+C +C + Y+ +S V
Sbjct: 1029 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1087
Query: 192 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
CDVC+ W+H CD D + ++ Y+CPTCR
Sbjct: 1088 ECDVCKFWIHLSCDKDEDRNIETLAI-KHINYKCPTCRS 1125
>gi|330804473|ref|XP_003290219.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
gi|325079683|gb|EGC33272.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
Length = 630
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 31/192 (16%)
Query: 54 MLSCKSCGKKYHRNCLK------NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFC 107
++ C C KK+H CL + +N L +WKC C+ CE+C+ D +K + C
Sbjct: 416 LIKCSECQKKFHPQCLGLHQTCVDSIRNNTL----AWKCTDCKNCEVCQNDVDESKIIIC 471
Query: 108 RRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN--GLSVRWFLGYTCCDAC 165
CD +H YC +PP + SG + C C C P N +++W YT CD+C
Sbjct: 472 DVCDKGFHTYCLNPPLSSPPSGGWRCSNCVFCTHCYIR-PENSENKNIKWKDNYTSCDSC 530
Query: 166 GRLFVKG-----NYCPVCLKVYRD----SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 216
F KG YC +C + +ST C C + VH CD I +
Sbjct: 531 ---FSKGFSDKSKYCSICRHSLKSEEEEEDSTTQ--CTYCGKLVHDDCDSIITDNL---- 581
Query: 217 VDGNLQYRCPTC 228
+ + Y+CP C
Sbjct: 582 ENEHFIYKCPGC 593
>gi|196016261|ref|XP_002117984.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
gi|190579457|gb|EDV19552.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
Length = 183
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 10/146 (6%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+++C CG+ YH C+ + + W+C C +CE C + GD ++ + C CD +
Sbjct: 46 LIACFQCGQSYHHYCVSAKLTRSVIV--NGWRCLDCAVCEGCGKAGDEDRLLLCDECDIS 103
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
YH YC +P V G + C + CH CGSN P G++ +W YT C C V
Sbjct: 104 YHTYCLNPQLDKVPEGEWKCHRCVSCHDCGSNFP--GINCQWTSNYTQCGPCASTSV--- 158
Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRW 199
CP C K Y + + +V C+ C R+
Sbjct: 159 -CPKCNKKYVNDD--IIVQCNNCDRY 181
>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
Length = 906
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 14/176 (7%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C +CG H CL + W+C +C+IC+ CR++ D + + C CD
Sbjct: 174 LLFCTTCGAHSHARCLNEGIIVTGEVR-AGWQCYTCKICQQCRKSDDDAQMIICETCDKG 232
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 173
+H YC +P +V + C C C GN + + W Y+ C C
Sbjct: 233 WHTYCLNPVMDSVPKDGWSCTNCRNCIEC-----GNKIHLEWQNDYSTCPVCW----SKQ 283
Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C VC K Y++ + ++ C C RW H + + E + N ++C CR
Sbjct: 284 ACSVCNKDYKEDDV--LLKCSECLRWQHALHEQVYSESDADSMAEQN--FKCKLCR 335
>gi|149032108|gb|EDL87020.1| rCG50635 [Rattus norvegicus]
Length = 609
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACR 280
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 157 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|332839578|ref|XP_003313789.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 429
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 10/137 (7%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+ C SCG YH CL R + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 293 LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACRKPGNDSKMLVCETCDKG 349
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL----- 168
YH +C PP + + + + C C +CG+ + WF Y+ C C +
Sbjct: 350 YHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQP 409
Query: 169 --FVKGNYCPVCLKVYR 183
V + PVC + +R
Sbjct: 410 IRSVAEQHTPVCSRAWR 426
>gi|82596471|ref|XP_726275.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481616|gb|EAA17840.1| Bromodomain, putative [Plasmodium yoelii yoelii]
Length = 4805
Score = 77.8 bits (190), Expect = 2e-11, Method: Composition-based stats.
Identities = 44/158 (27%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 80 HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
H+ + C C C C ++T + ++ C+ C+ H C P ++ +
Sbjct: 1337 HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCFPNVPDIYLFNWK 1396
Query: 133 CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
C KC C SN+ + W L CC C + + K N+C +C + Y+ +S V
Sbjct: 1397 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1455
Query: 192 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CDVC+ W+H CD D + ++ Y+CPTCR
Sbjct: 1456 ECDVCKFWIHLSCDKDEDRNIETLAIK-HINYKCPTCR 1492
>gi|148672214|gb|EDL04161.1| mCG145001 [Mus musculus]
Length = 630
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 40 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 99
C V E G + +L C SCG YH CL R + W+CP C++C+ CR+ G
Sbjct: 250 CAVCEGPG--QLCDLLFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQSCRKPG 304
Query: 100 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ +K + C CD YH +C PP +++ + + C C +CG+ + WF Y
Sbjct: 305 NDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWFENY 364
Query: 160 TCCDAC 165
+ C C
Sbjct: 365 SLCHRC 370
>gi|388854432|emb|CCF52016.1| related to histone acetyltransferase 3 (myst) [Ustilago hordei]
Length = 1215
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 7/104 (6%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 90
+C C E+ + + ++SC CG H +CLK W +N R + W+C C+
Sbjct: 83 VCAFCLRTAEHPKGDTPKLLVSCHECGSSGHPSCLK-WGRNPTKVRQALSYD-WRCIECK 140
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
CE+CR GD + MFC +CD +H YC PP G + CP
Sbjct: 141 KCEVCRDKGDDAQLMFCDKCDRGWHLYCLSPPLSKPPKGQWHCP 184
>gi|328716146|ref|XP_003245847.1| PREDICTED: hypothetical protein LOC100162709 isoform 2 [Acyrthosiphon
pisum]
Length = 1426
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 4/148 (2%)
Query: 37 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 155
+ D +K +FC CD YH YC + V G + C + C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398
Query: 156 FLGYTCCDACGRLFVKGNYCPVCLKVYR 183
F + + G C C ++++
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRMHQ 1426
>gi|156374109|ref|XP_001629651.1| predicted protein [Nematostella vectensis]
gi|156216656|gb|EDO37588.1| predicted protein [Nematostella vectensis]
Length = 251
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 78/177 (44%), Gaps = 12/177 (6%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
+++ C CG+ +H C+ + + W+C C +CE C + D + + C CD
Sbjct: 15 QLIVCSQCGQCFHPYCVG--VKVNKMILSKGWRCLDCTLCEGCGKGSDEARLLLCDSCDI 72
Query: 113 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 172
+YH YC PP + V G + C C CG+ G W YT C C
Sbjct: 73 SYHTYCLDPPLEKVPPGGWKCKWCVSCDDCGATSAGT--QCEWQSNYTQCGPC----ASK 126
Query: 173 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CPVC Y ++ M+ C C RW+H CDG+ E+ + D Y+C CR
Sbjct: 127 TSCPVCNIKYNLNDL--MIQCLHCDRWLHGSCDGLMTEEEVDRAAD--YGYQCLYCR 179
>gi|74150118|dbj|BAE24369.1| unnamed protein product [Mus musculus]
Length = 742
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 58/121 (47%), Gaps = 5/121 (4%)
Query: 31 SNTN---VMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
SNT+ +M +C V + G +L+C C + YH C+ + L W+C
Sbjct: 622 SNTDKFVLMQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCV 679
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C +CE+C + DP++ + C CD +YH YC PP V G + C C CG+ P
Sbjct: 680 ECIVCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP 739
Query: 148 G 148
G
Sbjct: 740 G 740
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 76/196 (38%), Gaps = 59/196 (30%)
Query: 36 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 735 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 787
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
C + DP + + C CD +YH YC PP + V G
Sbjct: 788 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKG------------------------ 823
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 213
G+ C K YR+ + ++ C C RW+H C +S E+ +
Sbjct: 824 ----GWKC------------------KCYREEDL--ILQCRQCDRWMHAVCQNLSTEEEV 859
Query: 214 QFQVDGNLQYRCPTCR 229
+ D + + C CR
Sbjct: 860 ENVAD--IGFDCSMCR 873
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 75.5 bits (184), Expect = 1e-10, Method: Composition-based stats.
Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 8/178 (4%)
Query: 58 KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRC 110
KS KKY R + + + C C C C ++T + ++ C+ C
Sbjct: 1640 KSKNKKYRRCINYIPSVEHSDITYKKFICKDCYRCIYCCESIYDYKQTPNVANYVICKNC 1699
Query: 111 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 170
+ H C P ++ + C KC+ C + W L CC C + +
Sbjct: 1700 NMVAHGSCCFPNVPDIYLFNWKCDDCLKCNKCNYSNLCYINYNEWELHLDCCINCYKEYE 1759
Query: 171 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
K N+C +C + Y + +S V CDVC+ W+H CD ++ + ++ + N+ Y+CPTC
Sbjct: 1760 KKNFCIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDK-NESRNIETLSNKNIDYKCPTC 1816
>gi|358252877|dbj|GAA50314.1| histone-lysine N-methyltransferase MLL4 [Clonorchis sinensis]
Length = 1769
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 77/180 (42%), Gaps = 27/180 (15%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
++L C SC + +H C++ + R H+ CR C C+ P + C RC A
Sbjct: 497 QLLFCVSCAEPFHFYCVERQFRPRRKDHFI------CRNCTECKECHSPAADLRCIRCSA 550
Query: 113 AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------------NVPGNGLSVRWF 156
YH C + P ++V G ++CP C CGS P G R
Sbjct: 551 GYHPSCLSDYAPAQSVHRGNWVCPHCASCVHCGSKPLHKRQEAGTETGKPPNTGGCSRST 610
Query: 157 LGYTC----CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 212
++ C AC +G+ CP C + Y + + M+ CD CQ W+H C ++ ++Y
Sbjct: 611 TAWSSEPNKCAACCSAEARGDICPECDRAYLPT-TKQMIQCDTCQLWMHRTCTKLTADEY 669
>gi|319411664|emb|CBQ73708.1| related to histone acetyltransferase 3 (myst) [Sporisorium
reilianum SRZ2]
Length = 1223
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 9/105 (8%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN-----RDLFHWSSWKCPSC 89
+C C + + + ++SC CG H +CL+ W +N + L + W+C C
Sbjct: 87 LCAFCLQTADRPKGDTPKLLISCYECGSSGHPSCLR-WGRNPTKVGKALSY--DWRCIEC 143
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
+ CEICR GD + MFC RCD +H YC PP G + CP
Sbjct: 144 KKCEICRDKGDDAQLMFCDRCDRGWHLYCLSPPLLKPPKGQWHCP 188
>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
Length = 6442
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 81 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1582 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1641
Query: 134 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
+ KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1642 DECLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1700
Query: 193 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CDVC+ W+H CD + + + Y+CPTCR
Sbjct: 1701 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCR 1736
>gi|443696184|gb|ELT96955.1| hypothetical protein CAPTEDRAFT_106026 [Capitella teleta]
Length = 319
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 48/95 (50%), Gaps = 3/95 (3%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC V + GC L C SCG+ YH NCL Q + S W+CP C+IC+ CR+
Sbjct: 219 LCIVCDVPGC--ISDQLFCTSCGQHYHGNCLDPPVQVNPVVR-SGWQCPECKICQTCRQP 275
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
GD NK + C CD YH +C P + + C
Sbjct: 276 GDDNKMLVCDTCDKGYHIFCLRPVMTTIPKNGWKC 310
>gi|20977848|gb|AAM33377.1|AF492830_1 CDK6/MLL fusion protein [Homo sapiens]
Length = 182
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 103 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 159
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 49 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 108
Query: 160 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 217
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 109 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 168
Query: 218 DGNLQYRCPTC 228
++ Y C C
Sbjct: 169 PESVAYTCVNC 179
>gi|240952194|ref|XP_002399348.1| zinc finger protein, putative [Ixodes scapularis]
gi|215490554|gb|EEC00197.1| zinc finger protein, putative [Ixodes scapularis]
Length = 379
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 60/125 (48%), Gaps = 5/125 (4%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
+C++C + + ++SC CGK H CL + W+C C++C IC
Sbjct: 232 VCKVC--NNVDASKEGEELISCSECGKVGHVTCLDILPEMAVAIKSYRWQCMECKMCNIC 289
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG-NGLSVR 154
T + K MFC RCD YH +C K+V +G ++C +C +CG PG G V+
Sbjct: 290 MATDNEEKMMFCDRCDRGYHSFCVG--MKSVPAGRWICRLCGRCATCGVASPGPEGPRVQ 347
Query: 155 WFLGY 159
W Y
Sbjct: 348 WHHEY 352
>gi|389584530|dbj|GAB67262.1| SET domain containing protein [Plasmodium cynomolgi strain B]
Length = 5788
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 73/162 (45%), Gaps = 10/162 (6%)
Query: 81 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1113 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1172
Query: 134 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1173 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1231
Query: 193 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 234
CDVC+ W+H CD ++ + ++ ++ Y+CPTCR +
Sbjct: 1232 CDVCKFWIHLSCDK-NESRNIETLSIKSINYKCPTCRSGSFH 1272
>gi|443696185|gb|ELT96956.1| hypothetical protein CAPTEDRAFT_106029, partial [Capitella teleta]
Length = 175
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 70/148 (47%), Gaps = 8/148 (5%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
LC + G R++ C CG+ YH C N +R + W+C C +CE C R
Sbjct: 26 LCVSCGSLGANEESRLIVCSQCGQCYHPYC-ANVKLSRIILE-KGWRCLDCTVCEGCGRP 83
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 158
D ++ + C CD +YH YC PP ++V G + C C +CG+ PG + W
Sbjct: 84 HDESRLILCDECDISYHIYCLDPPLESVPRGTWKCKWCAICVTCGTTAPGTNCA--WQNN 141
Query: 159 YTCCDACGRLFVKGNYCPVCLKVYRDSE 186
Y+ C C + CP+C + Y++ +
Sbjct: 142 YSQCGPCYSTVM----CPLCYRSYKEDD 165
>gi|320163082|gb|EFW39981.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 660
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 74/190 (38%), Gaps = 23/190 (12%)
Query: 56 SCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYH 115
+C SC H +C + W+C +C+ C +C T + + + C CD YH
Sbjct: 196 ACASCNNGGHVSCFHMTKLMGETVQTYGWECSNCKSCAMCNSTENETEMLLCDVCDRGYH 255
Query: 116 CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL-------SVRWFLGYTC------- 161
C K + G ++C C +C S + G L V W Y
Sbjct: 256 IQCID--LKTMPLGRWVCSLCNACTNCHSKLAGAILPRKEMDRPVDWLTFYATDVQSMKF 313
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C A + + YC +CLKV E + CD C R H CD L + G
Sbjct: 314 CSA-RSSWSQHEYCAICLKVNHPGEKLRLYKCDGCLRQTHPSCDK------LYKRPRGRT 366
Query: 222 QYRCPTCRGE 231
+Y CP CRG+
Sbjct: 367 RYFCPICRGD 376
>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
Length = 6587
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 10/162 (6%)
Query: 81 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1629 YKKFVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1688
Query: 134 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1689 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1747
Query: 193 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 234
CDVC+ W+H CD + + + Y+CPTCR +
Sbjct: 1748 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCRSGSFH 1788
>gi|428178234|gb|EKX47110.1| hypothetical protein GUITHDRAFT_137718 [Guillardia theta CCMP2712]
Length = 1125
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 46/94 (48%), Gaps = 1/94 (1%)
Query: 137 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 195
KC+SCG+ PG W YT C CG L K YC VC KV ++ E+ + C V
Sbjct: 715 VKCNSCGAKTPGKKPMDAWKQAYTQCSRCGELHAKKRYCSVCDKVLKEIETEDQILKCSV 774
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C WVH +CD + + + + + Y CP R
Sbjct: 775 CDLWVHGRCDDLDELSLVGMNLLSSEHYVCPKHR 808
>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
Length = 2857
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 63/153 (41%), Gaps = 9/153 (5%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 96
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2697 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2755
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 154
D +K +FC CD YH YC + V G + C + C +CGS PG S R
Sbjct: 2756 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2813
Query: 155 ---WFLGYTCCDACGRLFVKGNYCPVCLKVYRD 184
W Y D R++V C C K+ D
Sbjct: 2814 VAQWQHEYKKGDKNTRVYV-STLCVPCSKLRGD 2845
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
+L CK CG K H +CL A+ + SW+C C+ C IC TGDP+ +FC CD
Sbjct: 209 LLICKDCGNKAHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDK 268
Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
YH C P + SG + C
Sbjct: 269 GYHMNCHEPKLTQMPSGKWAC 289
>gi|390350878|ref|XP_788653.3| PREDICTED: zinc finger protein DPF3-like [Strongylocentrotus
purpuratus]
Length = 418
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 57/126 (45%), Gaps = 8/126 (6%)
Query: 16 VATASKDIAGEEQ------GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRN 67
V +AS+D GE G TN C C EN+ + ++SC CG+ H
Sbjct: 288 VTSASEDSLGEVSKSSKGGGGPETNNYCDFCLGDATENKKTQTPEDLISCSDCGRSGHPT 347
Query: 68 CLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 127
CL+ W+C C+ C +C + + ++ +FC CD YH YC +PP +
Sbjct: 348 CLQFTDTMIQKVKGYRWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMQAPP 407
Query: 128 SGPYLC 133
G ++C
Sbjct: 408 EGSWIC 413
>gi|149056303|gb|EDM07734.1| rCG53696 [Rattus norvegicus]
Length = 169
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 69/137 (50%), Gaps = 6/137 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 112
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 29 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 87
Query: 113 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 169
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 88 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 146
Query: 170 VKGNYCPVCLKVYRDSE 186
KGNYCP+C + Y D++
Sbjct: 147 EKGNYCPICTRCYEDND 163
>gi|355702665|gb|AES02007.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
Length = 167
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 82/161 (50%), Gaps = 8/161 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRCD 111
+ C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 9 FVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCR 66
Query: 112 AAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 168
+YH C P + + ++C K +C SCGS PG G +W ++ C C +L
Sbjct: 67 NSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKL 126
Query: 169 FVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 208
F KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 127 FAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 167
>gi|321261507|ref|XP_003195473.1| histone acetyltransferase [Cryptococcus gattii WM276]
gi|317461946|gb|ADV23686.1| Histone acetyltransferase, putative [Cryptococcus gattii WM276]
Length = 947
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
M+SC +CG+ H CL R W C C++CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKMCEQCEIKGDDSRLMFCDTCDR 100
Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
Length = 265
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
ML C SCG+ YH CL + L W+CP C++C+ CR+ GD NK + C CD
Sbjct: 187 MLFCTSCGRHYHGRCLDPAVEITSLVR-MGWQCPDCKVCQGCRQPGDDNKMLVCDVCDRG 245
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH +C PP + + C
Sbjct: 246 YHTFCLDPPMTTIPKTGWKC 265
>gi|405122036|gb|AFR96804.1| Myst4 protein [Cryptococcus neoformans var. grubii H99]
Length = 943
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCEIKGDDSRLMFCDTCDR 100
Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|328716148|ref|XP_003245848.1| PREDICTED: hypothetical protein LOC100162709 isoform 3 [Acyrthosiphon
pisum]
Length = 1397
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 3/113 (2%)
Query: 37 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
+ D +K +FC CD YH YC + V G + C + C SCG + PG
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPG 1392
>gi|58269200|ref|XP_571756.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57227992|gb|AAW44449.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 940
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100
Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|134114447|ref|XP_774152.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256785|gb|EAL19505.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 940
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100
Query: 113 AYHCYCQHPPHKNVSSGPYLCPK 135
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|71018437|ref|XP_759449.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
gi|46099056|gb|EAK84289.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
Length = 1283
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWS---SWKCPSCRICEICRRTGDPNKFMFCRRC 110
++SC CG H +CLK W + H + +W+C C+ CE+C GD + MFC RC
Sbjct: 215 LISCYECGSSGHPSCLK-WGRKSTKVHKALSYNWRCIECKKCEVCDDKGDDAQLMFCDRC 273
Query: 111 DAAYHCYCQHPPHKNVSSGPYLCP 134
D +H YC P G + CP
Sbjct: 274 DRGWHLYCLTPALSKPPKGQWHCP 297
>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
Length = 2007
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 53/93 (56%), Gaps = 6/93 (6%)
Query: 139 CHSCGSNVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
C SCG V G+ GL + G C C +L+ +G +CPVC +V++ + PMV CD C
Sbjct: 180 CGSCG--VTGDEGLKSKNAQGR--CSLCQKLYKEGQFCPVCDRVWQWATGDPMVGCDRCD 235
Query: 198 RWVHCQCDGISDEKYLQFQVDG-NLQYRCPTCR 229
W+H +CD ++ E + + DG L Y CP CR
Sbjct: 236 MWIHRECDALAAEVLDREENDGEELAYECPKCR 268
>gi|299749795|ref|XP_002911422.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
gi|298408603|gb|EFI27928.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
Length = 2272
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 45 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
N E+ +M C CG+ H C++ A D+ W+C C+ICE+C R GD +F
Sbjct: 649 NVRTEQPEQMTHCIECGRSGHPTCMQ-LAHIGDVIRSYPWRCIECKICEVCSRKGDDVRF 707
Query: 105 --------------------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
MFC CD +H YC +PP G + CP+ C
Sbjct: 708 VQLDLLGQTTDNPLLFQEKMMFCDSCDRGWHMYCLNPPMDETPPGKWSCPQ------CSP 761
Query: 145 NVPGNGLSV 153
P G+ +
Sbjct: 762 LFPEQGIPM 770
>gi|47225576|emb|CAG12059.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1216
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
Query: 132 LCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPM 190
+C +C SCG PG + W C C +LF GNYCP+C K Y D++ + M
Sbjct: 1035 VCMTCIRCKSCGV-TPGKSWDIEWNHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQM 1093
Query: 191 VCCDVCQRWVHCQCDGISDEKYL 213
+ C C WVH +C+ +++ ++
Sbjct: 1094 MQCGTCNHWVHAKCEDLTEGSHV 1116
>gi|390332559|ref|XP_003723528.1| PREDICTED: uncharacterized protein LOC580929 isoform 1
[Strongylocentrotus purpuratus]
Length = 3300
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 35 VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
++CR C G E C R + +LSC CG H +CLK + ++ S W+C C+
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
C IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324
Query: 150 GLSVRWFLG 158
+ R F G
Sbjct: 325 PGTNRLFRG 333
>gi|390332561|ref|XP_003723529.1| PREDICTED: uncharacterized protein LOC580929 isoform 2
[Strongylocentrotus purpuratus]
Length = 3278
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 35 VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
++CR C G E C R + +LSC CG H +CLK + ++ S W+C C+
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
C IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324
Query: 150 GLSVRWFLG 158
+ R F G
Sbjct: 325 PGTNRLFRG 333
>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2437
Score = 67.8 bits (164), Expect = 2e-08, Method: Composition-based stats.
Identities = 49/179 (27%), Positives = 75/179 (41%), Gaps = 19/179 (10%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR-RTGDPNKFMFCRRCDAAYH 115
C CG YH C+ NR+ + C +C C IC + D N + C C +H
Sbjct: 603 CFYCGAYYHTQCILQEEDNRN----QHFACDTCEPCSICHGKITDDN--ITCCECKTHFH 656
Query: 116 CYCQHPPHKNVSSGP-----YLCPKHTKCHSCGSNVPG--NGLSVRWFLGYTCCDACGRL 168
C ++++ + C +C C + + NG S + Y C+ C
Sbjct: 657 KKCGFSIAYDMNTSDKQIMRWYCESCVQCCICNNKLSDFQNGYSFKDDQIY--CNDCNEQ 714
Query: 169 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--SDEKYLQFQVDGNLQYRC 225
K YCP+C K + + MV C C W+H CD I D+ Y +++ + QYRC
Sbjct: 715 LQKKEYCPICKKFWSQETNKDMVQC-TCAMWIHRACDPILKDDKLYDEYKNNLRQQYRC 772
Score = 56.6 bits (135), Expect = 5e-05, Method: Composition-based stats.
Identities = 48/230 (20%), Positives = 86/230 (37%), Gaps = 65/230 (28%)
Query: 52 RRMLSCKSCGKKYHRNCL-------------KNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+ +L C SC + YH CL + NR+ W CP C++C++C +
Sbjct: 1131 KSLLFCTSCFESYHPYCLMIPGRQEYFKEKMERAMNNRE------WNCPKCQVCKVCSKG 1184
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPP----HKNVSSGPYLCPKHTKCHSCGSNVPGNG---- 150
+ K +FCR+CDA H C+ +++ + + C C C S +
Sbjct: 1185 PNITKNLFCRKCDAMVHFECEFKDVQVWNESKNELYWQCSDCFNCAKCSSKSLIDESDKQ 1244
Query: 151 --LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP------------------- 189
+++ + ++ C CG L +C C K + + P
Sbjct: 1245 LMINLDFTDNFSLCYKCGFLDAYYRFCKFCNKYCKKIPTLPKPANQQPLDDLFQEKSVRL 1304
Query: 190 ----------MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
+ C +CQ+ H +C + +Y+Q Q+ C TC+
Sbjct: 1305 KYLDQYLEEVLYQCKLCQQVYHKKCFEKAYPEYIQ-------QFFCYTCK 1347
>gi|409078326|gb|EKM78689.1| hypothetical protein AGABI1DRAFT_107193 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1494
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M++C CG+ H +C++ ++ D+ WKC C+ CE+C GD + +FC CD
Sbjct: 76 MVTCSECGRSGHPSCME-LSKIGDMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134
Query: 114 YHCYCQHPPHKNVSSGPYLCP 134
+H C PP + G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155
>gi|340376023|ref|XP_003386533.1| PREDICTED: zinc finger protein ubi-d4-like [Amphimedon
queenslandica]
Length = 402
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C + N+ + RM+SC CG+ H +CL+ + W+C C+
Sbjct: 298 NPYCDFCLGDDTLNQKSGKPERMVSCADCGRSGHPSCLQFSPSLAAVVLTYRWQCIECKS 357
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C ++ + ++ +FC CD YH YC PP K G + C
Sbjct: 358 CSLCGKSDNDDQLLFCDDCDRGYHMYCLKPPMKEAPEGSWSC 399
>gi|196002948|ref|XP_002111341.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
gi|190585240|gb|EDV25308.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
Length = 879
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+LSC CG H +CLK + W+C C+ C C+ G+P+ +FC CD
Sbjct: 207 LLSCVDCGNSGHPSCLKYSPELTSRVKTEPWQCIECKTCSYCQNAGNPDNLLFCDACDKG 266
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
+H C PP + SG ++C
Sbjct: 267 FHMECLSPPLTGMPSGRWVC 286
>gi|390332557|ref|XP_786050.3| PREDICTED: uncharacterized protein LOC580929 isoform 3
[Strongylocentrotus purpuratus]
Length = 2843
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 35 VMCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
++CR C G E C R + +LSC CG H +CLK + ++ S W+C C+
Sbjct: 208 LVCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKT 265
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGN 149
C IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 266 CAICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGR 324
Query: 150 GLSVRWFLG 158
+ R F G
Sbjct: 325 PGTNRLFRG 333
>gi|332030886|gb|EGI70522.1| Zinc finger protein ubi-d4 [Acromyrmex echinatior]
Length = 527
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 2/118 (1%)
Query: 18 TASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
+A K I G + + + C C EN+ + ++SC CG+ H CL+ A
Sbjct: 399 SAEKKIGGGKSSAAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANM 458
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP + G + C
Sbjct: 459 IVSVRKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|299115811|emb|CBN74374.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 3157
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 74/195 (37%), Gaps = 47/195 (24%)
Query: 53 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----SWKCPSCRICEIC--------RRTGD 100
+ L C +C YH CL R++ + W+C C+ C+ C RR GD
Sbjct: 1085 QTLVCSTCLMHYHPGCLDPPMTPREIASHAHSKEEWRCDYCQTCQGCGKGNEDEMRRGGD 1144
Query: 101 PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
P + C +H + P + GP K W
Sbjct: 1145 P---LVCHERGKVHHVHLDAP--EGCDPGPRAQEKI------------------WL---- 1177
Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 219
C C + + NYCP C Y + ++ + C C+ WVH C+G+S +Y Q VDG
Sbjct: 1178 -CSPCLDKYKERNYCPKCGVTYDEHDNMVQAIGCAACEFWVHASCEGLSTAEY-QMLVDG 1235
Query: 220 -----NLQYRCPTCR 229
+Y CP CR
Sbjct: 1236 KDSWFGAEYLCPVCR 1250
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 91 ICEICRRTG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
+CE+C T D + + C CD AYH YC P G ++C + C +C
Sbjct: 1411 VCEVCTETAKSDESLLLMCELCDRAYHTYCLTPSTDKPPEGTWICGQCISCTTC 1464
>gi|340381940|ref|XP_003389479.1| PREDICTED: hypothetical protein LOC100636799 [Amphimedon
queenslandica]
Length = 469
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 78/188 (41%), Gaps = 37/188 (19%)
Query: 107 CRRCDAAYH--CYCQHPPHKNVSSG-PYLCPKHTKCHSCGSNVPGN-------------G 150
C C+ YH C H + + C TKC SC SN+P
Sbjct: 260 CCECNEWYHEDCLLGHTSRPRTDNKRVWKCVTCTKCISCHSNMPNKIDDSGRSLLSSLLA 319
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 209
+W GY C C L KGN CPVC + Y D++ + MV C C WVH C+ ++D
Sbjct: 320 TPTQWSDGYCYCSDCIVLKAKGNSCPVCGECYLDNDFDSKMVQCSQCDNWVHSHCENMTD 379
Query: 210 EKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTE 269
E+Y + DL D+V + R ++ +I+S++ A G
Sbjct: 380 EEY--------------------EILSDLPDSVEYVCRLCIAYNRTMISSIKTALGPIWV 419
Query: 270 DEIFSISP 277
++I ++P
Sbjct: 420 NDINVLTP 427
>gi|196010657|ref|XP_002115193.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
gi|190582576|gb|EDV22649.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
Length = 167
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 2/92 (2%)
Query: 33 TNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
+N C C V N+ ++ R++SC CG+ H +CLK D +W+C C+
Sbjct: 66 SNRYCDFCLGDVNLNKKTGQSERLISCADCGRSGHPSCLKFTPSLTDTVLMYAWQCIECK 125
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
C IC + + +K +FC CD YH YC PP
Sbjct: 126 SCSICGTSDNDDKLLFCDDCDRGYHMYCLSPP 157
>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 2678
Score = 66.2 bits (160), Expect = 6e-08, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 7/130 (5%)
Query: 104 FMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTC 161
++ C C +H C +P P + C C CG W +T
Sbjct: 697 YVLCNICHVGFHGSCSNPFIPPLIAEKKYFKCSTCCYCSHCGYRDHNFMDYAAWDTTFTS 756
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG--ISDEKYLQFQVDG 219
C C R F KG YC +C +++ S + CD C+ W+H +CD I+ + L+
Sbjct: 757 CIRCCRGFEKGQYCAICRQIWSSSWDGDWLQCDTCRFWIHTECDTNLINSIEVLK---SP 813
Query: 220 NLQYRCPTCR 229
++ Y CP CR
Sbjct: 814 SVSYHCPVCR 823
>gi|390349864|ref|XP_003727298.1| PREDICTED: uncharacterized protein LOC100893490 [Strongylocentrotus
purpuratus]
Length = 631
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L CK C K H +C+K Q + S W+C C+ C +C GD + +FC CD
Sbjct: 248 LLVCKDCTIKVHPSCMKYSRQLAERSRLSPWQCIDCKTCHVCNDAGDADTLLFCDSCDKG 307
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH C +P + G ++C
Sbjct: 308 YHMACHNPKVEEKPLGRWVC 327
>gi|340381804|ref|XP_003389411.1| PREDICTED: hypothetical protein LOC100638610 [Amphimedon
queenslandica]
Length = 2366
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 57/135 (42%), Gaps = 8/135 (5%)
Query: 8 ARRFESG-------YVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKS 59
AR E G V A K + ++ N +C C EN ++ ++LSC
Sbjct: 162 ARMVEKGDLEKKGRLVKVAVKILEPFPSPKARPNSICSFCLGTEENNRDKQYEQLLSCHE 221
Query: 60 CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQ 119
CG H +CLK + + W C C+ C C + + + + C CD +H C
Sbjct: 222 CGNSGHPSCLKYSKELVEFITAEPWLCLECKKCIYCNASANADDLLICDACDKGFHMVCL 281
Query: 120 HPPHKNVSSGPYLCP 134
PP ++ G ++CP
Sbjct: 282 DPPISSLPEGRWVCP 296
>gi|356554670|ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max]
Length = 832
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH +CL + Q + H W CPSC IC++C D NK + C CD AYH YC P
Sbjct: 683 KYYHVSCLSS-KQLKSYGH--CWYCPSC-ICQVCLTDKDDNKIVLCDACDHAYHVYCMKP 738
Query: 122 PHKNVSSGPYLCPK 135
P ++ G + C K
Sbjct: 739 PQNSIPKGKWFCIK 752
>gi|426199317|gb|EKV49242.1| hypothetical protein AGABI2DRAFT_177299 [Agaricus bisporus var.
bisporus H97]
Length = 1474
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M++C CG+ H +C++ ++ ++ WKC C+ CE+C GD + +FC CD
Sbjct: 76 MVTCSECGRSGHPSCME-LSKIGEMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134
Query: 114 YHCYCQHPPHKNVSSGPYLCP 134
+H C PP + G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155
>gi|157128953|ref|XP_001661565.1| requim, req/dpf2 [Aedes aegypti]
gi|108872440|gb|EAT36665.1| AAEL011279-PA [Aedes aegypti]
Length = 433
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 2/123 (1%)
Query: 13 SGYVATASKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLK 70
S + A + E+G++ + C C EN+ ++SC CG+ H CL+
Sbjct: 301 SNASSNAPSPVPPIEKGRAIPSPYCDFCLGDARENKKTLEPEELVSCSDCGRSGHPTCLQ 360
Query: 71 NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP 130
A W+C C+ C IC + + ++ +FC CD YH YC PP G
Sbjct: 361 FTANMIISVRKYRWQCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLLTPPEGS 420
Query: 131 YLC 133
+ C
Sbjct: 421 WSC 423
>gi|170059917|ref|XP_001865571.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
gi|167878516|gb|EDS41899.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
Length = 450
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
E+G++ + C C EN+ A ++SC CG+ H CL+ A W
Sbjct: 332 EKGRAVPSPYCDFCLGDARENKKTFEAEELVSCSDCGRSGHPTCLQFTANMIISVRKYRW 391
Query: 85 KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C +C + + ++ +FC CD YH YC PP + G + C
Sbjct: 392 QCIECKYCTMCGTSDNDDQLLFCDDCDRGYHMYCLSPPLISPPEGSWSC 440
>gi|325192337|emb|CCA26782.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 2128
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 85/206 (41%), Gaps = 26/206 (12%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR 90
++T LC V + G A +L C C + H C+ + + F +++ CP+C
Sbjct: 1162 AHTYAWLDLCVVCASAG--SASSLLFCMDCAQTVHTFCVLSDTSAWNKF--NAFHCPNCL 1217
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
C +C T + C +C H C PP + S+ C + +C C +
Sbjct: 1218 QCRVCDET---EGVLACPKCGKGAHGACLDPPIDDPSA--IYCGECVECKHCQTPGTPRT 1272
Query: 151 LSVRWFLGYTCCD-----ACGRLFVK--GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 203
S+ L CC R F K + CP+CL+ E+ + CD C+RW H
Sbjct: 1273 YSLVPDLCLQCCSRQERWKSARQFTKPLADKCPICLETCNVKEA---IHCDACERWTHST 1329
Query: 204 CDGISDEKYLQFQVDGNLQYRCPTCR 229
CD ++ D + Y CP+CR
Sbjct: 1330 CDPMAFR-------DKDALYVCPSCR 1348
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 29/140 (20%)
Query: 91 ICEICRR-TGDPNKFMF-CRRCDAAYHCYCQHPPHKNV--------------SSGPYLCP 134
IC CR + D N+ + C+RC+ +H C PP+ + S P+ C
Sbjct: 740 ICSGCRLPSMDSNQSLLSCQRCNKRFHATCTDPPYVDTITVWDPHDHSELDHISLPFTCA 799
Query: 135 KHTKCHSCGSNVPGNGLSVRWFLGYTC---CDACGRLFVKGNYCPVCLKVYRDSES-TP- 189
C C N N VRW L T C AC L+ +C +C YR E+ TP
Sbjct: 800 DCDVCAGCNQNRTENPW-VRWRLPLTIVSLCGACELLYRSDQFCSIC---YRALEALTPR 855
Query: 190 ----MVCCDVCQRWVHCQCD 205
++ C C+ +VH +C+
Sbjct: 856 KELSLLSCSSCRHFVHPECE 875
>gi|449543136|gb|EMD34113.1| hypothetical protein CERSUDRAFT_141605, partial [Ceriporiopsis
subvermispora B]
Length = 260
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+SC CG+ H CL + A D+ WKC C+ CEIC D N+ MFC CD
Sbjct: 1 MVSCAECGRSAHPTCL-DLADIGDVMRSYDWKCMECKNCEICHSKEDDNRMMFCDFCDRG 59
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
+H C PP G + C
Sbjct: 60 WHMDCLDPPLSEAPPGKWHC 79
>gi|260818085|ref|XP_002603915.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
gi|229289239|gb|EEN59926.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
Length = 479
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%)
Query: 18 TASKDIAGEEQGQSNTNVMCRLCFVGENEGCERA---RRMLSCKSCGKKYHRNCLKNWAQ 74
T D E +S ++ C C + C R+ +L CK C K H +C++ A
Sbjct: 211 THGPDFEQEPLVKSPSDPRCDYCLLTSE--CNRSGEEEDLLICKDCNAKAHPSCMRYSAD 268
Query: 75 NRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
S W+C C+ C IC +GD +FC CD YH C P + G ++C
Sbjct: 269 LARRSRMSPWQCIDCKTCYICDDSGDAETLLFCDACDKGYHMACHEPAVTHKPLGKWVC 327
>gi|443897765|dbj|GAC75104.1| hypothetical protein PANT_14d00040 [Pseudozyma antarctica T-34]
Length = 1176
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 7/104 (6%)
Query: 36 MCRLCFVGENEGCERARRML-SCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 90
+C C + E ++L SC CG H CL+ W + R + W+C C+
Sbjct: 91 LCAFCQQPADRPKENTPKLLISCFECGSSGHPACLR-WGRKPTKVRSALSYE-WRCIECK 148
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
CEIC GD + MFC CD +H YC PP G + CP
Sbjct: 149 KCEICCDKGDDAQLMFCDGCDRGWHLYCLSPPLAKPPKGQWQCP 192
>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
Length = 2244
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 55/130 (42%), Gaps = 3/130 (2%)
Query: 103 KFMFCRRCDAAYHCYC--QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
+F+ C C YH C P + C KC CG G W ++
Sbjct: 538 EFVVCGTCGICYHGSCGNSFVPPLLFGGNNFNCSNCCKCIHCGYRDNGFMDYASWDSTFS 597
Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 220
C C + F +G +C +C K++ S + CD+C+ WVH CD +E ++F +
Sbjct: 598 SCIRCCKGFERGQFCSICRKIWTSSWEGEWLQCDICKFWVHYDCDKDLNEP-IEFYSNVK 656
Query: 221 LQYRCPTCRG 230
Y CP CR
Sbjct: 657 NLYNCPACRS 666
>gi|391344898|ref|XP_003746731.1| PREDICTED: zinc finger protein DPF3-like [Metaseiulus occidentalis]
Length = 470
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 2/101 (1%)
Query: 35 VMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
+ C C EN+ ++ M++C CG+ H CL+ + W+C C+ C
Sbjct: 348 LYCDFCLGTESENKKTKQPEEMVTCADCGRSGHPTCLQFTDVMTNNVKKYRWQCIECKTC 407
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C + + ++ +FC CD YH YC PP K G + C
Sbjct: 408 TLCGTSENDDQMLFCDDCDRGYHMYCLSPPLKEPPEGSWSC 448
>gi|196007802|ref|XP_002113767.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
gi|190584171|gb|EDV24241.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
Length = 443
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 64/141 (45%), Gaps = 9/141 (6%)
Query: 8 ARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYH 65
+R S ++ A +A + +N+ C +CF E N+ E ++ C CG H
Sbjct: 298 SRNSSSDHIGNARGKLAANKTVSTNS---CGVCFASEYVNDLGE-IEELIKCSQCGSLTH 353
Query: 66 RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKN 125
CL+ + + W+C C+ C C D +K MFC RCD YH +C +
Sbjct: 354 PTCLELTPEMVKVIQTYHWQCMDCKTCTACSDPYDEDKMMFCDRCDRGYHTFCVG--LDS 411
Query: 126 VSSGPYLCPKHTKCHSCGSNV 146
+ SG ++CP T+ HS N
Sbjct: 412 IPSGNWICPSCTQ-HSGNKNA 431
>gi|307169876|gb|EFN62385.1| Zinc finger protein ubi-d4 A [Camponotus floridanus]
Length = 528
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 2/112 (1%)
Query: 24 AGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
AG + + + C C EN+ + ++SC CG+ H CL+ A
Sbjct: 406 AGGKSSAAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRK 465
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP + G + C
Sbjct: 466 YRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 517
>gi|118344068|ref|NP_001071860.1| zinc finger protein [Ciona intestinalis]
gi|70571572|dbj|BAE06775.1| zinc finger protein [Ciona intestinalis]
Length = 399
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 30 QSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----S 83
Q+ +N C C EN+ + ++SC CG+ H CL Q D+ + S
Sbjct: 273 QTQSNNYCDFCLGDADENKKTGESEELVSCSDCGRSGHPTCL----QFTDIMTMNVKKYS 328
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C +C + + + +FC CD YH YC P +N G ++C
Sbjct: 329 WQCIECKSCHVCGTSDNDEQLLFCDDCDRGYHMYCLQPRMENPPEGSWIC 378
>gi|281205681|gb|EFA79870.1| hypothetical protein PPL_06690 [Polysphondylium pallidum PN500]
Length = 486
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 11/109 (10%)
Query: 132 LCPKHTKCHSCGSNVPGNGLSVRWFLGYT---CCDACGRLFVKGNYCPVCLKVYRDSEST 188
+ P+ +C CGS PG G + +W G + C++CG +K CP+C K+Y ++ +
Sbjct: 200 MVPRVNRC-VCGSTTPGKGPTCKWRKGPSGEVLCNSCGLQNMKKPKCPLCGKMYNKNKDS 258
Query: 189 PMVC-------CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
C CD C +WV +CDGI D L Y CP CR
Sbjct: 259 STECDSDEWIRCDDCSQWVMTECDGIKDISLYDDTQPNPLHYSCPKCRN 307
>gi|195132645|ref|XP_002010753.1| GI21713 [Drosophila mojavensis]
gi|193907541|gb|EDW06408.1| GI21713 [Drosophila mojavensis]
Length = 2287
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 2/98 (2%)
Query: 50 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
R + C SC +K H +C++ + +W+C C+ C CR + P K ++C +
Sbjct: 1971 RPEAFIRCYSCRRKVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQ 2030
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
CD YH YC K V G + C + + C CG+ P
Sbjct: 2031 CDRGYHIYCLG--IKTVPEGRWSCERCSICMRCGATRP 2066
>gi|198431091|ref|XP_002124209.1| PREDICTED: similar to monocytic leukemia zinc finger protein [Ciona
intestinalis]
Length = 2554
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C+ E + +LSC CG H C+K + S W+C C+ C +
Sbjct: 257 ICSYCYGTAECNKTGKQEELLSCADCGSSGHPICMKLSSDLVPKIRGSRWQCIECKSCRV 316
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C G+ + +FC CD +H C +PP + G ++C
Sbjct: 317 CGSKGNADNLLFCDSCDRGFHMECCNPPLLKMPKGSFIC 355
>gi|327259541|ref|XP_003214595.1| PREDICTED: zinc finger protein DPF3-like [Anolis carolinensis]
Length = 398
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G +N C C G N + R ++SC CG+ H CL+ A + W+C
Sbjct: 274 GAIISNNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTANMTEAVKTYQWQC 333
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 334 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVSEPPEGSWSC 380
>gi|427778555|gb|JAA54729.1| Putative d4 [Rhipicephalus pulchellus]
Length = 487
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 2/105 (1%)
Query: 31 SNTNVMCRLCFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
+N + C C GEN+ + ++SC CG+ H +CL+ W+C
Sbjct: 346 ANPSPYCDFCLGDNGENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIE 405
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 406 CKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLQPPLSEPPEGLWSC 450
>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
Length = 2062
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 4/129 (3%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
G E Q++ + C +C ++ + + C SC ++ H +C+ + +
Sbjct: 1720 GRESSQASRLLNCGVCLRSQHRNARDMPEVFIRCYSCRRRVHPSCIDMPQRMVGRVRNYN 1779
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
W+C C+ C CR P K +FC +CD YH YC + V G + C + C CG
Sbjct: 1780 WQCSGCKCCIKCRSNQRPGKMLFCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCG 1837
Query: 144 SNVPGNGLS 152
+ P GLS
Sbjct: 1838 ATRP-EGLS 1845
>gi|156384761|ref|XP_001633301.1| predicted protein [Nematostella vectensis]
gi|156220369|gb|EDO41238.1| predicted protein [Nematostella vectensis]
Length = 315
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C V EN+ R +LSC CG+ H +CL+ + W+C C+
Sbjct: 189 NSYCDFCLGDVSENKKSGRPEELLSCSDCGRSGHPSCLQFTPKLTYNVKKYRWQCIECKS 248
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC +PP G ++C
Sbjct: 249 CTLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMDKPPEGHWMC 290
>gi|269785119|ref|NP_001161515.1| Cer-d4 protein [Saccoglossus kowalevskii]
gi|268054007|gb|ACY92490.1| Cer-d4 [Saccoglossus kowalevskii]
Length = 392
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 48/103 (46%), Gaps = 2/103 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C EN+ + ++SC CG+ H +CL+ + W+C
Sbjct: 286 GIAAPNNYCDFCLGDASENKKTGVSEELISCSDCGRSGHPSCLQFTTKMTSNVKKYRWQC 345
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
C+ C +C + + ++ +FC CD YH YC +PP + G
Sbjct: 346 IECKSCHLCGTSDNDDQLLFCDDCDRGYHMYCLNPPMSHPPEG 388
>gi|156353196|ref|XP_001622960.1| predicted protein [Nematostella vectensis]
gi|156354354|ref|XP_001623361.1| predicted protein [Nematostella vectensis]
gi|156209598|gb|EDO30860.1| predicted protein [Nematostella vectensis]
gi|156210052|gb|EDO31261.1| predicted protein [Nematostella vectensis]
Length = 132
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 4/113 (3%)
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 160
+ C +C YH C P + V G ++C + +C CGS G W +T
Sbjct: 1 LLMCDKCQRGYHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFT 60
Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 212
C CG + GNYCP+C K Y D++ + M+ C+ CQ WVH C I+ ++Y
Sbjct: 61 HCYDCGTAWDNGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 113
>gi|71895933|ref|NP_001025643.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
gi|60550967|gb|AAH91612.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
Length = 430
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 11/141 (7%)
Query: 10 RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
R E + SK ++G + +SN N +C +C G++ + R+ R++ C C H +C
Sbjct: 286 RKEGSKRSVLSKSVSGY-KPKSNPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 344
Query: 69 LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS 128
L + + W+C C+ C IC + + MFC CD YH +C + S
Sbjct: 345 LDMSPELVTVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFCVG--LGALPS 402
Query: 129 GPYLCPKHTKCHSCGSNVPGN 149
G ++C C NVP
Sbjct: 403 GRWIC-------DCCQNVPST 416
>gi|84997431|ref|XP_953437.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304433|emb|CAI76812.1| hypothetical protein, conserved [Theileria annulata]
Length = 3595
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 58/134 (43%), Gaps = 12/134 (8%)
Query: 105 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
+ C C + H C +P N+ + C T+C SCG + W L + C
Sbjct: 1141 VVCVSCSISAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDHNTADYLNWGLFFFFCL 1200
Query: 164 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
C L + NYC +C KV+ D S V C+ C+ W+H +CD ++ + D
Sbjct: 1201 KCWELLERSNYCGICYKVWTHFDGSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1255
Query: 219 -GNLQYRCPTCRGE 231
N YRC CR E
Sbjct: 1256 SRNQNYRCCICRSE 1269
>gi|290973625|ref|XP_002669548.1| predicted protein [Naegleria gruberi]
gi|284083097|gb|EFC36804.1| predicted protein [Naegleria gruberi]
Length = 370
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 9/100 (9%)
Query: 40 CFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRR 97
C+V NEG ++ + ML+CK+CG K H C +N + L + SW+C C+ CE+C+
Sbjct: 32 CYVCHKTNEGTDK-QTMLTCKTCGLKVHAGCYEN--THFSLKYADSWECVDCKKCEVCKD 88
Query: 98 TG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG--PYLC 133
+ N+ + C RCD YH C + V G P+ C
Sbjct: 89 ANYREGNEILMCNRCDKGYHQLCCSEKFRTVPEGDKPWFC 128
>gi|195345773|ref|XP_002039443.1| GM22724 [Drosophila sechellia]
gi|194134669|gb|EDW56185.1| GM22724 [Drosophila sechellia]
Length = 1889
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ A+ +W+C C+ C C
Sbjct: 1568 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPARMVGRVRNYNWQCAGCKCCIKC 1627
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K +FC +CD YH YC K V G + C + C CG+ P
Sbjct: 1628 RSSQRPGKMLFCEQCDRGYHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 1677
>gi|405964745|gb|EKC30194.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 387
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C+ C K H +C+ A S W+C C+ C +C+ +GDP+ +FC CD
Sbjct: 163 ILVCQDCNAKAHPSCMGYNAILARRTLESPWQCIDCKTCTVCQDSGDPDTMLFCDACDKG 222
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH C P ++ G + C
Sbjct: 223 YHMTCHEPAIEDKPQGKWEC 242
>gi|194864252|ref|XP_001970846.1| GG10866 [Drosophila erecta]
gi|190662713|gb|EDV59905.1| GG10866 [Drosophila erecta]
Length = 497
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 53/120 (44%), Gaps = 2/120 (1%)
Query: 16 VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
ATA+K E+ + + C C EN+ ++SC CG+ H +CL+ A
Sbjct: 367 FATANKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 426
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 427 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486
>gi|37362278|gb|AAQ91267.1| requiem, apoptosis response zinc finger gene [Danio rerio]
Length = 368
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H +CL+ A W+C
Sbjct: 240 GLALPNDYCDFCLGDSSMNQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 299
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP + G + C
Sbjct: 300 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMSDPPEGSWSC 346
>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
Length = 1990
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
+C SCG + L + F C C ++ +G YCP C KV+ + PM+ CD C+
Sbjct: 223 QCASCGVSCARKELDAKQF-----CLLCAKMHGEGQYCPCCGKVWHYANCGPMIQCDTCE 277
Query: 198 RWVHCQCDGISDEKYLQFQVDG--------NLQYRCPTC 228
WVH CD + E L+ + D + Y CPTC
Sbjct: 278 MWVHDLCDATAAE-ILKKEADALKEGREEEEIPYNCPTC 315
>gi|156380495|ref|XP_001631804.1| predicted protein [Nematostella vectensis]
gi|156218850|gb|EDO39741.1| predicted protein [Nematostella vectensis]
Length = 273
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 3/102 (2%)
Query: 30 QSNTNVMCRLCFVGENEGCERA--RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
++ N +C C +G E + ++SC CG H +CLK W+C
Sbjct: 172 KATPNPLCGFC-LGPAESNKEGDYEELISCADCGNSGHPSCLKYSPALTARVQSEPWQCI 230
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
C+ C +CR GD + +FC CD +H C PP + +G
Sbjct: 231 ECKTCSVCRDAGDADNLLFCDMCDRGFHMECLDPPMSEMPTG 272
>gi|71029596|ref|XP_764441.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351395|gb|EAN32158.1| hypothetical protein, conserved [Theileria parva]
Length = 3588
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 59/134 (44%), Gaps = 12/134 (8%)
Query: 105 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
+ C C + H C +P N+ + C T+C SCG + W L + C
Sbjct: 1176 VVCVSCSTSAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDITCADYLNWGLFFFFCL 1235
Query: 164 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
C L + NYC +C KV+ D+ S V C+ C+ W+H +CD ++ + D
Sbjct: 1236 KCWELLERSNYCGICYKVWTNFDTSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1290
Query: 219 -GNLQYRCPTCRGE 231
N YRC CR E
Sbjct: 1291 SRNQNYRCLICRSE 1304
>gi|320167672|gb|EFW44571.1| MYST histone acetyltransferase 2 [Capsaspora owczarzaki ATCC 30864]
Length = 570
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 3/102 (2%)
Query: 34 NVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
+ +C LC ++ C ++ C +C H +CL W+C +C+ C
Sbjct: 100 DAVCALC---QSATCAARDSLIMCSNCSDCAHPSCLNLTKAAAAKVKTYPWRCSNCKTCS 156
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
+C + G K MFC CD H +C PP K+ S + CP+
Sbjct: 157 VCDKAGHEKKMMFCITCDRGTHSFCAQPPMKDPSEVAWSCPE 198
>gi|62177137|ref|NP_997861.2| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|62026699|gb|AAH92130.1| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|182892074|gb|AAI65789.1| Dpf2l protein [Danio rerio]
Length = 405
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H +CL+ A W+C
Sbjct: 277 GLALPNDYCDFCLGDSSMNQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 336
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP + G + C
Sbjct: 337 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMSDPPEGSWSC 383
>gi|291221144|ref|XP_002730583.1| PREDICTED: d4-like [Saccoglossus kowalevskii]
Length = 493
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 7/107 (6%)
Query: 33 TNVMCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDL-FHWSSWKCPS 88
T+ C C + C R +L CK C K H +C+ N+ Q + S W+C
Sbjct: 198 TDTKCDYCLLTAE--CNRNGKTEDLLVCKDCNAKAHPSCM-NYTQELAVRARMSPWQCFD 254
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C+ C +C +GD + +FC CD YH C P +G ++C K
Sbjct: 255 CKTCCVCGDSGDADNLLFCDACDKGYHMACHTPQILRKPTGKWMCIK 301
>gi|405965654|gb|EKC31016.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 2037
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 1/100 (1%)
Query: 35 VMCRLCFVGENEGCERARR-MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
++C C EN+ + ++SC CG H +CLK + + W+C C+ C
Sbjct: 207 LICCFCLGDENKNRDGVPEDLISCAECGNSGHPSCLKFSPELTETVKKLRWQCIDCKTCS 266
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+++G + +FC CD +H C PP G + C
Sbjct: 267 FCQKSGREDNMLFCDLCDRGFHMECCDPPLSKAPKGKWKC 306
>gi|312377713|gb|EFR24474.1| hypothetical protein AND_10892 [Anopheles darlingi]
Length = 539
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
E+ ++ + C C EN+ ++SC CG+ H +CL+ A W
Sbjct: 420 EKSRAVPSPYCDFCLGDARENKKTLEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRW 479
Query: 85 KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C IC + + ++ +FC CD YH YC PP + G + C
Sbjct: 480 QCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 528
>gi|45382857|ref|NP_989970.1| zinc finger protein DPF3 [Gallus gallus]
gi|18202301|sp|P58270.1|DPF3_CHICK RecName: Full=Zinc finger protein DPF3; AltName: Full=Zinc finger
protein cer-d4
gi|14010362|gb|AAK51968.1|AF362754_1 cer-d4 [Gallus gallus]
Length = 427
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C G N + R ++SC CG+ H CL+ + W+C C+
Sbjct: 308 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 367
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 368 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 409
>gi|449502401|ref|XP_004174505.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein DPF3
[Taeniopygia guttata]
Length = 392
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C G N + R ++SC CG+ H CL+ + W+C C+
Sbjct: 273 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 332
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 374
>gi|345480756|ref|XP_001605917.2| PREDICTED: zinc finger protein DPF3 [Nasonia vitripennis]
Length = 551
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 2/113 (1%)
Query: 23 IAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
+ G++ + + C C EN+ + ++SC CG+ H CL+ A
Sbjct: 428 MDGKKAKPAQPSPYCDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVR 487
Query: 81 WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP + G + C
Sbjct: 488 KYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLTSPPEGSWSC 540
>gi|195048475|ref|XP_001992534.1| GH24152 [Drosophila grimshawi]
gi|193893375|gb|EDV92241.1| GH24152 [Drosophila grimshawi]
Length = 2464
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 2/93 (2%)
Query: 55 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 114
+ C SC ++ H +C++ + +W+C C+ C CR + P K ++C +CD Y
Sbjct: 2085 IRCYSCRRRVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQCDRGY 2144
Query: 115 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
H YC K V G + C + + C CG+ P
Sbjct: 2145 HIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2175
>gi|355733003|gb|AES10880.1| myeloid/lymphoid or mixed-lineage leukemia 3 isoform 2 [Mustela
putorius furo]
Length = 102
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 98
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 8 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 65
Query: 99 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
DP + + C CD +YH YC PP + V G + C
Sbjct: 66 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKC 100
>gi|256070756|ref|XP_002571708.1| requim req/dpf2 [Schistosoma mansoni]
gi|350646387|emb|CCD58946.1| requim, req/dpf2, putative [Schistosoma mansoni]
Length = 450
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C E N+ R+ ML C CG+ H +CL+ H W+C C+ C +
Sbjct: 331 CDFCLGDESLNKKTGRSEDMLRCSDCGRFAHFSCLQFTPNMITSVHTYRWQCIECKTCWL 390
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + + +FC CD YH YC +PP G + C
Sbjct: 391 CGTSENDEQMLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 429
>gi|91094021|ref|XP_967377.1| PREDICTED: similar to d4 CG2682-PB [Tribolium castaneum]
Length = 525
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ ++SC CG+ H +CL + SW+C C+ C +
Sbjct: 418 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 477
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 516
>gi|162287269|ref|NP_001104639.1| zinc finger protein DPF3 [Danio rerio]
gi|215275221|sp|A9LMC0.1|DPF3_DANRE RecName: Full=Zinc finger protein DPF3
gi|159906528|gb|ABX10892.1| D4 zinc and double PHD fingers family 3 protein [Danio rerio]
Length = 391
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C G N +A ++SC CG+ H +CL+ W+C C+
Sbjct: 273 NDYCDFCLGDSGSNRKTGQAEELVSCSDCGRSGHPSCLQFTDNMMQAVRTYQWQCIECKS 332
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|348523632|ref|XP_003449327.1| PREDICTED: zinc finger protein neuro-d4-like [Oreochromis
niloticus]
Length = 381
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 6/124 (4%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 260 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 317
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTKCHSCGS 144
C+ C +C + + ++ +FC CD YH YC PP G + LC + K + G
Sbjct: 318 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSCHLCLRQLKEKASGI 377
Query: 145 NVPG 148
PG
Sbjct: 378 EDPG 381
>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 973
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 47/94 (50%), Gaps = 6/94 (6%)
Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CG ++P N L G C C RL +YC +C KV+ S+S V CD
Sbjct: 368 CDECGLDLPFNMSKKTKDLTPGGQLLCKTCARLMKSKHYCGICKKVWNQSDSGSWVRCDG 427
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
C+ WVH +CD IS + F+ G+ Y CP C+
Sbjct: 428 CKVWVHAECDKISS---ILFKNLGSTDYFCPACK 458
>gi|14010360|gb|AAK51967.1|AF362753_1 cer-d4 [Gallus gallus]
Length = 378
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C G N + R ++SC CG+ H CL+ + W+C C+
Sbjct: 259 NNYCDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKS 318
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 319 CSLCGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 360
>gi|353523840|ref|NP_001088070.2| PHD finger protein 10 [Xenopus laevis]
gi|296439270|sp|Q63ZP1.2|PHF10_XENLA RecName: Full=PHD finger protein 10
Length = 506
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 10 RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
R E + SK ++G + +S N +C +C G++ + R+ R++ C C H +C
Sbjct: 362 RKEGSKRSVLSKSVSGY-KPKSIPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 420
Query: 69 LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
L A+ + W+C C+ C IC + + MFC CD YH +C
Sbjct: 421 LDMSAELVAVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFC 470
>gi|26336851|dbj|BAC32109.1| unnamed protein product [Mus musculus]
Length = 440
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 117 YCQHPPHKNVSSGPYLC 133
+C P K+V + + C
Sbjct: 415 FCLQPVMKSVPTNGWKC 431
>gi|301769749|ref|XP_002920299.1| PREDICTED: zinc finger protein DPF3-like [Ailuropoda melanoleuca]
Length = 633
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 509 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 568
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 569 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 615
>gi|443694019|gb|ELT95254.1| hypothetical protein CAPTEDRAFT_227914 [Capitella teleta]
Length = 675
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 61/133 (45%), Gaps = 13/133 (9%)
Query: 8 ARRFESG----YVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGK 62
AR+ SG + T S D AG S+ C +C + + +M++C C
Sbjct: 532 ARKKVSGSHTEWAPTISSDSAG-----SDVVPKCAVCGGTSQRNKDNQEEQMVTCIKCHT 586
Query: 63 KYH-RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
H CL+ A+ + H SW+C C++C C GD K MFC CD +H +C
Sbjct: 587 PAHPSTCLELSAEMIPIIHTYSWQCMDCKMCAKCNDAGDEEKMMFCDHCDRGFHTFCLG- 645
Query: 122 PHKNVSSGPYLCP 134
+ + +G ++CP
Sbjct: 646 -LRVIPTGRWVCP 657
>gi|380014950|ref|XP_003691477.1| PREDICTED: zinc finger protein DPF3-like [Apis florea]
Length = 527
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|350413485|ref|XP_003490006.1| PREDICTED: zinc finger protein DPF3-like [Bombus impatiens]
Length = 468
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 359 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 418
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 419 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 457
>gi|393908955|gb|EJD75261.1| hypothetical protein LOAG_17567 [Loa loa]
Length = 371
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 32 NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
+ + +C LC +N+ + +++SC CG+ H +CLK W+C C
Sbjct: 248 DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 307
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ C IC + + ++ +FC CD +H YC PP G + C
Sbjct: 308 KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 351
>gi|388580883|gb|EIM21195.1| hypothetical protein WALSEDRAFT_38994 [Wallemia sebi CBS 633.66]
Length = 808
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 54 MLSCKSCGKKYHRNCL---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRC 110
M++C C H CL QN ++WS C C+ CE+C+ G+ + +FC C
Sbjct: 1 MITCGQCQSSAHPTCLHFTPTLTQNTLSYNWS---CVDCKGCEVCKNKGNEDDIIFCDLC 57
Query: 111 DAAYHCYCQHPPHKNVSSGPYLCP 134
D +H +C +PP +G + CP
Sbjct: 58 DRGWHMHCLNPPMNEPPAGDFACP 81
>gi|198455671|ref|XP_001357517.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
gi|198135345|gb|EAL24641.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
Length = 507
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 407 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 466
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 467 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 496
>gi|325181008|emb|CCA15418.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 639
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 30/59 (50%)
Query: 172 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 230
G YCPVC +VY D ++ VCCD C+ WVH CD L N Y CP C G
Sbjct: 580 GQYCPVCNQVYEDDDAASFVCCDSCEMWVHSACDTDLTPAKLATLAGTNETYICPLCGG 638
>gi|195476347|ref|XP_002086095.1| GE11247 [Drosophila yakuba]
gi|194185954|gb|EDW99565.1| GE11247 [Drosophila yakuba]
Length = 497
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486
>gi|18032212|gb|AAL56647.1|AF217500_1 histone acetyltransferase MOZ2 [Homo sapiens]
Length = 2072
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++A +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|392564180|gb|EIW57358.1| hypothetical protein TRAVEDRAFT_125931 [Trametes versicolor
FP-101664 SS1]
Length = 270
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+SC CG+ H +C+ D W+C +C+ C +CRR G+ + C CD
Sbjct: 1 MVSCVDCGRSGHPSCM-GLDNMGDAMRGYDWQCATCKSCSVCRRKGNEASMLICDHCDRG 59
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+H C PP + G + CP C G P
Sbjct: 60 WHMSCFDPPFRAPPEGTWHCP---SCPRVGETFP 90
>gi|307195046|gb|EFN77104.1| Zinc finger protein ubi-d4 A [Harpegnathos saltator]
Length = 534
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 425 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 484
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 485 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 523
>gi|52354758|gb|AAH82869.1| LOC494765 protein [Xenopus laevis]
Length = 417
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 10 RFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNC 68
R E + SK ++G + +S N +C +C G++ + R+ R++ C C H +C
Sbjct: 273 RKEGSKRSVLSKSVSGY-KPKSIPNAICGICLKGKDANKKGRSERLIHCSQCDNSGHPSC 331
Query: 69 LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
L A+ + W+C C+ C IC + + MFC CD YH +C
Sbjct: 332 LDMSAELVAVIKKYPWQCMECKTCIICGQPHHEEEMMFCDTCDRGYHTFC 381
>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1782
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 138 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
+C SCG + S+R C C +L +G +CPVC +V+ S MV CD C+
Sbjct: 211 RCASCGVT---DDESLRSKNSNGLCALCRKLHKEGQFCPVCDRVWHWSAGDAMVGCDRCE 267
Query: 198 RWVHCQCDGISDEKYLQFQ--VDGNLQYRCPTCR 229
W+H +CD ++ E + Q D ++ Y CP CR
Sbjct: 268 MWIHRECDAVAAEVLDREQNGEDEDIPYACPVCR 301
>gi|332842752|ref|XP_001140541.2| PREDICTED: zinc finger protein DPF3 [Pan troglodytes]
Length = 367
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349
>gi|340717364|ref|XP_003397154.1| PREDICTED: zinc finger protein DPF3-like [Bombus terrestris]
Length = 527
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|328701324|ref|XP_001945217.2| PREDICTED: zinc finger protein ubi-d4-like isoform 1 [Acyrthosiphon
pisum]
Length = 521
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 5/121 (4%)
Query: 14 GYVATASKDIAGEEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNW 72
GY T S G +N + C C E + + ++SC CG+ H CL+
Sbjct: 395 GYTKTKS----GGNNKSANPSPYCDFCLGDEVSNKSGQPEILISCSDCGRSGHPTCLQFT 450
Query: 73 AQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
A W+C C+ C IC + + ++ +FC CD YH YC PP + G +
Sbjct: 451 ANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHVYCLTPPLTSPPEGCWS 510
Query: 133 C 133
C
Sbjct: 511 C 511
>gi|344273540|ref|XP_003408579.1| PREDICTED: zinc finger protein DPF3-like [Loxodonta africana]
Length = 427
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 303 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 362
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 363 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 409
>gi|354468677|ref|XP_003496778.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Cricetulus
griseus]
Length = 1756
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++A +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|195148883|ref|XP_002015392.1| GL11042 [Drosophila persimilis]
gi|194109239|gb|EDW31282.1| GL11042 [Drosophila persimilis]
Length = 567
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 467 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 526
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 527 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 556
>gi|354468679|ref|XP_003496779.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Cricetulus
griseus]
Length = 2047
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++A +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|328778645|ref|XP_395098.4| PREDICTED: zinc finger protein DPF3-like [Apis mellifera]
Length = 533
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 424 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 483
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 484 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 522
>gi|363735536|ref|XP_421609.3| PREDICTED: histone acetyltransferase KAT6B [Gallus gallus]
Length = 2025
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 5/131 (3%)
Query: 8 ARRFESGYVAT---ASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKK 63
A ++ SG+ A+ S ++Q +++ +C C E+ ++ +LSC CG
Sbjct: 184 APKYSSGFPASLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSS 243
Query: 64 YHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPP 122
H +CLK + W+C C+ C CR G + + +FC CD +H C PP
Sbjct: 244 GHPSCLKFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPP 303
Query: 123 HKNVSSGPYLC 133
+ G ++C
Sbjct: 304 LSRMPKGMWIC 314
>gi|148670787|gb|EDL02734.1| D4, zinc and double PHD fingers, family 3, isoform CRA_a [Mus
musculus]
Length = 381
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 252 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 311
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 312 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 358
>gi|221120366|ref|XP_002164134.1| PREDICTED: histone acetyltransferase KAT6B-like [Hydra
magnipapillata]
Length = 832
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 9/111 (8%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L C CG H +C++ + W+C C+ C IC+ G+ +FC CD
Sbjct: 214 LLVCDECGNSGHPSCMQYSKELTARVRQEPWQCMECKKCNICKDQGEAANLLFCDACDKG 273
Query: 114 YHCYCQHPPHKNVSSGPYLC-----PKHTKCHSCGSNVPGNGL----SVRW 155
YH C PP ++ G ++C ++ K S+VP + L V+W
Sbjct: 274 YHMACLDPPLDDMPIGTWICDNCLSERNGKRRRISSSVPASLLLSTPDVKW 324
>gi|158292808|ref|XP_314129.4| AGAP005225-PA [Anopheles gambiae str. PEST]
gi|157017167|gb|EAA09477.4| AGAP005225-PA [Anopheles gambiae str. PEST]
Length = 468
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
E+ ++ + C C EN+ ++SC CG+ H +CL+ A W
Sbjct: 349 EKSRAVPSPYCDFCLGDARENKKTFEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRW 408
Query: 85 KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C IC + + ++ +FC CD YH YC PP + G + C
Sbjct: 409 QCIECKYCTICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 457
>gi|395857461|ref|XP_003801110.1| PREDICTED: zinc finger protein DPF3 [Otolemur garnettii]
Length = 489
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 365 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 424
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 425 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 471
>gi|328701326|ref|XP_003241562.1| PREDICTED: zinc finger protein ubi-d4-like isoform 2 [Acyrthosiphon
pisum]
Length = 458
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 5/121 (4%)
Query: 14 GYVATASKDIAGEEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNW 72
GY T S G +N + C C E + + ++SC CG+ H CL+
Sbjct: 332 GYTKTKS----GGNNKSANPSPYCDFCLGDEVSNKSGQPEILISCSDCGRSGHPTCLQFT 387
Query: 73 AQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
A W+C C+ C IC + + ++ +FC CD YH YC PP + G +
Sbjct: 388 ANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRGYHVYCLTPPLTSPPEGCWS 447
Query: 133 C 133
C
Sbjct: 448 C 448
>gi|321460287|gb|EFX71331.1| hypothetical protein DAPPUDRAFT_308933 [Daphnia pulex]
Length = 501
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 48/110 (43%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
E + ++ + C C EN+ ++SC CG+ H +CL+ A
Sbjct: 382 EMKARAPPSPYCDFCLGDATENKKSGHPEELVSCADCGRSGHPSCLQFTANMIISVKQYR 441
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C +C + + + +FC CD YH YC PP G + C
Sbjct: 442 WQCIECKCCSLCGNSDNDEQLLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 491
>gi|308235954|ref|NP_001184101.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 388
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ +A ++SC CG+ H +CL+ W+C
Sbjct: 264 GLALPNNYCDFCLGDSKINKKTNQAEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 323
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 324 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370
>gi|194758284|ref|XP_001961392.1| GF11022 [Drosophila ananassae]
gi|190622690|gb|EDV38214.1| GF11022 [Drosophila ananassae]
Length = 812
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 37 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ ++SC CG+ H +CL+ A W+C C+ C I
Sbjct: 389 CDFCLGDQRENKKTSMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSI 448
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
C + + ++ +FC CD YH YC PP
Sbjct: 449 CGTSDNDDQLLFCDDCDRGYHMYCLSPP 476
>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 7/90 (7%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C K++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 359 CKPCAKLIKSKQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKHFKDLE---NT 415
Query: 222 QYRCPTCRGECYQVRDLEDAVRELWRRKDM 251
Y CP C+G+ + E + + ++ K+M
Sbjct: 416 DYYCPDCKGK----FNCESSTSQTYKSKNM 441
>gi|402876619|ref|XP_003902055.1| PREDICTED: zinc finger protein DPF3-like, partial [Papio anubis]
Length = 277
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C G N + R ++SC CG+ H CL+ + W+C C+
Sbjct: 158 NNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKS 217
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 218 CILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 259
>gi|432891023|ref|XP_004075510.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Oryzias
latipes]
Length = 381
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 260 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 317
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 318 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 363
>gi|410059939|ref|XP_003318982.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Pan
troglodytes]
Length = 365
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 117 YCQHPPHKNVSSGPYLC 133
+C P K+V + + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364
>gi|19921648|ref|NP_610163.1| d4, isoform A [Drosophila melanogaster]
gi|16417832|gb|AAL18868.1|AF427473_1 dd4 protein [Drosophila melanogaster]
gi|16198077|gb|AAL13829.1| LD29238p [Drosophila melanogaster]
gi|21626860|gb|AAF57340.2| d4, isoform A [Drosophila melanogaster]
gi|220942560|gb|ACL83823.1| d4-PA [synthetic construct]
gi|220952536|gb|ACL88811.1| d4-PA [synthetic construct]
Length = 497
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 486
>gi|270003134|gb|EEZ99581.1| hypothetical protein TcasGA2_TC001567 [Tribolium castaneum]
Length = 481
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ ++SC CG+ H +CL + SW+C C+ C +
Sbjct: 374 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 433
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 434 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 472
>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
Length = 591
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 4/118 (3%)
Query: 9 RRFESGYVATASKDIAGE--EQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKY 64
+R +SGY+ ++++ N C C EN+ + ++SC CG+
Sbjct: 334 KRLDSGYLVEKYEELSHSCILMYMVMANNYCDFCLGDSEENKKSNQPEELVSCSDCGRSG 393
Query: 65 HRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
H CL+ A W+C C+ C +C + + ++ +FC CD YH YC +PP
Sbjct: 394 HPTCLQFTANMIISVKKYPWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPP 451
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 51/112 (45%), Gaps = 2/112 (1%)
Query: 24 AGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
+G ++ + N C C EN+ + ++SC CG+ H CL+ A
Sbjct: 463 SGMDKRDISANNYCDFCLGDSEENKKSNQPEELVSCSDCGRSGHPTCLQFTANMIISVKK 522
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 523 YPWQCIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPLSEPPEGNWSC 574
>gi|442622301|ref|NP_001260707.1| d4, isoform D [Drosophila melanogaster]
gi|440214084|gb|AGB93242.1| d4, isoform D [Drosophila melanogaster]
Length = 496
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 2/120 (1%)
Query: 16 VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
AT +K E+ + + C C EN+ ++SC CG+ H +CL+ A
Sbjct: 366 FATGNKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 425
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 426 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 485
>gi|24585823|ref|NP_724404.1| d4, isoform C [Drosophila melanogaster]
gi|7302246|gb|AAF57339.1| d4, isoform C [Drosophila melanogaster]
Length = 495
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 52/120 (43%), Gaps = 2/120 (1%)
Query: 16 VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
AT +K E+ + + C C EN+ ++SC CG+ H +CL+ A
Sbjct: 365 FATGNKVKQRVERDIAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTA 424
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 425 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 484
>gi|357607405|gb|EHJ65481.1| putative requim, req/dpf2 [Danaus plexippus]
Length = 513
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 2/106 (1%)
Query: 30 QSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
Q + + C C EN+ R++SC CG+ H CL+ W+C
Sbjct: 400 QVSPSPYCDFCLGDDRENKKTGTPERLVSCSDCGRSGHPTCLQFTVNMIVSVRKYRWQCI 459
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 460 ECKCCSVCGTSDNDDQLLFCDDCDRGYHMYCLAPPLDAPPEGSWSC 505
>gi|383856201|ref|XP_003703598.1| PREDICTED: zinc finger protein ubi-d4-like [Megachile rotundata]
Length = 559
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 37 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 450 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 509
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 510 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 548
>gi|403276505|ref|XP_003929938.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 371
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 117 YCQHPPHKNVSSGPYLC 133
+C P K+V + + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364
>gi|395503977|ref|XP_003756337.1| PREDICTED: zinc finger protein DPF3 [Sarcophilus harrisii]
Length = 375
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N+ R ++SC CG+ H CL+ + W+C
Sbjct: 251 GTVIPNNYCDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 310
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 311 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 357
>gi|345803644|ref|XP_854603.2| PREDICTED: zinc finger protein DPF3 [Canis lupus familiaris]
Length = 322
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 198 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 257
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 258 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 304
>gi|426234251|ref|XP_004011110.1| PREDICTED: zinc finger protein DPF3 [Ovis aries]
Length = 408
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 284 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 343
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 344 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 390
>gi|25396154|pir||A88925 protein F33E11.3 [imported] - Caenorhabditis elegans
Length = 1192
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+ C +C YH C++ + L W C CR+C IC + ++ +FC RCD
Sbjct: 437 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 496
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 156
+H YC K + G ++C + + N + +V F
Sbjct: 497 FHTYCVG--LKKLPQGTWICDTYCAIENMKFNRRASAAAVGGF 537
>gi|432891025|ref|XP_004075511.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Oryzias
latipes]
Length = 371
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 250 GSVIANGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 307
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 308 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 353
>gi|390407656|ref|NP_001254554.1| zinc finger protein DPF3 isoform 1 [Mus musculus]
gi|215274004|sp|P58269.2|DPF3_MOUSE RecName: Full=Zinc finger protein DPF3; AltName:
Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
Full=Zinc finger protein cer-d4
gi|26332973|dbj|BAC30204.1| unnamed protein product [Mus musculus]
Length = 378
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|410962581|ref|XP_003987847.1| PREDICTED: zinc finger protein DPF3 [Felis catus]
Length = 411
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 287 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 346
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 347 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 393
>gi|341879787|gb|EGT35722.1| hypothetical protein CAEBREN_06378 [Caenorhabditis brenneri]
Length = 375
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 2/81 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
++SC CG+ H +C+ N+ QN + + S W+C C+ C IC + + +K +FC CD
Sbjct: 279 LISCHDCGRSGHPSCM-NFNQNVTKIINRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 337
Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
YH YC P + Y C
Sbjct: 338 GYHLYCLRPALEKAPDDEYSC 358
>gi|268571913|ref|XP_002641182.1| Hypothetical protein CBG09043 [Caenorhabditis briggsae]
Length = 373
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
++SC CG+ H +C+ + S W+C C+ C IC + + +K +FC CD
Sbjct: 277 LVSCHDCGRSGHPSCMSFNQNVTMIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 336
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH YC +PP + Y C
Sbjct: 337 YHLYCLNPPLEKAPDDEYSC 356
>gi|126282822|ref|XP_001375927.1| PREDICTED: zinc finger protein DPF3-like [Monodelphis domestica]
Length = 384
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N+ R ++SC CG+ H CL+ + W+C
Sbjct: 260 GTVIPNNYCDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 319
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 320 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 366
>gi|296215435|ref|XP_002754118.1| PREDICTED: zinc finger protein DPF3 isoform 1 [Callithrix jacchus]
gi|332229063|ref|XP_003263707.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Nomascus leucogenys]
gi|397507377|ref|XP_003824173.1| PREDICTED: zinc finger protein DPF3 isoform 3 [Pan paniscus]
gi|215274167|sp|Q92784.3|DPF3_HUMAN RecName: Full=Zinc finger protein DPF3; AltName:
Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
Full=Zinc finger protein cer-d4
gi|60459281|gb|AAX20019.1| DPF3 [Homo sapiens]
Length = 378
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|297695458|ref|XP_002824959.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Pongo abelii]
Length = 378
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 254 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 313
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 314 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|116008472|ref|NP_724405.2| d4, isoform B [Drosophila melanogaster]
gi|113194576|gb|AAM68376.2| d4, isoform B [Drosophila melanogaster]
Length = 339
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 239 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 298
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 299 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 328
>gi|403223612|dbj|BAM41742.1| predicted protein [Theileria orientalis strain Shintoku]
Length = 4555
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 105 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 163
+ C C+ A H C +P N+ + C T+C SCG + W L + C
Sbjct: 1805 VVCVSCNVAAHRGCCNPVVPNLLFIESWKCDSCTQCISCGYRDTTCIEYLNWGLFFFFCL 1864
Query: 164 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 218
C L + NYC VC KV+ DS + V C+ C+ W+H +CD ++ + D
Sbjct: 1865 KCWELLERSNYCGVCYKVWTNFDSNTQKWVQCEGCKLWIHVECDDLA-----RIITDCPS 1919
Query: 219 -GNLQYRCPTCRG 230
+ YRC CR
Sbjct: 1920 SKSQNYRCCVCRS 1932
>gi|354488945|ref|XP_003506626.1| PREDICTED: zinc finger protein DPF3-like, partial [Cricetulus
griseus]
Length = 367
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 51/127 (40%), Gaps = 22/127 (17%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C K YH CL + HWS CPSC
Sbjct: 260 EVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWS---CPSCEAAGIPQKDEEEEKKV 316
Query: 93 ----EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
E CR D + C C ++YH YC +PP V G + CP+ C N P
Sbjct: 317 ATNMEYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPR-CLCPEP-KNRPE 374
Query: 149 NGLSVRW 155
LS RW
Sbjct: 375 KVLSWRW 381
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 25/113 (22%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
CE+C++ G+ + C C AYH C P + G + CP C + G +P
Sbjct: 259 CEVCQQGGE---IILCDTCPKAYHMVCLDPDMEEAPEGHWSCP---SCEAAG--IPQKD- 309
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
+ ++ YC VC V ++CCD C H C
Sbjct: 310 ----------EEEEKKVATNMEYCRVCKDV------GWLLCCDTCPSSYHAYC 346
>gi|25148780|ref|NP_498281.2| Protein DPFF-1, isoform a [Caenorhabditis elegans]
gi|22096399|sp|Q09477.2|YP99_CAEEL RecName: Full=Uncharacterized zinc finger protein C28H8.9
gi|351058508|emb|CCD65970.1| Protein DPFF-1, isoform a [Caenorhabditis elegans]
Length = 372
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 112
++SC CG+ H +CL N+ QN + S W+C C+ C IC + + +K +FC CD
Sbjct: 276 LVSCHDCGRSGHPSCL-NFNQNVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 334
Query: 113 AYHCYCQHPPHKNVSSGPYLC 133
YH YC P + Y C
Sbjct: 335 GYHLYCLTPALEKAPDDEYSC 355
>gi|417410666|gb|JAA51801.1| Putative phd zn-finger protein, partial [Desmodus rotundus]
Length = 433
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 20 SKDIAGEEQGQSNTNVMCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDL 78
SK + G + ++ N +C +C G+ G +A ++ C CG H +CL + +
Sbjct: 299 SKSVPGY-KPKAPPNALCGICLKGKESGRRGKAESLIHCSQCGNSGHPSCLDMTTELVSM 357
Query: 79 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
W+C C+ C +C + + MFC CD YH +C
Sbjct: 358 IKTYPWQCMECKTCIVCGQPHHEEEMMFCDVCDRGYHTFC 397
>gi|355778710|gb|EHH63746.1| hypothetical protein EGM_16777, partial [Macaca fascicularis]
Length = 363
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|355693412|gb|EHH28015.1| hypothetical protein EGK_18348, partial [Macaca mulatta]
Length = 368
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|338719785|ref|XP_001489567.3| PREDICTED: zinc finger protein DPF3-like [Equus caballus]
Length = 415
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 291 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 350
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 351 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 397
>gi|344241713|gb|EGV97816.1| Histone acetyltransferase MYST4 [Cricetulus griseus]
Length = 709
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++A +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|148235385|ref|NP_001079117.1| zinc finger protein ubi-d4 A [Xenopus laevis]
gi|18203564|sp|Q9W638.1|REQUA_XENLA RecName: Full=Zinc finger protein ubi-d4 A; AltName: Full=Apoptosis
response zinc finger protein A; AltName: Full=Protein
requiem A; Short=xReq A
gi|4808462|dbj|BAA77574.1| Requiem protein [Xenopus laevis]
Length = 388
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H +CL+ A W+C
Sbjct: 264 GIALPNNYCDFCLGDSKINKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 323
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 324 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLVPPVAEPPEGSWSC 370
>gi|440907612|gb|ELR57740.1| Zinc finger protein DPF3, partial [Bos grunniens mutus]
Length = 368
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 244 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 303
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 304 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|348573139|ref|XP_003472349.1| PREDICTED: zinc finger protein DPF3-like [Cavia porcellus]
Length = 369
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 245 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 304
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 305 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 351
>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
Length = 2166
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 2/69 (2%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN- 220
C C +L +G YCPVC +V++ + MV CD C WVHC CD + ++ Q G+
Sbjct: 226 CRLCAKLHREGQYCPVCDRVWQWANCPAMVGCDSCDFWVHCACDEPA-RTVMEAQERGDE 284
Query: 221 LQYRCPTCR 229
+ Y CP CR
Sbjct: 285 VDYHCPRCR 293
>gi|410916367|ref|XP_003971658.1| PREDICTED: zinc finger protein DPF3-like [Takifugu rubripes]
Length = 391
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C + N +A ++SC CG+ H CL+ W+C C+
Sbjct: 273 NDYCDFCLGDQDSNRKTGQAEELVSCSDCGRSGHPTCLQFTDNMMQAVRTYQWQCIECKS 332
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 333 CSICGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|195027395|ref|XP_001986568.1| GH21439 [Drosophila grimshawi]
gi|193902568|gb|EDW01435.1| GH21439 [Drosophila grimshawi]
Length = 504
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 2/123 (1%)
Query: 13 SGYVATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLK 70
S +A+ K+ E+ + + C C EN+ ++SC CG+ H +CL+
Sbjct: 371 SNPLASPKKNKLRAERDVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQ 430
Query: 71 NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP 130
W+C C+ C IC + + ++ +FC CD YH YC PP G
Sbjct: 431 FTPNMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGS 490
Query: 131 YLC 133
+ C
Sbjct: 491 WSC 493
>gi|4808460|dbj|BAA77573.1| Requiem protein [Xenopus laevis]
Length = 290
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 166 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 225
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 226 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 272
>gi|351714578|gb|EHB17497.1| Histone acetyltransferase MYST4 [Heterocephalus glaber]
Length = 2108
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 3/133 (2%)
Query: 3 AAEDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCG 61
AA Y F S + S ++Q +++ +C C E+ ++ +LSC CG
Sbjct: 185 AAPQYPSAFPSS-LPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCG 243
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQH 120
H +CLK + W+C C+ C CR G + + +FC CD +H C
Sbjct: 244 SSGHPSCLKFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD 303
Query: 121 PPHKNVSSGPYLC 133
PP + G ++C
Sbjct: 304 PPLSRMPKGMWIC 316
>gi|195567753|ref|XP_002107423.1| GD15571 [Drosophila simulans]
gi|194204830|gb|EDX18406.1| GD15571 [Drosophila simulans]
Length = 770
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 44/94 (46%), Gaps = 2/94 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+ C +C K+ H +C+ + +W+C C+ C CR + P K +FC +CD
Sbjct: 478 FIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKCRSSQRPGKMLFCEQCDRG 537
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
YH YC K V G + C + C CG+ P
Sbjct: 538 YHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 569
>gi|403418283|emb|CCM04983.1| predicted protein [Fibroporia radiculosa]
Length = 1278
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 42 VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP 101
VGE E M+SC CG+ H +CLK + + W C C+ CEICR+
Sbjct: 130 VGEPE------LMVSCAECGRSGHPSCLK-LVEMSETIRLYPWICSECKNCEICRKKEGE 182
Query: 102 NKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
N+ + C CD +H C PP + G
Sbjct: 183 NRMIMCDFCDRGWHMDCLQPPLVEMPPG 210
>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 1053
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 139 CHSCGSNVPGN------GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
C +CG +P GL+ G C C RL +YC +C KV S+S V
Sbjct: 389 CEACGLALPYKMSKKIKGLTPN---GQLLCKTCTRLTKSKHYCGICKKVSNHSDSGSWVR 445
Query: 193 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
CD C+ WVH +CD IS + + Y CPTCRG+
Sbjct: 446 CDGCKVWVHAECDKISSNHFKDLE---TTDYFCPTCRGK 481
>gi|355755784|gb|EHH59531.1| hypothetical protein EGM_09668 [Macaca fascicularis]
Length = 340
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 219 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 276
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 277 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 330
>gi|260837382|ref|XP_002613683.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
gi|229299071|gb|EEN69692.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
Length = 809
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 45/101 (44%), Gaps = 5/101 (4%)
Query: 36 MCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
+C C +G E C R A +LSC CG H +CLK Q W+C C+ C
Sbjct: 191 VCSFC-LGTAE-CNRDGQAEELLSCADCGNSGHPSCLKYSPQLTAKVRSMRWQCIDCKTC 248
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C D + +FC CD +H C +PP + G + C
Sbjct: 249 TACENKNDLDNILFCDACDRGFHMKCCNPPLTKMPKGNWEC 289
>gi|194226394|ref|XP_001914899.1| PREDICTED: histone acetyltransferase MYST3 [Equus caballus]
Length = 2012
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK A+ W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSAELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|432924374|ref|XP_004080595.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
KAT6B-like [Oryzias latipes]
Length = 2014
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ +RA +LSC CG H +CLK W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRAEELLSCADCGSSGHPSCLKFSPDLTSNVKKLRWQCIECKTCSS 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNAEEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|317419460|emb|CBN81497.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
Length = 2149
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|301780976|ref|XP_002925892.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Ailuropoda
melanoleuca]
gi|410983100|ref|XP_003997881.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Felis catus]
Length = 414
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|205830430|ref|NP_001128627.1| zinc finger protein neuro-d4 isoform a [Homo sapiens]
gi|297704629|ref|XP_002829197.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Pongo abelii]
gi|395847017|ref|XP_003796183.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Otolemur
garnettii]
gi|402905395|ref|XP_003915505.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Papio anubis]
gi|387540168|gb|AFJ70711.1| zinc finger protein neuro-d4 isoform a [Macaca mulatta]
Length = 414
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|432100457|gb|ELK29089.1| Histone acetyltransferase MYST4 [Myotis davidii]
Length = 2022
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
Length = 3609
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 12/131 (9%)
Query: 107 CRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 165
C C+ + H C +PP N+ + C ++C SCG + W L + C C
Sbjct: 1097 CVSCNLSAHRNCCNPPVPNLLFIESWKCDWCSQCISCGYRDTNGTEYLHWGLFFLLCLKC 1156
Query: 166 GRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD----G 219
K N+C VC KV+ DS + V C+ C+ W+H +CD ++ Q D
Sbjct: 1157 WESLEKNNFCGVCYKVWTNYDSVTQKWVQCEGCKLWIHVECDDLA-----QIITDCPSSR 1211
Query: 220 NLQYRCPTCRG 230
+ YRC CR
Sbjct: 1212 SQNYRCKVCRS 1222
>gi|6002696|gb|AAF00100.1|AF119231_1 histone acetyltransferase MORF beta [Homo sapiens]
Length = 2073
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|449505049|ref|XP_002192975.2| PREDICTED: histone acetyltransferase KAT6B [Taeniopygia guttata]
Length = 1842
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|348506563|ref|XP_003440828.1| PREDICTED: zinc finger protein DPF3-like [Oreochromis niloticus]
Length = 391
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 2/102 (1%)
Query: 34 NVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
N C C + N +A ++SC CG+ H CL+ W+C C+
Sbjct: 273 NDYCDFCLGDQDSNRKTGQAEELVSCSDCGRSGHPTCLQFTENMMQAVRTYQWQCIECKS 332
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 333 CSLCGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|431909715|gb|ELK12873.1| Zinc finger protein neuro-d4 [Pteropus alecto]
Length = 425
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 304 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 361
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 362 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415
>gi|297459745|ref|XP_001254780.2| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|297479887|ref|XP_002691098.1| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|296483064|tpg|DAA25179.1| TPA: Zinc finger protein DPF3-like [Bos taurus]
Length = 474
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 350 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 409
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 410 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 456
>gi|197091705|gb|ACH42085.1| PHD zinc finger protein 10 [Crassostrea gigas]
Length = 367
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 10/119 (8%)
Query: 36 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
+C +C G+N + ++ C C K+ H CL + + W+C C+ C C
Sbjct: 224 VCSICTQGQNSEVKGESDLVVCSECNKEGHPGCLDLTNEMVTVIKTYPWQCMDCKTCVEC 283
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
D +K MFC CD YH +C K++ +G H KC SC P N +V+
Sbjct: 284 MDPYDEDKMMFCDLCDRGYHTFCVG--LKSIPTG------HWKCKSCKG--PENPQTVQ 332
>gi|301773210|ref|XP_002922022.1| PREDICTED: histone acetyltransferase MYST4-like [Ailuropoda
melanoleuca]
gi|281342250|gb|EFB17834.1| hypothetical protein PANDA_010953 [Ailuropoda melanoleuca]
Length = 2063
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|187957110|gb|AAI50619.1| MYST histone acetyltransferase (monocytic leukemia) 4 [Homo
sapiens]
Length = 2073
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|100816397|ref|NP_036462.2| histone acetyltransferase KAT6B isoform 1 [Homo sapiens]
gi|143811424|sp|Q8WYB5.3|KAT6B_HUMAN RecName: Full=Histone acetyltransferase KAT6B; AltName:
Full=Histone acetyltransferase MOZ2; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Monocytic leukemia zinc finger
protein-related factor
gi|119574944|gb|EAW54559.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_c [Homo sapiens]
Length = 2073
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|397483738|ref|XP_003813054.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Pan paniscus]
Length = 2075
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|355782819|gb|EHH64740.1| hypothetical protein EGM_18047 [Macaca fascicularis]
Length = 2069
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|355562477|gb|EHH19071.1| hypothetical protein EGK_19714 [Macaca mulatta]
Length = 2077
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|297301091|ref|XP_002805720.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Macaca
mulatta]
Length = 1893
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|195382643|ref|XP_002050039.1| GJ20409 [Drosophila virilis]
gi|194144836|gb|EDW61232.1| GJ20409 [Drosophila virilis]
Length = 490
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 2/120 (1%)
Query: 16 VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
+A+ K+ E+ + + C C EN+ ++SC CG+ H +CL+
Sbjct: 360 LASPKKNKLRAEREVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTP 419
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 420 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 479
>gi|344243487|gb|EGV99590.1| Zinc finger protein DPF3 [Cricetulus griseus]
Length = 408
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 2/96 (2%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 243 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 302
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPP 122
C+ C +C + + ++ +FC CD YH YC +PP
Sbjct: 303 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPP 338
>gi|332834457|ref|XP_003312688.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Pan troglodytes]
Length = 2070
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|402880388|ref|XP_003903785.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Papio anubis]
Length = 2070
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|341942234|sp|Q8BRB7.3|KAT6B_MOUSE RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Protein querkopf
Length = 1872
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
Length = 2002
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1696 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKC 1755
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1756 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCGATKP 1805
>gi|73953062|ref|XP_536397.2| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Canis lupus
familiaris]
Length = 2090
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|348508657|ref|XP_003441870.1| PREDICTED: histone acetyltransferase MYST4 [Oreochromis niloticus]
Length = 2141
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|426365193|ref|XP_004049670.1| PREDICTED: histone acetyltransferase KAT6B [Gorilla gorilla
gorilla]
Length = 2072
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|397483742|ref|XP_003813056.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Pan paniscus]
Length = 1892
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|403293045|ref|XP_003937534.1| PREDICTED: zinc finger protein neuro-d4 [Saimiri boliviensis
boliviensis]
Length = 293
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 172 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 229
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 230 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 283
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 74/200 (37%), Gaps = 64/200 (32%)
Query: 35 VMCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW---SSWKCPSC 89
+C +C F E A ++ C C + YH C + N D+ ++W C C
Sbjct: 1022 TVCLICGSFSSAKSISEGA--LVHCACCCEPYHPFCAEPDFLNTDVLAQMKRNTWICRKC 1079
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
+ C +C G P + CRRC +Y H++C G + P N
Sbjct: 1080 QCCHVC---GHPKNLLVCRRCKKSY---------------------HSEC--LGPSYPTN 1113
Query: 150 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 209
S D ES M+ C C+RWVH +C+ +S
Sbjct: 1114 CYS-----------------------------DEDYESR-MIQCSGCKRWVHSKCESLSV 1143
Query: 210 EKY-LQFQVDGNLQYRCPTC 228
E Y L + N+ Y+CP C
Sbjct: 1144 EMYTLLAHMPNNVTYKCPDC 1163
>gi|374349205|ref|NP_001243397.1| histone acetyltransferase KAT6B isoform 2 [Homo sapiens]
gi|119574942|gb|EAW54557.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_a [Homo sapiens]
Length = 1890
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|6002694|gb|AAF00099.1|AF119230_1 histone acetyltransferase MORF alpha [Homo sapiens]
Length = 1890
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|426255810|ref|XP_004021541.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Ovis aries]
Length = 1869
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|332244078|ref|XP_003271198.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Nomascus leucogenys]
Length = 2055
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 186 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 245
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 246 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 295
>gi|317419461|emb|CBN81498.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
Length = 1996
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|110556652|ref|NP_059507.2| histone acetyltransferase KAT6B [Mus musculus]
gi|327365366|ref|NP_001192170.1| histone acetyltransferase KAT6B [Mus musculus]
gi|148669523|gb|EDL01470.1| mCG123147, isoform CRA_a [Mus musculus]
Length = 1763
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|6716789|gb|AAF26744.1|AF222800_1 histone acetyltransferase querkopf [Mus musculus]
Length = 1763
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|74184716|dbj|BAE27963.1| unnamed protein product [Mus musculus]
Length = 1763
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|6002686|gb|AAF00095.1| histone acetyltransferase MORF [Homo sapiens]
gi|20521021|dbj|BAA20837.2| KIAA0383 [Homo sapiens]
gi|152012887|gb|AAI50271.1| MYST4 protein [Homo sapiens]
gi|168267336|dbj|BAG09724.1| MYST histone acetyltransferase 4 [synthetic construct]
Length = 1781
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|338716911|ref|XP_003363544.1| PREDICTED: histone acetyltransferase MYST4 isoform 3 [Equus
caballus]
Length = 1878
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|410975403|ref|XP_003994122.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Felis catus]
Length = 2078
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|390472131|ref|XP_002807481.2| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Callithrix jacchus]
Length = 2066
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|291404131|ref|XP_002718449.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 2 [Oryctolagus cuniculus]
Length = 1774
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|291404129|ref|XP_002718448.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 1 [Oryctolagus cuniculus]
Length = 2065
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|241640825|ref|XP_002409304.1| requim, req/dpf2, putative [Ixodes scapularis]
gi|215501331|gb|EEC10825.1| requim, req/dpf2, putative [Ixodes scapularis]
Length = 379
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 2/105 (1%)
Query: 31 SNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
+N + C C GEN+ + ++SC CG+ H +CL+ W+C
Sbjct: 256 ANPSPYCDFCLGDNGENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIE 315
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 316 CKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLTPPLSEPPEGLWSC 360
>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
Length = 1982
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 58/137 (42%), Gaps = 4/137 (2%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
G E S C +C ++ + + C +C K+ H +C+ + +
Sbjct: 1671 GNEFSSSTRLSSCGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYN 1730
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
W+C C+ C CR + P K ++C +CD YH YC + V G + C + C CG
Sbjct: 1731 WQCAGCKCCIKCRSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCIRCG 1788
Query: 144 SNVPGNGLSVRWFLGYT 160
+ P GL+ L T
Sbjct: 1789 ATKP-EGLAHVAALSQT 1804
>gi|395820446|ref|XP_003783577.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Otolemur
garnettii]
Length = 2062
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|297467918|ref|XP_872746.3| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Bos taurus]
gi|297491533|ref|XP_002698931.1| PREDICTED: histone acetyltransferase KAT6B [Bos taurus]
gi|296472060|tpg|DAA14175.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 4 [Bos
taurus]
Length = 2054
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|403298004|ref|XP_003939830.1| PREDICTED: histone acetyltransferase KAT6B [Saimiri boliviensis
boliviensis]
Length = 2051
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|402880392|ref|XP_003903787.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Papio anubis]
Length = 1887
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|397483740|ref|XP_003813055.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Pan paniscus]
Length = 1783
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395820450|ref|XP_003783579.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Otolemur
garnettii]
Length = 1880
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|440893247|gb|ELR46092.1| Histone acetyltransferase MYST4 [Bos grunniens mutus]
Length = 2054
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|68565919|sp|Q8WML3.1|KAT6B_MACFA RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4
gi|17025966|dbj|BAB72094.1| histone acetyltransferase MORF [Macaca fascicularis]
Length = 1784
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395501560|ref|XP_003755161.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Sarcophilus
harrisii]
Length = 1862
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|374349207|ref|NP_001243398.1| histone acetyltransferase KAT6B isoform 3 [Homo sapiens]
gi|119574943|gb|EAW54558.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_b [Homo sapiens]
Length = 1781
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|410053854|ref|XP_001162157.3| PREDICTED: zinc finger protein neuro-d4 [Pan troglodytes]
Length = 519
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 398 GTVIPNGYCDFCLGGSKKTGCPED--LISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 455
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 456 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 509
>gi|194042830|ref|XP_001928984.1| PREDICTED: histone acetyltransferase MYST4 [Sus scrofa]
Length = 2065
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|147904561|ref|NP_001081346.1| zinc finger protein ubi-d4 B [Xenopus laevis]
gi|47682308|gb|AAH70839.1| LOC397786 protein [Xenopus laevis]
gi|52078456|gb|AAH82478.1| LOC397786 protein [Xenopus laevis]
Length = 387
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 263 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 322
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 323 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 369
>gi|417406752|gb|JAA50020.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1807
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|380798715|gb|AFE71233.1| zinc finger protein neuro-d4 isoform a, partial [Macaca mulatta]
Length = 407
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 286 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 343
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 344 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 397
>gi|341888296|gb|EGT44231.1| hypothetical protein CAEBREN_09890 [Caenorhabditis brenneri]
gi|341902393|gb|EGT58328.1| hypothetical protein CAEBREN_07005 [Caenorhabditis brenneri]
Length = 454
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 3/89 (3%)
Query: 46 EGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 104
+ CER M+ C +C YH +C++ + + W C CR+C +C + G+ N+
Sbjct: 339 DSCERVGGEMVCCATCNIAYHPHCIEMPERMAMIVKTYEWSCVDCRVCSVCHKPGEENEV 398
Query: 105 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC RCD +H C K+ G ++C
Sbjct: 399 VFCDRCDRGFHNSCVG--LKSTPIGSWIC 425
>gi|149689991|ref|XP_001504001.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Equus
caballus]
Length = 2061
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|348576162|ref|XP_003473856.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Cavia
porcellus]
Length = 2053
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|45383492|ref|NP_989662.1| zinc finger protein ubi-d4 [Gallus gallus]
gi|18202299|sp|P58268.1|REQU_CHICK RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName: Full=Protein
requiem
gi|14010356|gb|AAK51965.1|AF362751_1 requiem [Gallus gallus]
Length = 405
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEGSWSC 385
>gi|402880390|ref|XP_003903786.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Papio anubis]
Length = 1778
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|119577164|gb|EAW56760.1| D4, zinc and double PHD fingers family 1, isoform CRA_a [Homo
sapiens]
Length = 425
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 304 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 361
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 362 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415
>gi|417406735|gb|JAA50012.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1778
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395820448|ref|XP_003783578.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Otolemur
garnettii]
Length = 1771
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|395501558|ref|XP_003755160.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Sarcophilus
harrisii]
Length = 2045
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|426255806|ref|XP_004021539.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Ovis aries]
Length = 2052
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|297301093|ref|XP_002805721.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Macaca
mulatta]
Length = 1784
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
Length = 1843
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1532 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1591
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1592 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1641
>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C K++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 358 CKPCAKLIKSRQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKLFKDLE---NA 414
Query: 222 QYRCPTCRGECYQVRDLEDAVRELWRRKDMA 252
Y CP C+G+ + E + + ++ K+++
Sbjct: 415 DYYCPDCKGK----FNYESSTSQTYKSKNIS 441
>gi|126272817|ref|XP_001366112.1| PREDICTED: histone acetyltransferase MYST4 [Monodelphis domestica]
Length = 2045
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|324512595|gb|ADY45214.1| Zinc finger protein [Ascaris suum]
Length = 382
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 32 NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
+ +++C LC EN+ +++SC CG+ H +CLK + W+C C
Sbjct: 260 DVSLICDLCLGDRNENKKTSLPEQLVSCHDCGRSGHPSCLKFTENMITSTNKYGWQCIEC 319
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ C IC + + ++ +FC CD +H YC P G + C
Sbjct: 320 KSCAICGTSDNDDQLLFCDDCDRGFHLYCLRPRLATAPEGEWSC 363
>gi|297298207|ref|XP_002805197.1| PREDICTED: hypothetical protein LOC694878 [Macaca mulatta]
Length = 472
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G N C C G N + R ++SC CG+ H CL+ + W+C
Sbjct: 348 GTVIPNNYCDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQC 407
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 408 IECKSCILCGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 454
>gi|302141752|emb|CBI18955.3| unnamed protein product [Vitis vinifera]
Length = 795
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 11/117 (9%)
Query: 22 DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS--C-GKKYHRNCLKNWAQNRDL 78
++ ++Q ++ +C++C E +L C C K YH++CL +
Sbjct: 626 EVGIQQQKETKYFQLCKIC----GSDMEFGEHLLECGHPFCPNKYYHKSCLTS---TELR 678
Query: 79 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
+ W CPSC +C C D K + C CD AYH YC +PP ++ G + C K
Sbjct: 679 MYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNPPRTSIPRGKWFCRK 734
>gi|426255808|ref|XP_004021540.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Ovis aries]
Length = 1760
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
Full=Protein enhancer of yellow 3
gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
Length = 2006
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1695 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1754
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1755 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1804
>gi|395501556|ref|XP_003755159.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Sarcophilus
harrisii]
Length = 1753
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|338716908|ref|XP_003363543.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Equus
caballus]
Length = 1769
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
Length = 2012
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1701 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1760
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1761 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1810
>gi|395741628|ref|XP_002820847.2| PREDICTED: histone acetyltransferase KAT6B, partial [Pongo abelii]
Length = 1870
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 8 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 67
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 68 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 107
>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
Length = 2011
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 37 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1700 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1759
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1760 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1809
>gi|431904096|gb|ELK09518.1| Histone acetyltransferase MYST4 [Pteropus alecto]
Length = 1926
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNRDKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|344274631|ref|XP_003409118.1| PREDICTED: histone acetyltransferase MYST4 [Loxodonta africana]
Length = 1878
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|56790323|ref|NP_001007153.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
gi|54035542|gb|AAH83281.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
gi|182890270|gb|AAI65788.1| Dpf2 protein [Danio rerio]
Length = 400
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H +CL+ W+C
Sbjct: 272 GLALPNNYCDFCLGDSSLNQKTGQSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQC 331
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 332 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPMSVPPEGSWSC 378
>gi|426235049|ref|XP_004011503.1| PREDICTED: PHD finger protein 10 [Ovis aries]
Length = 451
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 41/86 (47%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C CG H +CL A+ + W+C C+ C
Sbjct: 330 NALCGICLKGKESSKRGKAEPLVHCSQCGNSGHPSCLDMPAELVSMIKTYPWQCMECKAC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 390 IVCGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|348518782|ref|XP_003446910.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
Length = 399
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N ++ ++SC CG+ H +CL+ W+C
Sbjct: 271 GLALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 330
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 331 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 377
>gi|13431818|sp|Q9QX66.2|DPF1_MOUSE RecName: Full=Zinc finger protein neuro-d4; AltName:
Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
Full=D4, zinc and double PHD fingers family 1
Length = 387
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|18203563|sp|Q9W636.2|REQUB_XENLA RecName: Full=Zinc finger protein ubi-d4 B; AltName: Full=Apoptosis
response zinc finger protein B; AltName: Full=Protein
requiem B; Short=xReq B
Length = 366
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 242 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 301
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 302 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 348
>gi|335289596|ref|XP_003355927.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Sus scrofa]
gi|115527907|gb|AAI25154.1| DPF1 protein [Homo sapiens]
gi|119577167|gb|EAW56763.1| D4, zinc and double PHD fingers family 1, isoform CRA_d [Homo
sapiens]
gi|208966118|dbj|BAG73073.1| D4, zinc and double PHD fingers family 1 [synthetic construct]
Length = 387
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|347800720|ref|NP_001099199.2| zinc finger protein neuro-d4 [Rattus norvegicus]
Length = 387
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|327276821|ref|XP_003223166.1| PREDICTED: histone acetyltransferase MYST4-like [Anolis
carolinensis]
Length = 2024
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCLELTTNVKALRWQCIECKTCSA 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313
>gi|7305309|ref|NP_038902.1| zinc finger protein neuro-d4 [Mus musculus]
gi|6649546|gb|AAF21455.1|U48238_1 zinc finger protein neuro-d4 [Mus musculus]
gi|30481687|gb|AAH52348.1| D4, zinc and double PHD fingers family 1 [Mus musculus]
gi|148692120|gb|EDL24067.1| neuronal d4 domain family member [Mus musculus]
Length = 388
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|348576164|ref|XP_003473857.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Cavia
porcellus]
Length = 1762
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|205830434|ref|NP_001128628.1| zinc finger protein neuro-d4 isoform c [Homo sapiens]
gi|395751088|ref|XP_002829195.2| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Pongo abelii]
gi|395751090|ref|XP_003779218.1| PREDICTED: zinc finger protein neuro-d4 [Pongo abelii]
gi|395847019|ref|XP_003796184.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Otolemur
garnettii]
gi|395847023|ref|XP_003796186.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Otolemur
garnettii]
gi|402905397|ref|XP_003915506.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Papio anubis]
gi|402905399|ref|XP_003915507.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Papio anubis]
gi|410983102|ref|XP_003997882.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Felis catus]
gi|193787694|dbj|BAG52900.1| unnamed protein product [Homo sapiens]
Length = 332
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 211 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 268
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 269 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322
>gi|432897021|ref|XP_004076387.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
Length = 399
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N ++ ++SC CG+ H +CL+ W+C
Sbjct: 271 GLALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 330
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 331 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 377
>gi|126010798|gb|AAI33649.1| DPF1 protein [Bos taurus]
Length = 388
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|116003927|ref|NP_001070323.1| zinc finger protein neuro-d4 [Bos taurus]
gi|115305354|gb|AAI23606.1| D4, zinc and double PHD fingers family 1 [Bos taurus]
Length = 387
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 266 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 323
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 324 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|432877939|ref|XP_004073268.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
Length = 407
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H CL+ W+C
Sbjct: 278 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 337
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 338 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLTPPMTEPPEGSWSC 384
>gi|189526911|ref|XP_697383.3| PREDICTED: hypothetical protein LOC568932 [Danio rerio]
Length = 2011
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ +R +LSC CG H +CLK A W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSADLTANVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C C+ G + ++ +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSSCQIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|149056396|gb|EDM07827.1| neuronal d4 domain family member, isoform CRA_b [Rattus norvegicus]
gi|166796914|gb|AAI59416.1| D4, zinc and double PHD fingers family 1 [Rattus norvegicus]
Length = 332
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 211 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 268
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 269 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322
>gi|345305893|ref|XP_001506182.2| PREDICTED: histone acetyltransferase MYST4 isoform 1
[Ornithorhynchus anatinus]
Length = 2066
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 2/109 (1%)
Query: 27 EQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK 85
+Q +++ +C C E+ ++ +LSC CG H +CLK + W+
Sbjct: 206 DQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQ 265
Query: 86 CPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C C+ C CR G + +FC CD +H C PP + G ++C
Sbjct: 266 CIECKTCSACRIQGKNAENMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|302843023|ref|XP_002953054.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
nagariensis]
gi|300261765|gb|EFJ45976.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
nagariensis]
Length = 2579
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 60/134 (44%), Gaps = 16/134 (11%)
Query: 14 GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
G A D AG +G + + + R+C + + +S +SCG++ N + N
Sbjct: 1226 GLGARLESDKAGIPEG-AQSGLRRRVCVGAQQHPPWDSAEEMSIRSCGQRSRCNIIHNLL 1284
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
R LF CP R+C + D N+ + C CD YHCYC PP V +G + C
Sbjct: 1285 --RLLF-----SCP--RVCWLDE---DKNRILLCDGCDGEYHCYCVEPPLLEVPAGAWFC 1332
Query: 134 PKHTKCHSCGSNVP 147
P C + G +P
Sbjct: 1333 P---SCTARGLGLP 1343
>gi|260830577|ref|XP_002610237.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
gi|229295601|gb|EEN66247.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
Length = 354
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 48/112 (42%), Gaps = 5/112 (4%)
Query: 23 IAGEEQGQSNTNVMCRLCFVGENEGCERA--RRMLSCKSCGKKYHRNCLKNWAQNRDLFH 80
+ + G SN C C E E + ++SC CG+ H CL+ +
Sbjct: 244 VPAKGSGPSN---YCDFCLGDEKENKKSGVPEELISCSDCGRSGHPTCLQFTPHMTESVK 300
Query: 81 WSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 132
W+C C+ C +C + + ++ +FC CD YH YC PP + G L
Sbjct: 301 KYRWQCIECKSCHLCGTSENDDQLLFCDDCDRGYHMYCLSPPLSSPPEGKTL 352
>gi|348514482|ref|XP_003444769.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
Length = 405
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H CL+ W+C
Sbjct: 276 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 335
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 336 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLTPPMTEPPEGSWSC 382
>gi|270013653|gb|EFA10101.1| hypothetical protein TcasGA2_TC012280 [Tribolium castaneum]
Length = 559
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 57/131 (43%), Gaps = 18/131 (13%)
Query: 24 AGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
+G EQG +C +C V G RM++C+ C H +CL ++ ++
Sbjct: 283 SGREQGG-----VCSVCHVQNKMGPND--RMVACRECT---HYSCLNGDDMMLRMYPDNT 332
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP-PHKNVSSGP--YLC-----PK 135
W+CP C+ C IC T D C C AYH C P H+ P +LC P+
Sbjct: 333 WQCPHCKTCVICFETSDAGYLTVCAVCADAYHAGCHQPRIHEKFIKPPAKWLCINCEMPE 392
Query: 136 HTKCHSCGSNV 146
K + SN+
Sbjct: 393 ELKINEIQSNI 403
>gi|410895483|ref|XP_003961229.1| PREDICTED: histone acetyltransferase KAT6B-like [Takifugu rubripes]
Length = 2123
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKQPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + ++ +FC CD +H C +PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCNPPLSRMPKGTWIC 313
>gi|308497276|ref|XP_003110825.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
gi|308242705|gb|EFO86657.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
Length = 372
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 38/80 (47%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
++SC CG+ H +CL + S W+C C+ C IC + + +K +FC CD
Sbjct: 276 LVSCHDCGRSGHPSCLSFNENVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 335
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH YC P + Y C
Sbjct: 336 YHLYCLRPALEKAPDDEYSC 355
>gi|195425644|ref|XP_002061104.1| GK10624 [Drosophila willistoni]
gi|194157189|gb|EDW72090.1| GK10624 [Drosophila willistoni]
Length = 515
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 37 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
C C EN+ ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 406 CDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSI 465
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C + + ++ +FC CD YH YC PP G + C
Sbjct: 466 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 504
>gi|440894955|gb|ELR47273.1| Zinc finger protein neuro-d4, partial [Bos grunniens mutus]
Length = 383
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 262 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 319
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 320 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 373
>gi|37998957|dbj|BAD00088.1| chimeric MOZ-ASXH2 fusion protein [Homo sapiens]
Length = 2228
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|115533182|ref|NP_001041113.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
gi|351062484|emb|CCD70456.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
Length = 447
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+ C +C YH C++ + L W C CR+C IC + ++ +FC RCD
Sbjct: 345 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 404
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
+H YC K + G ++C
Sbjct: 405 FHTYCVG--LKKLPQGTWIC 422
>gi|195122592|ref|XP_002005795.1| GI20660 [Drosophila mojavensis]
gi|193910863|gb|EDW09730.1| GI20660 [Drosophila mojavensis]
Length = 492
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 2/120 (1%)
Query: 16 VATASKDIAGEEQGQSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWA 73
+A+ K E+ + + C C EN+ ++SC CG+ H +CL+
Sbjct: 362 LASPKKPKLRTEREVAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTP 421
Query: 74 QNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 422 NMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 481
>gi|410913627|ref|XP_003970290.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
Length = 407
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H CL+ W+C
Sbjct: 278 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 337
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 338 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMTEPPEGSWSC 384
>gi|358338934|dbj|GAA39327.2| zinc finger protein ubi-d4, partial [Clonorchis sinensis]
Length = 331
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 44/105 (41%), Gaps = 2/105 (1%)
Query: 31 SNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPS 88
S+T C C N+ + ML C CG+ H CL+ A W+C
Sbjct: 165 SDTEYACDFCLGDARLNKKTGQPEDMLRCSDCGRCAHFTCLQFTANMVSSVRTYRWQCIE 224
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + + +FC CD YH YC PP G + C
Sbjct: 225 CKTCWLCGTSENDEQMLFCDDCDRGYHMYCLSPPLSEPPEGSWSC 269
>gi|324512933|gb|ADY45341.1| PHD finger protein 10 [Ascaris suum]
Length = 481
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 54/126 (42%), Gaps = 25/126 (19%)
Query: 30 QSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCP 87
QS C +C G + +MLSC +C K H +C L N L + W C
Sbjct: 344 QSGNENRCSIC------GVTNSAQMLSCATCSTKVHPDCAGLPERVVNVALNYM--WSCI 395
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPK-------HT 137
C+ C +C + + + MFC RCD YH +C PP +G ++C +
Sbjct: 396 ECKKCTVCEKPDNEDAMMFCDRCDRGYHTFCVGLSAPP-----TGTWVCTNFCADQTIQS 450
Query: 138 KCHSCG 143
KC+ C
Sbjct: 451 KCNKCS 456
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
E C++ ++ C +C + YH CL+ + WS CP C E
Sbjct: 421 EVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWS---CPHCEGEGIQEQEEDEHMEF 477
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C AA+H +C +PP KNV +G + CP+
Sbjct: 478 CRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPR 518
>gi|410915434|ref|XP_003971192.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
Length = 398
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N ++ ++SC CG+ H +CL+ W+C
Sbjct: 270 GIALPNNYCDFCLGDSKTNHKTGQSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQC 329
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 330 IECKCCNMCGTSENDDQLLFCDDCDRGYHMYCLNPPMSEPPEGSWSC 376
>gi|33869836|gb|AAH21191.1| DPF1 protein, partial [Homo sapiens]
Length = 414
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 293 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 350
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 351 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|301765980|ref|XP_002918393.1| PREDICTED: histone acetyltransferase MYST3-like [Ailuropoda
melanoleuca]
Length = 1702
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|356551207|ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786712 [Glycine max]
Length = 525
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + + + + W CPSC IC++C D +K + C CD AYH YC P
Sbjct: 376 KYYHVRCL---SSKQLKSYGNCWYCPSC-ICQVCLTDKDDDKIVLCDGCDHAYHIYCMKP 431
Query: 122 PHKNVSSGPYLCPK 135
P ++ G + C K
Sbjct: 432 PQNSIPKGKWFCIK 445
>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 989
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 161 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 220
CC C +L YC +C +++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 360 CCKYCSKLRKSKQYCGICKRIWHHSDGGNWVCCDGCNVWVHAECDKISSKVFKDLE---N 416
Query: 221 LQYRCPTCRGE 231
Y CP C+G+
Sbjct: 417 TDYYCPDCKGK 427
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2598 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2652
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2653 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2706
Query: 145 NVP 147
P
Sbjct: 2707 RAP 2709
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2600 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2659
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2660 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2696
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 460 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 516
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C +AYH +C +PP ++ G + CP+
Sbjct: 517 CRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPR 557
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2644 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2698
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2699 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2752
Query: 145 NVP 147
P
Sbjct: 2753 RAP 2755
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2646 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2705
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2706 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2742
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2635 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2689
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2690 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2743
Query: 145 NVP 147
P
Sbjct: 2744 RAP 2746
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2637 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2696
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2697 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2733
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2630 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2684
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2685 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2738
Query: 145 NVP 147
P
Sbjct: 2739 RAP 2741
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2632 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2691
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2692 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2728
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2562 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2616
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2617 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2670
Query: 145 NVP 147
P
Sbjct: 2671 RAP 2673
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2564 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2623
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2624 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2660
>gi|417406854|gb|JAA50068.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 2010
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|443693752|gb|ELT95039.1| hypothetical protein CAPTEDRAFT_19718 [Capitella teleta]
Length = 421
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 2/106 (1%)
Query: 30 QSNTNVMCRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
+S + C C EN+ ++SC CG+ H +CL+ A W+C
Sbjct: 299 RSKASRYCDFCLGDDIENKKTGMKEELVSCADCGRSGHPSCLQFTANMIISVKKYPWQCI 358
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC +PP G + C
Sbjct: 359 ECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 404
>gi|195399353|ref|XP_002058285.1| GJ15575 [Drosophila virilis]
gi|194150709|gb|EDW66393.1| GJ15575 [Drosophila virilis]
Length = 2162
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 6/112 (5%)
Query: 40 CFVGENEGCERARRM----LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C V + AR M + C SC ++ H +C++ + +W+C C+ C C
Sbjct: 1896 CGVCQRTQHRNARNMPEAFIRCYSCRRRVHPSCIEMPHRMVGRVRNYNWQCADCKCCIKC 1955
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+ P K ++C +CD YH YC K V G + C + + C CG+ P
Sbjct: 1956 GSSQQPGKMLYCEQCDRGYHIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2005
>gi|380792697|gb|AFE68224.1| histone acetyltransferase KAT6B, partial [Macaca mulatta]
Length = 1077
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|392333207|ref|XP_003752828.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1855
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ ++SC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|6648954|gb|AAF21305.1|AF108133_1 neuro-d4 [Mus musculus]
Length = 323
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 202 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAPVRTYRWQCI 259
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 260 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 313
>gi|148232579|ref|NP_001090745.1| D4, zinc and double PHD fingers family 1 [Xenopus (Silurana)
tropicalis]
gi|120537304|gb|AAI29026.1| dpf1 protein [Xenopus (Silurana) tropicalis]
Length = 383
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 262 GSVIANGYCDFCLGGAKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 319
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 320 ECKSCILCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 365
>gi|354482358|ref|XP_003503365.1| PREDICTED: histone acetyltransferase MYST3 [Cricetulus griseus]
Length = 1992
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSTELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|47225244|emb|CAG09744.1| unnamed protein product [Tetraodon nigroviridis]
Length = 393
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H CL+ W+C
Sbjct: 267 GLALPNDYCDFCLGDSTLNQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQC 326
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 327 IECKCCNVCGTSENDDQLLFCDDCDRGYHMYCLSPPMTEPPEGSWSC 373
>gi|395535344|ref|XP_003769687.1| PREDICTED: PHD finger protein 10 [Sarcophilus harrisii]
Length = 598
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N MC +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 477 NAMCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTC 536
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 537 IICGQPHHEEEMMFCDVCDRGYHTFC 562
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2584 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2638
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C S
Sbjct: 2639 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIS 2692
Query: 145 NVP 147
P
Sbjct: 2693 RAP 2695
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2586 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2645
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2646 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2682
>gi|334312613|ref|XP_001373063.2| PREDICTED: histone acetyltransferase MYST3 [Monodelphis domestica]
Length = 1951
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|355697899|gb|EHH28447.1| Histone acetyltransferase MYST3 [Macaca mulatta]
Length = 2099
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 304 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 363
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 364 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 403
>gi|345781619|ref|XP_003432152.1| PREDICTED: histone acetyltransferase KAT6A [Canis lupus familiaris]
Length = 2017
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|432099934|gb|ELK28828.1| Histone acetyltransferase MYST3 [Myotis davidii]
Length = 1861
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|426359475|ref|XP_004046999.1| PREDICTED: histone acetyltransferase KAT6A [Gorilla gorilla
gorilla]
Length = 2005
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|426256598|ref|XP_004023426.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Ovis aries]
Length = 1931
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|119918267|ref|XP_874495.2| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
gi|297491293|ref|XP_002698753.1| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
gi|296472344|tpg|DAA14459.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 3 [Bos
taurus]
Length = 2018
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|150378463|ref|NP_001092882.1| histone acetyltransferase KAT6A [Homo sapiens]
gi|150378493|ref|NP_006757.2| histone acetyltransferase KAT6A [Homo sapiens]
gi|150378543|ref|NP_001092883.1| histone acetyltransferase KAT6A [Homo sapiens]
gi|215274095|sp|Q92794.2|KAT6A_HUMAN RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger protein;
AltName: Full=Runt-related transcription factor-binding
protein 2; AltName: Full=Zinc finger protein 220
gi|119583643|gb|EAW63239.1| MYST histone acetyltransferase (monocytic leukemia) 3, isoform
CRA_a [Homo sapiens]
gi|208965270|dbj|BAG72649.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
construct]
gi|225000792|gb|AAI72379.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
construct]
Length = 2004
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|297682771|ref|XP_002819083.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Pongo abelii]
Length = 2010
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|403303692|ref|XP_003942458.1| PREDICTED: histone acetyltransferase KAT6A [Saimiri boliviensis
boliviensis]
Length = 1968
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|395507492|ref|XP_003758058.1| PREDICTED: histone acetyltransferase KAT6A [Sarcophilus harrisii]
Length = 1993
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|149410700|ref|XP_001509833.1| PREDICTED: histone acetyltransferase MYST3-like isoform 2
[Ornithorhynchus anatinus]
Length = 2003
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|118101408|ref|XP_424402.2| PREDICTED: histone acetyltransferase KAT6A [Gallus gallus]
Length = 1981
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|348562787|ref|XP_003467190.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4-like
[Cavia porcellus]
Length = 358
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 237 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 294
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 295 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 348
>gi|1517914|gb|AAC50662.1| monocytic leukaemia zinc finger protein [Homo sapiens]
Length = 2004
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|348557736|ref|XP_003464675.1| PREDICTED: histone acetyltransferase MYST3 [Cavia porcellus]
Length = 2016
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 207 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 266
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 267 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 306
>gi|440901050|gb|ELR52053.1| Histone acetyltransferase MYST3 [Bos grunniens mutus]
Length = 1923
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|397505596|ref|XP_003846081.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Pan paniscus]
Length = 2002
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|339522307|gb|AEJ84318.1| D4 zinc and double PHD fingers family 2 [Capra hircus]
Length = 391
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 48/116 (41%), Gaps = 8/116 (6%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
C+ C IC + + ++ +FC CD YH YC P G + +CH C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDACDRGYHMYCLTPSMSEPPEGSW------RCHLC 374
>gi|332241000|ref|XP_003269676.1| PREDICTED: histone acetyltransferase KAT6A [Nomascus leucogenys]
Length = 2004
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|149470805|ref|XP_001506848.1| PREDICTED: zinc finger protein ubi-d4-like [Ornithorhynchus
anatinus]
Length = 425
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 2/103 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 263 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 322
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
C+ C IC + + ++ +FC CD YH YC PP G
Sbjct: 323 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEG 365
>gi|114619920|ref|XP_001140373.1| PREDICTED: histone acetyltransferase KAT6A isoform 1 [Pan
troglodytes]
gi|332826020|ref|XP_003311745.1| PREDICTED: histone acetyltransferase KAT6A [Pan troglodytes]
Length = 2002
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|395857485|ref|XP_003801122.1| PREDICTED: histone acetyltransferase KAT6A [Otolemur garnettii]
Length = 2002
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|326932701|ref|XP_003212452.1| PREDICTED: histone acetyltransferase MYST3-like [Meleagris
gallopavo]
Length = 1981
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|74197305|dbj|BAC32253.2| unnamed protein product [Mus musculus]
Length = 933
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|402878102|ref|XP_003902742.1| PREDICTED: histone acetyltransferase KAT6A [Papio anubis]
Length = 2010
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|431902227|gb|ELK08728.1| Histone acetyltransferase MYST3 [Pteropus alecto]
Length = 1731
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|312081277|ref|XP_003142959.1| hypothetical protein LOAG_07378 [Loa loa]
Length = 147
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 52/116 (44%), Gaps = 2/116 (1%)
Query: 20 SKDIAGEEQGQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRD 77
+ IA + + +C LC +N+ + +++SC CG+ H +CLK
Sbjct: 12 AASIAALLSPSIDVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLT 71
Query: 78 LFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
W+C C+ C IC + + ++ +FC CD +H YC PP G + C
Sbjct: 72 STGKYGWQCIECKSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 127
>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
Length = 1078
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CG+++P + G C C RL ++C +C KV+ S+S V CD
Sbjct: 407 CEGCGTSLPLKPAKKIKGTSPGGQLLCKTCARLTKSKHFCGICKKVWNHSDSGSWVRCDG 466
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
C+ WVH +CD IS + F+ G Y CP C+ +
Sbjct: 467 CKVWVHAECDKISSNR---FKDLGGTDYYCPACKAK 499
>gi|118344120|ref|NP_001071881.1| zinc finger protein [Ciona intestinalis]
gi|70571741|dbj|BAE06812.1| zinc finger protein [Ciona intestinalis]
Length = 667
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 3/100 (3%)
Query: 35 VMCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 93
++C +C G + A ++ C C H +CL+ + + +W+C C+ C
Sbjct: 399 IVCGICSKDGSSNKKGEAEELIKCSQCDNHGHPSCLEMSVEQVSVIETYNWQCMECKTCT 458
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
IC + MFC RCD YH +C + + SG + C
Sbjct: 459 ICSMPHREDLMMFCDRCDRGYHTFCVS--LRAIPSGVWAC 496
>gi|226498206|ref|NP_001147779.1| LOC100281389 [Zea mays]
gi|195613724|gb|ACG28692.1| PHD-finger family protein [Zea mays]
gi|219885501|gb|ACL53125.1| unknown [Zea mays]
gi|413921539|gb|AFW61471.1| PHD-finger family [Zea mays]
Length = 558
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 36 MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKNWA----QNRDLFHWSSWKCPS 88
+C+LC E+E ++ + C C K YH CLK + R+L W CPS
Sbjct: 419 LCKLCGTCEDEN----KKFVVCGHGYCSFKFYHALCLKESQIASEKQRNL---KCWYCPS 471
Query: 89 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C +C C + D K + C CD AYH YC PP +V G + C
Sbjct: 472 C-LCRRCFKNKDDEKIVLCDGCDEAYHTYCMDPPRSSVPRGKWFC 515
>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
Length = 2123
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 1633 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 1687
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 1688 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 1738
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 1633 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 1692
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 1693 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 1725
>gi|449488248|ref|XP_004176107.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Taeniopygia guttata]
Length = 2010
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|296222095|ref|XP_002757039.1| PREDICTED: histone acetyltransferase KAT6A [Callithrix jacchus]
Length = 2003
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2509 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2563
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2564 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2614
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2509 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2568
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2569 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2601
>gi|344281584|ref|XP_003412558.1| PREDICTED: histone acetyltransferase MYST3 [Loxodonta africana]
Length = 2011
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|224082358|ref|XP_002306661.1| predicted protein [Populus trichocarpa]
gi|222856110|gb|EEE93657.1| predicted protein [Populus trichocarpa]
Length = 524
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
GK YH CL N + + + W CPSC +C C D +K + C CD AYH YC
Sbjct: 377 GKYYHVRCLTN---RQLILYGPRWYCPSC-LCRGCLTDKDDDKIVLCDGCDHAYHLYCMI 432
Query: 121 PPHKNVSSGPYLC 133
PP +V G + C
Sbjct: 433 PPRISVPKGKWFC 445
>gi|124487239|ref|NP_001074618.1| histone acetyltransferase KAT6A [Mus musculus]
gi|148700926|gb|EDL32873.1| mCG13090 [Mus musculus]
Length = 2003
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|168045004|ref|XP_001774969.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673716|gb|EDQ60235.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 293
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 22/159 (13%)
Query: 604 VLQSLPKDSKPP---LRLKFRK-PNLE------NQNSQVSQPEEEKSLIKGQRSKRKRPS 653
+L S P P LRLK +K P E +++++ S+ + E ++ Q SK+KR +
Sbjct: 142 LLHSSPGSDSPVTRRLRLKIKKQPAREVVIPTTSKDNKASEHDGELAIKAHQTSKKKRTA 201
Query: 654 PFTEKTLFNEDEDAAQSNQDSLM--SEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKG 711
+A S Q L D +WIL++LG DAI KRVEV DN W+KG
Sbjct: 202 --------TPQRNAEGSTQRRLAIGDATDDDSWILQRLGTDAISKRVEVFWPIDNIWYKG 253
Query: 712 VVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 750
+ ++ + DD +TLE GK+ VR + +K
Sbjct: 254 TIVAVF--STQFCVDYDDGDQETLEFGKEKVRLLSTTKK 290
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 417 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 473
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C +AYH +C +PP ++ G + CP+
Sbjct: 474 CRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPR 514
>gi|68565903|sp|Q8BZ21.2|KAT6A_MOUSE RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger homolog;
AltName: Full=Monocytic leukemia zinc finger protein
Length = 2003
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 77/202 (38%), Gaps = 55/202 (27%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGC----ERARRMLSC--KSCGKKYHRNCLKNWAQNRDL 78
G+ Q + C C G C ER + C +CGK YH NCL +W Q+
Sbjct: 583 GKSDEQEDETFKCIDCLSGVAPACFLCNEREGDRIRCIVPACGKHYHSNCLLSWPQS--- 639
Query: 79 FHWSSWK--CPSCRICEIC-------RRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNV-- 126
HW + CP +C C +R+ PN+ M C RC ++YH P +V
Sbjct: 640 -HWQGGRLTCPY-HVCHTCSSDNPQDKRSRAPNEKMARCVRCPSSYHASTLCLPAGSVIL 697
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
++ +CPKH K P L+ W C C R
Sbjct: 698 TANQIICPKHYK-------APHPPLNAAW------CFLCTR------------------- 725
Query: 187 STPMVCCDVCQRWVHCQCDGIS 208
++CCD C H +C GI+
Sbjct: 726 GGSLICCDTCPTSFHLECLGIN 747
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2867 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2921
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C +
Sbjct: 2922 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIA 2975
Query: 145 NVP 147
P
Sbjct: 2976 RAP 2978
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2869 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2928
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2929 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2965
>gi|327286450|ref|XP_003227943.1| PREDICTED: histone acetyltransferase MYST3-like [Anolis
carolinensis]
Length = 2017
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|338710071|ref|XP_001916345.2| PREDICTED: zinc finger protein neuro-d4-like [Equus caballus]
Length = 205
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 84 GTVIPNGYCDFCLGGSKKTGCPE--DLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 141
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 142 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 195
>gi|148669525|gb|EDL01472.1| mCG123147, isoform CRA_c [Mus musculus]
Length = 938
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 216 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 275
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 325
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2542 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2596
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2597 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2647
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2542 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2601
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2602 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2634
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2454 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2508
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2509 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2559
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2454 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2513
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2514 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2546
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 15/102 (14%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI------------CE 93
E C++ ++ C +C K YH CL+ ++ WS CP+C E
Sbjct: 403 EVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWS---CPTCEADGGVAEDDDDEHQE 459
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C +AYH +C PP ++ G + CP+
Sbjct: 460 FCRICKDGGELLCCDMCPSAYHTFCLTPPLDDIPDGDWRCPR 501
>gi|63146269|gb|AAH95974.1| Myst4 protein, partial [Mus musculus]
Length = 828
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
Length = 1018
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CG ++P + G C C +L +YC +C K++ S+S V CD
Sbjct: 351 CEGCGVSLPFKLSKKMKSSITGGQFLCKTCAKLTKLKHYCGICKKIWNHSDSGSWVRCDG 410
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
C+ WVH +CD IS+ + F+ G Y CP C+ +
Sbjct: 411 CKVWVHAECDKISNSR---FKDLGATDYYCPACKAK 443
>gi|56001099|dbj|BAD72833.1| monocytic leukemia zinc finger protein [Rattus norvegicus]
Length = 1991
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 201 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 260
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 261 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 300
>gi|213972547|ref|NP_001094040.1| histone acetyltransferase KAT6A [Rattus norvegicus]
gi|68565633|sp|Q5TKR9.2|KAT6A_RAT RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger homolog;
AltName: Full=Monocytic leukemia zinc finger protein
gi|149057780|gb|EDM09023.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Rattus
norvegicus]
Length = 1998
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|351707582|gb|EHB10501.1| Histone acetyltransferase MYST3 [Heterocephalus glaber]
Length = 2068
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTIRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLIRMPKGMWIC 307
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 17/123 (13%)
Query: 31 SNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC---- 86
SN+ C+ C GENE ++L C C K YH C K N W ++C
Sbjct: 2871 SNSLQNCQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKA 2925
Query: 87 PSCRICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
+ R C +C R K ++C C AYH C PP V G + CH C +
Sbjct: 2926 TNERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIT 2979
Query: 145 NVP 147
P
Sbjct: 2980 RAP 2982
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 140
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2873 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2932
Query: 141 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
CG + P + + CD C R + Y P LKV R
Sbjct: 2933 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2969
>gi|357631309|gb|EHJ78887.1| hypothetical protein KGM_12125 [Danaus plexippus]
Length = 501
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 22 DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
++ +E T+ +C +C + + G R++ C+ C K H +CL++ +
Sbjct: 257 EMNNDEPLSHETSGVCTVCLIQKTRGSND--RLVECRDCNNKAHLSCLQSGSGILKPRPD 314
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
++W+CP C+ C +C T D C C +YH C P
Sbjct: 315 NTWQCPHCKTCVVCCETNDAGILTVCSICSDSYHALCHTP 354
>gi|326923554|ref|XP_003208000.1| PREDICTED: histone acetyltransferase MYST4-like [Meleagris
gallopavo]
Length = 2028
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 8/134 (5%)
Query: 8 ARRFESGYVAT---ASKDIAGEEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCG-- 61
A ++ SG+ A+ S ++Q +++ +C C E+ ++ +LSC CG
Sbjct: 184 APKYSSGFPASLPPVSLLPHEKDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSS 243
Query: 62 -KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQ 119
K H +CLK + W+C C+ C CR G + + +FC CD +H C
Sbjct: 244 GKLEHPSCLKFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECC 303
Query: 120 HPPHKNVSSGPYLC 133
PP + G ++C
Sbjct: 304 DPPLSRMPKGMWIC 317
>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
Length = 1418
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 86/229 (37%), Gaps = 75/229 (32%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGC-----------------ERAR-------RMLSCKS- 59
+E G T+++C C VGE EGC E A ++L+C
Sbjct: 835 AKEVGAVGTSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDAPSHTPIEDKLLTCSQP 894
Query: 60 -CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRR 109
CGK++H +C K W Q H S +CP +C C + DP +K C R
Sbjct: 895 VCGKRFHTSCCKYWPQASSSKH--SARCPR-HVCHTC-VSNDPSGRFQQLGSSKLAKCVR 950
Query: 110 CDAAYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
C A YH +C + +++ +CP+H N+ V Y C
Sbjct: 951 CPATYHQDSHCIPAGTQMLNATNIICPRH--------NIAKADAHVNVLWCYIC------ 996
Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
VKG +VCC+ C VH C I ++E Y+
Sbjct: 997 --VKGGE---------------LVCCETCPIAVHAHCRNIPIKTNENYI 1028
>gi|154757359|gb|AAI51762.1| MYST4 protein [Bos taurus]
Length = 349
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 204 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 263
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 264 QCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313
>gi|334324322|ref|XP_001381625.2| PREDICTED: PHD finger protein 10-like [Monodelphis domestica]
Length = 662
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N MC +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 541 NAMCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSIIKTYPWQCMECKTC 600
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 601 IICGQPHHEEEMMFCDVCDRGYHTFC 626
>gi|327272024|ref|XP_003220786.1| PREDICTED: PHD finger protein 10-like [Anolis carolinensis]
Length = 489
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 18 TASKDIAGEEQGQSNTNVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNR 76
+ SK + G + ++ N +C +C G E+ +A ++ C C H +CL A+
Sbjct: 353 SVSKSVPGY-KPKAFPNAICGICLKGKESNKKGKAEALIHCSQCENSGHPSCLDMSAELV 411
Query: 77 DLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
+ W+C C+ C IC + + MFC CD YH +C
Sbjct: 412 AIIKTYPWQCMECKTCIICGQPHHEEEMMFCDLCDRGYHTFC 453
>gi|327276253|ref|XP_003222884.1| PREDICTED: zinc finger protein neuro-d4-like [Anolis carolinensis]
Length = 388
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370
>gi|29387208|gb|AAH48199.1| MYST4 protein, partial [Homo sapiens]
gi|33874216|gb|AAH14143.1| MYST4 protein, partial [Homo sapiens]
Length = 325
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 205 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRW 264
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 265 QCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|297299302|ref|XP_001094798.2| PREDICTED: histone acetyltransferase MYST3 [Macaca mulatta]
Length = 1905
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 73/191 (38%), Gaps = 18/191 (9%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 153
CR G + + +FC CD +H C PP + G L +C + G+
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGNTLA-----VEACSGEL---GVEA 319
Query: 154 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQR----WVHCQCDGISD 209
W + C A VKG C +R V +C++ W+ GI
Sbjct: 320 GWHMS-ACVGAFRGAKVKGG-CGQRGTSHRSHSGQTCVVVVMCKQLYWNWISTFTSGIRL 377
Query: 210 EKYLQFQVDGN 220
E+ Q DGN
Sbjct: 378 EE--QLAADGN 386
>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1067
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C +CG ++P L G C C RL +YC +C KV+ S+S V CD
Sbjct: 403 CEACGLSLPYKMLKKTKDSSPGGQFLCRTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 462
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
C+ WVH +CD IS + + Y CPTC+ +
Sbjct: 463 CKVWVHAECDKISSNLFKNLE---GTDYYCPTCKAK 495
>gi|392353369|ref|XP_003751480.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1640
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 51/110 (46%), Gaps = 2/110 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ ++SC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C C+ C CR G + + +FC CD +H C PP + G ++C
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|291409041|ref|XP_002720841.1| PREDICTED: MYST histone acetyltransferase 2-like [Oryctolagus
cuniculus]
Length = 1806
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|359492251|ref|XP_002284634.2| PREDICTED: uncharacterized protein LOC100247132 [Vitis vinifera]
Length = 386
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 11/117 (9%)
Query: 22 DIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKS--C-GKKYHRNCLKNWAQNRDL 78
++ ++Q ++ +C++C E +L C C K YH++CL +
Sbjct: 217 EVGIQQQKETKYFQLCKIC----GSDMEFGEHLLECGHPFCPNKYYHKSCLTSTELR--- 269
Query: 79 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
+ W CPSC +C C D K + C CD AYH YC +PP ++ G + C K
Sbjct: 270 MYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNPPRTSIPRGKWFCRK 325
>gi|4808456|dbj|BAA77571.1| Requiem protein [Xenopus laevis]
Length = 198
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 74 GLALPNNYCDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 133
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 134 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 180
>gi|4808454|dbj|BAA77570.1| Requiem protein [Xenopus laevis]
Length = 386
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 46/107 (42%), Gaps = 4/107 (3%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ ++SC CG+ H +CL+ A W+C
Sbjct: 264 GIALPNNYCDFCLGDSKINKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQC 323
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + N +FC CD YH YC PP G + C
Sbjct: 324 IECKCCNICGTSE--NDLLFCDDCDRGYHMYCLVPPVAEPPEGSWSC 368
>gi|55726215|emb|CAH89880.1| hypothetical protein [Pongo abelii]
Length = 1275
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|410912120|ref|XP_003969538.1| PREDICTED: PHD finger protein 10-like [Takifugu rubripes]
Length = 493
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 3/87 (3%)
Query: 34 NVMCRLCFVGENEGCERAR--RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
+ +C +C G E +R R ++ C C H +CL + + SW+C C+
Sbjct: 368 SAICGICQKG-REANKRGRPEALIHCSQCDNSGHPSCLDMSGELVSVIQTYSWQCMECKT 426
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYC 118
C +C++ ++ MFC +CD YH +C
Sbjct: 427 CTVCQQPHHEDEMMFCDKCDRGYHTFC 453
>gi|148745647|gb|AAI42660.1| MYST3 protein [Homo sapiens]
Length = 1149
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|355779658|gb|EHH64134.1| Histone acetyltransferase MYST3 [Macaca fascicularis]
Length = 2276
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 444 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 503
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 504 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 543
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2720 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2774
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2775 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIARAP 2825
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2720 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2779
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2780 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2812
>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
sativus]
Length = 1073
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C +C RL +YC +C K++ S+S V CD C+ WVH +CD IS F+ G+
Sbjct: 434 CKSCTRLTNSKHYCGICKKIWNHSDSGSWVRCDGCKVWVHAECDKISSN---LFKDLGST 490
Query: 222 QYRCPTCRGE 231
Y CPTC+ +
Sbjct: 491 DYFCPTCKAK 500
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 45/108 (41%), Gaps = 12/108 (11%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C G+ E ++L C C K YH C + N W ++C R C
Sbjct: 1611 CQFCHSGDKED-----QLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNC 1665
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCH 140
+C + G+ + C +C AYH C PP V G +LC CH
Sbjct: 1666 IVCGKPGNKTISVLCDQCPKAYHIECLQPPLAKVPRGKWLC---VLCH 1710
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 39/100 (39%), Gaps = 17/100 (17%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK-------HTKCHSCGS 144
C+ C ++ + C CD YH YC PP N+ G + C + C CG
Sbjct: 1611 CQFCHSGDKEDQLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNCIVCGK 1670
Query: 145 NVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
PGN +SV CD C + + P KV R
Sbjct: 1671 --PGNKTISV-------LCDQCPKAYHIECLQPPLAKVPR 1701
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2657 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2711
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2712 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2762
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2657 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2716
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2717 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2749
>gi|74228562|dbj|BAE25366.1| unnamed protein product [Mus musculus]
Length = 1291
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|344306723|ref|XP_003422034.1| PREDICTED: PHD finger protein 10-like [Loxodonta africana]
Length = 533
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 412 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTC 471
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + ++ MFC CD YH +C
Sbjct: 472 IVCGQPHHEDEMMFCDVCDRGYHTFC 497
>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
Length = 1057
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 42/93 (45%), Gaps = 5/93 (5%)
Query: 139 CHSCGSNVPGNGLS--VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVC 196
C SC +P + F C C +L YC +C K++ S+ VCCD C
Sbjct: 399 CDSCNLILPCKTIKRKASVFQTELICKHCAKLRKSKQYCGICKKIWHHSDGGNWVCCDGC 458
Query: 197 QRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
WVH +CD IS + + + N Y CP CR
Sbjct: 459 NVWVHAECDNISRKLFKDLE---NFDYYCPDCR 488
>gi|345316943|ref|XP_001509649.2| PREDICTED: PHD finger protein 10, partial [Ornithorhynchus
anatinus]
Length = 468
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 347 NAICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTTELVSMIKTYPWQCMECKTC 406
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 407 IICGQPHHEEEMMFCDVCDRGYHTFC 432
>gi|6850865|emb|CAB71104.1| putative protein [Arabidopsis thaliana]
Length = 902
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 338 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHNN-- 395
Query: 222 QYRCPTCR 229
Y CP C+
Sbjct: 396 -YYCPDCK 402
>gi|358336360|dbj|GAA54889.1| histone acetyltransferase MYST3 [Clonorchis sinensis]
Length = 1190
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 30/68 (44%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
L CK CG + H CL W + + S W+C C+ C +C+ + C CD
Sbjct: 808 FLICKDCGLRAHPTCLDYWPELTERARQSPWQCTDCKTCTVCQNKQITTDLLVCDACDKG 867
Query: 114 YHCYCQHP 121
+H C P
Sbjct: 868 FHIECHVP 875
>gi|195447676|ref|XP_002071320.1| GK25190 [Drosophila willistoni]
gi|194167405|gb|EDW82306.1| GK25190 [Drosophila willistoni]
Length = 2262
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 11/113 (9%)
Query: 14 GYVATASKDIAGEEQGQSNTNVM----CRLCFVGENEGCERARRM----LSCKSCGKKYH 65
G + AS D +++ Q+N+ + C +C ++ AR M + C SC + H
Sbjct: 1842 GASSVASSDNEQQQRPQNNSRLSHGSNCGVCLRNQHRN---ARNMPEAFIRCYSCRRNVH 1898
Query: 66 RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
+C++ + +W+C C+ C CRR K ++C +CD YH YC
Sbjct: 1899 PSCIEMPQRMLGRVRNYNWQCAECKCCIKCRRRQKEGKMLYCEQCDRGYHIYC 1951
>gi|297679669|ref|XP_002817646.1| PREDICTED: PHD finger protein 10 [Pongo abelii]
Length = 498
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 436
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 437 IICGQPHHEEEMMFCDMCDRGYHTFC 462
>gi|345778329|ref|XP_532272.3| PREDICTED: PHD finger protein 10 isoform 1 [Canis lupus familiaris]
Length = 410
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|338722861|ref|XP_001499514.3| PREDICTED: PHD finger protein 10 [Equus caballus]
Length = 451
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NALCGICLKGKETNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2620 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2674
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2675 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2725
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2620 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2679
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2680 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2712
>gi|432852854|ref|XP_004067418.1| PREDICTED: PHD finger protein 10-like [Oryzias latipes]
Length = 442
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ R ++ C C H +CL ++ + W+C C+ C
Sbjct: 317 NAICGICQKGKESNKKGRPEALIHCSQCDNSGHPSCLDMSSELVSVIQTYRWQCMECKTC 376
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C++ ++ MFC +CD YH +C
Sbjct: 377 TVCQQPHHEDEMMFCDKCDRGYHTFC 402
>gi|402868771|ref|XP_003898462.1| PREDICTED: PHD finger protein 10 [Papio anubis]
Length = 498
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 17 ATASKDIAGEEQGQSNTNVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQN 75
+ SK + G + ++ N +C +C G E+ +A ++ C C H +CL +
Sbjct: 361 SVLSKSVPGY-KPKAIPNALCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVEL 419
Query: 76 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYC 118
+ W+C C+ C IC + + MFC CD YH +C
Sbjct: 420 VSMIKTYPWQCMECKTCIICGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|357130254|ref|XP_003566765.1| PREDICTED: uncharacterized protein LOC100821699 [Brachypodium
distachyon]
Length = 918
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 9/124 (7%)
Query: 14 GYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSC--KSCGKKY-HRNCLK 70
V + ++ GEE C++C E++ +R L C C KY H CLK
Sbjct: 739 AIVISCTEPAEGEELPNIVVGGSCKMCGTPEDDD----KRFLICGHSHCPYKYYHIRCLK 794
Query: 71 N-WAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
+ ++ W CPSC +C +C GD + + C CD AYH YC P +V G
Sbjct: 795 SKQIASKVQRDKPCWYCPSC-LCRVCLSDGDDEQTILCDGCDEAYHLYCMTPRRTSVPKG 853
Query: 130 PYLC 133
+ C
Sbjct: 854 KWYC 857
>gi|296439269|sp|Q4V7A6.2|PHF10_RAT RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
Length = 497
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|154152087|ref|NP_001093826.1| zinc finger protein ubi-d4 [Bos taurus]
gi|296218741|ref|XP_002755572.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Callithrix
jacchus]
gi|426252022|ref|XP_004019718.1| PREDICTED: zinc finger protein ubi-d4 [Ovis aries]
gi|118582243|gb|ABL07500.1| zinc-finger protein ubi-d4 [Capra hircus]
gi|151557067|gb|AAI49970.1| DPF2 protein [Bos taurus]
gi|152941218|gb|ABS45046.1| D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|296471617|tpg|DAA13732.1| TPA: D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|417400089|gb|JAA47013.1| Putative transcription factor requiem/neuro-d4 [Desmodus rotundus]
Length = 391
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 982
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 222 QYRCPTCR 229
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|291414421|ref|XP_002723450.1| PREDICTED: D4, zinc and double PHD fingers family 2-like
[Oryctolagus cuniculus]
Length = 388
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 262 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 321
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 322 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 368
>gi|308488788|ref|XP_003106588.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
gi|308253938|gb|EFO97890.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
Length = 452
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 4/92 (4%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M+ C C YH C++ + L W C CR+C IC + N+ +FC +CD
Sbjct: 352 MICCSVCQIVYHPRCIEMPDRMAALVRTYEWSCVDCRVCSICNKPEKENEIVFCDKCDRG 411
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 145
+H +C K++ G ++C T C N
Sbjct: 412 FHTFCVG--LKSLPRGTWIC--DTYCSETNRN 439
>gi|341942257|sp|Q9D8M7.4|PHF10_MOUSE RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
Length = 497
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|390470774|ref|XP_003734353.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Callithrix
jacchus]
Length = 405
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385
>gi|74208866|dbj|BAE21185.1| unnamed protein product [Mus musculus]
Length = 1148
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|54020946|ref|NP_001005717.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
gi|49522323|gb|AAH75306.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 428
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 44 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 103
EN+ M+SC CG+ H +CL+ W+C C+ C +C + + ++
Sbjct: 321 ENKKTGSKEEMVSCADCGRSGHPSCLQFSPNMIISVKKYPWQCIECKSCGLCGTSDNDDQ 380
Query: 104 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+FC CD YH YC PP G + C
Sbjct: 381 LLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 410
>gi|443725765|gb|ELU13216.1| hypothetical protein CAPTEDRAFT_167868 [Capitella teleta]
Length = 236
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 52/122 (42%), Gaps = 16/122 (13%)
Query: 35 VMCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
V+C C + C R +L C C K H +C+ + S W+C C+
Sbjct: 14 VLCDYCL--QTASCNRKGAQEDLLICTDCQAKAHPSCMDYSSDLARRARRSPWQCIDCKT 71
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-----------PKHTKCH 140
C +C GDP+ +FC CD YH C P ++ +G ++C P+ T C
Sbjct: 72 CCLCEDAGDPDAMLFCDACDKGYHMSCHSPVIEDKPTGKWVCSRCCQEIEADAPETTFCG 131
Query: 141 SC 142
SC
Sbjct: 132 SC 133
>gi|356518577|ref|XP_003527955.1| PREDICTED: uncharacterized protein LOC100795906 [Glycine max]
Length = 646
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL N+ + W CPSC +C +C D ++ + C CD AYH YC P
Sbjct: 499 KYYHVRCL---TINQLKSYGHCWYCPSC-LCRVCLTDQDDDRIVLCDGCDHAYHIYCMKP 554
Query: 122 PHKNVSSGPYLCPK 135
P ++ G + C K
Sbjct: 555 PRTSIPRGNWFCRK 568
>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
Length = 3086
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2585 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2639
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2640 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2690
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2585 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2644
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 2645 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2677
>gi|12841710|dbj|BAB25323.1| unnamed protein product [Mus musculus]
Length = 410
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|395544822|ref|XP_003774305.1| PREDICTED: zinc finger protein ubi-d4 [Sarcophilus harrisii]
Length = 423
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 297 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 356
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 357 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 403
>gi|297267436|ref|XP_002808108.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein ubi-d4-like
[Macaca mulatta]
Length = 391
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
Full=Protein SET DOMAIN GROUP 14; AltName:
Full=Trithorax-homolog protein 3; Short=TRX-homolog
protein 3
gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 1018
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 222 QYRCPTCR 229
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
Length = 1018
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 222 QYRCPTCR 229
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|431910278|gb|ELK13351.1| Zinc finger protein ubi-d4 [Pteropus alecto]
Length = 391
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|194328734|ref|NP_060758.2| PHD finger protein 10 isoform a [Homo sapiens]
gi|296439276|sp|Q8WUB8.3|PHF10_HUMAN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a; AltName: Full=XAP135
gi|119567827|gb|EAW47442.1| PHD finger protein 10, isoform CRA_a [Homo sapiens]
Length = 498
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 436
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 437 IICGQPHHEEEMMFCDMCDRGYHTFC 462
>gi|431904613|gb|ELK09995.1| PHD finger protein 10 [Pteropus alecto]
Length = 451
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NALCGICLKGKESNKKGKAESLIHCSQCDSSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
Length = 1070
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
G C C RL ++C +C KV+ S+S CD C+ W+H +CD IS F+
Sbjct: 427 GQFLCKKCARLTKSKHFCGICKKVWNHSDSGSWARCDGCKVWIHAECDRISSN---HFKD 483
Query: 218 DGNLQYRCPTCRGE 231
G + Y CPTC+ +
Sbjct: 484 LGGIDYYCPTCKAK 497
>gi|224066781|ref|XP_002302212.1| predicted protein [Populus trichocarpa]
gi|222843938|gb|EEE81485.1| predicted protein [Populus trichocarpa]
Length = 604
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 55/124 (44%), Gaps = 8/124 (6%)
Query: 11 FESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSC-GKKYHRNCL 69
FE +A+ ++ +G N C++C G G ++ C GK YH CL
Sbjct: 410 FEEASNFSANNEVKLSSEGTGNV-CTCKIC--GSPVGNGEKIKICDHSECPGKYYHVRCL 466
Query: 70 KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
Q H W CPSC +C +C D +K + C CD AYH YC PP +V G
Sbjct: 467 TT-RQIDSCGH--RWYCPSC-LCRVCITDRDDDKIVLCDGCDHAYHLYCMIPPRISVPKG 522
Query: 130 PYLC 133
+ C
Sbjct: 523 KWFC 526
>gi|403165473|ref|XP_003325473.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165738|gb|EFP81054.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1108
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 2/96 (2%)
Query: 54 MLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK-FMFCRRCD 111
M+SC CG+ H +C++ N + W C CR C C + GD ++ + C CD
Sbjct: 294 MVSCWECGQSGHFSCMELNNLTIKSHAKSYPWLCLECRRCHGCDKKGDDDQNMLLCAVCD 353
Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+H C +PP + V SG + CP + C +P
Sbjct: 354 RGWHGECLNPPLRTVPSGDFTCPFDHQSTQCIPPLP 389
>gi|194328736|ref|NP_579866.2| PHD finger protein 10 isoform b [Homo sapiens]
Length = 496
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 375 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 434
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 435 IICGQPHHEEEMMFCDMCDRGYHTFC 460
>gi|255522851|ref|NP_077212.3| PHD finger protein 10 [Mus musculus]
Length = 497
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 376 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 435
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 436 IICGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|444724505|gb|ELW65108.1| Zinc finger protein ubi-d4 [Tupaia chinensis]
Length = 412
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 286 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 345
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 346 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 392
>gi|443690042|gb|ELT92280.1| hypothetical protein CAPTEDRAFT_224752 [Capitella teleta]
Length = 1892
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
Query: 36 MCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C +++ + +++SC CG H +CLK + W+C C+ C +
Sbjct: 210 LCSFCLGADDKNRDGVPEQLISCADCGNCGHPSCLKFSDSLVERVGHMRWQCIECKKCSL 269
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C TG + +FC CD H C PP + G ++C
Sbjct: 270 CGETGKEDNMLFCDACDRGIHMECCIPPLTSAPEGKWVC 308
>gi|387542916|gb|AFJ72085.1| PHD finger protein 10 isoform a [Macaca mulatta]
Length = 498
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 436
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 437 IICGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|67078518|ref|NP_001019918.1| PHD finger protein 10 [Rattus norvegicus]
gi|66910931|gb|AAH98049.1| PHD finger protein 10 [Rattus norvegicus]
Length = 410
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|197098008|ref|NP_001127678.1| zinc finger protein ubi-d4 [Pongo abelii]
gi|332250195|ref|XP_003274239.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Nomascus
leucogenys]
gi|397516926|ref|XP_003828672.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Pan paniscus]
gi|402892869|ref|XP_003909629.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Papio anubis]
gi|403293484|ref|XP_003937746.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Saimiri
boliviensis boliviensis]
gi|426369137|ref|XP_004051553.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Gorilla gorilla
gorilla]
gi|56403615|emb|CAI29608.1| hypothetical protein [Pongo abelii]
Length = 405
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385
>gi|30584805|gb|AAP36655.1| Homo sapiens requiem, apoptosis response zinc finger gene
[synthetic construct]
gi|61370771|gb|AAX43549.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
Length = 392
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|149047117|gb|EDL99837.1| PHD finger protein 10 [Rattus norvegicus]
Length = 449
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 328 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 387
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 388 IICGQPHHEEEMMFCDVCDRGYHTFC 413
>gi|440907399|gb|ELR57553.1| Zinc finger protein ubi-d4, partial [Bos grunniens mutus]
Length = 380
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 254 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 313
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 314 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 360
>gi|336370765|gb|EGN99105.1| hypothetical protein SERLA73DRAFT_160636 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1506
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
WKC C+ CE+CR GD + +FC CD +H C PP + G + CP
Sbjct: 9 WKCLECKNCEVCREKGDDERILFCDFCDRGWHMDCLQPPLQESPPGKWHCP 59
>gi|5454004|ref|NP_006259.1| zinc finger protein ubi-d4 [Homo sapiens]
gi|350534556|ref|NP_001233580.1| zinc finger protein ubi-d4 [Pan troglodytes]
gi|73983120|ref|XP_866588.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Canis lupus
familiaris]
gi|332250193|ref|XP_003274238.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Nomascus
leucogenys]
gi|397516924|ref|XP_003828671.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Pan paniscus]
gi|402892867|ref|XP_003909628.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Papio anubis]
gi|403293482|ref|XP_003937745.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Saimiri
boliviensis boliviensis]
gi|410974412|ref|XP_003993641.1| PREDICTED: zinc finger protein ubi-d4 [Felis catus]
gi|426369135|ref|XP_004051552.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Gorilla gorilla
gorilla]
gi|2842711|sp|Q92785.2|REQU_HUMAN RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|2121234|gb|AAB58307.1| requiem homolog [Homo sapiens]
gi|2529705|gb|AAB81203.1| requiem [Homo sapiens]
gi|15928853|gb|AAH14889.1| D4, zinc and double PHD fingers family 2 [Homo sapiens]
gi|28144169|gb|AAO26041.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|30582275|gb|AAP35364.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|61361059|gb|AAX41982.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|61361064|gb|AAX41983.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|119594781|gb|EAW74375.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|119594782|gb|EAW74376.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|123983164|gb|ABM83323.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|123997873|gb|ABM86538.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|158257320|dbj|BAF84633.1| unnamed protein product [Homo sapiens]
gi|208967739|dbj|BAG72515.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|343962233|dbj|BAK62704.1| zinc-finger protein ubi-d4 [Pan troglodytes]
gi|355566318|gb|EHH22697.1| Protein requiem [Macaca mulatta]
gi|355751970|gb|EHH56090.1| Protein requiem [Macaca fascicularis]
gi|380815318|gb|AFE79533.1| zinc finger protein ubi-d4 [Macaca mulatta]
gi|410218232|gb|JAA06335.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410249592|gb|JAA12763.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410288496|gb|JAA22848.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410336195|gb|JAA37044.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
Length = 391
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|380792751|gb|AFE68251.1| PHD finger protein 10 isoform a, partial [Macaca mulatta]
Length = 480
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 377 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 436
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 437 IICGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|311247329|ref|XP_003122585.1| PREDICTED: zinc finger protein ubi-d4 [Sus scrofa]
Length = 391
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|149725409|ref|XP_001492666.1| PREDICTED: zinc finger protein ubi-d4 [Equus caballus]
Length = 391
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|26331782|dbj|BAC29621.1| unnamed protein product [Mus musculus]
Length = 1010
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|432091134|gb|ELK24346.1| Zinc finger protein ubi-d4 [Myotis davidii]
Length = 405
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 279 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 338
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 339 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 385
>gi|380792753|gb|AFE68252.1| PHD finger protein 10 isoform b, partial [Macaca mulatta]
Length = 478
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 375 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 434
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 435 IICGQPHHEEEMMFCDVCDRGYHTFC 460
>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1003
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C RL +YC +C K++ S+S V CD C+ WVH +CD IS F+ G
Sbjct: 364 CKTCARLTKSKHYCGICKKIWNYSDSGSWVRCDGCKVWVHAECDKISSN---LFKNLGGS 420
Query: 222 QYRCPTCR 229
Y CPTC+
Sbjct: 421 DYFCPTCK 428
>gi|410960397|ref|XP_004001392.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 10 [Felis catus]
Length = 440
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 319 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 378
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 379 IVCGQPHHEEEMMFCDVCDRGYHTFC 404
>gi|12805463|gb|AAH02206.1| PHD finger protein 10 [Mus musculus]
gi|148688526|gb|EDL20473.1| PHD finger protein 10, isoform CRA_b [Mus musculus]
Length = 408
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 287 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 346
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 347 IICGQPHHEEEMMFCDVCDRGYHTFC 372
>gi|395852334|ref|XP_003798694.1| PREDICTED: zinc finger protein ubi-d4 [Otolemur garnettii]
Length = 391
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1035
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 139 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C +CG ++P L G C C RL +YC +C KV+ S+S V CD
Sbjct: 370 CEACGLSLPYKMLKKTKDSSPGGQFLCKTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 429
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 231
C+ WVH +CD I + + Y CPTC+ +
Sbjct: 430 CKVWVHAECDKICSNLFKNLE---GTDYYCPTCKAK 462
>gi|332825485|ref|XP_518861.3| PREDICTED: PHD finger protein 10 [Pan troglodytes]
Length = 451
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 14/101 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 382 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 438
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C +AYH +C PP ++ G + CP+
Sbjct: 439 CRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPR 479
>gi|148688525|gb|EDL20472.1| PHD finger protein 10, isoform CRA_a [Mus musculus]
Length = 469
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 348 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 407
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 408 IICGQPHHEEEMMFCDVCDRGYHTFC 433
>gi|301121094|ref|XP_002908274.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103305|gb|EEY61357.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 634
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 9/72 (12%)
Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG-------ISD--EKYLQFQVD 218
L +G YCPVC +VY D + VCCD C+ WVH CD + D E + +
Sbjct: 562 LRAQGQYCPVCNEVYEDDDQNTFVCCDSCELWVHGACDPSLTPYVVLLDMMESIIAAMAN 621
Query: 219 GNLQYRCPTCRG 230
+Y CP C G
Sbjct: 622 TEDKYICPLCAG 633
>gi|119567828|gb|EAW47443.1| PHD finger protein 10, isoform CRA_b [Homo sapiens]
Length = 449
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 328 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 387
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 388 IICGQPHHEEEMMFCDMCDRGYHTFC 413
>gi|332263993|ref|XP_003281033.1| PREDICTED: PHD finger protein 10 [Nomascus leucogenys]
Length = 451
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|355569151|gb|EHH25368.1| hypothetical protein EGK_21333 [Macaca mulatta]
Length = 410
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|281351742|gb|EFB27326.1| hypothetical protein PANDA_014721 [Ailuropoda melanoleuca]
Length = 397
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 276 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 335
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 336 IVCGQPHHEEEMMFCDVCDRGYHTFC 361
>gi|7023354|dbj|BAA91934.1| unnamed protein product [Homo sapiens]
gi|48146663|emb|CAG33554.1| PHF10 [Homo sapiens]
gi|82571445|gb|AAI10324.1| PHD finger protein 10 [Homo sapiens]
gi|261858284|dbj|BAI45664.1| PHD finger protein 10 [synthetic construct]
Length = 410
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 1149
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 44/98 (44%), Gaps = 14/98 (14%)
Query: 139 CHSCGSNVPGNGLS-------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
C SCG +P + FL C C +L YC +C K++ S+ V
Sbjct: 283 CASCGLMLPCKTMKKVKDSSHAPQFL----CKHCVKLRKSKQYCGICKKIWHHSDGGNWV 338
Query: 192 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CCD C WVH +CD IS E + + N Y CP C+
Sbjct: 339 CCDGCNVWVHAECDKISTEHFKDLE---NTDYYCPDCK 373
>gi|348561411|ref|XP_003466506.1| PREDICTED: PHD finger protein 10-like [Cavia porcellus]
Length = 614
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 493 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSVELVSMIKTYPWQCMECKTC 552
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 553 IICGQPHHEEEMMFCDVCDRGYHTFC 578
>gi|74219112|dbj|BAE26697.1| unnamed protein product [Mus musculus]
Length = 391
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTRQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|344295884|ref|XP_003419640.1| PREDICTED: zinc finger protein ubi-d4 [Loxodonta africana]
Length = 391
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|237839305|ref|XP_002368950.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211966614|gb|EEB01810.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 551
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 8/142 (5%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
E+ A R + + A ++ + E+G+ R C R R +L C C
Sbjct: 297 EELAARGKVFFAACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 353
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
+ +H +C + N +L W C C+ CE C+ + + + C CD AYH C
Sbjct: 354 RQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 412
Query: 121 PPHKNVSSGPYLCPKHTKCHSC 142
PP + V G + C +C C
Sbjct: 413 PPVEEVPDGTWFCADCGRCACC 434
>gi|426355212|ref|XP_004045024.1| PREDICTED: PHD finger protein 10 [Gorilla gorilla gorilla]
Length = 451
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELISMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|72015501|ref|XP_785947.1| PREDICTED: uncharacterized protein LOC580820 [Strongylocentrotus
purpuratus]
Length = 1065
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 9/105 (8%)
Query: 34 NVMCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
+C LC + ++ C C H +CL+ + W+C C+ C
Sbjct: 845 TAICGLCLKDRRSNTKGVPENLVHCSQCDNSGHPSCLEMNDELVATIKTYPWQCMECKTC 904
Query: 93 EICRRTGDP---NKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C GDP +K MFC +CD YH +C ++ +G +LCP
Sbjct: 905 SQC---GDPTHEDKMMFCDKCDRGYHTFCVG--LTDIPTGNWLCP 944
>gi|18088065|gb|AAH20954.1| PHD finger protein 10 [Homo sapiens]
gi|123981058|gb|ABM82358.1| PHD finger protein 10 [synthetic construct]
gi|123995863|gb|ABM85533.1| PHD finger protein 10 [synthetic construct]
Length = 408
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 287 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 346
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 347 IICGQPHHEEEMMFCDMCDRGYHTFC 372
>gi|148744463|gb|AAI42960.1| MYST3 protein [Homo sapiens]
Length = 815
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|390462274|ref|XP_002747237.2| PREDICTED: PHD finger protein 10 [Callithrix jacchus]
Length = 451
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 330 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 389
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 390 IICGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|291240495|ref|XP_002740154.1| PREDICTED: PHD finger protein 10-like [Saccoglossus kowalevskii]
Length = 459
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 15/122 (12%)
Query: 19 ASKDIAGEEQGQSNTNVMCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRD 77
A K GE + + + +C +C G++ ++ + +++ C C H CL+ N
Sbjct: 322 AKKTTNGEYKPKDIPDAVCGICLKGKDSSKKKFSEQLVHCSQCDNSGHPTCLQ---MNDS 378
Query: 78 LFH---WSSWKCPSCRICEICRRTGDP---NKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
L H W+C C+ C +C GDP +K MFC CD +H +C K++ +G +
Sbjct: 379 LVHVIKTYPWQCMECKTCTLC---GDPTHEDKMMFCDDCDRGHHTFCVG--LKSIPTGQW 433
Query: 132 LC 133
C
Sbjct: 434 TC 435
>gi|26333367|dbj|BAC30401.1| unnamed protein product [Mus musculus]
Length = 461
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|351701967|gb|EHB04886.1| Zinc finger protein ubi-d4 [Heterocephalus glaber]
Length = 601
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 475 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 534
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 535 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 581
>gi|242014022|ref|XP_002427697.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512132|gb|EEB14959.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 639
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 36 MCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C++C F + + R SC CGKK H C+++ Q F S W+C C+ C
Sbjct: 390 LCKICSFNIDFKSSRNLDRWTSCHFCGKKAHVTCIQDTEQ-WTRFKLSKWQCRDCKNCST 448
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
C+ + C CD AYH C + K+ S+ + C K ++ S G
Sbjct: 449 CKNKFSDGDLIVCGLCDDAYHLTCAN-VKKSKSNQKWFCNKCSRVFSDG 496
>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
Length = 1603
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 36 MCRLCFVGENEGCE----RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 91
+C++C G+ ++ C CG+ +H C+ + + +WKC C+
Sbjct: 857 LCKVCLSGDVPSVVGKSFVPSTLICCVDCGEVFHTFCIGLPEEVASVIDRLTWKCADCKC 916
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP----YLCPKHTKCHSCG--SN 145
C +C + + + C RCD +H YC + + P ++CP +K S G +
Sbjct: 917 CSVCMALDNEDLLLICDRCDLGFHTYC-----AGLDALPEEDDWVCPSCSKIQSNGDEQD 971
Query: 146 VPGNGLSVRWFL 157
V + +W L
Sbjct: 972 VKVETVKEKWLL 983
>gi|426388550|ref|XP_004060697.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4
[Gorilla gorilla gorilla]
Length = 388
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 6/114 (5%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 267 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 324
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 138
C+ C +C + + ++ +FC D YH YC PP G + LC +H K
Sbjct: 325 ECKSCSLCGTSENDDQLLFCDDSDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|355749042|gb|EHH53525.1| hypothetical protein EGM_14185 [Macaca fascicularis]
Length = 410
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|13487236|gb|AAK27451.1|AF338735_1 hypothetical PHD zinc finger protein XAP135 [Homo sapiens]
Length = 410
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|26344145|dbj|BAC35729.1| unnamed protein product [Mus musculus]
Length = 435
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|403305903|ref|XP_003943488.1| PREDICTED: PHD finger protein 10 [Saimiri boliviensis boliviensis]
Length = 410
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|159164819|pdb|2YSM|A Chain A, Solution Structure Of The First And Second Phd Domain From
MyeloidLYMPHOID OR MIXED-Lineage Leukemia Protein 3
Homolog
Length = 111
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 57 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 116
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 25 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 81
Query: 117 YCQHPPHKNVSSGPYLC 133
+C P K+V + + C
Sbjct: 82 FCLQPVMKSVPTNGWKC 98
>gi|347966735|ref|XP_001689318.2| AGAP001877-PA [Anopheles gambiae str. PEST]
gi|333469922|gb|EDO63223.2| AGAP001877-PA [Anopheles gambiae str. PEST]
Length = 2382
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 5/115 (4%)
Query: 36 MCRLCFVGENEG-CERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C +C EN+ + + C C +K H +C+ + W+C C++C
Sbjct: 2153 LCAVCMGPENKNKYSKPELFVRCTRCRRKAHPSCIGMSSVMYKRVQQYKWQCSECKLCMK 2212
Query: 95 CRR--TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
C R +K ++C +CD YH C+ +N+ G + C T C CG+ P
Sbjct: 2213 CNRQPAAIDSKMVYCDQCDRGYHLACKG--LRNLPEGRWHCNICTICGLCGAQTP 2265
>gi|291190717|ref|NP_001167047.1| Zinc finger protein ubi-d4 [Salmo salar]
gi|223647844|gb|ACN10680.1| Zinc finger protein ubi-d4 [Salmo salar]
Length = 402
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ ++ + SC CG+ H +CL+ W+C
Sbjct: 274 GLAIPNNYCDFCLGDSALNQKTGQSEELQSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 333
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC P G + C
Sbjct: 334 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLSPAMAEPPEGSWSC 380
>gi|432875795|ref|XP_004072911.1| PREDICTED: histone acetyltransferase KAT6A-like [Oryzias latipes]
Length = 1964
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 236 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 295
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 296 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 335
>gi|84000081|ref|NP_001033141.1| PHD finger protein 10 [Bos taurus]
gi|122136994|sp|Q2T9V9.1|PHF10_BOVIN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
gi|83405479|gb|AAI11244.1| PHD finger protein 10 [Bos taurus]
gi|296483819|tpg|DAA25934.1| TPA: PHD finger protein 10 [Bos taurus]
Length = 410
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 289 NALCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|440893739|gb|ELR46406.1| PHD finger protein 10, partial [Bos grunniens mutus]
Length = 469
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 348 NALCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 407
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 408 IICGQPHHEEEMMFCDVCDRGYHTFC 433
>gi|13938144|gb|AAH07188.1| Dpf2 protein, partial [Mus musculus]
Length = 351
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 225 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 284
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 285 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 331
>gi|328785548|ref|XP_003250614.1| PREDICTED: hypothetical protein LOC100576266 [Apis mellifera]
Length = 659
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)
Query: 23 IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
I+ E Q S T + CR C + E E +++C+ C + H +C+ + +
Sbjct: 409 ISQESQDSSKTTIKDGPCRQCSLCAKEKQEN---LVACRDCTVRAHPSCIYS-PEEMIQK 464
Query: 80 HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
S+W+C C+ C IC T D C CD AYH YC P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|380029159|ref|XP_003698249.1| PREDICTED: uncharacterized protein LOC100865213 [Apis florea]
Length = 659
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)
Query: 23 IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
I+ E Q S T + CR C + E E +++C+ C + H +C+ + +
Sbjct: 409 ISQESQDSSKTTIKDGPCRQCSLCAKEKQEN---LVACRDCTVRAHPSCIYS-PEEMIQK 464
Query: 80 HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
S+W+C C+ C IC T D C CD AYH YC P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|354505054|ref|XP_003514587.1| PREDICTED: zinc finger protein ubi-d4-like [Cricetulus griseus]
gi|344258641|gb|EGW14745.1| Zinc finger protein ubi-d4 [Cricetulus griseus]
Length = 391
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|427797535|gb|JAA64219.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2019
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 51 ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
A +LSC SC H +CLK N L W+C CR+C C + + C
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
CD+ +H C PP G + KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336
>gi|187469691|gb|AAI66788.1| Dpf2 protein [Rattus norvegicus]
Length = 390
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 264 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 323
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 324 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 370
>gi|6755314|ref|NP_035392.1| zinc finger protein ubi-d4 [Mus musculus]
gi|2500148|sp|Q61103.1|REQU_MOUSE RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|1167972|gb|AAC52783.1| ubi-d4 [Mus musculus]
gi|12836275|dbj|BAB23583.1| unnamed protein product [Mus musculus]
gi|15215228|gb|AAH12709.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
gi|74184334|dbj|BAE25702.1| unnamed protein product [Mus musculus]
gi|74201274|dbj|BAE26098.1| unnamed protein product [Mus musculus]
gi|74201435|dbj|BAE26153.1| unnamed protein product [Mus musculus]
gi|74206142|dbj|BAE23543.1| unnamed protein product [Mus musculus]
gi|148701237|gb|EDL33184.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
Length = 391
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
Length = 1441
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 83/221 (37%), Gaps = 75/221 (33%)
Query: 33 TNVMCRLCFVGENEGC-----------------ERAR-------RMLSCKS--CGKKYHR 66
T+++C C VGE EGC E A ++L+C CGK++H
Sbjct: 856 TSLVCHECNVGEPEGCVICHQVESPAVPSTPMKEEAPSHIPIEDKLLTCSQPLCGKRFHT 915
Query: 67 NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYH-- 115
+C K W Q H S +CP +C C + DP +K C RC A YH
Sbjct: 916 SCCKYWPQANSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQD 971
Query: 116 CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
+C + +++ +CP+H N+ V Y C VKG
Sbjct: 972 SHCIPAGTQMLNATHIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1014
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
+VCC+ C VH C I ++E Y+
Sbjct: 1015 --------------LVCCETCPIAVHAHCRNIPIKTNENYI 1041
>gi|327291384|ref|XP_003230401.1| PREDICTED: zinc finger protein ubi-d4-like, partial [Anolis
carolinensis]
Length = 178
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 52 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 111
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C IC + + ++ +FC CD YH YC PP G + C
Sbjct: 112 IECKCCNICGTSENDDQLLFCDDCDRGYHMYCLTPPMSEPPEGSWSC 158
>gi|427797307|gb|JAA64105.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2011
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 51 ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
A +LSC SC H +CLK N L W+C CR+C C + + C
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
CD+ +H C PP G + KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336
>gi|291414252|ref|XP_002723376.1| PREDICTED: PHD finger protein 10 [Oryctolagus cuniculus]
Length = 626
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 505 NAICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSTELVSMIKTYPWQCMECKTC 564
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 565 IVCGQPHHEEEMMFCDVCDRGYHTFC 590
>gi|157817959|ref|NP_001101986.1| zinc finger protein ubi-d4 [Rattus norvegicus]
gi|149062118|gb|EDM12541.1| D4, zinc and double PHD fingers family 2 (predicted) [Rattus
norvegicus]
Length = 391
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|326915596|ref|XP_003204100.1| PREDICTED: PHD finger protein 10-like, partial [Meleagris
gallopavo]
Length = 361
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 240 NAICGICLKGKESNKKGKAEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 299
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 300 IICGQPHHEEEMMFCDVCDRGYHTFC 325
>gi|410922269|ref|XP_003974605.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
KAT6A-like [Takifugu rubripes]
Length = 2234
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E +R ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNAENMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 94
E C++ ++ C +C + YH CL + WS CP C E
Sbjct: 355 EVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWS---CPHCEAEGTQEQDDDEHNEF 411
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 412 CRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPR 452
>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 2943
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/262 (22%), Positives = 86/262 (32%), Gaps = 88/262 (33%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNR----DLFHWSSWKCPSCRICEICRRTG---------- 99
+ C CG+ +H C+ + R D + W+CP+C++CEIC + G
Sbjct: 1593 FIFCVDCGEGFHSFCVSGMSAARLEDSDQLR-AYWRCPNCKMCEICGQPGAVCGAESSAR 1651
Query: 100 ---------DPNK-------------FMFCRRCDAAYHCYCQHPPHK---------NVSS 128
PN + C CD +H C P K SS
Sbjct: 1652 VPATAGNVDSPNTDTETSLELAKTESLLLCGHCDRGFHGSCLVPAIKLPLNPKKRDGTSS 1711
Query: 129 GPYL-CPKHTKCHSCGSN--------VPGNGLSV-------------------------- 153
P + C C +C S+ P + L
Sbjct: 1712 SPVIYCASCVSCVNCKSSREYLDSDVAPNDHLDALERTYSYEQDKCLHCHNRKEREVQAL 1771
Query: 154 ---RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--- 207
L DA R CP+C + + D++ ++ CD C+RWVH CD +
Sbjct: 1772 RERTRLLTEVWMDAARRSKKDAEKCPLCRRKW-DADLEELMQCDACERWVHPPCDDLLKK 1830
Query: 208 SDEKYLQFQVDGNLQYRCPTCR 229
++Y D N Y C CR
Sbjct: 1831 EPKRYQTLVSDPNAVYVCAACR 1852
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 47/138 (34%), Gaps = 29/138 (21%)
Query: 97 RTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVS--------------SGPYLCPKHTKCH 140
R GD + + C +CD +H C PPH +S P++C T C
Sbjct: 1063 RRGDAGEEELLACAQCDNQFHATCCDPPHAPLSLVSPDDGDVLVADLKTPFVCSDCTSCA 1122
Query: 141 SC----GSNVPGNGLSVRW------FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE---S 187
C S RW C C + +C VC V D + S
Sbjct: 1123 GCRCRKSDEARAEEPSPRWSQWRLPLQTAALCTTCIPYYKANRFCGVCNLVLDDEQLATS 1182
Query: 188 TPMVCCDVCQRWVHCQCD 205
++ C C W+H C+
Sbjct: 1183 VDLLTCATCHHWIHADCE 1200
>gi|168001639|ref|XP_001753522.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695401|gb|EDQ81745.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 428
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 48/99 (48%), Gaps = 22/99 (22%)
Query: 148 GNGLSVRWFL----------GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPM-----VC 192
G G + RW + T C+AC F +G YCP C+++YR+ + V
Sbjct: 72 GKGGTCRWHVRNYGSQKDPKHVTLCNACKINFDQGKYCPFCVQIYREKDPDSFDGKEWVG 131
Query: 193 CD--VCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
CD C+RWVH +C+ IS QVD Y CP+CR
Sbjct: 132 CDNRTCRRWVHVECE-ISGGN----QVDSTAFYLCPSCR 165
>gi|221507890|gb|EEE33477.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 546
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 8/142 (5%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
E+ A R + + A ++ + E+G+ R C R R +L C C
Sbjct: 292 EELAARGKVFFSACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 348
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
+ +H +C + N +L W C C+ CE C+ + + + C CD AYH C
Sbjct: 349 RQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 407
Query: 121 PPHKNVSSGPYLCPKHTKCHSC 142
PP + V G + C +C C
Sbjct: 408 PPVEEVPDGTWFCADCGRCACC 429
>gi|6648956|gb|AAF21306.1|AF108134_1 ubi-d4/requiem [Mus musculus]
Length = 380
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 254 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 313
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 314 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 360
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 378 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCESEGGQEQEEDEHQEF 434
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C +C+ G+ + C C AAYH +C PP +V G + CP+
Sbjct: 435 CRVCKDGGE---LLCCDSCPAAYHTFCLSPPITDVPDGDWKCPR 475
>gi|47227720|emb|CAG09717.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2476
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E +R ++SC CG H +CLK + W+C C+ C
Sbjct: 441 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 500
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 501 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 540
>gi|348564966|ref|XP_003468275.1| PREDICTED: zinc finger protein ubi-d4-like [Cavia porcellus]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 265 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 324
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 325 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|348522233|ref|XP_003448630.1| PREDICTED: histone acetyltransferase MYST3-like [Oreochromis
niloticus]
Length = 2258
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 243 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 302
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 303 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 342
>gi|26389386|dbj|BAC25728.1| unnamed protein product [Mus musculus]
Length = 803
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 36 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|344257401|gb|EGW13505.1| PHD finger protein 10 [Cricetulus griseus]
Length = 329
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 208 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTC 267
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 268 IICGQPHHEEEMMFCDVCDRGYHTFC 293
>gi|405951463|gb|EKC19373.1| Bromodomain adjacent to zinc finger domain protein 2B [Crassostrea
gigas]
Length = 2317
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 43/114 (37%), Gaps = 34/114 (29%)
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+C++CRR + + + C CD YH YC P N+ G + C+ C S G
Sbjct: 2033 LCQLCRRDDNEAQLLLCDGCDQGYHTYCFKPKMDNIPDGDWY------CYECISKATGE- 2085
Query: 151 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
CC CG+ + +V CD+C R +H C
Sbjct: 2086 ---------PCCVVCGKRMGR------------------IVECDLCPRAIHLDC 2112
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 30 QSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC--- 86
+S V+C+LC +NE ++L C C + YH C K N W ++C
Sbjct: 2027 KSIMKVLCQLCRRDDNEA-----QLLLCDGCDQGYHTYCFKPKMDNIPDGDWYCYECISK 2081
Query: 87 ----PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 142
P C +C +R G + + C C A H C +PP + P+ C +C
Sbjct: 2082 ATGEPCCVVC--GKRMG---RIVECDLCPRAIHLDCLNPPLPRM-------PRKWVCPAC 2129
Query: 143 GSN 145
+N
Sbjct: 2130 TAN 2132
>gi|221483410|gb|EEE21729.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
Length = 556
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 89/239 (37%), Gaps = 37/239 (15%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNTNVMCRLCFVGENEGCERARR----MLSCKSC 60
E+ A R + + A ++ + E+G+ R C R R +L C C
Sbjct: 302 EELAARGKVFFAACGAEPLRTGEEGEDADE---RFVLSCRQRSCRRDPRDFKDLLVCFRC 358
Query: 61 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
+H +C + N +L W C C+ CE C+ + + + C CD AYH C
Sbjct: 359 RHSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRAYHMDCME 417
Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--------SVRWF---------------- 156
PP + V G + C +C C + + S+R
Sbjct: 418 PPVEEVPDGTWFCADCGRCACCDRRLSDEKILDPHSCVGSMRRLCFDCKERHRRGKRSRL 477
Query: 157 --LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP---MVCCDVCQRWVHCQCDGISDE 210
LG + DA + + C VC+K E P V CD+C++ VH C + E
Sbjct: 478 SRLGSSQGDAGTHSAKRTSLCDVCVKSLCACEGKPPKMRVACDLCKQVVHADCARLPQE 536
>gi|328872173|gb|EGG20540.1| hypothetical protein DFA_00401 [Dictyostelium fasciculatum]
Length = 436
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 8/114 (7%)
Query: 142 CGSNVPGNGLSVRWFLG---YTCCDACGRLFVKGNYCPVCLKVYRDSE----STPMVCCD 194
CGS PG G + +W G C++CG +K C +C VY E S + CD
Sbjct: 278 CGSTTPGKGPTCKWRKGPNGEVLCNSCGLQNMKKPKCLLCGIVYNSKEAMASSISWIRCD 337
Query: 195 VCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 247
C++WV +CD G+ D L Y CP CR + + + + R L +
Sbjct: 338 DCKQWVMSKCDSGMGDISLYDDSNPNPLHYSCPKCRTDPSKPKTTRNNHRSLLK 391
>gi|170595283|ref|XP_001902318.1| Hypothetical C28H8.9 in chromosome III [Brugia malayi]
gi|158590068|gb|EDP28834.1| Hypothetical C28H8.9 in chromosome III, putative [Brugia malayi]
Length = 149
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 32 NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
+ + +C LC +N+ + +++SC CG+ H +CLK W+C C
Sbjct: 26 DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 85
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ C IC + + ++ +FC CD +H YC PP G + C
Sbjct: 86 KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129
>gi|1083466|pir||A55302 probable transcription factor requiem - mouse
gi|606661|gb|AAA64637.1| Requiem [Mus musculus]
Length = 371
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 2/107 (1%)
Query: 29 GQSNTNVMCRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
G + N C C N+ + ++SC CG+ H +CL+ W+C
Sbjct: 245 GLALPNNYCDFCLGDSKINKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQC 304
Query: 87 PSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC P G + C
Sbjct: 305 IECKCCNLCGTSENDDQLLFCDDCDRGYHMYCLTPSMSEPPEGSWSC 351
>gi|402591828|gb|EJW85757.1| Dpf2 protein [Wuchereria bancrofti]
Length = 149
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 49/104 (47%), Gaps = 2/104 (1%)
Query: 32 NTNVMCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC 89
+ + +C LC +N+ + +++SC CG+ H +CLK W+C C
Sbjct: 26 DVSTVCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIEC 85
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ C IC + + ++ +FC CD +H YC PP G + C
Sbjct: 86 KSCAICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129
>gi|350396306|ref|XP_003484507.1| PREDICTED: hypothetical protein LOC100744391 [Bombus impatiens]
Length = 658
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 7/102 (6%)
Query: 23 IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
I E Q S T V CR C + E E +++C+ C + H +C+ + +
Sbjct: 408 IPQESQDSSKTTVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMIQK 463
Query: 80 HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
S+W+C C+ C IC T D C CD AYH YC P
Sbjct: 464 AGSNWQCERCKSCTICCETSDAGPLATCFTCDDAYHYYCHTP 505
>gi|301779690|ref|XP_002925264.1| PREDICTED: PHD finger protein 10-like [Ailuropoda melanoleuca]
Length = 636
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 515 NALCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTC 574
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 575 IVCGQPHHEEEMMFCDVCDRGYHTFC 600
>gi|290992402|ref|XP_002678823.1| predicted protein [Naegleria gruberi]
gi|284092437|gb|EFC46079.1| predicted protein [Naegleria gruberi]
Length = 457
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 5/88 (5%)
Query: 28 QGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
+G+ T +C++C E+ + + C SC +H C ++ + H +W C
Sbjct: 305 RGRPRTRKLCKIC-----NSFEQETKFIQCLSCNSYFHTFCYTPNLEHLEQVHKDNWLCS 359
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYH 115
C+IC CR+ + +FC CD +H
Sbjct: 360 DCKICLKCRKGPNEGTLVFCDYCDCGFH 387
>gi|290976199|ref|XP_002670828.1| predicted protein [Naegleria gruberi]
gi|284084391|gb|EFC38084.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 10/74 (13%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSES----TPMVCCDVCQRWVHCQCDGISDEKYLQFQV 217
C+ACG + KG++C C ++Y++S++ P + CD C RWVH C E+ F++
Sbjct: 36 CNACGLHYKKGHFCIYCNQIYKESDADDKEEPWIGCDSCHRWVHQNC-----ERQNGFEI 90
Query: 218 DGNLQYRCPTCRGE 231
N Y CP CR +
Sbjct: 91 KPN-GYLCPCCRNQ 103
>gi|340722214|ref|XP_003399503.1| PREDICTED: hypothetical protein LOC100648836 [Bombus terrestris]
Length = 659
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 7/102 (6%)
Query: 23 IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
I E Q S T V CR C + E E +++C+ C + H +C+ + +
Sbjct: 409 IPQESQDSSKTTVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMVQK 464
Query: 80 HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
S+W+C C+ C IC T D C CD AYH YC P
Sbjct: 465 AGSNWQCERCKSCTICCETSDAGPLATCFTCDDAYHYYCHTP 506
>gi|169146772|emb|CAQ13473.1| novel protein similar to human D4, zinc and double PHD fingers
family 1 (DPF1) [Danio rerio]
Length = 127
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 46/106 (43%), Gaps = 3/106 (2%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 6 GSVIPNGYCDFCLGGSKKTGCPE--DLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 63
Query: 88 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ C +C + + ++ +FC CD YH YC PP G + C
Sbjct: 64 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 109
>gi|118402055|ref|XP_001033347.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89287695|gb|EAR85684.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 510
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 6/121 (4%)
Query: 112 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 171
+ ++ YC P KN+SS P + TK S L + T CD C + + +
Sbjct: 165 SCFNQYC--PKDKNISSVPSPQQESTKKQSVNGQYKQAKLE-KTGEKVTFCDLCCQRYQE 221
Query: 172 GNYCPVCLKVYRDS---ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 228
++C C +VY D + + CD C RW H C+ +K N+QY CP C
Sbjct: 222 KHFCYYCQQVYFDDYNIDDKEWILCDTCDRWCHLHCEEEKIKKAFSSSQSENVQYDCPRC 281
Query: 229 R 229
R
Sbjct: 282 R 282
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 73/202 (36%), Gaps = 55/202 (27%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGC----ERARRMLSC--KSCGKKYHRNCLKNWAQNRDL 78
G+ Q + C C G C ER + C +CGK YH CL W Q+
Sbjct: 529 GKSDEQEDETFKCIDCLSGVAPACFLCNEREGDRIRCIVPACGKHYHSKCLIPWPQS--- 585
Query: 79 FHWSSWK--CPSCRICEIC-------RRTGDPN-KFMFCRRCDAAYHC--YCQHPPHKNV 126
HW + CP +C C R+ PN K C RC ++YH C + +
Sbjct: 586 -HWQGGRLTCPY-HVCHTCSSDNPQNNRSRAPNEKVAKCVRCPSSYHASALCLPAGSEIL 643
Query: 127 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 186
++ +CPKH K P L+ W C C R
Sbjct: 644 TASQIICPKHYK-------APHPPLNAAW------CFLCTR------------------- 671
Query: 187 STPMVCCDVCQRWVHCQCDGIS 208
++CCD C H +C GI+
Sbjct: 672 GGSLICCDTCPTSFHLECLGIN 693
>gi|118344196|ref|NP_001071920.1| zinc finger protein [Ciona intestinalis]
gi|92081536|dbj|BAE93315.1| zinc finger protein [Ciona intestinalis]
Length = 257
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 34/80 (42%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
ML CK C K H +C+K + W+C C+ C C D +FC CD A
Sbjct: 36 MLFCKDCDAKAHPSCMKYSSTLAAQALSYPWQCVECKTCSSCFTARDGASILFCDGCDKA 95
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH C P G +LC
Sbjct: 96 YHMLCHEPEVITKPEGKWLC 115
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 46/102 (45%), Gaps = 15/102 (14%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR------------ICE 93
E C++ ++ C +C K +H CL + WS CP+C E
Sbjct: 352 EVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWS---CPNCEGEGIPEPEPADEHME 408
Query: 94 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
CR D + + C +C ++YH +C +PP + + ++CP+
Sbjct: 409 FCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPR 450
>gi|62896783|dbj|BAD96332.1| PHD finger protein 10 isoform a variant [Homo sapiens]
Length = 410
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + W+C C+ C
Sbjct: 289 NAICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSTIKTYPWQCMECKTC 348
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 349 IICGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
Length = 1018
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)
Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CG +P + V G C C RL YC +C K+ S+S V CD
Sbjct: 352 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 411
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
C+ WVH +C IS + F+ G Y CP C+ + + E + E W+ K +K+
Sbjct: 412 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 464
>gi|298708138|emb|CBJ30479.1| myst-related protein [Ectocarpus siliculosus]
Length = 1620
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 15/101 (14%)
Query: 47 GC---ERARRMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCPSC-----RI-CEIC 95
GC +R +L C CG +YH CL W W CP C R+ C +C
Sbjct: 560 GCMQNDRPTEILQCDGPMCGLEYHYGCLDPPLDKVPSSKW--WYCPDCVRTDNRVGCRVC 617
Query: 96 RRTGDPNKFMFC--RRCDAAYHCYCQHPPHKNVSSGPYLCP 134
+ D +K + C C+ +H YC PP K V G + CP
Sbjct: 618 KVDVDYDKLLKCDGPGCELEWHTYCLKPPVKTVPKGDFFCP 658
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|74183205|dbj|BAE22542.1| unnamed protein product [Mus musculus]
Length = 331
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 2/106 (1%)
Query: 26 EEQGQSNTNVMCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
++Q +++ +C C E+ ++ +LSC CG H +CLK + W
Sbjct: 206 KDQPRADPIPICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRW 265
Query: 85 KCPSCRICEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 129
+C C+ C CR G + + +FC CD +H C PP + G
Sbjct: 266 QCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKG 311
>gi|320167424|gb|EFW44323.1| jumonji [Capsaspora owczarzaki ATCC 30864]
Length = 2147
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
+CE+C R D +K + C CD YH YC HPP V G + CP
Sbjct: 388 VCEVCLRPDDESKIILCDSCDHGYHVYCLHPPLPRVPDGDWYCP 431
>gi|302393562|ref|NP_001116784.3| histone acetyltransferase MYST3 [Danio rerio]
Length = 2247
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 329
>gi|190339720|gb|AAI63677.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Danio rerio]
Length = 2246
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1002
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC +C +++ S+ VCCD C WVH CD I++E++ + + +
Sbjct: 355 CKHCSKLRKFNQYCGICKRIWHPSDDGDWVCCDGCNVWVHAGCDNITNERFKELEHN--- 411
Query: 222 QYRCPTCR 229
Y CP C+
Sbjct: 412 NYYCPDCK 419
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|293332508|ref|NP_001169841.1| uncharacterized protein LOC100383733 [Zea mays]
gi|224031939|gb|ACN35045.1| unknown [Zea mays]
gi|413941582|gb|AFW74231.1| hypothetical protein ZEAMMB73_231911 [Zea mays]
Length = 555
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 9/104 (8%)
Query: 36 MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKN-WAQNRDLFHWSSWKCPSCRI 91
+C+ C E+E R+ + C C K YH CLK + + W CPSC +
Sbjct: 417 LCKNCGTCEDED----RKFMVCGHGLCSFKFYHVLCLKERQIASEKQKNLKCWYCPSC-L 471
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C C + D K + C CD AYH YC PP ++V G + C +
Sbjct: 472 CRRCFKDKDDEKIVLCDGCDEAYHIYCMDPPCESVPRGKWFCTR 515
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 49/121 (40%), Gaps = 16/121 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR-----------ICEI 94
E C++ ++ C +C + YH C + WS CP C E
Sbjct: 329 EVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWS---CPHCEGEGIKEQEEDDHMEF 385
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 154
CR D + + C C +AYH +C +PP K + G + CP+ C V L+ R
Sbjct: 386 CRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPR-CSCEPLKGRV-AKILTWR 443
Query: 155 W 155
W
Sbjct: 444 W 444
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475
>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1094
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)
Query: 139 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 195
C CG +P + V G C C RL YC +C K+ S+S V CD
Sbjct: 422 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 481
Query: 196 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 255
C+ WVH +C IS + F+ G Y CP C+ + + E + E W+ K +K+
Sbjct: 482 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 534
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 92
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 193 CQFCTSGENE-----DKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 247
Query: 93 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 248 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 298
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 144
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 252
Query: 145 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 183
+ P + + CD C R + Y P LKV R
Sbjct: 253 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 285
>gi|383850174|ref|XP_003700672.1| PREDICTED: uncharacterized protein LOC100875893 [Megachile
rotundata]
Length = 659
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 7/102 (6%)
Query: 23 IAGEEQGQSNTNVM---CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF 79
++ E Q S + V CR C + E E +++C+ C + H +C+ + +
Sbjct: 409 VSQEPQDSSKSAVKDGPCRQCSLCAKEKQET---LVACRDCTVRAHPSCIYS-PEEMIQK 464
Query: 80 HWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
SSW+C C+ C IC T D C CD AYH YC P
Sbjct: 465 AGSSWQCERCKSCTICCETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|348680969|gb|EGZ20785.1| hypothetical protein PHYSODRAFT_298767 [Phytophthora sojae]
Length = 606
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPT 227
L G YCPVC +VY D + + VCCD C+ WVH CD S Y+ ++D T
Sbjct: 539 LRALGQYCPVCSEVYEDDDQSTFVCCDSCELWVHGACDP-SLTPYVAVKIDRATTLISNT 597
Query: 228 C 228
C
Sbjct: 598 C 598
>gi|294948371|ref|XP_002785717.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
gi|239899765|gb|EER17513.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
Length = 1340
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 48/103 (46%), Gaps = 18/103 (17%)
Query: 139 CHSC--GSN---VPGNGLS---VRWFL---GYTCC-DACGRLFVKGNYCPVCLKVYRDSE 186
CH C G N P G + +R L TC D C R YCPVCL+ +
Sbjct: 123 CHGCILGLNQRGCPSQGPASPCIRMLLTVDSLTCIKDLCDR------YCPVCLRAWSTVW 176
Query: 187 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
MV CD C+ WVH +CD + ++ D +++Y CP CR
Sbjct: 177 CDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 219
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 387 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 443
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 444 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 486
>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
Length = 1441
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 89/239 (37%), Gaps = 81/239 (33%)
Query: 15 YVATASKDIAGEEQGQSNTNVMCRLCFVGENEGC-----------------ERAR----- 52
+V +K +AG T+++C C VGE EGC E A
Sbjct: 844 HVTADAKVVAG------GTSLVCHECNVGETEGCVICHQVESPAVPSTPMKEEAPSHSPI 897
Query: 53 --RMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP------- 101
++L+C CGK++H +C K W Q S +CP +C C + DP
Sbjct: 898 EDKLLTCSQPLCGKRFHTSCCKYWPQANS--SKLSARCPR-HVCHTC-VSDDPSGKFQQL 953
Query: 102 --NKFMFCRRCDAAYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFL 157
+K C RC A YH +C + +++ +CP+H N+ V
Sbjct: 954 GSSKLAKCVRCPATYHQDSHCIPAGTQMLNATHIICPRH--------NIAKADAHVNVLW 1005
Query: 158 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
Y C VKG +VCC+ C VH C I ++E Y+
Sbjct: 1006 CYIC--------VKGGE---------------LVCCETCPIAVHAHCRNIPIKTNENYI 1041
>gi|190337311|gb|AAI63678.1| Myst3 protein [Danio rerio]
Length = 2247
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLMRMPKGMWIC 329
>gi|345493038|ref|XP_003426985.1| PREDICTED: hypothetical protein LOC100678755 isoform 1 [Nasonia
vitripennis]
gi|345493040|ref|XP_003426986.1| PREDICTED: hypothetical protein LOC100678755 isoform 2 [Nasonia
vitripennis]
Length = 728
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 51/120 (42%), Gaps = 14/120 (11%)
Query: 24 AGEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSS 83
A ++ + N C LC + E + +C+ C + H +C+ + + H +S
Sbjct: 486 AALKESPKDPNRHCSLCSKDKQEA------LTACRDCTVRAHPSCIYTPEEIMNKTH-TS 538
Query: 84 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
W+C C+ C +C T + + C CD A+H C H P VS + CH C
Sbjct: 539 WQCERCKTCVVCYETSEAGPLVACYSCDDAFHYTC-HTPRIPVSKAKW------NCHECS 591
>gi|449496893|ref|XP_002189921.2| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 10 [Taeniopygia
guttata]
Length = 426
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ + ++ C C H +CL + + W+C C+ C
Sbjct: 305 NAICGICLKGKESNKKGKPEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 364
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 365 IICGQPHHEEEMMFCDVCDRGYHTFC 390
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 366 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 422
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 423 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 465
>gi|328871667|gb|EGG20037.1| hypothetical protein DFA_07153 [Dictyostelium fasciculatum]
Length = 1433
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/113 (22%), Positives = 53/113 (46%), Gaps = 18/113 (15%)
Query: 37 CRLCFVGENEGCERA----------RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 86
C +CF+ + G ++ +++C +C + +H++C+ + + + + S W C
Sbjct: 716 CVICFLSTSGGNKKFYHQKTKKNQNTTLVTCFACERSFHQDCITDQPNSNN--NNSEWYC 773
Query: 87 P-----SCRI-CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C++ C +C++ + F+ C +C YH YC P V P+ C
Sbjct: 774 SIDCSMTCQVRCNVCQKGDHEDSFVLCDKCSDGYHIYCLSPQLSEVPYDPWEC 826
>gi|294868766|ref|XP_002765684.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
50983]
gi|239865763|gb|EEQ98401.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
50983]
Length = 1071
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%)
Query: 174 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
YCPVCL+ + MV CD C+ WVH +CD + ++ D +++Y CP CR
Sbjct: 353 YCPVCLRAWSTVWCDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 408
>gi|229594235|ref|XP_001024908.3| PHD-finger family protein [Tetrahymena thermophila]
gi|225566987|gb|EAS04663.3| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 425
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 1/90 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+L CK+C K +H C + + + W C C++C C + N+ + C CD
Sbjct: 279 ILVCKNCNKSFHAECC-DPPLEKGIVSKYDWFCTECKLCIACNKNTKENELLMCDCCDRP 337
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
+H C P ++ G + C KC CG
Sbjct: 338 FHMSCLEPARTDIPEGRWFCKDCEKCPCCG 367
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 368 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 424
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 425 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 467
>gi|413950797|gb|AFW83446.1| hypothetical protein ZEAMMB73_198866 [Zea mays]
Length = 870
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 7/99 (7%)
Query: 133 CPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 191
C + +C SCG+ P + + + + C C R+ YC +CLK + V
Sbjct: 372 CRRVLQCESCGNCFPNKDTNKMVYVMEQLACRLCARILALKKYCGICLKNLQHKYGGRRV 431
Query: 192 CCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
CC C+ WVH +CD S+ K LQ + +Y CP CR
Sbjct: 432 CCHGCESWVHAECDENCSNLKDLQ-----DKKYHCPYCR 465
>gi|323450933|gb|EGB06812.1| hypothetical protein AURANDRAFT_28864 [Aureococcus anophagefferens]
Length = 266
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 49 ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC--RRTGDPNKFMF 106
+R +L C CG+ H C + ++W+CP+C++CE+C + D ++ ++
Sbjct: 66 KRGGELLFCVDCGEACHAMCASTPIDSMSDAARATWRCPNCKVCELCGESKVDDESRLLY 125
Query: 107 CRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 143
C CD AYH C P G ++C C CG
Sbjct: 126 CDLCDKAYHLDCVTPKLDVAPPGRWICGLCVTCRHCG 162
>gi|3540206|gb|AAC34356.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1250
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 1107 KYYHIRCLTS---RQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1162
Query: 122 PHKNVSSGPYLC 133
P ++V +G + C
Sbjct: 1163 PCESVPNGEWFC 1174
>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
Length = 1055
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 221
C C +L YC VC K++ S+ VCCD C WVH +CD IS + F+ +
Sbjct: 415 CKHCHKLRQSKQYCGVCKKIWHHSDGGNWVCCDGCNVWVHAECDKISSK---LFKDLAHS 471
Query: 222 QYRCPTCR 229
+Y CP C+
Sbjct: 472 EYYCPDCK 479
>gi|449278058|gb|EMC86025.1| PHD finger protein 10 [Columba livia]
Length = 414
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ + ++ C C H +CL + + W+C C+ C
Sbjct: 293 NAICGICLKGKESNKKGKPEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 352
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
IC + + MFC CD YH +C
Sbjct: 353 IICGQPHHEEEMMFCDVCDRGYHTFC 378
>gi|297842501|ref|XP_002889132.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
lyrata]
gi|297334973|gb|EFH65391.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
lyrata]
Length = 1160
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 1017 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1072
Query: 122 PHKNVSSGPYLC 133
P ++V +G + C
Sbjct: 1073 PCESVPNGEWFC 1084
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
C+ C G+NE ++L C C K YH C K +N W C C
Sbjct: 1937 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1988
Query: 90 RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
R C +C + + + + C C AYH C HP V G + C K
Sbjct: 1989 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 2036
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C+ C + +K + C CD YH YC P +N+ G + CH C + G
Sbjct: 1937 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1988
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
C CG+ + T ++ C++C R H C
Sbjct: 1989 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 2019
>gi|195174305|ref|XP_002027919.1| GL27102 [Drosophila persimilis]
gi|194115608|gb|EDW37651.1| GL27102 [Drosophila persimilis]
Length = 2142
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+ C SC ++ H +C+ + +W+C C+ C C+ + P K ++C +CD
Sbjct: 1837 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1896
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
YH YC K V + C + C CG+ P
Sbjct: 1897 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1928
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
C+ C G+NE ++L C C K YH C K +N W C C
Sbjct: 1791 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1842
Query: 90 RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
R C +C + + + + C C AYH C HP V G + C K
Sbjct: 1843 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 1890
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C+ C + +K + C CD YH YC P +N+ G + CH C + G
Sbjct: 1791 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1842
Query: 152 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 204
C CG+ + T ++ C++C R H C
Sbjct: 1843 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 1873
>gi|198471111|ref|XP_002133666.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
gi|198145773|gb|EDY72293.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
Length = 2132
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+ C SC ++ H +C+ + +W+C C+ C C+ + P K ++C +CD
Sbjct: 1827 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1886
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 147
YH YC K V + C + C CG+ P
Sbjct: 1887 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1918
>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
Short=CHD-3
gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
Length = 1787
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 25/109 (22%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------- 90
E C + ++ C +C + YH C+ +N + W CP C
Sbjct: 269 EVCNQDGELMLCDTCTRAYHVACID---ENMEQPPEGDWSCPHCEEHGPDVLIVEEEPAK 325
Query: 91 ----ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C IC+ T + + C C ++YH YC PP + G + CP+
Sbjct: 326 ANMDYCRICKETSN---ILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPR 371
>gi|255558536|ref|XP_002520293.1| DNA binding protein, putative [Ricinus communis]
gi|223540512|gb|EEF42079.1| DNA binding protein, putative [Ricinus communis]
Length = 510
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL N N + W CPSC +C C D ++ + C CD AYH YC P
Sbjct: 378 KYYHVRCLTN---NLLKSYGPRWYCPSC-LCRTCFVDRDDDQIVLCDGCDHAYHMYCMSP 433
Query: 122 PHKNVSSGPYLC 133
P ++ G + C
Sbjct: 434 PRTSIPRGKWFC 445
>gi|209882276|ref|XP_002142575.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
gi|209558181|gb|EEA08226.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
Length = 305
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 44/96 (45%), Gaps = 1/96 (1%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
+ C C K YH C + + + S W C C C +CR++G + + C C+ A
Sbjct: 132 FIECSICKKSYHLTCCDPIIEKVSI-NNSKWICSDCNGCIVCRKSGREDYQVLCDVCNRA 190
Query: 114 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 149
+H YC +P +V G ++C C C N+ N
Sbjct: 191 FHIYCLYPTLDSVPQGIWICDDCYVCAFCQGNIKYN 226
>gi|47118082|gb|AAT11171.1| monocytic leukemia zinc finger protein [Danio rerio]
Length = 2246
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 36 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKSCSS 288
Query: 95 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|313233623|emb|CBY09794.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 43/98 (43%), Gaps = 1/98 (1%)
Query: 37 CRLCFVGENEGCERARRMLSCKS-CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 95
C LC G + E ++ C C + H C+ A +W+C C+ C C
Sbjct: 225 CDLCSNGPDTSSEDMSMLVKCSGPCKRLTHPYCVNLPANIVKNVSTYAWECQDCKHCSKC 284
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+ +K +FC CD H YC +PP KN SG + C
Sbjct: 285 GLDENDDKLLFCDDCDRGVHLYCLNPPLKNAPSGRWTC 322
>gi|303286287|ref|XP_003062433.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455950|gb|EEH53252.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 450
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 37/69 (53%), Gaps = 3/69 (4%)
Query: 162 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV--CQRWVHCQCDGISDEKYLQFQVDG 219
C C +L +G +CP C KV++ + MV CD C+ WVH CD + E + +
Sbjct: 85 CALCAKLHKEGQFCPACDKVWQWANCPAMVGCDAPGCEFWVHASCDARAKE-VMDAPENE 143
Query: 220 NLQYRCPTC 228
+++Y CP C
Sbjct: 144 DIEYHCPRC 152
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 92
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481
>gi|156392562|ref|XP_001636117.1| predicted protein [Nematostella vectensis]
gi|156223217|gb|EDO44054.1| predicted protein [Nematostella vectensis]
Length = 229
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C++CRR GD K + C CD +H YC PP K++ G + CP
Sbjct: 1 CKLCRRKGDAEKMLLCDACDRGHHMYCLKPPIKHIPEGNWFCP 43
>gi|363731649|ref|XP_003641005.1| PREDICTED: PHD finger protein 10-like [Gallus gallus]
Length = 534
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 34 NVMCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 92
N +C +C G E+ +A ++ C C H +CL + + W+C C+ C
Sbjct: 413 NAICGICLKGKESNKKGKAEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTC 472
Query: 93 EICRRTGDPNKFMFCRRCDAAYHCYC 118
+C + + MFC CD YH +C
Sbjct: 473 IMCGQPHHEEEMMFCDVCDRGYHTFC 498
>gi|195432472|ref|XP_002064247.1| GK19801 [Drosophila willistoni]
gi|194160332|gb|EDW75233.1| GK19801 [Drosophila willistoni]
Length = 478
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 48/121 (39%), Gaps = 6/121 (4%)
Query: 26 EEQGQSNTNVMCRLCFVGE-NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
E+Q Q C C E G ++CKSC +K H CL +N + +
Sbjct: 131 EDQQQQRNMDTCVKCSKSEPKRGSGHKSNFITCKSCMQKCHFACLPLNFENLTMAR-KKY 189
Query: 85 KCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 144
KC CR C C G + C C YH C +PP + L + KCH C +
Sbjct: 190 KCEKCRYCSYCNSKGKEILIILCSSCVDGYHFECHNPPL----NASILDDREWKCHKCDT 245
Query: 145 N 145
N
Sbjct: 246 N 246
>gi|339246939|ref|XP_003375103.1| zinc finger protein ubi-d4 A [Trichinella spiralis]
gi|316971607|gb|EFV55360.1| zinc finger protein ubi-d4 A [Trichinella spiralis]
Length = 694
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
Query: 26 EEQGQSNTNVMCRLC----FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW 81
+ QG + C LC F + GC A ++++C CG+ H CL+ +
Sbjct: 139 QRQGMVELSSTCDLCQGDIFENKRTGC--AEQLINCSDCGRAGHPYCLQFSSNMIISTKK 196
Query: 82 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
W+C C+ C IC + + +FC CD +H YC P
Sbjct: 197 YGWQCIECKSCAICGTSEHDEQLLFCDDCDRGFHMYCLTP 236
>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
[Brachypodium distachyon]
Length = 1037
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 7/94 (7%)
Query: 138 KCHSCGSNVPG-NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVC 196
+C SCG+ P + + + + C C R+ YC +CLK ++ VCC C
Sbjct: 381 QCESCGNCFPNKDSNKMVYVMEQLACKHCARILRSKEYCGICLKSWQHKCGRRWVCCHGC 440
Query: 197 QRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
+ W+H +CD SD K LQ + Y CP CR
Sbjct: 441 ESWIHAECDKKCSDLKDLQ-----DKSYFCPYCR 469
>gi|300508320|pdb|2KWJ|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
Transcriptional Protein Dpf3 Bound To A Histone Peptide
Containing Acetylation At Lysine 14
gi|300508322|pdb|2KWK|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
Transcriptional Protein Dpf3b Bound To A H3 Peptide Wild
Type
gi|300508324|pdb|2KWN|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
Fingers Of Human Transcriptional Protein Dpf3b Bound To
A Histone H4 Peptide Containing Acetylation At Lysine 16
gi|300508326|pdb|2KWO|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
Fingers Of Human Transcriptional Protein Dpf3b Bound To
A Histone H4 Peptide Containing N-Terminal Acetylation
At Serine 1
Length = 114
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 40/84 (47%)
Query: 50 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 109
R ++SC CG+ H CL+ + W+C C+ C +C + + ++ +FC
Sbjct: 19 RPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCILCGTSENDDQLLFCDD 78
Query: 110 CDAAYHCYCQHPPHKNVSSGPYLC 133
CD YH YC +PP G + C
Sbjct: 79 CDRGYHMYCLNPPVAEPPEGSWSC 102
>gi|42563280|ref|NP_177849.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|95147302|gb|ABF57286.1| At1g77250 [Arabidopsis thaliana]
gi|332197833|gb|AEE35954.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 522
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 122 PHKNVSSGPYLC 133
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|51969394|dbj|BAD43389.1| unnamed protein product [Arabidopsis thaliana]
gi|51969560|dbj|BAD43472.1| unnamed protein product [Arabidopsis thaliana]
gi|51969870|dbj|BAD43627.1| unnamed protein product [Arabidopsis thaliana]
Length = 522
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 122 PHKNVSSGPYLC 133
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|198437529|ref|XP_002126456.1| PREDICTED: similar to Bromodomain adjacent to zinc finger domain
protein 1A (ATP-utilizing chromatin assembly and
remodeling factor 1) (hACF1) (ATP-dependent
chromatin-remodeling protein) (Williams syndrome
transcription factor-related chromatin-remodeling fa...
[Ciona intestinalis]
Length = 1458
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 24/43 (55%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C ICRR GD K + C CD +H YC P K V SG + CP
Sbjct: 1178 CRICRRKGDGEKMLLCDNCDRGHHMYCLRPALKIVPSGDWFCP 1220
>gi|390478964|ref|XP_002762152.2| PREDICTED: zinc finger protein neuro-d4 [Callithrix jacchus]
Length = 364
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
G N C C G + GC ++SC CG+ H +CL+ N + W
Sbjct: 233 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 290
Query: 85 KCPSCRICEICRRTG-------DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
+C SC +C G ++ +FC CD YH YC PP G + LC
Sbjct: 291 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 350
Query: 135 KHTK 138
+H K
Sbjct: 351 RHLK 354
>gi|51969444|dbj|BAD43414.1| unnamed protein product [Arabidopsis thaliana]
Length = 522
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 62 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 121
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 122 PHKNVSSGPYLC 133
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|45382851|ref|NP_989971.1| zinc finger protein neuro-d4 [Gallus gallus]
gi|18202298|sp|P58267.1|DPF1_CHICK RecName: Full=Zinc finger protein neuro-d4; AltName: Full=D4, zinc
and double PHD fingers family 1
gi|14010358|gb|AAK51966.1|AF362752_1 neuro-d4 [Gallus gallus]
Length = 380
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 36/80 (45%)
Query: 54 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 113
M++C CG+ H +CL+ W+C C+ C +C + + +FC CD
Sbjct: 283 MIACADCGRAGHPSCLQFTLAMAAAARSYRWQCIECKNCSLCGSAENDEQLLFCDDCDRG 342
Query: 114 YHCYCQHPPHKNVSSGPYLC 133
YH YC PP G + C
Sbjct: 343 YHMYCISPPVAEPPEGTWSC 362
>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
Length = 1423
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 81/221 (36%), Gaps = 75/221 (33%)
Query: 33 TNVMCRLCFVGENEGCERAR------------------------RMLSCKS--CGKKYHR 66
T+++C C VGE EGC ++L+C CGK++H
Sbjct: 848 TSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHT 907
Query: 67 NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYHCY 117
+C K W Q H S +CP +C C + DP +K C RC A YH
Sbjct: 908 SCCKYWPQASSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQL 963
Query: 118 CQHPPH--KNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
+ P + +++ +CP+H N+ V Y C VKG
Sbjct: 964 SKCIPAGTQMLNTTNIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1006
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
+VCC+ C VH C I ++E Y+
Sbjct: 1007 --------------LVCCETCPIAVHAHCRNIPIKTNESYI 1033
>gi|348508478|ref|XP_003441781.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oreochromis niloticus]
Length = 1521
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C++CRR GD K + C C+ A+H +C P V +G +LCP +C V G
Sbjct: 1202 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRVPNGEWLCP------ACQPTVARRGS 1255
Query: 152 SVR 154
VR
Sbjct: 1256 RVR 1258
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 380 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 436
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C IC+ G+ + C C +AYH +C +PP + G + CP+
Sbjct: 437 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 477
>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
AltName: Full=Maternal-effect sterile 4 homolog
gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
Length = 1427
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 81/221 (36%), Gaps = 75/221 (33%)
Query: 33 TNVMCRLCFVGENEGCERAR------------------------RMLSCKS--CGKKYHR 66
T+++C C VGE EGC ++L+C CGK++H
Sbjct: 852 TSLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHT 911
Query: 67 NCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYHCY 117
+C K W Q H S +CP +C C + DP +K C RC A YH
Sbjct: 912 SCCKYWPQASSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQL 967
Query: 118 CQHPPH--KNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYC 175
+ P + +++ +CP+H N+ V Y C VKG
Sbjct: 968 SKCIPAGTQMLNTTNIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE- 1010
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 213
+VCC+ C VH C I ++E Y+
Sbjct: 1011 --------------LVCCETCPIAVHAHCRNIPIKTNESYI 1037
>gi|196014713|ref|XP_002117215.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
gi|190580180|gb|EDV20265.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
Length = 1478
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C ICRR GD + C CD +H YC PP ++ +G + CP
Sbjct: 1141 CRICRRKGDAELMLLCDECDRGHHTYCLRPPLNSIPAGNWYCP 1183
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 91
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 377 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 433
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
C IC+ G+ + C C +AYH +C +PP + G + CP+
Sbjct: 434 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 474
>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
Length = 1012
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 7/111 (6%)
Query: 121 PPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYCPVCL 179
P N + P + +C SCG+ P S+ + + C C ++ YC VCL
Sbjct: 369 PMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYCGVCL 428
Query: 180 KVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
K ++ VCC C+ WVH +CD S+ K L+ + Y CP CR
Sbjct: 429 KSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 474
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 13/100 (13%)
Query: 46 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR----------ICEIC 95
E C++ ++ C +C + YH CL+ + WS CP C E C
Sbjct: 373 EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS---CPHCENDGALEDDDEHMEFC 429
Query: 96 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 135
R D + + C C +AYH +C +PP + G + CP+
Sbjct: 430 RVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPR 469
>gi|297596335|ref|NP_001042415.2| Os01g0218900 [Oryza sativa Japonica Group]
gi|56784089|dbj|BAD81418.1| unknown protein [Oryza sativa Japonica Group]
gi|222617993|gb|EEE54125.1| hypothetical protein OsJ_00898 [Oryza sativa Japonica Group]
gi|255673003|dbj|BAF04329.2| Os01g0218900 [Oryza sativa Japonica Group]
Length = 405
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
Query: 135 KHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 193
K C CG+ +P S + G C C +L YC +C K++ ++ VCC
Sbjct: 306 KSPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365
Query: 194 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
D CQ WVH +C D+ ++ + N Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397
>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C ICRR D + + C C+ A+H YC PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 65/158 (41%), Gaps = 12/158 (7%)
Query: 39 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP--SCRICEICR 96
LCF E G + S ++CGKKYHR C+ N R +S+KCP C C +
Sbjct: 726 LCFACEQPGGLEGLQTCSVRNCGKKYHRACISN--NPRAALKDNSFKCPLHKCANCTYPQ 783
Query: 97 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKC--HSCGSNVPGNGLSVR 154
+ P + C RC AYH C + ++ LCPKH H+ + G R
Sbjct: 784 ASTYP--LVRCIRCPIAYHTCCVPAGCLHENAIYLLCPKHQPVEKHAKSNICLACGDGGR 841
Query: 155 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 192
F CCD C + + V SE +P C
Sbjct: 842 LF----CCDTCPAAYHQECLKDVLALTGTPSEDSPWYC 875
>gi|301780978|ref|XP_002925893.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Ailuropoda
melanoleuca]
gi|410983104|ref|XP_003997883.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Felis catus]
Length = 380
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 249 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 306
Query: 88 SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
C+ C +C + + ++ +FC CD YH YC PP G + LC
Sbjct: 307 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 366
Query: 135 KHTK 138
+H K
Sbjct: 367 RHLK 370
>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
Length = 1032
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 7/115 (6%)
Query: 117 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYC 175
+ + P N + P + +C SCG+ P S+ + + C C ++ YC
Sbjct: 355 FFEVPMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYC 414
Query: 176 PVCLKVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 229
VCLK ++ VCC C+ WVH +CD S+ K L+ + Y CP CR
Sbjct: 415 GVCLKSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 464
>gi|205830432|ref|NP_004638.2| zinc finger protein neuro-d4 isoform b [Homo sapiens]
gi|297704631|ref|XP_002829198.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Pongo abelii]
gi|395847021|ref|XP_003796185.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Otolemur
garnettii]
gi|402905403|ref|XP_003915509.1| PREDICTED: zinc finger protein neuro-d4 isoform 5 [Papio anubis]
gi|313104100|sp|Q92782.2|DPF1_HUMAN RecName: Full=Zinc finger protein neuro-d4; AltName:
Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
Full=D4, zinc and double PHD fingers family 1
Length = 380
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 249 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 306
Query: 88 SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
C+ C +C + + ++ +FC CD YH YC PP G + LC
Sbjct: 307 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 366
Query: 135 KHTK 138
+H K
Sbjct: 367 RHLK 370
>gi|357616639|gb|EHJ70297.1| hypothetical protein KGM_09919 [Danaus plexippus]
Length = 1569
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 47/119 (39%), Gaps = 19/119 (15%)
Query: 37 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 89
C+ C G+NE ++L C C K YH C K + W W+C +
Sbjct: 1256 CQFCLSGDNED-----QLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWECVNKARGGSRE 1310
Query: 90 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 148
R+C +C + + C C AYH C +PP + G + C + C S P
Sbjct: 1311 RVCIVCGGAAR-GRALPCALCVRAYHLDCHYPPLTKMPRGKWYCSQ------CASRAPA 1362
>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C ICRR D + + C C+ A+H YC PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 134
C ICRR GD + + C +CD +H YC P K+V G + CP
Sbjct: 1132 CRICRRKGDAEQMLLCDKCDRGHHMYCLKPRLKHVPKGDWFCP 1174
>gi|332206683|ref|XP_003252426.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4
[Nomascus leucogenys]
Length = 459
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
G N C C G + GC ++SC CG+ H +CL+ N + W
Sbjct: 328 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 385
Query: 85 KCPSCRICEICRRTG-------DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
+C SC +C G ++ +FC CD YH YC PP G + LC
Sbjct: 386 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 445
Query: 135 KHTK 138
+H K
Sbjct: 446 RHLK 449
>gi|145534470|ref|XP_001452979.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420679|emb|CAK85582.1| unnamed protein product [Paramecium tetraurelia]
Length = 286
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 4/69 (5%)
Query: 162 CDACGRLFVKGNYCPVCLKVY--RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 219
CD C +L+ KGN+C C +VY D E+ V CD CQ+W H C+ + + +Q + +
Sbjct: 87 CDKCSKLYNKGNFCDFCEQVYGSYDDEAV-WVQCDSCQKWNHIVCEQKNRNQNIQIEFET 145
Query: 220 NLQYRCPTC 228
+ QY C TC
Sbjct: 146 S-QYHCLTC 153
>gi|392575621|gb|EIW68754.1| hypothetical protein TREMEDRAFT_63213 [Tremella mesenterica DSM
1558]
Length = 2086
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 6/61 (9%)
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 150
+CE+CR G P+K + C +CD YH YC PP K + + + C SC PGNG
Sbjct: 470 VCEVCRSGGAPDKMLLCDKCDCGYHIYCLDPPLKGLPAY-----EEWYCTSCLLG-PGNG 523
Query: 151 L 151
Sbjct: 524 F 524
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Nasonia vitripennis]
Length = 1382
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 83/220 (37%), Gaps = 52/220 (23%)
Query: 5 EDYARRFESGYVATASKDIAGEEQGQSNT-NVMCRLCFVGENEGC----ER--ARRMLSC 57
E+ A + + + SK+I +E+ + N C C G C ER R S
Sbjct: 619 ENNAYKEDLKEIKAKSKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVCHERDGERTKCSI 678
Query: 58 KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--------KFMFCRR 109
+CGK YH +CLK+W Q + + CP IC C N KF C +
Sbjct: 679 LACGKHYHPDCLKSWPQCQ--WQGGRLTCPH-HICHTCASDNPQNSHPRSAGEKFAKCVK 735
Query: 110 CDAAYHCYCQHPPHKN--VSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 167
C + YH P + ++ +CPKH K S+ P
Sbjct: 736 CPSTYHASISCLPAGSTILTGSQIVCPKHYK----SSHPP-------------------- 771
Query: 168 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 207
V +C +C +E ++CCD C H +C GI
Sbjct: 772 --VNATWCFLC------TEGGSLICCDTCPTSFHLECLGI 803
>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Harpegnathos saltator]
Length = 1948
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 359 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 408
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 409 -CPHCEGEGITGAADDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 467
Query: 132 LCPK 135
CP+
Sbjct: 468 KCPR 471
>gi|344298355|ref|XP_003420859.1| PREDICTED: zinc finger protein neuro-d4-like [Loxodonta africana]
Length = 264
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSW 84
G N C C G + GC ++SC CG+ H +CL+ N + W
Sbjct: 133 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 190
Query: 85 KCPSCRICEICRRTGDP-------NKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
+C SC +C G ++ +FC CD YH YC PP G + LC
Sbjct: 191 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 250
Query: 135 KHTK 138
+H K
Sbjct: 251 QHLK 254
>gi|218187759|gb|EEC70186.1| hypothetical protein OsI_00918 [Oryza sativa Indica Group]
Length = 405
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
Query: 135 KHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 193
K C CG+ +P S + G C C +L YC +C K++ ++ VCC
Sbjct: 306 KLPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365
Query: 194 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
D CQ WVH +C D+ ++ + N Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397
>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
Length = 991
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 139 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
C SCG+ VP + G C C +L YC +C K++ ++ VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401
Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
WVH +CD + + N Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429
>gi|218188422|gb|EEC70849.1| hypothetical protein OsI_02356 [Oryza sativa Indica Group]
Length = 1226
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 11/103 (10%)
Query: 36 MCRLCFVGENEGCERARRMLSC---KSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCR 90
+C++C E E+ +R L C K YH +CLK A ++ L W CPSC
Sbjct: 1074 LCKMCGNPE----EKDKRFLVCGHTHCLYKYYHISCLKATQIASDKQLDK-PCWYCPSC- 1127
Query: 91 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
+C +C D + + C CD AYH YC P ++ G + C
Sbjct: 1128 LCRVCHSDRDDDLTILCDGCDEAYHLYCITPRRTSIPKGKWYC 1170
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 359 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 408
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 409 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 467
Query: 132 LCPK 135
CP+
Sbjct: 468 KCPR 471
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 360 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 409
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 410 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 468
Query: 132 LCPK 135
CP+
Sbjct: 469 KCPR 472
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 361 GEEGIQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 410
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 411 -CPHCEGEGITGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 469
Query: 132 LCPK 135
CP+
Sbjct: 470 KCPR 473
>gi|355703500|gb|EHH29991.1| hypothetical protein EGK_10556 [Macaca mulatta]
Length = 385
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 254 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 311
Query: 88 SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
C+ C +C + + ++ +FC CD YH YC PP G + LC
Sbjct: 312 ECKSCSLCGTSENDGASLAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 371
Query: 135 KHTK 138
+H K
Sbjct: 372 RHLK 375
>gi|348534080|ref|XP_003454531.1| PREDICTED: PHD finger protein 10-like [Oreochromis niloticus]
Length = 491
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 3/99 (3%)
Query: 36 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 94
+C +C G+ + + ++ C C H +CL + + W+C C+ C +
Sbjct: 368 ICGICQKGKEANKKGKPEALIHCSECENSGHPSCLDMSEELVSMIQTYRWQCMECKTCTV 427
Query: 95 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 133
C++ ++ MFC CD YH +C ++ +G ++C
Sbjct: 428 CQQPHHEDEMMFCDMCDRGYHTFCVG--MDSIPTGLWIC 464
>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
Length = 991
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 139 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 197
C SCG+ VP + G C C +L YC +C K++ ++ VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401
Query: 198 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 229
WVH +CD + + N Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429
>gi|449444070|ref|XP_004139798.1| PREDICTED: uncharacterized protein LOC101205573 [Cucumis sativus]
Length = 574
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 62 KKYHRNCL-KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 120
K YH CL K ++ D + W CPSC +C C D +K + C CD +H YC
Sbjct: 434 KCYHTRCLTKKQLKSYD----ACWYCPSC-LCRACLINQDDDKIVLCDGCDHGFHIYCMR 488
Query: 121 PPHKNVSSGPYLCPK 135
PP + G + C K
Sbjct: 489 PPLAAIPKGKWFCSK 503
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 369 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 418
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 419 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 477
Query: 132 LCPK 135
CP+
Sbjct: 478 KCPR 481
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 24/124 (19%)
Query: 25 GEEQGQSNTNVMCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSW 84
GEE Q++ C E C++ ++ C +C + YH CL+ + WS
Sbjct: 360 GEEGLQTDHQDYC--------EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS-- 409
Query: 85 KCPSCR-------------ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY 131
CP C E CR D + + C C +AYH +C +PP + G +
Sbjct: 410 -CPHCEGEGIAGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDW 468
Query: 132 LCPK 135
CP+
Sbjct: 469 KCPR 472
>gi|335289600|ref|XP_003355929.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Sus scrofa]
gi|119577165|gb|EAW56761.1| D4, zinc and double PHD fingers family 1, isoform CRA_b [Homo
sapiens]
Length = 353
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 16/124 (12%)
Query: 29 GQSNTNVMCRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP 87
G N C C G + GC ++SC CG+ H +CL+ W+C
Sbjct: 222 GTVIPNGYCDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCI 279
Query: 88 SCRICEICRRTGD----------PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCP 134
C+ C +C + + ++ +FC CD YH YC PP G + LC
Sbjct: 280 ECKSCSLCGTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCL 339
Query: 135 KHTK 138
+H K
Sbjct: 340 RHLK 343
>gi|432892838|ref|XP_004075862.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oryzias latipes]
Length = 1572
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 92 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 151
C++CRR GD K + C C+ A+H +C P + +G +LCP +C V G
Sbjct: 1183 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRIPTGEWLCP------ACQPTVARRGS 1236
Query: 152 SVR 154
+R
Sbjct: 1237 RLR 1239
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,315,885,204
Number of Sequences: 23463169
Number of extensions: 545413839
Number of successful extensions: 1520511
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2132
Number of HSP's successfully gapped in prelim test: 2700
Number of HSP's that attempted gapping in prelim test: 1501520
Number of HSP's gapped (non-prelim): 16224
length of query: 754
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 603
effective length of database: 8,816,256,848
effective search space: 5316202879344
effective search space used: 5316202879344
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)