BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005013
(719 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449436315|ref|XP_004135938.1| PREDICTED: uncharacterized protein LOC101208296 [Cucumis sativus]
gi|449488832|ref|XP_004158186.1| PREDICTED: uncharacterized protein LOC101230410 [Cucumis sativus]
Length = 847
Score = 927 bits (2396), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/735 (65%), Positives = 559/735 (76%), Gaps = 52/735 (7%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF GENE ERAR+MLSCK+CGKKYHR+CLK+WAQ+RDLFHWSSW CPSCR CE+C
Sbjct: 149 MCRICFFGENESSERARKMLSCKTCGKKYHRSCLKSWAQHRDLFHWSSWTCPSCRACEVC 208
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RRTGDPNKFMFC+RCD AYHCYCQHPPHKNVSSGPYLCPKHT+CHSCGSNVPGNG SVRW
Sbjct: 209 RRTGDPNKFMFCKRCDGAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCGSNVPGNGQSVRW 268
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYT CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CD ISDEKYLQF
Sbjct: 269 FLGYTFCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCHCDSISDEKYLQF 328
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q+DGNLQY+C CRGECYQV++LEDAV+E+WRR+D AD+DLI +LRAAAGLPT+DEIFSI
Sbjct: 329 QIDGNLQYKCTACRGECYQVKNLEDAVQEIWRRRDEADRDLIVNLRAAAGLPTQDEIFSI 388
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
SPYSDDEENGP V+KNEFGRSLKLSLKG DK PKK K++GKK NKKY ++KG PL
Sbjct: 389 SPYSDDEENGPAVVKNEFGRSLKLSLKGFADKVPKKSKDYGKKSSNKKYAKEKG--TPLA 446
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGDDTQSP-----KNEGLDIPSSVAGIVSHTEGVCSISQ 355
++ E DQ+FE +DV G G++ NEGLD S VAG +SH EG CS++Q
Sbjct: 447 NQSELDQNFEVRNDVQQSGFGEGNEKNGGLLPQNNNEGLDT-SPVAGSLSHNEGTCSVNQ 505
Query: 356 PGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLG 414
PG+LKHK+VDEVMVSD++K S+ V+ K SK LD+GED GK+ SKSKT K KKLVINLG
Sbjct: 506 PGVLKHKFVDEVMVSDEEKTSKVVQIKASKAQGLDTGEDSGKYASKSKTAKGKKLVINLG 565
Query: 415 ARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDH 474
ARKINV SP+SDASSCQR QDL SN G++V++
Sbjct: 566 ARKINVATSPKSDASSCQRGQDLAVSN---------------------------GEKVNN 598
Query: 475 SSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHE---PEHMHVLSGKRNID 531
SSQS GLK +V FG+VR SD+NT RG++A E P+ V S KRN++
Sbjct: 599 SSQSTGLKAGETENSVPSFGKVRFGSSDTNTTFGRGNTASGSEVGPPDGTRVFSRKRNME 658
Query: 532 RSRAAVSRVGEVAALRGDR----KQLESRPNASRESNDD---TSVLQSLPKDSKPPLRLK 584
S AV +G V+ ++ ++ KQLES + + +DD T + QSLP+DSKP L+ K
Sbjct: 659 GSTPAVGSLGGVSTVKEEKVPSGKQLESGSHICNDGHDDNGQTPLPQSLPRDSKPLLKFK 718
Query: 585 FRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEI 644
F+KP L+N Q+S EEEKSL+KGQRSKRKRPSP EK FNE ED +S+QD+L+
Sbjct: 719 FKKPPLDN---QISCHEEEKSLVKGQRSKRKRPSPLMEKVPFNEVEDLTRSHQDNLLD-- 773
Query: 645 MDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGK 704
DANWILKKLGKDAIGKRVEV SD SW KGVV D ++GTSTLS+ LDD R KTLELGK
Sbjct: 774 -DANWILKKLGKDAIGKRVEVQHPSDKSWQKGVVRDMIDGTSTLSVALDDGREKTLELGK 832
Query: 705 QGVRFVPQKQKRSMS 719
QG+R VP KQKRS S
Sbjct: 833 QGIRLVPLKQKRSKS 847
>gi|224106097|ref|XP_002314042.1| predicted protein [Populus trichocarpa]
gi|222850450|gb|EEE87997.1| predicted protein [Populus trichocarpa]
Length = 845
Score = 909 bits (2350), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/735 (64%), Positives = 553/735 (75%), Gaps = 53/735 (7%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C++CFVG+ G ERAR+ML CKSCGKKYHR+CLK WA++RDLFHWSSW CPSC+ CE+C
Sbjct: 144 FCQICFVGQTGGSERARKMLPCKSCGKKYHRSCLKTWARHRDLFHWSSWTCPSCQTCEVC 203
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
R+TGDPNKF+FC+RCD AYHCYCQHPPHKNVSSGPYLCPKHT+CHSCGS+VPGNGLSVRW
Sbjct: 204 RKTGDPNKFVFCKRCDGAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCGSSVPGNGLSVRW 263
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CDGISDEKYLQF
Sbjct: 264 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDEKYLQF 323
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDGNLQY+C TCRGECYQV+DL+DA++ELWRR+D AD+ LIASLRAAAGLP +++IFSI
Sbjct: 324 QVDGNLQYQCATCRGECYQVKDLKDAIQELWRRRDKADRGLIASLRAAAGLPAQEDIFSI 383
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
SPYSD + NGP L+N+F S+ LSLKG+ KSPKK +HGKK NKK+P+KKG
Sbjct: 384 SPYSDGDGNGPEALRNDFRHSINLSLKGIGGKSPKKSNDHGKKHWNKKFPKKKGCHAASI 443
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGD----DTQSPKNEGLDIP-SSVAGIVSHTEGVCSISQ 355
SK EP Q HD+HS + D D++S G D S VAGIV+HTEGVCSISQ
Sbjct: 444 SKSEPHQ-----HDIHSSVHDMDDCKIYDSESQAKGGSDKSCSPVAGIVNHTEGVCSISQ 498
Query: 356 PGILKHKYVDEVMVSDDDKISRV-KFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLG 414
PG+LKHK+VDEVMVSD ++ S V K K++KPHD+DSG D KH KSK++KAK+LVINLG
Sbjct: 499 PGVLKHKFVDEVMVSDGERTSNVFKIKSNKPHDVDSGGDTEKHAGKSKSVKAKRLVINLG 558
Query: 415 ARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDH 474
ARKINV++ P+SD SCQ E DL SN D DH
Sbjct: 559 ARKINVSSPPKSDVQSCQSELDLKASNR---------------------------DTADH 591
Query: 475 SSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEP---EHMHVLSGKRNI 530
S Q+RGL K A R GN+IKFG+V+ E S+ N K GS +D +E +H V S K+++
Sbjct: 592 SGQTRGLIKFARREGNLIKFGKVKAEASNFNPKSDGGSHSDGYETVPLDHARVSSAKKSL 651
Query: 531 DRSRAAVSRVG-EVAALRGDR----KQLESRPNASRESNDD---TSVLQSLPKDSKPPLR 582
+ SRA V G EV LR D+ KQ E RP+ ESN D T + SLPK+SK L+
Sbjct: 652 EGSRAVVRPAGGEVPTLRSDKLSLGKQSEVRPDTHTESNGDSGDTPIFHSLPKESKLSLK 711
Query: 583 LKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMS 642
LK +KPNLENQ+S + EEEKS I+GQRSKRKR S EKT++NEDE S+ DS M+
Sbjct: 712 LKIKKPNLENQSSLIHLHEEEKSNIRGQRSKRKRASSLMEKTMYNEDEGMPPSHLDSEMT 771
Query: 643 EIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLEL 702
E AN ILKKLGKDAIGKRVEVHQ SDNSWHKGVV+D VEGTS LS+TLDD VKTL+L
Sbjct: 772 E---ANRILKKLGKDAIGKRVEVHQPSDNSWHKGVVSDIVEGTSKLSVTLDDGIVKTLKL 828
Query: 703 GKQGVRFVPQKQKRS 717
GKQ VR V QKQKRS
Sbjct: 829 GKQAVRIVSQKQKRS 843
>gi|356544287|ref|XP_003540585.1| PREDICTED: uncharacterized protein LOC100815407 [Glycine max]
Length = 845
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/728 (61%), Positives = 544/728 (74%), Gaps = 33/728 (4%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CR+C GENEG E+A++MLSCKSCGKKYHRNCL++W +NRDLFHWSSW CP CRICE CR
Sbjct: 137 CRICKCGENEGSEKAQKMLSCKSCGKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACR 196
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
RTGDP+KFMFC+RCD AYHCYC PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF
Sbjct: 197 RTGDPSKFMFCKRCDGAYHCYCLQPPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWF 256
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 181
+ YT CDACGRLF KGNYCPVCLKVYRDSESTPMVCCD CQ WVHCQCD IS+EKY QFQ
Sbjct: 257 MAYTNCDACGRLFTKGNYCPVCLKVYRDSESTPMVCCDTCQLWVHCQCDNISEEKYHQFQ 316
Query: 182 VDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 241
VDGNLQY+CPTCRGECYQV++ EDA +E+WRR+++A++DLI+SLRAAAGLPT++EIFSIS
Sbjct: 317 VDGNLQYKCPTCRGECYQVKNPEDAAQEIWRRRNIAERDLISSLRAAAGLPTQEEIFSIS 376
Query: 242 PYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNS 301
P+SDDE++GP+ LK+E RS K SLK + + SPKK +KK +KK Q + S
Sbjct: 377 PFSDDEDSGPLKLKSESARSFKFSLKNLANDSPKKKTS------SKKTAKKKNSQSFMTS 430
Query: 302 KPEPDQSFEGYHDV---HSYGNSFGDDTQSPKNEGLDIPSSVA-GIVSHTEGVCSISQPG 357
K + S EG+ D+ HS + DD QS +NEG D+ SS A G +S TE I+QPG
Sbjct: 431 KIDTHNSCEGHSDIKSLHSLDDDKNDDIQSQRNEGPDVYSSPATGSLSQTEASFPINQPG 490
Query: 358 ILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGAR 416
ILK K+VDEVMVSD+++ R V+ K++K H DS E+ GKH K++ +K KKLVINLGAR
Sbjct: 491 ILKQKFVDEVMVSDEERKPRVVRIKSNKAHIPDSEEESGKHSLKTQNVKGKKLVINLGAR 550
Query: 417 KINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKL--GDGDRVDH 474
KINV +SPRSD+SSCQ++QD T NG ED S R KF LDR D +++ G G +VD
Sbjct: 551 KINVASSPRSDSSSCQKDQDPVTVNGNEDRSQWRKGDKFALDRQDDTARHIDGKGIKVD- 609
Query: 475 SSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSR 534
S QS+ +++GR GN+IK G+V+ ++S+ N RG+ +D K +ID
Sbjct: 610 SGQSKFFRVSGREGNLIKLGKVKPDISEFNLTSGRGNMSDGRI---------KHSID--- 657
Query: 535 AAVSRVGEVAALRGDRKQLESRPNASRES-----NDDTSVLQSLPKDSKPPLRLKFRKPN 589
+++VG A RG+R L + S ++ N++ + SLPKDSKP LR KF+KP+
Sbjct: 658 GMINQVGIKATSRGERTYLGRQSEGSSDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPS 717
Query: 590 LENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANW 649
+E+QNS EEEK IKGQRSKRKRPSPF EK FNE E +QS+QDS M IMDANW
Sbjct: 718 IESQNS--PHQEEEKMTIKGQRSKRKRPSPFKEKASFNESEGVSQSHQDSAMDGIMDANW 775
Query: 650 ILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRF 709
IL KLG DAIGKRVEVHQ SDNSWHKG+VTD VEGTS L + LDD +VKT+EL KQGVRF
Sbjct: 776 ILMKLGNDAIGKRVEVHQTSDNSWHKGLVTDVVEGTSKLYVALDDGKVKTVELRKQGVRF 835
Query: 710 VPQKQKRS 717
VPQKQKRS
Sbjct: 836 VPQKQKRS 843
>gi|356529861|ref|XP_003533505.1| PREDICTED: uncharacterized protein LOC100809429 [Glycine max]
Length = 820
Score = 841 bits (2173), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/726 (60%), Positives = 529/726 (72%), Gaps = 58/726 (7%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CR+C GENEG E+A++MLSCKSCGKKYHRNCL++W +NRDLFHWSSW CP CRICE CR
Sbjct: 141 CRICKCGENEGSEKAQKMLSCKSCGKKYHRNCLRSWGRNRDLFHWSSWTCPLCRICEACR 200
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
RTGDP+KFMFC+RCD AYHCYC PPHK+V +GPYLC KH +CHSCGSNVPGNGLSVRWF
Sbjct: 201 RTGDPSKFMFCKRCDGAYHCYCLQPPHKSVCNGPYLCTKHARCHSCGSNVPGNGLSVRWF 260
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 181
+ YT CDACGRLF KGNYCPVCLKVYRDSESTPMVCCD CQ WVHCQCD ISDEKY QFQ
Sbjct: 261 MSYTNCDACGRLFTKGNYCPVCLKVYRDSESTPMVCCDSCQLWVHCQCDNISDEKYHQFQ 320
Query: 182 VDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 241
+DGNLQY+CPTCRGECYQV++ EDA RE+WRR+++A++DLIASLRAAAGLPT++EIFSIS
Sbjct: 321 LDGNLQYKCPTCRGECYQVKNPEDAAREIWRRRNIAERDLIASLRAAAGLPTQEEIFSIS 380
Query: 242 PYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNS 301
P+SDDE++GP+ LK+E RS K SLK + + SPKK +KK +KK Q+ + S
Sbjct: 381 PFSDDEDSGPLKLKSESARSFKFSLKNLANDSPKKKSS------SKKTAKKKDSQLFMTS 434
Query: 302 KPEPDQSFEGYHDV---HSYGNSFGDDTQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPG 357
K + S EG+ D+ HS + DD QS +NEG D+ SS AG +S TE I QPG
Sbjct: 435 KIDTHNSCEGHSDIKSLHSLDDDKNDDIQSQRNEGPDVYSSPAAGSLSQTEASFPIDQPG 494
Query: 358 ILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGAR 416
ILK K+VDEVMVSD+++ R V+ K++K DS E+ GKH K++ +K KKLVINLGAR
Sbjct: 495 ILKQKFVDEVMVSDEERKPRVVRIKSNKALIPDSEEESGKHSLKTQNVKGKKLVINLGAR 554
Query: 417 KINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSS 476
KINV +SPRSD SSCQ++QD T N G++VD S
Sbjct: 555 KINVASSPRSDTSSCQKDQDPVTVN---------------------------GNKVD-SG 586
Query: 477 QSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAA 536
QS+ +++GR GN+IK G+V+ +VS+ N RG+ +D K +ID
Sbjct: 587 QSKIFRVSGREGNLIKLGKVKPDVSEFNLTSGRGNMSDGRI---------KHSID---GM 634
Query: 537 VSRVGEVAALRGDRKQLESRPNASRES-----NDDTSVLQSLPKDSKPPLRLKFRKPNLE 591
+++VG A RG+R L + S ++ N++ + SLPKDSKP LR KF+KP++E
Sbjct: 635 INQVGIKAPSRGERTYLGKQSEGSSDAYETDDNNNRTPSHSLPKDSKPLLRFKFKKPSIE 694
Query: 592 NQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWIL 651
+QNS SQ EEEK IKGQRSKRKRPSPF EKT FNE E +QS QDS M IMDANWIL
Sbjct: 695 SQNS--SQQEEEKMTIKGQRSKRKRPSPFKEKTTFNESEGVSQSRQDSAMDGIMDANWIL 752
Query: 652 KKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVP 711
KLG DAIGKRVEVHQ SDNSWHKGVVTD VEGTS L + LDD +VK +EL KQGVRFVP
Sbjct: 753 MKLGNDAIGKRVEVHQTSDNSWHKGVVTDVVEGTSKLYVALDDGKVKNVELRKQGVRFVP 812
Query: 712 QKQKRS 717
QKQKRS
Sbjct: 813 QKQKRS 818
>gi|297833588|ref|XP_002884676.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297330516|gb|EFH60935.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 778
Score = 789 bits (2038), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/724 (58%), Positives = 497/724 (68%), Gaps = 97/724 (13%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF+GE EG ERARRMLSCK+CGKKYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+C
Sbjct: 147 MCRMCFLGEGEGSERARRMLSCKTCGKKYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVC 206
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RRTGDPNKFMFC+RCDAAYHCYCQHPPHKNVSSGPYLCPKHT+CHSC S VPGNGLSVRW
Sbjct: 207 RRTGDPNKFMFCKRCDAAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRW 266
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FL YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CDGISD+KYLQF
Sbjct: 267 FLSYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDDKYLQF 326
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDG LQY+C TCRGECYQV+DL+DAV+ELW++KD+ DK+LIASLRAAAGLPT++EIFSI
Sbjct: 327 QVDGKLQYKCATCRGECYQVKDLQDAVQELWKKKDVVDKELIASLRAAAGLPTDEEIFSI 386
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
P+SDD+ENGPV GRSLK S+KG+V+KSPKK KE+GK L+KK+ KKG L
Sbjct: 387 FPFSDDDENGPVS-----GRSLKFSIKGLVEKSPKKSKEYGKHSLSKKHASKKGSHTKL- 440
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILK 360
EP+ E + G D+ NE D+ SSVAGI CS +P I+K
Sbjct: 441 ---EPELHQEVGSERLRLGGVRIDNVGFQINEQSDVNSSVAGI-------CSTHEPKIVK 490
Query: 361 HKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKIN 419
HK VD+VMV+D++K SR V+ K SKPHD DS ED ++ + K++KAKKLVINLGARKIN
Sbjct: 491 HKRVDDVMVTDEEKPSRIVRIKCSKPHDSDS-EDTLRNAGEEKSVKAKKLVINLGARKIN 549
Query: 420 VTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSR 479
V+ S +S+ S L R S LG GD+VD + + R
Sbjct: 550 VSGSSKSNVVS-------------------------HLSRDKDQSTLG-GDKVDQTGEVR 583
Query: 480 GLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSR 539
LKI+GR FG+ + E S
Sbjct: 584 TLKISGR------FGKTQSEGS-------------------------------------- 599
Query: 540 VGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-S 598
A G Q + + +D TS+ +L K+++P L+ K RKPN +Q S V +
Sbjct: 600 ----KATFGSITQFPASTSEGNHVDDKTSISPALQKEARPLLKFKLRKPNSGDQTSSVTT 655
Query: 599 QPEEEK-SLIKGQRSKRKRPSPFTEKTLFNEDEDA-AQSNQDSLMS-EIMDANWILKKLG 655
Q E+EK S KGQRSKRKRPS + ED +A S+QDS + E+MDANWILKKLG
Sbjct: 656 QSEDEKLSSAKGQRSKRKRPSSLVDMASLKEDGEATTHSHQDSSRNDEMMDANWILKKLG 715
Query: 656 KDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 715
KD+IGKRVEVH S NSWHKG VTD TSTLS++LDD +KT ELGK VRF+PQKQK
Sbjct: 716 KDSIGKRVEVH-GSQNSWHKGTVTDVSGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQK 774
Query: 716 RSMS 719
RS S
Sbjct: 775 RSRS 778
>gi|297736278|emb|CBI24916.3| unnamed protein product [Vitis vinifera]
Length = 679
Score = 789 bits (2037), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/520 (74%), Positives = 436/520 (83%), Gaps = 11/520 (2%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF GE EG ERAR+ML C SCGKKYHR CLK+W+QNRDLFHWSSW CPSCRICE+C
Sbjct: 143 MCRICFFGEMEGSERARKMLPCNSCGKKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVC 202
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR+GDPNKFMFCRRCD AYHCYCQ PPHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRW
Sbjct: 203 RRSGDPNKFMFCRRCDDAYHCYCQQPPHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRW 262
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF
Sbjct: 263 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 322
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDGNLQY+C TCRGECYQV+DLEDAV+ELWRR+D AD+DLIASLRA A LPT+DEIFSI
Sbjct: 323 QVDGNLQYKCATCRGECYQVKDLEDAVQELWRRRDKADRDLIASLRAKARLPTQDEIFSI 382
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
SPYSDDEENGPV LK+EFGRSLKLSLKG VDKSPKK KE+GK+ NKK +KKG+Q PL
Sbjct: 383 SPYSDDEENGPVSLKSEFGRSLKLSLKGSVDKSPKKTKEYGKQSSNKKNVKKKGHQTPLI 442
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGDD--TQSPKNEGLDIPSS-VAGIVSHTEGVCSISQPG 357
SK E QSFEG+ D + S GDD Q +++G + SS VAG +SHTEG+CSI+QPG
Sbjct: 443 SKKESHQSFEGHDDAQPFEYSLGDDKNEQPNRSDGRGVFSSPVAGSLSHTEGICSINQPG 502
Query: 358 ILKHKYVDEVMVSDDDKISRV-KFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGAR 416
+LKHK+VDE+ V+++D+ SRV + K++KPH D GED GK SKSKT+K KLVI+LGAR
Sbjct: 503 VLKHKFVDEIAVNNEDRTSRVIQIKSNKPHGSDVGEDTGKQASKSKTMKGTKLVIHLGAR 562
Query: 417 KINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGD--GDRVDH 474
NVTNSPRSDASSCQREQDLTTSNG ED S QRM D+HD +K GD GD++D+
Sbjct: 563 NRNVTNSPRSDASSCQREQDLTTSNGSEDTSQQRMG-----DKHDRIAKFGDSKGDKIDY 617
Query: 475 SSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSAD 514
S Q++G K GR GN+IK G+VR E S+ N K RG+ D
Sbjct: 618 SGQAKGSKHGGREGNLIKLGKVRTEPSEMNPKFGRGNKDD 657
>gi|145338256|ref|NP_187459.2| PHD finger-containing protein [Arabidopsis thaliana]
gi|110739634|dbj|BAF01725.1| hypothetical protein [Arabidopsis thaliana]
gi|110741394|dbj|BAF02246.1| hypothetical protein [Arabidopsis thaliana]
gi|332641110|gb|AEE74631.1| PHD finger-containing protein [Arabidopsis thaliana]
Length = 779
Score = 770 bits (1988), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/725 (57%), Positives = 494/725 (68%), Gaps = 98/725 (13%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF+GE EG +RARRMLSCK CGKKYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+C
Sbjct: 147 MCRMCFLGEGEGSDRARRMLSCKDCGKKYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVC 206
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RRTGDPNKFMFC+RCDAAYHCYCQHPPHKNVSSGPYLCPKHT+CHSC S VPGNGLSVRW
Sbjct: 207 RRTGDPNKFMFCKRCDAAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRW 266
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FL YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CDGISD+KY+QF
Sbjct: 267 FLSYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDDKYMQF 326
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDG LQY+C TCRGECYQV+DL+DAV+ELW++KD+ DK+LIASLRAAAGLPTE+EIFSI
Sbjct: 327 QVDGKLQYKCATCRGECYQVKDLQDAVQELWKKKDVVDKELIASLRAAAGLPTEEEIFSI 386
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
P+SDDEENGPV GRSLK S+KG+V+KSPKK KE+G +K + +
Sbjct: 387 FPFSDDEENGPVS-----GRSLKFSIKGLVEKSPKKSKEYG----KHSSSKKHASKKGSH 437
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILK 360
+K EP+ E + G D+ NE D+ SSVAGI CS +P I+K
Sbjct: 438 TKLEPEVHQEIGSERRRLGGVRIDNVGFQINEQSDVNSSVAGI-------CSTHEPKIVK 490
Query: 361 HKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKIN 419
HK VD+VMV+D++K SR V+ K SKPHD DS ED ++ + K++KAKKLVINLGARKIN
Sbjct: 491 HKRVDDVMVTDEEKPSRIVRIKCSKPHDSDS-EDTLRNAGEEKSVKAKKLVINLGARKIN 549
Query: 420 VTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSR 479
V+ S +S+ S L R S LG GD+VD + + R
Sbjct: 550 VSGSSKSNVVS-------------------------HLSRDKDQSTLG-GDKVDQTGEVR 583
Query: 480 GLKIAGRGGNVIKFGRVRQEVSDSN-TKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVS 538
LKI+GR FG+ + E S + V++ +A E H+
Sbjct: 584 TLKISGR------FGKTQSEGSKATFGSVTQFPAASTSEGNHV----------------- 620
Query: 539 RVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV- 597
+D TS+ +L K+++P L+ K RKPN +Q S V
Sbjct: 621 -------------------------DDKTSISPALQKEARPLLKFKLRKPNSGDQTSSVT 655
Query: 598 SQPEEEK-SLIKGQRSKRKRPSPFTEKTLFNEDEDA-AQSNQD-SLMSEIMDANWILKKL 654
+Q E+EK S KGQRSKRKRPS + ED +A S+QD S E+MDANWILKKL
Sbjct: 656 TQSEDEKLSSAKGQRSKRKRPSSLVDMASLKEDGEATTHSHQDNSRNDEMMDANWILKKL 715
Query: 655 GKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQ 714
GKD+IGKRVEVH S NSW KG VTD TSTLS++LDD +KT ELGK VRF+PQKQ
Sbjct: 716 GKDSIGKRVEVH-GSQNSWRKGTVTDVSGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQ 774
Query: 715 KRSMS 719
KRS S
Sbjct: 775 KRSRS 779
>gi|6648214|gb|AAF21212.1|AC013483_36 unknown protein [Arabidopsis thaliana]
Length = 764
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/725 (55%), Positives = 480/725 (66%), Gaps = 113/725 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF+GE EG +RARRMLSCK CGKKYH+NCLK+WAQ+RDLFHWSSW CPSCR+CE+C
Sbjct: 147 MCRMCFLGEGEGSDRARRMLSCKDCGKKYHKNCLKSWAQHRDLFHWSSWSCPSCRVCEVC 206
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RRTGDPNKFMFC+RCDAAYHCYCQHPPHKNVSSGPYLCPKHT+CHSC S VPGNGLSVRW
Sbjct: 207 RRTGDPNKFMFCKRCDAAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCDSTVPGNGLSVRW 266
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FL YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CDGISD+KY+QF
Sbjct: 267 FLSYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCHCDGISDDKYMQF 326
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDG LQY+C TCRGECYQV+DL+DAV+ELW++KD+ DK+LIASLRAAA
Sbjct: 327 QVDGKLQYKCATCRGECYQVKDLQDAVQELWKKKDVVDKELIASLRAAA----------- 375
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
DDEENGPV GRSLK S+KG+V+KSPKK KE+G +K + +
Sbjct: 376 ----DDEENGPVS-----GRSLKFSIKGLVEKSPKKSKEYG----KHSSSKKHASKKGSH 422
Query: 301 SKPEPDQSFEGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILK 360
+K EP+ E + G D+ NE D+ SSVAGI CS +P I+K
Sbjct: 423 TKLEPEVHQEIGSERRRLGGVRIDNVGFQINEQSDVNSSVAGI-------CSTHEPKIVK 475
Query: 361 HKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKIN 419
HK VD+VMV+D++K SR V+ K SKPHD DS ED ++ + K++KAKKLVINLGARKIN
Sbjct: 476 HKRVDDVMVTDEEKPSRIVRIKCSKPHDSDS-EDTLRNAGEEKSVKAKKLVINLGARKIN 534
Query: 420 VTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSR 479
V+ S +S+ S L R S LG GD+VD + + R
Sbjct: 535 VSGSSKSNVVS-------------------------HLSRDKDQSTLG-GDKVDQTGEVR 568
Query: 480 GLKIAGRGGNVIKFGRVRQEVSDSN-TKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVS 538
LKI+GR FG+ + E S + V++ +A E H+
Sbjct: 569 TLKISGR------FGKTQSEGSKATFGSVTQFPAASTSEGNHV----------------- 605
Query: 539 RVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV- 597
+D TS+ +L K+++P L+ K RKPN +Q S V
Sbjct: 606 -------------------------DDKTSISPALQKEARPLLKFKLRKPNSGDQTSSVT 640
Query: 598 SQPEEEK-SLIKGQRSKRKRPSPFTEKTLFNEDEDA-AQSNQD-SLMSEIMDANWILKKL 654
+Q E+EK S KGQRSKRKRPS + ED +A S+QD S E+MDANWILKKL
Sbjct: 641 TQSEDEKLSSAKGQRSKRKRPSSLVDMASLKEDGEATTHSHQDNSRNDEMMDANWILKKL 700
Query: 655 GKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQ 714
GKD+IGKRVEVH S NSW KG VTD TSTLS++LDD +KT ELGK VRF+PQKQ
Sbjct: 701 GKDSIGKRVEVH-GSQNSWRKGTVTDVSGDTSTLSVSLDDGSIKTFELGKHSVRFIPQKQ 759
Query: 715 KRSMS 719
KRS S
Sbjct: 760 KRSRS 764
>gi|413916644|gb|AFW56576.1| RING/FYVE/PHD-type zinc finger family protein [Zea mays]
Length = 819
Score = 601 bits (1550), Expect = e-169, Method: Compositional matrix adjust.
Identities = 340/735 (46%), Positives = 443/735 (60%), Gaps = 74/735 (10%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCRLCF GENEG +A +ML CK C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+C
Sbjct: 135 MCRLCFSGENEGSTKAAKMLPCKLCSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVC 194
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD YHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 195 RRPGDPNKLMFCKRCDDPYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 254
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 255 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 314
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C CRGEC Q+RD EDA+RELW+R+D+AD +L+ +LRAAA LP+ +++ +
Sbjct: 315 QADQNLQYTCAACRGECSQIRDTEDAIRELWKRRDVADHELMITLRAAAKLPSLEDVSPL 374
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 295
P SDDE+ G VLK+E +LK SLK K P E K NKK +KKG
Sbjct: 375 YPNSDDEKLGAYVLKSESRNTLKFSLKSNSSKPPPDTPEQEKVVFKSSGSNKKPSKKKGG 434
Query: 296 QMPLNSKPEPDQSFEGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQ 355
Q + + E HDV S + GD + ++ + +S + + S
Sbjct: 435 QGNKTNDGHDEIFLERRHDVKSSNSRLGDQSIDGNHDMSPFKNDDNAYISSS----TRSS 490
Query: 356 PGILKHKYVDEVMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGA 415
LK + V ++ D I +VK K SK L +D ++ SK+ T KA KLVI+LG+
Sbjct: 491 EKNLKSPSMKAV-TNNADMIPKVKIKGSKVSSLHY-KDGEENTSKADTGKATKLVIHLGS 548
Query: 416 RKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHS 475
R + SP+S+ S+ QREQDL + + G ++D +
Sbjct: 549 RHKTRSGSPKSELSNYQREQDLGSIH---------------------------GRKLDVT 581
Query: 476 SQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRA 535
SQ +G + + +V+K R V N+ + ++ +H +GK RS A
Sbjct: 582 SQLKGSRSEVKERSVMKLVR-ETGVQQRNSLLGDLGTSKKH-------ATGK----RSNA 629
Query: 536 AVSRVGEVAALRGDRKQLESRPNASRESN----DDTSVLQSLPKDSKPP-LRLKFRKPNL 590
+S G+ +RP A ++S+ D+ P + KP L+LKF++P+
Sbjct: 630 LIS-----GMENGNETGTRNRPFAQKQSHSSQVDENQGTADSPDNLKPSLLKLKFKRPHY 684
Query: 591 ENQNSQVSQPEEEKSLI----------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSL 640
E N+Q SQPEE S + KGQRSKRKRPS EK + A+ + S
Sbjct: 685 EQLNTQASQPEEPTSWVSQQEDQFNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHHQST 742
Query: 641 MSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTL 700
E+MDANWIL+KLGKDAIGKR+EVH SD WH+G+V++ + G TL I LD+ R + +
Sbjct: 743 DDEVMDANWILRKLGKDAIGKRIEVHLTSDGKWHQGMVSNVMGG--TLCIQLDNGRSENV 800
Query: 701 ELGKQGVRFVPQKQK 715
ELGKQ +R + + K
Sbjct: 801 ELGKQAIRLIASRSK 815
>gi|115488844|ref|NP_001066909.1| Os12g0527800 [Oryza sativa Japonica Group]
gi|77556508|gb|ABA99304.1| PHD-finger family protein, expressed [Oryza sativa Japonica Group]
gi|113649416|dbj|BAF29928.1| Os12g0527800 [Oryza sativa Japonica Group]
gi|215717023|dbj|BAG95386.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 688
Score = 592 bits (1527), Expect = e-166, Method: Compositional matrix adjust.
Identities = 340/745 (45%), Positives = 445/745 (59%), Gaps = 91/745 (12%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 295
P SDDE+ G V+KN+ +LK S K K E K + NKK+ +KKG
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300
Query: 296 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 348
Q + + E ++ S G S GD D S KN D + V E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356
Query: 349 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 405
+ H + D I +VK K +K H D GE++ +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404
Query: 406 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 465
KLVI++G+R + + SP+S+ S+ Q+EQ+L + +G
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441
Query: 466 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 525
+VD +SQ + + + +V+K VR+ N+ + ++ +H +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKH-------AT 488
Query: 526 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 581
GKR S A VS + E A+ G R Q +S + + + + + + P KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543
Query: 582 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 631
+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601
Query: 632 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 690
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659
Query: 691 TLDDSRVKTLELGKQGVRFVPQKQK 715
LD+ R + +ELGK+ +R + Q+ K
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQRSK 684
>gi|222617191|gb|EEE53323.1| hypothetical protein OsJ_36320 [Oryza sativa Japonica Group]
Length = 756
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 340/744 (45%), Positives = 444/744 (59%), Gaps = 91/744 (12%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+C
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVC 60
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 61 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 120
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 180
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C CRGEC Q+RD EDAVRELW+R+D+ D DL+ASLRAAA LP+ +++
Sbjct: 181 QSDQNLQYTCGACRGECSQIRDTEDAVRELWKRRDVVDHDLMASLRAAAALPSLEDVSPS 240
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWL-----NKKYPRKKGY 295
P SDDE+ G V+KN+ +LK S K K E K + NKK+ +KKG
Sbjct: 241 HPNSDDEKLGAYVMKNDGRNTLKFSFKSNSTKPALDSSEQEKNAIKSSGSNKKHSKKKGN 300
Query: 296 QMPLNSKPEPDQSFEGYHDVHSYGNSFGD-------DTQSPKNEGLDIPSSVAGIVSHTE 348
Q + + E ++ S G S GD D S KN D + V E
Sbjct: 301 QNNKTVSEQDEIFLEKRNETKSLG-SLGDQIADVTRDKSSFKN---DADAFVLSSAQSAE 356
Query: 349 GVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIK 405
+ H + D I +VK K +K H D GE++ +KS T K
Sbjct: 357 KALKLQSAKAAAH---------NADMIPKVKIKGTKVPSLHFKDVGEEN---AAKSDTGK 404
Query: 406 AKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSK 465
KLVI++G+R + + SP+S+ S+ Q+EQ+L + +G
Sbjct: 405 GTKLVIHIGSRHKSRSGSPKSEMSNSQKEQELVSMHG----------------------- 441
Query: 466 LGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLS 525
+VD +SQ + + + +V+K VR+ N+ + ++ +H +
Sbjct: 442 ----GKVDVTSQFKSSRSEIKEKSVMKL--VRETGVQQNSLLGDLGASKKHA-------T 488
Query: 526 GKRNIDRSRAAVSRVGEVAALRGDRK----QLESRPNASRESNDDTSVLQSLPKDSKPPL 581
GKR S A VS + E A+ G R Q +S + + + + + + P KP L
Sbjct: 489 GKR----SNAIVSAM-ENASESGTRSRSFGQKQSVNHLTENQGNASFSVNNSPDSLKPSL 543
Query: 582 -RLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 631
+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 544 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 601
Query: 632 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 690
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 602 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 659
Query: 691 TLDDSRVKTLELGKQGVRFVPQKQ 714
LD+ R + +ELGK+ +R + Q Q
Sbjct: 660 QLDNGRSENIELGKRAIRLIAQSQ 683
>gi|357151790|ref|XP_003575905.1| PREDICTED: uncharacterized protein LOC100821635 [Brachypodium
distachyon]
Length = 809
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 329/738 (44%), Positives = 438/738 (59%), Gaps = 84/738 (11%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCRLC GENEG +A +ML CK C KKYH+ C+K W ++RDLFHWSSW CPSCR CE+C
Sbjct: 129 MCRLCISGENEGSSKAAKMLPCKLCNKKYHKKCVKYWGEHRDLFHWSSWVCPSCRSCEVC 188
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD AYHCYCQ P HKNVS GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 189 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVSHGPYLCPKHTRCHSCGSGVPGSGHSTRW 248
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 249 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 308
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C +CRGEC Q+RD EDAVRELW+R+++ D DL+ SLRAAA LP+ +++
Sbjct: 309 QADENLQYTCASCRGECSQIRDAEDAVRELWKRRNIVDHDLMVSLRAAAALPSLEDVSPS 368
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKK-------WLNKKYPRKK 293
+P SDDE G V KN+ +LK S K K P + + G++ NKK+ +KK
Sbjct: 369 NPNSDDERLGAFVPKNDGRNTLKFSFKSNSSKPP--LDQSGQEKNVPKTSGSNKKHSKKK 426
Query: 294 GYQMPLNSKPEPDQSF-EGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCS 352
G Q + S +PD+ F E H+ SY N G + N G + + + V +
Sbjct: 427 GNQGNI-SVGDPDEIFLEKRHEAKSYSNLGGHTIEG--NHG-------QSTIKNDDSVFT 476
Query: 353 ISQPGILKHKYVDEVMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVIN 412
+S + ++ ++ D I +VK + SK L + + +KS K KLV +
Sbjct: 477 LSAT-----RSSEKGAANNADMIPKVKIRGSKAPSLHFKDVGEVNTAKSDAGKGTKLVFH 531
Query: 413 LGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRV 472
G R + + SP+S+ ++ +EQ+L + +G ++
Sbjct: 532 FGTRHKSGSGSPKSEMTNSHKEQELGSLHG---------------------------GKI 564
Query: 473 DHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDR 532
D +SQ + K + +V+K R V N+ + ++ +H ++GKR
Sbjct: 565 DVTSQFKSSKSEKKEKSVMKLVR-ETGVQQRNSLLGDLGTSKKH-------VTGKR---- 612
Query: 533 SRAAVSRVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPP------LRLKFR 586
S A +S + E A G R + + D SLP ++ P L+LKF+
Sbjct: 613 SNAIISGM-ENAGESGTRSRSFGHKQSIPNQLTDNQATASLPVNNSPDSLKPSLLKLKFK 671
Query: 587 KPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQ 637
+P+ E ++QV+QPEE E ++ KGQRSKRKRPS +K +E + + +Q
Sbjct: 672 RPHFEQPSAQVAQPEETATWASQQEELNVAKGQRSKRKRPS--MDKMDGSEGKTPGKRHQ 729
Query: 638 DSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRV 697
S E MDA WIL+KLGKDAIGKR+E+ SD WH+GVV++ + G TL + LDD
Sbjct: 730 QSTGDEAMDATWILRKLGKDAIGKRIEIQLPSDGKWHQGVVSNVLSG--TLCVQLDDGSS 787
Query: 698 KTLELGKQGVRFVPQKQK 715
+ LELGKQ VR V Q+ K
Sbjct: 788 ENLELGKQAVRLVAQRSK 805
>gi|414878222|tpg|DAA55353.1| TPA: RING/FYVE/PHD-type zinc finger family protein [Zea mays]
Length = 818
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 337/741 (45%), Positives = 443/741 (59%), Gaps = 88/741 (11%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCRLCF GENEG +A +ML CK C K+YHRNCLK+W ++RDLFHWSSW CPSCR CE+C
Sbjct: 136 MCRLCFSGENEGSTKAAKMLPCKLCSKRYHRNCLKSWGEHRDLFHWSSWVCPSCRSCEVC 195
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 196 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 255
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCP+CLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 256 FLGYTCCDACGRLFVKGNYCPICLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 315
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C CRGEC Q+RD EDA+RELW+R+D+AD +L+A+LRAAA LP+ +++
Sbjct: 316 QADQNLQYTCAACRGECSQIRDTEDAIRELWKRRDVADHELMATLRAAAALPSLEDVSPP 375
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLN 300
SDDE+ G LKNE +LK SLK K P E K KK +
Sbjct: 376 YQNSDDEKLGAYALKNESRNTLKFSLKSNSSKPPPDTPEQEKIVFKSSGSNKKPSKKKSG 435
Query: 301 SKPEP----DQSF-EGYHDVHSYGNSFGDDT------QSP--KNEGLDIPSSVAGIVSHT 347
+ D+ F E H V S + GD T +SP ++ + + SS + +
Sbjct: 436 QANKTVDGHDEIFLERRHAVKSSNSCLGDQTINENHDRSPFKNDDNVYVSSSTRSLEKN- 494
Query: 348 EGVCSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAK 407
+ P + + + ++ D I +VK K SK L +D ++ K+ T KA
Sbjct: 495 -----LKSPSM-------KAVANNADMIPKVKIKGSKVSSLHY-KDGEENTPKNDTGKAT 541
Query: 408 KLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLG 467
KLVI+LG+R + SP+S+ S+ QREQDL + +G
Sbjct: 542 KLVIHLGSRHKTRSGSPKSELSNSQREQDLGSIHG------------------------- 576
Query: 468 DGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGK 527
++D +SQ + + + +V+K R V N+ + ++ +H +GK
Sbjct: 577 --GKIDVTSQLKSSRNEVKERSVMKLVR-DTGVQQRNSLLGDLGTSKKH-------ATGK 626
Query: 528 RNIDRSRAAVSRVGEVAALRGDRKQ--LESRPNASRESNDDTSVLQSLPKDSKPPL-RLK 584
R S A +S + E A G R + + + ++S+ N T+ P KP L +LK
Sbjct: 627 R----SNALISGM-ENANETGTRNRSFAQKQSHSSQVENHGTA---DSPDSLKPSLLKLK 678
Query: 585 FRKPNLENQNSQVSQPEEEKSLI----------KGQRSKRKRPSPFTEKTLFNEDEDAAQ 634
F++P+ E N+Q SQPEE S + KGQRSKRKRPS EK + A+
Sbjct: 679 FKRPHFEQLNTQASQPEEPTSWVSQQEEQLNVAKGQRSKRKRPS--MEKADGLDGITPAK 736
Query: 635 SNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDD 694
+Q S E+MDANWIL+KLGKDAIGKR+EVH SD WH+G+V++ + G TL I LD+
Sbjct: 737 RHQQS-TDEVMDANWILRKLGKDAIGKRIEVHLTSDGKWHQGMVSNVMGG--TLCIRLDN 793
Query: 695 SRVKTLELGKQGVRFVPQKQK 715
R + +ELGKQ +R + + K
Sbjct: 794 GRSENVELGKQAIRLIASRSK 814
>gi|326519042|dbj|BAJ92681.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 816
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 328/739 (44%), Positives = 422/739 (57%), Gaps = 79/739 (10%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCRLCF GENEG +A +ML CK C KKYH+ C+KNW ++RDLFHWSSW C SCR CE+C
Sbjct: 129 MCRLCFSGENEGSSKAAKMLPCKLCNKKYHKKCVKNWGEHRDLFHWSSWICSSCRSCEVC 188
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR GDPNK MFC+RCD AYHCYCQ P HKNV+ GPYLCPKHT+CHSCGS VPG+G S RW
Sbjct: 189 RRPGDPNKLMFCKRCDGAYHCYCQQPSHKNVTHGPYLCPKHTRCHSCGSGVPGSGHSTRW 248
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE PMVCCDVC++WVH +CDGIS+EKY QF
Sbjct: 249 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSEVIPMVCCDVCEKWVHIECDGISEEKYQQF 308
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q D NLQY C +CRGEC Q+RD EDAVRELW+R+++ D DL+ SLRAAAGLP+ +++ S
Sbjct: 309 QADQNLQYTCASCRGECSQIRDAEDAVRELWKRRNVVDHDLMISLRAAAGLPSLEDV-SP 367
Query: 241 SPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPL- 299
P SDDE G +VLKN+ +LK SLK K P E K P+ G
Sbjct: 368 CPNSDDERLGALVLKNDGRNTLKFSLKSNSSKPPLDQCEQ-----EKNVPKNSGTNKKHS 422
Query: 300 --------NSKPEPDQSF-EGYHDVHSYGNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGV 350
S +P++ F E H+ S + GD T ++ ++ V +
Sbjct: 423 KKKSSQGNKSVADPNEIFLERRHEAKSMSSHLGDHTVDVNHDRNSFKNNENVFVLPS--- 479
Query: 351 CSISQPGILKHKYVDEVMVSDDDKISRVKFKTSKP---HDLDSGEDDGKHVSKSKTIKAK 407
+ S LK V + ++ + I +VK K SK H D GE++ + T K
Sbjct: 480 -TRSSEKDLKSTSV-KATTNNANTIPKVKIKGSKVPSLHFKDIGEENN---ANGDTGKGT 534
Query: 408 KLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLG 467
KLVI+LG R + + SP+S+ S+ +EQ+L +++G + S + KL
Sbjct: 535 KLVIHLGTRHKSKSGSPKSEMSNSHKEQELGSTHGGKTDVTSLFKSSKSSKKEKSVMKLV 594
Query: 468 DGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGK 527
V SS L + R + SSA ++SG
Sbjct: 595 GETGVQQSSLLGDLGTSKRHA------------------TGKRSSA---------LISGM 627
Query: 528 RNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPP--LRLKF 585
N + S G +S P+ ES S + DS P L+LKF
Sbjct: 628 ENANESGTRSRSFG----------HKQSIPSQLTESQGTASFAVNNSPDSLKPSLLKLKF 677
Query: 586 RKPNLENQNSQVSQPEEEKS---------LIKGQRSKRKRPSPFTEKTLFNEDEDAAQSN 636
++P+LE + QVSQ EE + + KGQRSKRKRPS T+K +E ++ +
Sbjct: 678 KRPHLEQPSLQVSQTEEPATWASQQEDLNVAKGQRSKRKRPS--TDKMDGSEGSTPSKRH 735
Query: 637 QDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSR 696
S E MDA WIL+KLG DAIGKR+E+ SD WH+GVV++ + G L + LD+
Sbjct: 736 GQSTGDEAMDATWILRKLGNDAIGKRIEIQLASDGKWHQGVVSNVISG--MLCVQLDNGS 793
Query: 697 VKTLELGKQGVRFVPQKQK 715
+ LELG Q VR + Q+ K
Sbjct: 794 SENLELGNQAVRLIAQRLK 812
>gi|224055146|ref|XP_002298424.1| predicted protein [Populus trichocarpa]
gi|222845682|gb|EEE83229.1| predicted protein [Populus trichocarpa]
Length = 797
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 315/535 (58%), Positives = 377/535 (70%), Gaps = 53/535 (9%)
Query: 155 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 214
+ D C V SDEKYLQFQVDGNLQY+C TCRGECYQV+DLEDAV+ELWRR+
Sbjct: 236 IFASDGCTVIVMASEIFCSDEKYLQFQVDGNLQYQCSTCRGECYQVKDLEDAVQELWRRR 295
Query: 215 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 274
D AD+ LIASLRAAAGLP +++IFSI+PYSDD+ENGP +N+FGRS+KLSLKG+V+KSP
Sbjct: 296 DKADRGLIASLRAAAGLPAQEDIFSITPYSDDDENGPAAPRNDFGRSIKLSLKGLVEKSP 355
Query: 275 KKVKEHGKKWLNKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD----DTQSPK 330
KK K+HGKK LNKKYP++KG SK E Q H+ HSY + GD DT+S
Sbjct: 356 KKSKDHGKKHLNKKYPKRKGPHAASFSKTESYQ-----HESHSYEHDSGDEKNNDTESQA 410
Query: 331 NEGLDIPSS-VAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDL 388
GL SS VAGIV+HTEG+CSI+QPG LKHK+V+EVMVSD ++ S+ VK K++KP DL
Sbjct: 411 KGGLGRCSSPVAGIVNHTEGICSINQPGALKHKFVEEVMVSDGERTSKIVKIKSNKPRDL 470
Query: 389 DSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSL 448
DSG DD + SKSK++KAKKLVINLGARKINV++SP+SDA SCQREQDL SN
Sbjct: 471 DSG-DDAEKPSKSKSVKAKKLVINLGARKINVSSSPKSDAQSCQREQDLKASN------- 522
Query: 449 QRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL-KIAGRGGNVIKFGRVRQEVSDSNTKV 507
GD VDHS Q RGL K A R GN IKFG+V+ E S N K
Sbjct: 523 --------------------GDGVDHSEQKRGLIKFARREGNFIKFGKVKAEASSLNLKS 562
Query: 508 SRGSSADEHEP---EHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNAS 560
G+ D +E +H V S KR+++ SRAAV GEV LR DR KQ E+R +
Sbjct: 563 DGGNHFDAYETTPLDHARVTSSKRSLEGSRAAVGPAGEVPMLRNDRVSLGKQSEARLDTH 622
Query: 561 RESND---DTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRP 617
ESND DT +L SLPKDSK L+LK +KPNLENQ+SQ+ EEEKS +GQRSKRKR
Sbjct: 623 TESNDDSGDTPILHSLPKDSKLSLKLKIKKPNLENQSSQILLHEEEKSNTRGQRSKRKRA 682
Query: 618 SPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNS 672
S F +KT++NEDED ++S+ D SE+M+ANWILKKLGKDAIGKRVEVHQ SDNS
Sbjct: 683 STFMDKTMYNEDEDMSESHLD---SEMMEANWILKKLGKDAIGKRVEVHQPSDNS 734
>gi|359487302|ref|XP_002274438.2| PREDICTED: uncharacterized protein LOC100249974 [Vitis vinifera]
Length = 730
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 208/258 (80%), Positives = 225/258 (87%), Gaps = 9/258 (3%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF GE EG ERAR+ML C SCGKKYHR CLK+W+QNRDLFHWSSW CPSCRICE+C
Sbjct: 143 MCRICFFGEMEGSERARKMLPCNSCGKKYHRLCLKSWSQNRDLFHWSSWTCPSCRICEVC 202
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RR+GDPNKFMFCRRCD AYHCYCQ PPHKNVSSGPYLCPKHT+CHSCGSNVPGNGLSVRW
Sbjct: 203 RRSGDPNKFMFCRRCDDAYHCYCQQPPHKNVSSGPYLCPKHTRCHSCGSNVPGNGLSVRW 262
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF
Sbjct: 263 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 322
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QVDGNLQY+C TCRGECYQV+DLEDAV+ELWRR+D AD+ + +S A L + I SI
Sbjct: 323 QVDGNLQYKCATCRGECYQVKDLEDAVQELWRRRDKADRGVFSS-PVAGSLSHTEGICSI 381
Query: 241 SPYSDDEENGPVVLKNEF 258
N P VLK++F
Sbjct: 382 --------NQPGVLKHKF 391
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/392 (58%), Positives = 271/392 (69%), Gaps = 41/392 (10%)
Query: 338 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRV-KFKTSKPHDLDSGEDDGK 396
S VAG +SHTEG+CSI+QPG+LKHK+VDE+ V+++D+ SRV + K++KPH D GED GK
Sbjct: 366 SPVAGSLSHTEGICSINQPGVLKHKFVDEIAVNNEDRTSRVIQIKSNKPHGSDVGEDTGK 425
Query: 397 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 456
SKSKT+K KLVI+LGAR NVTNSPRSDASSCQREQDLTTSNG
Sbjct: 426 QASKSKTMKGTKLVIHLGARNRNVTNSPRSDASSCQREQDLTTSNG-------------- 471
Query: 457 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 516
D++D+S Q++G K GR GN+IK G+VR E S+ N K RG+ D
Sbjct: 472 -------------DKIDYSGQAKGSKHGGREGNLIKLGKVRTEPSEMNPKFGRGNKDDGV 518
Query: 517 E---PEHMHVLSGKRNIDRSRAAVSRVGEVAALRGD----RKQLESRPNASRESNDDTS- 568
E PE+ VL GKR+I+ S V EV+ RG+ RK ESR N E NDD S
Sbjct: 519 EAIPPENTRVLLGKRSIEGSTNVAGAVTEVS--RGEKVFSRKHPESRLNMYGEGNDDNSS 576
Query: 569 ---VLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTL 625
V SLPKDSKP L+LKF+ P+ ENQ+S E+EKS +KGQRSKRKRPSPF EKT
Sbjct: 577 TPSVSHSLPKDSKPLLKLKFKNPSFENQSSWGLPGEDEKSAVKGQRSKRKRPSPFMEKTS 636
Query: 626 FNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGT 685
F EDED +Q +QD M +IMDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGT
Sbjct: 637 FKEDEDGSQFHQDDSMDQIMDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGT 696
Query: 686 STLSITLDDSRVKTLELGKQGVRFVPQKQKRS 717
STL + DD R KTLELGKQ +R + QKQKRS
Sbjct: 697 STLIVKFDDGRAKTLELGKQAIRLISQKQKRS 728
>gi|255553540|ref|XP_002517811.1| protein binding protein, putative [Ricinus communis]
gi|223543083|gb|EEF44618.1| protein binding protein, putative [Ricinus communis]
Length = 734
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 201/236 (85%), Positives = 219/236 (92%), Gaps = 2/236 (0%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MCR+CF+GE EG ERARRMLSCKSCGKKYHR+CLK+WAQ+RDLFHWSSW CPSCRICEIC
Sbjct: 157 MCRMCFLGEAEGSERARRMLSCKSCGKKYHRSCLKSWAQHRDLFHWSSWTCPSCRICEIC 216
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
RRTGDPNKFMFC+RCD AYHCYCQHPPHKNVSSGPYLCPKHT+CHSCGS+VPGNGLSVRW
Sbjct: 217 RRTGDPNKFMFCKRCDGAYHCYCQHPPHKNVSSGPYLCPKHTRCHSCGSSVPGNGLSVRW 276
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCD+CQRWVHC CDGISDEKYLQF
Sbjct: 277 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDICQRWVHCSCDGISDEKYLQF 336
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDE 236
QVDGNLQY+C TCRGECYQV+D EDAV+ELWRR+D AD+ + +S + AG+ E
Sbjct: 337 QVDGNLQYKCATCRGECYQVKDHEDAVQELWRRRDEADRGVYSS--SIAGVVNHAE 390
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/385 (60%), Positives = 273/385 (70%), Gaps = 38/385 (9%)
Query: 338 SSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISR-VKFKTSKPHDLDSGEDDGK 396
SS+AG+V+H EG CS++Q G+LKHKYVDEVMVSD ++ SR V+ K KPHDLDSG+D K
Sbjct: 380 SSIAGVVNHAEGNCSVNQTGVLKHKYVDEVMVSDGERTSRIVRLKNKKPHDLDSGDDAEK 439
Query: 397 HVSKSKTIKAKKLVINLGARKINVTNSPRSDASSCQREQDLTTSNGIEDPSLQRMNSKFV 456
H K K++KAKKLVINLGARKINVTNS RSDASSCQR+QD+TT NG
Sbjct: 440 HAIKFKSVKAKKLVINLGARKINVTNSHRSDASSCQRDQDMTTPNG-------------- 485
Query: 457 LDRHDGSSKLGDGDRVDHSSQSRGLKIAGRGGNVIKFGRVRQEVSDSNTKVSRGSSADEH 516
D VDHS Q R LK R GN IKFG+V+ E S+ N K GS AD
Sbjct: 486 -------------DTVDHSVQIRSLKFPRREGNFIKFGKVKNETSNLNPKFQTGSDADGE 532
Query: 517 EPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDR----KQLESRPNASRESNDDTSVLQS 572
+ + V S KR+ID AV V EV LR D+ KQLE R ESNDD+ S
Sbjct: 533 K--MVSVSSSKRSIDGCGTAVGPVDEVPTLRSDKVSIGKQLEVRSETHAESNDDSGD-AS 589
Query: 573 LPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDA 632
LPKDSK L+LK + PNL NQ S+ PEEEKS I+GQRSKRKRPS F +K+LFNE+ED
Sbjct: 590 LPKDSKISLKLKIKNPNLLNQYSRKPPPEEEKSSIRGQRSKRKRPSSFMDKSLFNENEDI 649
Query: 633 AQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITL 692
Q++QDS E+++A+WILKKLGKDAIGKRVEVHQ SDNSWHKGVV+DTVEGTS +S+TL
Sbjct: 650 TQAHQDS---EMLEASWILKKLGKDAIGKRVEVHQPSDNSWHKGVVSDTVEGTSMISVTL 706
Query: 693 DDSRVKTLELGKQGVRFVPQKQKRS 717
DDSRVKTL+LGKQ VRFVPQKQKRS
Sbjct: 707 DDSRVKTLQLGKQAVRFVPQKQKRS 731
>gi|218186977|gb|EEC69404.1| hypothetical protein OsI_38556 [Oryza sativa Indica Group]
Length = 625
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 226/439 (51%), Positives = 268/439 (61%), Gaps = 57/439 (12%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI- 59
MCR+CF GENEG +A +ML CK C KKYHR+CLKNW ++RDLFHWSSW CPSCR CE+
Sbjct: 1 MCRICFSGENEGSTKAAKMLPCKLCNKKYHRSCLKNWGEHRDLFHWSSWVCPSCRSCEVL 60
Query: 60 ----------------------------CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
CRR GDPNK MFC+RCD AYHCYCQ P HKNV
Sbjct: 61 LDWSLGFDVNLAKTLVCGVTGPTSGSSVCRRPGDPNKLMFCKRCDGAYHCYCQQPSHKNV 120
Query: 92 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 151
+ GPYLCPKHT+CHSCGS VPG+G S RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE
Sbjct: 121 THGPYLCPKHTRCHSCGSGVPGSGHSTRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 180
Query: 152 STPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 211
PMVCCDVC++WVH +CDGIS+EKY QFQ D NLQY C CRGEC Q+RD EDAVRELW
Sbjct: 181 VIPMVCCDVCEKWVHIECDGISEEKYQQFQSDQNLQYTCGACRGECSQIRDTEDAVRELW 240
Query: 212 RRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVD 271
+R+D+ D DL+ASLRAAA LP+ +++ P SDDE+ G V+KN+ +LK S K
Sbjct: 241 KRRDVVDHDLMASLRAAAALPSLEDVSPSHPNSDDEKLGAYVMKNDGRNTLKFSFKSNST 300
Query: 272 KSPKKVKEHGKKWL-----NKKYPRKKGYQMPLNSKPEPDQSFEGYHDVHSYGNSFGD-- 324
K E K + NKK+ +KKG Q + + E ++ S G S GD
Sbjct: 301 KPALDSSEQEKNAIKSSGSNKKHSKKKGNQNNKTVSEQDEIFLEKRNETKSLG-SLGDQI 359
Query: 325 -----DTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDEVMVSDDDKISRVK 379
D S KN D + V E + H + D I +VK
Sbjct: 360 ADVTRDKSSFKN---DADAFVLSSAQSAEKALKLQSAKAAAH---------NADMIPKVK 407
Query: 380 FKTSKP---HDLDSGEDDG 395
K +K H D GE++
Sbjct: 408 IKGTKVPSLHFKDVGEENA 426
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/144 (47%), Positives = 95/144 (65%), Gaps = 14/144 (9%)
Query: 581 LRLKFRKPNLENQNSQVSQPEE---------EKSLIKGQRSKRKRPSPFTEKTLFNEDED 631
L+LKF++P E ++Q SQPEE E ++ KGQRSKRKRPS +K +E +
Sbjct: 449 LKLKFKRPIFEQPSTQSSQPEEPGTWASPQEELNVAKGQRSKRKRPS--LDKMDGSESKA 506
Query: 632 -AAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSI 690
AA+ ++ S E MDANWIL+KLGKDAIGKR+EV SD WH+GVV++ + G TL +
Sbjct: 507 PAAKRHEQSTGEEAMDANWILRKLGKDAIGKRIEVQLASDGKWHQGVVSNVING--TLCL 564
Query: 691 TLDDSRVKTLELGKQGVRFVPQKQ 714
LD+ R + +ELGK+ +R + Q Q
Sbjct: 565 QLDNGRSENIELGKRAIRLIAQSQ 588
>gi|297816482|ref|XP_002876124.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297321962|gb|EFH52383.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 678
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 184/228 (80%), Positives = 205/228 (89%), Gaps = 1/228 (0%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C++ E ERA+ MLSCK CGKKYHRNCLK+WAQ+RDLF+WSSW CPSCRICE C
Sbjct: 144 CHMCYLVEVGKSERAK-MLSCKCCGKKYHRNCLKSWAQHRDLFNWSSWACPSCRICEGCG 202
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
GDP KFMFC+RCD AYHC CQ P HKNVSSGPYLCPKHTKC+SCGS VPGNG S+RWF
Sbjct: 203 TLGDPKKFMFCKRCDDAYHCDCQQPRHKNVSSGPYLCPKHTKCYSCGSTVPGNGQSLRWF 262
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 181
LG+TCCDACGRLFVKGNYCPVCLKVYRDSE+TPMVCCD CQRWVHC CDGISDEKY+QFQ
Sbjct: 263 LGHTCCDACGRLFVKGNYCPVCLKVYRDSEATPMVCCDFCQRWVHCHCDGISDEKYMQFQ 322
Query: 182 VDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 229
VDGNLQY+C TCRGECYQV+DLEDAV+E+W+RKD+ADKDLIASL+A+A
Sbjct: 323 VDGNLQYKCSTCRGECYQVKDLEDAVQEIWKRKDIADKDLIASLKASA 370
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 174/338 (51%), Gaps = 18/338 (5%)
Query: 392 EDDGKHVSKSKTIKAKKLVINL--GARKINVTNSP---RSDASSCQREQDLTTSNGIEDP 446
ED + + K K I K L+ +L AR + T S ++ + NG E+
Sbjct: 345 EDAVQEIWKRKDIADKDLIASLKASARVVGQTGGAPLVNQPGSVERKVSEKAMVNGEEEK 404
Query: 447 SLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGL--KIAGRGGNVIKFGR--VRQEVSD 502
L+ + K + S K G ++ +++ L I R V V + S
Sbjct: 405 PLRVLRIKSSRTQDSDSEKFGKHSTELNTVKAKKLVISIGPRKTGVTNSTSCDVSKLTSK 464
Query: 503 SNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLESRPNASRE 562
SN K + S + E L GK N D R + GEV + + + + +
Sbjct: 465 SNGKQEKLQSEETFSREQHRSLLGKNN-DEKRGSR---GEVTTSKAEGGFIGRHSDGKGD 520
Query: 563 SNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQV-SQPEEEKSLIKGQRSKRKRPSPFT 621
N + S+ KDS+ L+L+ +K N E+Q + S E KG RSKRKR SP
Sbjct: 521 LNSGSH--DSMQKDSRRLLKLRIKKHNPESQEGETPSIVYERGKSGKGHRSKRKRASPPA 578
Query: 622 EKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHK--GVVT 679
EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW K
Sbjct: 579 EKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWEKGVVSEV 638
Query: 680 DTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 717
GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 639 GGGGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 676
>gi|42565848|ref|NP_190778.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332645370|gb|AEE78891.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 696
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 184/228 (80%), Positives = 205/228 (89%), Gaps = 1/228 (0%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C++ E ERA+ MLSCK CGKKYHRNC+K+WAQ+RDLF+WSSW CPSCRICE C
Sbjct: 162 CHMCYLVEVGKSERAK-MLSCKCCGKKYHRNCVKSWAQHRDLFNWSSWACPSCRICEGCG 220
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
GDP KFMFC+RCD AYHC CQHP HKNVSSGPYLCPKHTKC+SC S VPGNG S+RWF
Sbjct: 221 TLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKCYSCESTVPGNGQSLRWF 280
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ 181
LG+TCCDACGRLFVKGNYCPVCLKVYRDSE+TPMVCCD CQRWVHCQCDGISDEKY+QFQ
Sbjct: 281 LGHTCCDACGRLFVKGNYCPVCLKVYRDSEATPMVCCDFCQRWVHCQCDGISDEKYMQFQ 340
Query: 182 VDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 229
VDGNLQY+C TCRGE YQV+DLEDAV+E+W+RKDMADKDLIASL+A+A
Sbjct: 341 VDGNLQYKCSTCRGESYQVKDLEDAVQEIWKRKDMADKDLIASLKASA 388
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)
Query: 496 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 555
V + S SN K + + + E L GK N D R + GEV L+ + +
Sbjct: 476 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 531
Query: 556 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 614
+ + N + S KDS+ L+LK +K N E Q S+ E+S KG RSKR
Sbjct: 532 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 589
Query: 615 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 674
KR SP EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW
Sbjct: 590 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 649
Query: 675 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 717
KGVV++ GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 650 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 694
>gi|168045006|ref|XP_001774970.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673717|gb|EDQ60236.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 686
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 154/248 (62%), Positives = 194/248 (78%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C LC GE G ++A RMLSC++C K+YHR C K WA++RDLF+W+SW C SCR+CE+C
Sbjct: 172 VCGLCGCGEAIGSDKAGRMLSCQACRKQYHRKCTKYWAEHRDLFNWASWMCGSCRVCEVC 231
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
R+GD NK MFC+RCD AYH C HPP K+V GP++CPKH +C SC + VPG G+S +W
Sbjct: 232 LRSGDSNKLMFCKRCDHAYHSSCLHPPLKHVPKGPFVCPKHVRCTSCNTTVPGGGVSSKW 291
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
FL Y+ CDACGRLF +G YCP+CLKVYRDSE PMVCCDVC+ WVHC+CDGISDEKY +F
Sbjct: 292 FLSYSLCDACGRLFTRGKYCPICLKVYRDSEPAPMVCCDVCEHWVHCECDGISDEKYQEF 351
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
QV+ L+Y+C +CRGECY+V DL+DA E+WRRKD+ D IA +RAAAGLP+ +EI
Sbjct: 352 QVNSQLRYKCASCRGECYKVADLDDAAVEIWRRKDIRDATQIAEIRAAAGLPSPEEILKA 411
Query: 241 SPYSDDEE 248
P SD+E+
Sbjct: 412 YPSSDEED 419
>gi|4678939|emb|CAB41330.1| putative protein [Arabidopsis thaliana]
Length = 763
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 166/239 (69%), Positives = 189/239 (79%), Gaps = 12/239 (5%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C++ E ERA+ MLSCK CGKKYHRNC+K+WAQ+RDLF+WSSW CPSCRICE C
Sbjct: 144 CHMCYLVEVGKSERAK-MLSCKCCGKKYHRNCVKSWAQHRDLFNWSSWACPSCRICEGCG 202
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
GDP KFMFC+RCD AYHC CQHP HKNVSSGPYLCPKHTKC+SC S VPGNG S+R+
Sbjct: 203 TLGDPKKFMFCKRCDDAYHCDCQHPRHKNVSSGPYLCPKHTKCYSCESTVPGNGQSLRYL 262
Query: 122 ------LGYTCCDACGRL---FVKGNYC--PVCLKVYRDSESTPMVCCDVCQRWVHCQCD 170
L C G L V+G + L VYRDSE+TPMVCCD CQRWVHCQCD
Sbjct: 263 TFCLVILEIYSCGFWGILVVMLVEGCLLRGIIVLYVYRDSEATPMVCCDFCQRWVHCQCD 322
Query: 171 GISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 229
GISDEKY+QFQVDGNLQY+C TCRGE YQV+DLEDAV+E+W+RKDMADKDLIASL+A+A
Sbjct: 323 GISDEKYMQFQVDGNLQYKCSTCRGESYQVKDLEDAVQEIWKRKDMADKDLIASLKASA 381
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 9/225 (4%)
Query: 496 VRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAALRGDRKQLES 555
V + S SN K + + + E L GK N D R + GEV L+ + +
Sbjct: 469 VSKTASKSNGKQEKLQAEETFSREERRSLLGK-NSDEKRGSR---GEVTTLKAEGGFIGR 524
Query: 556 RPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQVSQPEEEKSLI-KGQRSKR 614
+ + N + S KDS+ L+LK +K N E Q S+ E+S KG RSKR
Sbjct: 525 HSDGKGDLNSGSH--DSSQKDSRRLLKLKIKKHNPEGQESEAPSIVYERSKSGKGHRSKR 582
Query: 615 KRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWH 674
KR SP EK+ FNEDED + S +DSL+ E++DA+WILKKLGKDA GK+V++H+ SD+SW
Sbjct: 583 KRASPPAEKSAFNEDEDVSLSREDSLLDEMLDASWILKKLGKDAKGKKVQIHEASDDSWE 642
Query: 675 KGVVTDT--VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 717
KGVV++ GTS L +TL++ +VKT+ELGKQGVRFVPQKQKR+
Sbjct: 643 KGVVSEVGGAGGTSKLMVTLENGKVKTVELGKQGVRFVPQKQKRT 687
>gi|302813786|ref|XP_002988578.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
gi|300143685|gb|EFJ10374.1| hypothetical protein SELMODRAFT_447369 [Selaginella moellendorffii]
Length = 774
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 144/248 (58%), Positives = 185/248 (74%), Gaps = 1/248 (0%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C LC + E E ++ RML+C+ C +++HR CLK+WA NRDLF+W+SW+C CR CE C
Sbjct: 180 FCGLCQLAEAES-KKQERMLTCQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDC 238
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ TGDPN+ +FC+RCD A+H C+ K + GP+LCPKH++CHSCG+ VPG G S RW
Sbjct: 239 KVTGDPNRLLFCKRCDEAHHNNCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRW 298
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
F Y CDACGRLFVK YCP+C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +F
Sbjct: 299 FHSYLFCDACGRLFVKDKYCPICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEF 358
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q NL++ C CRGEC+Q +E+AV ELW+RKD AD+D I SLRA+AGLP+E E+ +
Sbjct: 359 QTIQNLRFTCAACRGECFQATSVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARL 418
Query: 241 SPYSDDEE 248
P SDDE+
Sbjct: 419 CPSSDDEQ 426
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 94/182 (51%), Gaps = 23/182 (12%)
Query: 537 VSRVGEVAALRGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPLRLKFRKPNLENQNSQ 596
V +V A RG + + + R +D+ P+ + L+LK +KP+ ++
Sbjct: 440 VFKVNSSAKARGKSSEEAADSSKKRSRSDNVEP----PETERKTLKLKIKKPS----GTE 491
Query: 597 VSQPEEEKSLIKGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGK 656
V E + +GQRSKRKRP+ E+ E DA +S++D IL +LG
Sbjct: 492 VV---EASNTARGQRSKRKRPASSQEE----EVADAVESDEDDTS--------ILHRLGS 536
Query: 657 DAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKR 716
DA+ KRVEV + SD +W KG +T + S ++ D+ KTL+ GK+ VR + ++++
Sbjct: 537 DAVTKRVEVCRSSDKTWLKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRERY 596
Query: 717 SM 718
++
Sbjct: 597 AI 598
>gi|302795017|ref|XP_002979272.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
gi|300153040|gb|EFJ19680.1| hypothetical protein SELMODRAFT_444121 [Selaginella moellendorffii]
Length = 764
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 144/248 (58%), Positives = 184/248 (74%), Gaps = 1/248 (0%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C LC E E ++ RML+C+ C +++HR CLK+WA NRDLF+W+SW+C CR CE C
Sbjct: 180 FCGLCQQAEAES-KKQERMLTCQGCDRRFHRKCLKDWAGNRDLFNWASWRCLHCRTCEDC 238
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ TGDPN+ +FC+RCD A+H C+ K + GP+LCPKH++CHSCG+ VPG G S RW
Sbjct: 239 KVTGDPNRLLFCKRCDEAHHNNCKQSGAKAPAKGPFLCPKHSQCHSCGTRVPGGGSSSRW 298
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
F Y CDACGRLFVK YCP+C+KVYR+SE TPMV CD C+ WVHC C+GISDEKY +F
Sbjct: 299 FHSYLFCDACGRLFVKDKYCPICMKVYRESEPTPMVLCDGCEHWVHCVCEGISDEKYQEF 358
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSI 240
Q NL++ C CRGEC+Q +E+AV ELW+RKD AD+D I SLRA+AGLP+E E+ +
Sbjct: 359 QTIQNLRFTCAACRGECFQATSVEEAVVELWKRKDEADRDQIKSLRASAGLPSESEMARL 418
Query: 241 SPYSDDEE 248
P SDDE+
Sbjct: 419 CPSSDDEQ 426
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 86/162 (53%), Gaps = 25/162 (15%)
Query: 560 SRESNDDTSVLQSLPKDSKPP------LRLKFRKPNLENQNSQVSQPEEEKSLIKGQRSK 613
S E D+S +S + +PP L+LK +KP+ ++ E + +GQRSK
Sbjct: 453 SSEEAADSSKKRSRSDNVEPPETERKTLKLKIKKPS-------GTEVVEASNTARGQRSK 505
Query: 614 RKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSW 673
RKRP+ E+ E DA +S++D IL +LG DA+ KRVEV + SD +W
Sbjct: 506 RKRPASSQEE----EVADAVESDEDDTS--------ILHRLGSDAVTKRVEVCRSSDKTW 553
Query: 674 HKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 715
KG +T + S ++ D+ KTL+ GK+ VR + ++++
Sbjct: 554 LKGTITHVQQRRSQFTVNFDNGDKKTLKYGKEKVRLLGKRER 595
>gi|242085692|ref|XP_002443271.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
gi|241943964|gb|EES17109.1| hypothetical protein SORBIDRAFT_08g016700 [Sorghum bicolor]
Length = 531
Score = 288 bits (738), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 215/600 (35%), Positives = 302/600 (50%), Gaps = 112/600 (18%)
Query: 155 MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRK 214
MVCCDVC++WVH +CDGISDEKY +FQ D NLQY C CRGEC Q+RD EDA+RELW+R+
Sbjct: 1 MVCCDVCEKWVHIECDGISDEKYQEFQADQNLQYTCAACRGECSQIRDTEDAIRELWKRR 60
Query: 215 DMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSP 274
D+ D +L+ +LRAAA LP+ +++ P SDDE+ G VLKNE +LK SLK K P
Sbjct: 61 DVVDHELMVTLRAAAALPSPEDVSPPYPNSDDEKLGAYVLKNESRNTLKFSLKSNSSKPP 120
Query: 275 KKVKEHGKKWL-----------------NKK--------YPRKKGYQMPLNSKPEPDQSF 309
E K NK R+ ++P NS+ DQS
Sbjct: 121 SDTPEQEKIVFKSPGSNKKSSKKKGGQGNKTDDGHDEIFLERRHDVKLP-NSRL-GDQSI 178
Query: 310 EGYHDVHSY---GNSFGDDTQSPKNEGLDIPSSVAGIVSHTEGVCSISQPGILKHKYVDE 366
+G HD + N++ + + L PS A
Sbjct: 179 DGNHDRSPFKNDDNAYISSSTRSSEKSLKSPSKKA------------------------- 213
Query: 367 VMVSDDDKISRVKFKTSKPHDLDSGEDDGKHVSKSKTIKAKKLVINLGARKINVTNSPRS 426
+ ++ D I +VK K SK L +D ++ K+ T KA KLVI+LG+R + SP+S
Sbjct: 214 -VPNNADMIPKVKIKGSKVSTLHY-KDGEENTPKNDTGKATKLVIHLGSRHKTRSGSPKS 271
Query: 427 DASSCQREQDLTTSNGIEDPSLQRMNSKFVLDRHDGSSKLGDGDRVDHSSQSRGLKIAGR 486
+ S+ QREQDL + +G +VD +SQ + + +
Sbjct: 272 ELSNSQREQDLGSIHG---------------------------GKVDVTSQLKSSRSEVK 304
Query: 487 GGNVIKFGRVRQEVSDSNTKVSRGSSADEHEPEHMHVLSGKRNIDRSRAAVSRVGEVAAL 546
+V+K R N+ + ++ +H +GKR S A +S + E A
Sbjct: 305 ERSVMKLVR-ETGAPQRNSLLGDLGTSKKH-------ATGKR----SNALISGM-ENANE 351
Query: 547 RGDRKQLESRPNASRESNDDTSVLQSLPKDSKPPL-RLKFRKPNLENQNSQVSQPEEEKS 605
G R + ++ D+ P + KP L +LKF++P+ E N+Q SQPEE S
Sbjct: 352 TGSRNRSFAQKQYHSSQVDENQGTADSPDNLKPSLLKLKFKRPHFEQLNTQASQPEEPTS 411
Query: 606 LI----------KGQRSKRKRPSPFTEKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLG 655
+ KGQRSKRKRPS EK + A+ +Q S E+MDANWIL+KLG
Sbjct: 412 WVSQQEEQLNVAKGQRSKRKRPS--MEKADGLDGTTPAKRHQQSTDDEVMDANWILRKLG 469
Query: 656 KDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 715
KDAIGKR+EVH SD WH+G+V++ + G TL I LD+ R + +ELGKQ +R + + K
Sbjct: 470 KDAIGKRIEVHLTSDGKWHQGMVSNVIGG--TLCIQLDNGRSENVELGKQAIRLIASRSK 527
>gi|224106095|ref|XP_002314041.1| predicted protein [Populus trichocarpa]
gi|222850449|gb|EEE87996.1| predicted protein [Populus trichocarpa]
Length = 96
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/96 (78%), Positives = 82/96 (85%), Gaps = 3/96 (3%)
Query: 622 EKTLFNEDEDAAQSNQDSLMSEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDT 681
EKT++NEDE +QS+ DS E+M+ANWILKKLG DAIGKRVEVHQ SDNSWHKGVV+D
Sbjct: 2 EKTMYNEDEGMSQSHLDS---EMMEANWILKKLGYDAIGKRVEVHQPSDNSWHKGVVSDI 58
Query: 682 VEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQKRS 717
VE TS LSITLDD RVKTLELGKQ VRFV QKQKRS
Sbjct: 59 VEDTSMLSITLDDDRVKTLELGKQAVRFVSQKQKRS 94
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 108/220 (49%), Gaps = 5/220 (2%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 393 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 451
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 452 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 511
Query: 140 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 196
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 512 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 571
Query: 197 CYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDE 236
+ +LE R +D +S R LP D+
Sbjct: 572 PNRGLELERR-RSASPFEDGRRSPFASSSRTPPHLPIPDD 610
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 787 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 844
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 845 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 902
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 182
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 903 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 955
Query: 183 DGNLQYRCPTCR 194
D + C CR
Sbjct: 956 DKG--FICTLCR 965
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 392 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 450
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 451 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 510
Query: 140 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 196
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 511 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 570
Query: 197 CYQVRDLE 204
+ +LE
Sbjct: 571 PNRGLELE 578
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 767 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 824
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 825 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 882
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 182
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 883 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 935
Query: 183 DGNLQYRCPTCR 194
D + C CR
Sbjct: 936 DKG--FICTLCR 945
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 98/188 (52%), Gaps = 4/188 (2%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG+ YH +CL + + D + W+CP+C+IC+ CR+ GD NK + C CD Y
Sbjct: 409 LFCTSCGQHYHGSCL-DPPVSIDPVVRAGWQCPNCKICQTCRQPGDDNKMLVCDTCDKGY 467
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P + + C C CG+ PGNG S RW YT CD+C + KG
Sbjct: 468 HTFCLKPAMITIPKNGWKCKTCRVCTDCGARTPGNGPSSRWHHNYTVCDSCYQQRNKGYC 527
Query: 140 CPVCLKVYR-DSESTPMVCCDVCQRWVHCQCDG-ISDEKYLQFQVDGN-LQYRCPTCRGE 196
CP+C K YR + MV C +C R+VH CD KY Q + G Y+CP CR
Sbjct: 528 CPICGKAYRHHTTHKVMVQCHLCNRYVHADCDDRTVISKYQQSKAAGQPTPYKCPDCRHR 587
Query: 197 CYQVRDLE 204
+ +LE
Sbjct: 588 PNRGLELE 595
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 88/192 (45%), Gaps = 14/192 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + G R+L+C CG+ YH C+ + + W+C C +CE C ++
Sbjct: 784 MCLSCGSFGLGSEGRLLTCSQCGQCYHPYCVS--IKITKVVLSKGWRCLDCTVCEGCGKS 841
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + V G + C C CGS PG + W
Sbjct: 842 SDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGE--NADWMNN 899
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDVCQRWVHCQCDGISDEKYLQFQV 182
YT C C + +C C + YRD+E ++C C CQRW H C+ + E + +
Sbjct: 900 YTQCGPCASM----THCAYCYRSYRDNE---LLCQCSHCQRWEHALCNSLYTEDETERAM 952
Query: 183 DGNLQYRCPTCR 194
D + C CR
Sbjct: 953 DKG--FICTLCR 962
>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
Length = 1566
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 91/178 (51%), Gaps = 4/178 (2%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG+ YH +CL + + W+CP C+IC+ CR+ GD NK + C CD Y
Sbjct: 388 LFCTSCGQHYHGSCLDPPVDVNPVVR-AGWQCPECKICQTCRQPGDDNKMLVCDTCDKGY 446
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P + + + C C CGS PG+G S RW L Y+ CD+C + KG
Sbjct: 447 HTFCLRPVMQTIPKNGWKCKNCRICTDCGSRTPGSGPSSRWHLNYSVCDSCYQQRNKGLC 506
Query: 140 CPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
CP+C K YR + M+ C+ C++WVH CD D Q D L Y C CR
Sbjct: 507 CPICGKAYRQHTAHNAMIQCESCKKWVHVDCDESIDISVYQQLKDDKLTTIYNCVDCR 564
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC C F + EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 800 MCVSCGSFGRDAEG-----RLLTCSQCGQCYHPYCVN--IKITKVVLSKGWRCLDCTVCE 852
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D + + C CD +YH YC PP +NV G + C C CG+ P G +
Sbjct: 853 GCGKASDEGRLLLCDDCDISYHTYCLEPPLQNVPKGGWKCKWCVCCTKCGATSP--GFNS 910
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L CPVC K Y++ E ++ C C RW+H +CDG +E +
Sbjct: 911 EWQNNYTQCGPCSSLLT----CPVCFKEYKEDEL--IIQCVQCYRWLHAECDGFHNEDDI 964
Query: 179 QFQVDGNLQYRCPTCR 194
+ D Y C CR
Sbjct: 965 ERAADQG--YHCLLCR 978
>gi|281202543|gb|EFA76745.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
Length = 604
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 92/179 (51%), Gaps = 1/179 (0%)
Query: 18 RMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++L C C + +H C+ + ++WKC C++CE C+ T + +K +FC CD
Sbjct: 358 QLLQCVGCLRSFHGKCINLQTLAIETIKKLNTWKCTDCKVCEACKDTTNEDKMLFCDVCD 417
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
YH +C +PP + +G + C C CG+ PG + W YT C+ C L +
Sbjct: 418 RGYHTFCLNPPLERPPTGGWRCSTCVFCIHCGTRTPGPQANSAWRGHYTECEQCNVLVAE 477
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 195
YC VC KV + E +P + C C RW H QCDG+S +F+ + N QY+C CR
Sbjct: 478 RKYCSVCRKVIKPHEKSPTIQCGYCDRWTHSQCDGMSVSNLEKFKDNPNHQYKCQACRN 536
>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 4990
Score = 132 bits (331), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 99/196 (50%), Gaps = 9/196 (4%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCL-KNWAQNRDLFHWSSWKCPSCRICEI 59
C LC + G + L C SCG YH CL + A + ++ + W+CP C++C++
Sbjct: 1829 FCVLCCQADKIG-----KQLFCTSCGHHYHGGCLHPSVALSPEV--RAGWQCPDCKVCQM 1881
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
CR+ G+ +K + C CD YH +C P + + C C CGS PG+G S R
Sbjct: 1882 CRQPGEDSKMLVCDTCDKGYHTFCLKPVMTAIPKNGWKCKNCRVCGDCGSRTPGSGPSSR 1941
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL- 178
W L Y+ CD+C + KG CP+C K YR M+ C C++ VH +CD D L
Sbjct: 1942 WHLNYSVCDSCYQQRNKGLSCPLCGKAYRQFTQKAMIQCGTCKKHVHAECDDAIDNLMLD 2001
Query: 179 QFQVDGNLQYRCPTCR 194
+ + + + Y C CR
Sbjct: 2002 RVRNEEQVDYMCSVCR 2017
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
+++ C CG+ YH C + + W+C C +CE C + D + + C CD
Sbjct: 2160 KLIVCTQCGQCYHPYCAS--VKVTKVILSKGWRCLDCTVCEGCGKPHDEGRLLLCDECDI 2217
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
+YH YC PP V G + C C +CG+ PG G + W YT C C R +
Sbjct: 2218 SYHIYCLDPPLDQVPKGTWKCKWCVMCINCGTTTPGFGCN--WQNNYTQCGPC-RSKID- 2273
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CPVC Y+D E ++ C C RW+H CDG+ E ++ D Y+C CR
Sbjct: 2274 --CPVCRHKYQDDEM--IIQCLQCNRWLHALCDGLRSEDDMERAAD--YDYQCLFCR 2324
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 129 bits (323), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 99/200 (49%), Gaps = 8/200 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++ C SCG+ YH C+ AQ + + W+C CRIC++CR TGD K M C +CD
Sbjct: 381 LMFCSSCGEHYHGICV-GLAQLPGV--RAGWQCRKCRICQVCRMTGDETKLMTCEQCDKI 437
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
YH CQ P ++ + C C CGS PG GLS RW YT CD+C + KG
Sbjct: 438 YHSTCQRPIVTSIPKYGWKCRCCRVCGDCGSRTPGAGLSSRWHAHYTVCDSCYQQRNKGF 497
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECY 198
CP+C + YR MV C +C+++VH CD +D + + + +Y C
Sbjct: 498 SCPLCHRAYRAHAHREMVQCTLCRKFVHGTCDPEADLVTYHQRKEAHPEYEY-----VCL 552
Query: 199 QVRDLEDAVRELWRRKDMAD 218
++L L +R + D
Sbjct: 553 MCKNLTQPATLLAKRNSIDD 572
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 82/178 (46%), Gaps = 17/178 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C ++EGC ++SC CG+ YH C+ + + W+C C +CE
Sbjct: 705 ICVMCGALGTDHEGC-----LISCVQCGQCYHPYCVN--VKITKVVLQKGWRCLDCTVCE 757
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D + + C CD +YH YC PP V G + C C +CG+ P G +
Sbjct: 758 GCGQRNDEARLILCDDCDISYHIYCMDPPLDYVPHGNWKCKWCAICQTCGATDP--GFNC 815
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
W C CG N CP C + Y SE ++ C C+RW+H CD I E+
Sbjct: 816 SWM---NSCSECGPCASHVN-CPSCSEPY--SEGDLIIQCVQCERWLHGTCDSIKTEE 867
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 93/180 (51%), Gaps = 9/180 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C CG+ YH +C+ L + W+C SCR+C++CR+ D +K M C RC+
Sbjct: 392 LVMCSICGQHYHGSCV-----GLALLPGVRAGWQCASCRVCQVCRQPEDVSKVMLCERCE 446
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
AYH C P ++ + C C CGS PG GLS RW YT CD+C + K
Sbjct: 447 KAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSRWHSHYTVCDSCYQQRNK 506
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF--QVDGNLQYRCPTCR 194
G CP+C K YR + MV C C+++VH CD +D Q +V + +Y C C+
Sbjct: 507 GFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEADPLTYQHRKEVKPDYEYVCLHCK 566
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 80/174 (45%), Gaps = 17/174 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 716 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 768
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C C +CGSN P G +
Sbjct: 769 GCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--GFNS 826
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
W YT C C C C + Y + + ++ C C+RW+HC CD I
Sbjct: 827 SWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 874
>gi|328717947|ref|XP_001943997.2| PREDICTED: hypothetical protein LOC100159693, partial
[Acyrthosiphon pisum]
Length = 2904
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 93/187 (49%), Gaps = 10/187 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C +CG YH CL L + W+C +CRIC++CR+ + K M C CD
Sbjct: 398 LMMCTACGSHYHGVCL-----GLALLPGVRAGWQCGNCRICQVCRQPAEQTKVMLCEGCD 452
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
AYH C P + + C C CGS PG GLS RW YT CD+C + K
Sbjct: 453 KAYHPGCLRPQVTTIPKIGWKCKCCRVCTDCGSRTPGAGLSSRWHAHYTVCDSCYQQRNK 512
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCRG 195
G+ CP+C + YR + MV C C++++H CD + +Y+ + +Y CP C+
Sbjct: 513 GSSCPLCHRAYRAAAHREMVQCISCRKYIHGACD--PEAEYITSHSKSSASEYMCPLCKN 570
Query: 196 ECYQVRD 202
Q RD
Sbjct: 571 AVQQRRD 577
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 84/172 (48%), Gaps = 10/172 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G ++ ++SC CG+ YH C+ + + W+C C +CE C +
Sbjct: 723 ICVMCGSLGTDQEACLISCSQCGQCYHPFCVN--VKVTKVILQKGWRCLDCTVCEGCGQR 780
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D ++ + C CD +YH YC P V G + C +C +CGSN P G + W
Sbjct: 781 NDESRLILCDECDISYHIYCTDPKLDYVPRGTWKCKWCAQCLTCGSNDP--GFNCSWLNN 838
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
YT C C + CP C + Y D++ ++ C C RW+H +CD I +E
Sbjct: 839 YTECGPCASRSI----CPSCQESYTDNQL--IIKCSQCDRWLHGKCDKIENE 884
>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
Length = 1858
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 93/211 (44%), Gaps = 20/211 (9%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKN-----WAQNRDLFHWSSWKCPSCR 55
+CR C G E M C C + YH C+K+ + SWKC C
Sbjct: 578 LCRGC--GTRGTDEETSGMHWCNQCCQPYHDFCVKSSFGDAYESTLKEIAQGSWKCWDCI 635
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GN 114
+C C + + C C H C P V SG +LC + KC SCG+ P G
Sbjct: 636 VCTTCNSSFPEETLVVCDNCAVGRHLGCMDIPLAEVPSGRWLCSQCVKCDSCGAQTPRGM 695
Query: 115 GLS-----------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQ 162
G + W Y+ C CG L +GNYC VC KVY D + TPM+ C+ C
Sbjct: 696 GKTRLPSSFPSSQPCEWMFDYSLCQPCGLLKARGNYCRVCEKVYEDDDYDTPMISCEQCS 755
Query: 163 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 193
W+H C G+ +E Y + D NL + CP+C
Sbjct: 756 MWLHTHCVGMDEETYEMYSNDENLAFTCPSC 786
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 93/180 (51%), Gaps = 9/180 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C CG+ YH +C+ L + W+C SCR+C++CR+ D +K M C RC+
Sbjct: 383 LVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSCRVCQVCRQPEDVSKVMLCERCE 437
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
AYH C P ++ + C C CGS PG GLS RW YT CD+C + K
Sbjct: 438 KAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSRWHSHYTVCDSCYQQRNK 497
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ--VDGNLQYRCPTCR 194
G CP+C K YR + MV C C+++VH CD +D Q + V + +Y C C+
Sbjct: 498 GFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEADPLTYQHRKDVKPDYEYVCLHCK 557
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 17/174 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 707 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 759
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C C +CGSN P G +
Sbjct: 760 GCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--GFNS 817
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
W YT C C C C + Y +E ++ C C+RW+HC CD I
Sbjct: 818 SWQKNYTQCGPCA----SHAACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 93/180 (51%), Gaps = 9/180 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C CG+ YH +C+ L + W+C SCR+C++CR+ D +K M C RC+
Sbjct: 383 LVMCSICGQHYHGSCV-----GLALLPGVRAGWQCVSCRVCQVCRQPEDVSKVMLCERCE 437
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
AYH C P ++ + C C CGS PG GLS RW YT CD+C + K
Sbjct: 438 KAYHPSCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSRWHSHYTVCDSCYQQRNK 497
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQ--VDGNLQYRCPTCR 194
G CP+C K YR + MV C C+++VH CD +D Q + V + +Y C C+
Sbjct: 498 GFSCPLCRKAYRAAAYREMVQCSACKKFVHGTCDPEADPLTYQHRKDVKPDYEYVCLHCK 557
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 17/174 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 707 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 759
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C C +CGSN P G +
Sbjct: 760 GCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--GFNS 817
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
W YT C C C C + Y +E ++ C C+RW+HC CD I
Sbjct: 818 SWQKNYTQCGPCA----SHTACISCQETY--TEGDLIIQCIQCERWLHCACDSI 865
>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
Length = 6073
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/236 (31%), Positives = 111/236 (47%), Gaps = 22/236 (9%)
Query: 4 LCFVGENEGCE------RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCR 55
L +GE C +L C SCG +H +C+ L + W+C CR
Sbjct: 370 LLPIGETSQCSTCLSLGNVSNILMCTSCGAHHHGSCV-----GLALLPGVRAGWQCFECR 424
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 115
+C++CR+ + K M C CD AYH C P ++ + C C CGS PG+G
Sbjct: 425 VCQVCRQPSEIGKIMLCESCDKAYHPSCLRPIVTSIPKYGWKCKCCRVCSDCGSRTPGSG 484
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
LS RW ++ CD+C + KG CPVC + YR + MV C C+++VH CD +D
Sbjct: 485 LSSRWHNHFSVCDSCYQQRNKGFCCPVCGRAYRAAAHREMVQCIKCRKYVHGSCDNEADI 544
Query: 176 KYLQFQVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAA 229
+ + N +Y C C ++L R+ +RKD D+ L+ S +A+
Sbjct: 545 SVYAARKETNPDYEYICCIC-------KNLNSMGRQGIKRKDSFDEALLESSLSAS 593
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C + EGC ++SC CG+ YH C+ + + W+C C +CE
Sbjct: 723 ICVMCGALGTDQEGC-----LISCAQCGQCYHPYCVN--VKVTKVILQKGWRCLDCTVCE 775
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D ++ C CD +YH YC PP V G + C C CGSN P G +
Sbjct: 776 GCGQRNDDSRLTLCDDCDISYHIYCMDPPLDYVPRGVWKCKWCVVCIRCGSNDP--GFNC 833
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W GYT C C +CP CL+ Y + E ++ C+ C+RW+H CDGI ++ L
Sbjct: 834 NWMNGYTECGPCA----SHTFCPSCLEPYVEGEL--VIQCEQCERWLHGSCDGIRND--L 885
Query: 179 QFQVDGNLQYRCPTCR 194
+ + +Y C CR
Sbjct: 886 DAEKCADEKYTCVLCR 901
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 102/205 (49%), Gaps = 20/205 (9%)
Query: 19 MLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C CG+ YH +C L R + W+C SCR+C++CR+ D +K M C RCD
Sbjct: 379 LVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVCQVCRQPEDVSKVMLCERCD 433
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
AYH C P ++ + C C CGS PG GLS RW YT CD+C + K
Sbjct: 434 KAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSRWHSHYTVCDSCYQQRNK 493
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG--NLQYRCPTCR 194
G CP+C K YR + MV C C+++VH CD +D Q + + + +Y C C+
Sbjct: 494 GFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLTYQQRKEAKPDYEYVCLHCK 553
Query: 195 GECYQVRDLEDAVRELWRRKDMADK 219
++ + RRKD D+
Sbjct: 554 -----------SIAMVARRKDSIDE 567
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 703 ICVMCGSIGMDQEGC-----LIACVQCGQCYHPYCAG--VKITKVILQKGWRCLDCTVCE 755
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C +C +CGSN P G +
Sbjct: 756 GCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAQCQTCGSNDP--GFNS 813
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
W YT C C C VC + Y++ + ++ C C+RW+HC CD I E
Sbjct: 814 SWQKSYTQCGPCA----SHTACVVCQEAYQEGDL--IIQCVQCERWLHCGCDSIKSE 864
>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
Length = 3915
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 92/191 (48%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+LSC CG+ YH C+ + + W+C C +CE C +
Sbjct: 20 MCVVCGSFGRGAEGRLLSCSQCGQCYHPFCVN--IKITKVVLSKGWRCLECTVCEACGQA 77
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP +NV G + C C CG+ P GL W
Sbjct: 78 SDPGRLLLCDDCDISYHTYCLDPPLQNVPKGSWKCKWCVLCTHCGATSP--GLRCEWQNN 135
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L V CPVC + YR+ E ++ C C RWVH C G++ ++ ++ D
Sbjct: 136 YTQCGPCASLTV----CPVCTRSYREEEL--ILQCRQCDRWVHGSCQGLNSDEDVENAAD 189
Query: 184 GNLQYRCPTCR 194
+ C CR
Sbjct: 190 EG--FDCTLCR 198
>gi|449684588|ref|XP_002166105.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
magnipapillata]
Length = 229
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 99/195 (50%), Gaps = 18/195 (9%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
++++C C H +CL + W+C C++C C D ++ MFC CD
Sbjct: 14 KLINCSQCSNGGHPSCLDMNKSLLKVIKGYPWQCMECKVCTECLAPHDEHEMMFCDNCDR 73
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYT----------- 125
YH YC K + G ++C + KC SC S P +G S RW + +T
Sbjct: 74 GYHSYCVGV--KEIPKGRWVCNRCGKCCSCLSRQPVSDGGSGRWKMEFTKPTDGSEPEFL 131
Query: 126 --CCDACGRLFVKGNYCPVCLKVYRDSEST--PMVCCDVCQRWVHCQCDGISDEKYLQFQ 181
C C LF KG++CPVCLKVY D + PMVCCD C RW+H CDGI +++Y++
Sbjct: 132 QNHCRKCSILFRKGSFCPVCLKVYCDDDGVVNPMVCCDNCDRWIHTDCDGIDEQRYIELS 191
Query: 182 VDGNLQYRCPTCRGE 196
D + Y C CRGE
Sbjct: 192 KDHHSAYTCVLCRGE 206
>gi|291240901|ref|XP_002740354.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like
[Saccoglossus kowalevskii]
Length = 4402
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 100/202 (49%), Gaps = 20/202 (9%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEI 59
+CF+ + G + ++ C C + +H CL K N D+ W C C+ C +
Sbjct: 1052 VCFLCASTG---QQELVYCNVCCEPFHEFCLEEDEKPLDDNTDI-----WCCKRCKFCHV 1103
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGL 116
C R + + C +C YH C P + + ++C K +C SCG+ PG
Sbjct: 1104 CGRQQN---LLQCDKCHNTYHAECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGQSS 1160
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 175
S +W ++ C CG+LF GNYCP+C + Y D + + M+ C C+ WVH +C+G++DE
Sbjct: 1161 SAQWSHDFSLCQDCGKLFDIGNYCPLCQQCYTDDDYDSKMMQCPCCESWVHAKCEGLTDE 1220
Query: 176 KY-LQFQVDGNLQYRCPTCRGE 196
Y + + ++ Y C C+ E
Sbjct: 1221 MYQIMCEFPEDIHYTCSKCQPE 1242
>gi|297736277|emb|CBI24915.3| unnamed protein product [Vitis vinifera]
Length = 75
Score = 120 bits (302), Expect = 2e-24, Method: Composition-based stats.
Identities = 57/73 (78%), Positives = 62/73 (84%)
Query: 645 MDANWILKKLGKDAIGKRVEVHQQSDNSWHKGVVTDTVEGTSTLSITLDDSRVKTLELGK 704
MDANWILKKLGKDAIGKRVEVHQ SDNSWHKG+V D +EGTSTL + DD R KTLELGK
Sbjct: 1 MDANWILKKLGKDAIGKRVEVHQSSDNSWHKGMVIDFIEGTSTLIVKFDDGRAKTLELGK 60
Query: 705 QGVRFVPQKQKRS 717
Q +R + QKQKRS
Sbjct: 61 QAIRLISQKQKRS 73
>gi|427798455|gb|JAA64679.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 951
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 6 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 18 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 76
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 77 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 136
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 137 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 192
Query: 179 QFQVDGNLQYRCPTC 193
DG+ Y CP C
Sbjct: 193 SSPKDGD--YVCPVC 205
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 96/196 (48%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C F EG R+++C CG+ YH C+ + + W+C C +CE
Sbjct: 380 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCE 432
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D ++ + C CD +YH YC PP + V G + C C CG+ PGNG
Sbjct: 433 GCGQPHDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--S 490
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
+W YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+
Sbjct: 491 QWQNNYTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEEDA 544
Query: 179 QFQVDGNLQYRCPTCR 194
+ + Y CP CR
Sbjct: 545 ERCAE--YGYNCPYCR 558
>gi|241859648|ref|XP_002416243.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
gi|215510457|gb|EEC19910.1| hypothetical protein IscW_ISCW023204 [Ixodes scapularis]
Length = 1179
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 15/195 (7%)
Query: 6 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+VG C+ M+S C CG YH CL + A W+CP C+ C+
Sbjct: 18 YVGSQANCQSCEEMVSPSELLFCTLCGAHYHGFCL-DPAVRVTTSTRVGWQCPDCKACQA 76
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
CRR GD + + C CD +H YC P NV + C C CGS PG+G S R
Sbjct: 77 CRRPGDEARLLTCDICDKGFHVYCVKPVVANVPKHGWKCQNCRVCGDCGSRTPGSGPSSR 136
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W + ++ CD+C + KG CP+C K YR S M C +C++++H +CD
Sbjct: 137 WHMNFSVCDSCYQQRNKGVACPLCGKAYRQFSHREDMAQCTMCRKYIHMECDS------- 189
Query: 179 QFQVDGNLQYRCPTC 193
Q + Y CP C
Sbjct: 190 QLANHQDADYVCPVC 204
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C F EG R+++C CG+ YH C+ N R + W+C C +CE
Sbjct: 287 LCAMCGSFGRAEEG-----RLIACAQCGQCYHPYCV-NVKVTRMILK-KGWRCLDCTVCE 339
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D ++ + C CD +YH YC PP +NV G + C C CG+ PG G
Sbjct: 340 GCGQPHDESRLLLCDECDISYHTYCLSPPLENVPQGNWKCRWCVVCLQCGATDPGFG--S 397
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C CP+CL Y++++ ++ C C+RW+H CD I+ E+
Sbjct: 398 HWQNNYTQCGPC----ASKTSCPLCLLKYQENDL--VIQCVQCERWMHGFCDQIACEE-- 449
Query: 179 QFQVDGNLQYRCPTCR 194
+ Y CP CR
Sbjct: 450 DAEKCAEYGYNCPYCR 465
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 942 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 994
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 995 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1052
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CP+C + YRD E ++ C C RW+H C ++ E+ +
Sbjct: 1053 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1106
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 1107 ENIAD--MGFDCTICR 1120
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 86/187 (45%), Gaps = 22/187 (11%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 359 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 415
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P +V + + C C CG+ S +W CD+C + + N
Sbjct: 416 HTFCLQPVMDSVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 468
Query: 140 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCD---GISDEKYLQFQVDGNLQYRCPTCR- 194
CP C K+ M+ C +C+RW+H +CD GI E L+ Y C C+
Sbjct: 469 SCPFCDKLCLQDFQKDMLHCHMCKRWIHMECDRSPGIELESQLK-------DYICTLCKQ 521
Query: 195 GECYQVR 201
GE Q +
Sbjct: 522 GEGDQTQ 528
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 912 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 964
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 965 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1022
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CP+C + YRD E ++ C C RW+H C ++ E+ +
Sbjct: 1023 EWQNNYTQCAPCASL----STCPICYRTYRDEEL--IIQCRQCDRWMHAICQNLNTEEEV 1076
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 1077 ENIAD--MGFDCTICR 1090
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 84/184 (45%), Gaps = 16/184 (8%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 327 LFCTTCGQHYHGMCLDIQVTP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 383
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P V + + C C CG+ S +W CD+C + + N
Sbjct: 384 HTFCLQPVMDAVPTNGWKCKNCRVCAECGTRT-----SCQWHHNCLVCDSCYQQ--QDNL 436
Query: 140 -CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR-GEC 197
CP C K+ M+ C +C+RW+H CD S L+ Q+ Y C CR GE
Sbjct: 437 SCPFCEKLCLQDFQKDMLHCHMCKRWIHIDCDR-SPGSELESQLK---DYICTLCRQGEG 492
Query: 198 YQVR 201
Q +
Sbjct: 493 DQTQ 496
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 991 MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKITKVVLSKGWRCLECTVCEACGQA 1048
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP +NV + C C CG+ P GL W
Sbjct: 1049 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQNN 1106
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CP+CL Y SE T +V C C RW H C + E+ ++ D
Sbjct: 1107 YTLCAPCASL----STCPICLVDY--SEGTIIVQCRQCDRWFHASCQSLHSEEDIEKAAD 1160
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1161 SS--FDCTMCR 1169
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 73/175 (41%), Gaps = 11/175 (6%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG YH CL + W+CP C++C+ C+ G+ K + C CD Y
Sbjct: 357 LFCTSCGLHYHGICLDMAVTP---LRRAGWQCPECKVCQTCKNPGEDTKMLVCDMCDKGY 413
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P + + + C C CG+ G +W C+ C +
Sbjct: 414 HTFCLQPVIDTLPTNGWRCQNCRVCLQCGTRTGG-----QWHHTSLLCENCVQNQDPALC 468
Query: 140 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CP+C + +V C C+RW+H +C+ + Q ++ Y C CR
Sbjct: 469 CPMCSCILDPEHHKDLVFCHTCKRWLHLECE---RQNSGQAEIHPREDYVCSNCR 520
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 965 MCVVCGSFGLGAEGRLLACAQCGQCYHPFCVG--IKINKVVLSKGWRCLECTVCEACGQA 1022
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP +NV + C C CG+ P GL W
Sbjct: 1023 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTP--GLRCEWQSN 1080
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CP+CL Y SE T +V C C RW H C G+ E L+ +
Sbjct: 1081 YTQCAPCASL----STCPICLVNY--SEGTVIVQCRQCDRWFHASCQGLHSEDDLEKAAE 1134
Query: 184 GNLQYRCPTCR 194
+ + C C+
Sbjct: 1135 NS--FDCTICQ 1143
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 75/175 (42%), Gaps = 11/175 (6%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C SCG YH CL + W+CP C++C+ C+ GD K + C CD Y
Sbjct: 357 LFCTSCGLHYHGMCLDMAVTP---LRRAGWQCPECKVCQTCKNHGDDTKMLVCDMCDKGY 413
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P +++ + + C C CG+ G+ W C+ C +
Sbjct: 414 HTFCLQPAMESLPTNGWRCKNCRVCIQCGTRTSGH-----WHHNSLLCENCFQNQDPALC 468
Query: 140 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
C +C + ++ C C+RW+H +C+ + Q +++ Y C CR
Sbjct: 469 CSMCSCILDPEHHKDLLFCQTCKRWLHLECE---RQNSGQTEINPREDYVCFNCR 520
>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 3840
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 99/199 (49%), Gaps = 16/199 (8%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 64 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNVPGNGL 116
K + C +C +YH C P N + P ++C K +C SCGS PG G
Sbjct: 1434 QQATKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKCVRCKSCGSTTPGKGW 1490
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 175
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ ++DE
Sbjct: 1491 DAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGKCDRWVHSKCENLTDE 1550
Query: 176 KY-LQFQVDGNLQYRCPTC 193
Y + + ++ Y C C
Sbjct: 1551 MYEILSNLPESVAYTCINC 1569
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 87/191 (45%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 671 MCVVCGSFGLGAEGRLLACAQCGQCYHPYCVG--IKINKVVLSKGWRCLECTVCEACGQA 728
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP +NV + C C CG+ P GL W
Sbjct: 729 TDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVTCTQCGATTP--GLRCEWQKN 786
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L CP+CL Y SE T ++ C C RW H C + E+ ++ +
Sbjct: 787 YTQCAPCASLMT----CPICLVDY--SEGTTILQCRQCDRWFHASCQSLHSEEDVEKAAE 840
Query: 184 GNLQYRCPTCR 194
+ C CR
Sbjct: 841 NG--FNCTMCR 849
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
L C SCG+ YH CL + W+CP C+IC+ C+
Sbjct: 138 LFCTSCGQHYHGICLDMAVTP---LRRAGWQCPECKICQTCK 176
>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2711
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1269 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1327
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387
Query: 193 CRGECY 198
C G +
Sbjct: 1388 CAGATH 1393
>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2688
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1187 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1245
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1246 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1304
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1305 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1364
Query: 193 CRGECY 198
C G +
Sbjct: 1365 CAGATH 1370
>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
mutus]
Length = 2524
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1066 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1124
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 1125 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 1183
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1184 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1243
Query: 193 CRGECY 198
C G +
Sbjct: 1244 CAGATH 1249
>gi|328874899|gb|EGG23264.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
Length = 758
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 89/184 (48%), Gaps = 3/184 (1%)
Query: 18 RMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
+ SC C + YH CL+ N + +WKC C++CE+C +K MFC CD
Sbjct: 494 QTFSCIGCHRVYHGKCLQLNQLAIDTIKRNGNWKCIDCKLCEVCNEGVHEDKMMFCDVCD 553
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
YH +C P +G + C + C CGS G S +W YT C+ C
Sbjct: 554 KGYHTFCCSPKLDAPPTGGWKCSQCVHCIHCGSRSAGPSSSSKWNANYTVCEVCTPKVQD 613
Query: 137 GNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCR 194
YC VC K+ + S E P+V C C RW H CD I++E + + + N QY+CPTC+
Sbjct: 614 KKYCTVCRKIIKSSGEKKPIVQCVYCDRWTHAGCDSITEEFLEKMKENPNYHQYKCPTCK 673
Query: 195 GECY 198
Y
Sbjct: 674 TGNY 677
>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
Length = 3017
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1517 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1575
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1576 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1634
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1635 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1694
Query: 193 CRGECY 198
C G +
Sbjct: 1695 CAGATH 1700
>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 2665
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1165 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1223
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1224 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1282
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1283 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1342
Query: 193 CRGECYQVRDLEDAVRELWR 212
C G AV+ WR
Sbjct: 1343 CAG----------AVQPRWR 1352
>gi|358334996|dbj|GAA53428.1| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
sinensis]
Length = 3518
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 83/176 (47%), Gaps = 11/176 (6%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L+C CG+ YH C R + W+C C +CE C T + + + C CD +
Sbjct: 523 LLACAQCGQCYHPFCADVPKITRTMLE-KGWRCLDCTVCEGCGGTTNESLLLLCDDCDIS 581
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
YH YC PP + V G + C + C +CG P GL+ +W Y+ C C L
Sbjct: 582 YHTYCLDPPLQEVPKGGWKCSECVVCTNCGQRDP--GLNGKWHANYSMCAPCASLAT--- 636
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CPVC YR+ E ++ C +C RW H CD + E L+ D + Y C CR
Sbjct: 637 -CPVCTLAYREGEL--LIRCALCSRWSHAGCDQLRTEDELELATD--MGYNCLLCR 687
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 82/217 (37%), Gaps = 42/217 (19%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG YH +CL+ Q W+C C+ C IC + D NK + C CD
Sbjct: 53 LLFCTGCGSHYHGSCLEPSLQPNPTIRIG-WQCAECKACLICNESKDENKMLVCDVCDKG 111
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG----SNVPGNGLSV----------RWFLGY 124
+H YC PP + + C + C CG S V G G+ V +W Y
Sbjct: 112 FHTYCLRPPVSCIPRNGFKCERCRVCSDCGAGRASTVSGLGVMVEFNNPQLPVIKWHSNY 171
Query: 125 TCCDACGRLFVKGNY-CPVCLKVYRDSESTPMVC----------------CDVCQRWVHC 167
T CD C + CPVC + +R S P C C+R VH
Sbjct: 172 TLCDRCFHSRKRPTASCPVCERAWRCSLQVPSYISTQPQTSTHVTWPGRRCTKCRRMVHA 231
Query: 168 QCDGI----------SDEKYLQFQVDGNLQYRCPTCR 194
CD + S + G + Y CP CR
Sbjct: 232 DCDPLQSVATGTASPSSAMSEDTNIAGGIAYSCPVCR 268
>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4 [Callithrix jacchus]
Length = 2660
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 18/200 (9%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1160 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1218
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1219 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1277
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1278 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1337
Query: 193 CRGECYQVRDLEDAVRELWR 212
C G AV+ WR
Sbjct: 1338 CAG----------AVQPRWR 1347
>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
Length = 2713
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 193 CRG 195
C G
Sbjct: 1399 CAG 1401
>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
Length = 2721
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1280 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1338
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 193 CRGECY 198
C G +
Sbjct: 1399 CAGATH 1404
>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
homolog; AltName: Full=Trithorax homolog 2; AltName:
Full=WW domain-binding protein 7; Short=WBP-7
Length = 2713
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1221 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1279
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 1280 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 1338
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1339 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1398
Query: 193 CRG 195
C G
Sbjct: 1399 CAG 1401
>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
anubis]
Length = 2716
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1216 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1274
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1275 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1333
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1334 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1393
Query: 193 CRG 195
C G
Sbjct: 1394 CAG 1396
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 962 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 1014
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 1015 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1072
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ E ++ C C RW+H C ++ E+ +
Sbjct: 1073 EWQNNYTQCAPCASL----STCPVCYRNYREEEL--ILQCRQCDRWMHAICQNLNTEEEV 1126
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 1127 ENIAD--IGFDCTMCR 1140
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 83/173 (47%), Gaps = 12/173 (6%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD YH
Sbjct: 354 CTTCGQHYHGMCLDIAITP---LKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHT 410
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P +V + + C C CG+ S +W CD+C + + CP
Sbjct: 411 FCLQPVIDSVPTNGWKCKNCRVCAECGTRT-----SAQWHHNCLVCDSCYQQ-QESLSCP 464
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
C K Y M+ C +C+RW+H +CD +D + L+ Q+ +Y C C+
Sbjct: 465 FCGKYYHPDFQKDMLHCHMCKRWIHIECDKPTDTE-LESQL--REEYICMFCK 514
>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
Length = 2605
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1105 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1163
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1164 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1222
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1223 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1282
Query: 193 CRG 195
C G
Sbjct: 1283 CAG 1285
>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein
4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
domain-binding protein 7; Short=WBP-7
gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
Length = 2715
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1273
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1274 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1332
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1333 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1392
Query: 193 CRG 195
C G
Sbjct: 1393 CAG 1395
>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
Length = 3463
Score = 115 bits (288), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 10/195 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 62
+CF+ ++G ML C+ C + +HR CL+ A+ + +W C CR C +C R+
Sbjct: 1732 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCRFCHVCGRK 1787
Query: 63 TGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ + C RC YH C P P +N ++C +C SCG PG + W
Sbjct: 1788 NKNSKPLLECERCQNCYHASCLGPSYPKQNKKRKTWVCVTCIRCKSCGV-TPGKSWDIDW 1846
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 178
C C +LF GNYCP+C K Y D++ + M+ C C WVH +C+ ++DE Y +
Sbjct: 1847 NHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1906
Query: 179 QFQVDGNLQYRCPTC 193
+ ++ Y C C
Sbjct: 1907 LSSLPESVVYSCRPC 1921
>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
garnettii]
Length = 2714
Score = 115 bits (288), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1210 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1268
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1269 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1327
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1328 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1387
Query: 193 CRG 195
C G
Sbjct: 1388 CAG 1390
>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
Length = 2566
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 950 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1008
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1009 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1067
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1068 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1127
Query: 193 CRGECY 198
C G +
Sbjct: 1128 CAGATH 1133
>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
gorilla gorilla]
Length = 2536
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1120 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1178
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1179 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1237
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1238 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1297
Query: 193 CRG 195
C G
Sbjct: 1298 CAG 1300
>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
troglodytes]
Length = 2526
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1026 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1084
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1085 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1143
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1144 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1203
Query: 193 CRG 195
C G
Sbjct: 1204 CAG 1206
>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
Length = 2415
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 915 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 973
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 974 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1032
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1033 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1092
Query: 193 CRG 195
C G
Sbjct: 1093 CAG 1095
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ N R + W+C C +CE C
Sbjct: 151 MCVVCGSFGQGAEGRLLACSQCGQCYHPFCV-NVKMTRVVL-TKGWRCLECTVCEACGEA 208
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP V G + C +C CGS+ P G+ W
Sbjct: 209 SDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKWCVRCVQCGSSSP--GVRCDWQDN 266
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
Y+CC CG L CP+C + Y E ++ C C RWVH C + E+ ++ D
Sbjct: 267 YSCCGPCGSL----RRCPLCQRPYAHDEL--IMQCQQCDRWVHATCQNLMCEEDVEAAAD 320
Query: 184 GNLQYRCPTCR 194
+ C CR
Sbjct: 321 EG--FDCSLCR 329
>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Nomascus leucogenys]
Length = 2433
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1192 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1250
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1251 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1309
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1310 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1369
Query: 193 CRG 195
C G
Sbjct: 1370 CAG 1372
>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Pan paniscus]
Length = 2776
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1276 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1334
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1335 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1393
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1394 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCRP 1453
Query: 193 CRG 195
C G
Sbjct: 1454 CAG 1456
>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
glaber]
Length = 2592
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1072 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRASKHLLECERCCH 1130
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1131 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1189
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1190 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1249
Query: 193 CRG 195
C G
Sbjct: 1250 CAG 1252
>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
Length = 2013
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 512 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 570
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 571 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 629
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 630 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 689
Query: 193 CRG 195
C G
Sbjct: 690 CAG 692
>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
Length = 2209
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1067 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1125
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1126 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1184
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1185 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1244
Query: 193 CRG 195
C G
Sbjct: 1245 CAG 1247
>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
mulatta]
Length = 2789
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1337 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1395
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1396 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCIQLY 1454
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1455 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1514
Query: 193 CRG 195
C G
Sbjct: 1515 CAG 1517
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 905 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 962
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ P GL W
Sbjct: 963 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRCEWQNN 1020
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1021 YTQCAPCASL----STCPVCYRHYREEDL--ILQCRQCDRWMHAICQNLNTEEEVENVAD 1074
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1075 --IGFDCSMCR 1083
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 307 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 363
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 364 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 416
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C VC+RWVH +CD +D ++D L +Y C C+
Sbjct: 417 FCGKCYHPEFQEDMLHCSVCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 466
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 188 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 240
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 241 MCGKASDPSRLLLCDDCDVSYHTYCLEPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 298
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
W YT C C L CPVC + + + E ++ C C RWVH C+ + E
Sbjct: 299 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 349
>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
Length = 1503
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 122/283 (43%), Gaps = 28/283 (9%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 73
+++ C SCG +H C L N R S W C C C+ICR+ D N KF+ C
Sbjct: 216 KLIMCSSCGDHFHSTCIGLANLPDTR-----SGWCCARCTKCQICRQQ-DSNDIKFVKCE 269
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 191
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLMAYHKKKEQNPDYDYVCP 389
Query: 192 TCRGECYQV--RDLEDAVRELWRRKDMADKDLIA---SLRAAAGLPTEDEIFSISPYSDD 246
C+ + L + + +D L+ + G PT +++ S ++
Sbjct: 390 NCKTNSTRPPQSQLTETIDMALSESQTSDTQLMVKDIEMDPLEGRPTTNDVIS----EEN 445
Query: 247 EENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 282
+ P V N G+ K L+ GV+ + KK GK
Sbjct: 446 HKQPPGAKKKVCFSNLRGKGTKFMLQRMGVMSQISKKRSTRGK 488
Score = 92.0 bits (227), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 88/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 532 ICVMCGSLGIESDAVMITCAQCGQCYHPYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 589
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC +PP + V G + C T C CG N +V +
Sbjct: 590 NDEARLLLCDECDISYHIYCVNPPLETVPQGTWKCSFCTMCQKCGRNPTEKSDNVDSNMS 649
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C C C VC Y + E ++ C+ C++W H CD ++ + + + D
Sbjct: 650 E--CPPCA----SQTACSVCTNPYANGEM--IIQCEKCEQWSHFLCDSVNAQLTIDY-YD 700
Query: 184 GNLQYRCPTCR 194
N+ Y+C CR
Sbjct: 701 KNI-YKCLKCR 710
>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
lupus familiaris]
Length = 2713
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1212 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1270
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1271 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1329
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1330 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1389
Query: 193 CRGECY 198
C G +
Sbjct: 1390 CAGATH 1395
>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Ailuropoda melanoleuca]
Length = 2611
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1111 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 1169
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 1170 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 1228
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 1229 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1288
Query: 193 CRGECY 198
C G +
Sbjct: 1289 CAGATH 1294
>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Loxodonta africana]
Length = 2200
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 94/186 (50%), Gaps = 8/186 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL A+ H +W C C+ C +C R G K + C RC
Sbjct: 827 LVFCQVCCDPFHPFCLDE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGTKHLLECERCRH 885
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 886 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 944
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 945 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 1004
Query: 193 CRGECY 198
C G +
Sbjct: 1005 CAGATH 1010
>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
Length = 2270
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 771 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 829
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 830 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 888
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 889 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 948
Query: 193 CRG 195
C G
Sbjct: 949 CAG 951
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 90/195 (46%), Gaps = 19/195 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 418 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 470
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 471 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 528
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L CPVC + + + E ++ C C RWVH C+ + E +
Sbjct: 529 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTEDEV 582
Query: 179 QFQVDGNLQYRCPTC 193
+ D + C C
Sbjct: 583 EQASDEG--FACTYC 595
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 9/115 (7%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG+ YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 98 LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 154
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDAC 130
YH +C P ++ + P+ C + C CG+ +PG S +WF Y C+AC
Sbjct: 155 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEAC 206
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 42/116 (36%), Gaps = 13/116 (11%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C +C G+ + +FC C YH C + + CP+ C +C PG
Sbjct: 86 CAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKVCQTC--RKPGEDS 143
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
+ CDAC + Y CL+ DS T C C+ C G+
Sbjct: 144 KM------LVCDACDK-----GYHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGL 188
>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
Length = 1744
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 94/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 270 LVFCQVCCDPFHPFCLEE-AERPSPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 328
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 329 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 387
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 388 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 447
Query: 193 CRG 195
C G
Sbjct: 448 CAG 450
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 569 ENVAD--IGFDCSMCR 582
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F +EG ++L+C C + YH C+ + L W+C C +CE
Sbjct: 688 MCVVCGSFGKGSEG-----QLLACAQCAQCYHPYCVNSKITKTKL--RKGWRCLECIVCE 740
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
+C + DP++ + C CD +YH YC PP NV G + C C CGSN P G
Sbjct: 741 MCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTP--GFHC 798
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
W YT C C L CPVC + + + E ++ C C RWVH C+ + E
Sbjct: 799 EWQNNYTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTE 849
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 75/154 (48%), Gaps = 13/154 (8%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG+ YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 227 LLFCTGCGQHYHAACLEIGATP---IQRAGWQCPECKVCQTCRKPGEDSKMLVCDACDKG 283
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS---NVPGNGLSVRWFLGYTCCDACGRLFV 135
YH +C P ++ + P+ C + C CG+ +PG S +WF Y C+AC
Sbjct: 284 YHTFCLQPAMDSLPTDPWKCKRCRVCTDCGARGLELPG---STQWFENYAVCEACQHH-- 338
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
+ C VC K D + C VC R VH C
Sbjct: 339 RNCTCSVCNK--PDGSVATLQSCSVCHRLVHSGC 370
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 404 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 456
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 457 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 514
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 515 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 568
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 569 ENVAD--IGFDCSMCR 582
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+LSC CG+ YH C+ + + + H W+C C +CE
Sbjct: 911 MCVVCGSFGKGAEG-----RLLSCSQCGQCYHPYCV-SIKITKVVLH-KGWRCLECTVCE 963
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ P GL
Sbjct: 964 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSP--GLRC 1021
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CP+C YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 1022 EWQNNYTQCAPCASL----STCPICCCNYREEDL--ILQCRQCDRWMHTVCQNLNTEEEV 1075
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + C CR
Sbjct: 1076 ESTADNG--FDCTMCR 1089
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 99/226 (43%), Gaps = 21/226 (9%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C +CG+ YH CL + W+CP C++C+ C+ +G+ NK + C CD Y
Sbjct: 332 LFCTTCGQHYHGMCLDIQV---TALKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGY 388
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P +V + + C C CG+ S +W CD+C K
Sbjct: 389 HTFCLQPVMDSVPTNGWKCKYCRVCAECGTRT-----SSQWHHNCLMCDSCYNQQEKLP- 442
Query: 140 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL-QYRCPTCRG--- 195
CP+C K + C +C+RW+H +CD + ++D +L +Y C C+
Sbjct: 443 CPLCEKTSSPDGQKDRLYCHLCRRWIHIECDRSPNN-----ELDSHLKEYVCSLCKHSIV 497
Query: 196 --ECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFS 239
EC D + +L D D + +A G+ ED++ +
Sbjct: 498 EEECALSCDAMETA-QLLPEPDTGCADEMEIEDSAEGVTNEDQVVT 542
>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
Length = 1461
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 101/210 (48%), Gaps = 11/210 (5%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C SCG +H C L N R S W C C C+ICR+ + KF+ C +
Sbjct: 227 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQQEANDIKFVKCEQ 281
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 282 CQKIYHANCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 341
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + + + + Y CP
Sbjct: 342 NKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDEEADLTAYHKRKEYNPDYDYVCPV 401
Query: 193 CR-GECYQVRDLEDAVRELWRRKDMADKDL 221
C+ G ++ LE ++ + + + +D+
Sbjct: 402 CKVGSNIKIEPLERSIADPQASEHFSIRDV 431
Score = 92.4 bits (228), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E +++C C + YH C + +R + W+C C +CE C +
Sbjct: 535 ICVMCGSVGVEGDAELITCAQCAQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 592
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC +PP + V G + C T C CG N P ++
Sbjct: 593 NDEARLLLCDECDISYHIYCVNPPLEQVPRGNWKCSFCTICQKCGRN-PTEKIN-HSDSN 650
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C C N C +C K Y D E ++ C+ C++W+H CD I+ + + + D
Sbjct: 651 SPECPPCA----SQNSCSICSKGYSDGEM--IIQCEQCEQWLHFLCDSINSQHTMDY-YD 703
Query: 184 GNLQYRCPTCR 194
N+ Y+C CR
Sbjct: 704 HNM-YKCIKCR 713
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 917 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 974
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 975 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1032
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C +S E+ ++ D
Sbjct: 1033 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLSTEEEVENVAD 1086
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1087 --IGFDCSMCR 1095
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 319 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 375
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 376 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 428
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 174
C K Y M+ C++C+RWVH +CD +D
Sbjct: 429 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD 461
>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
Length = 1458
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 97/204 (47%), Gaps = 17/204 (8%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRR--TGDPNKFMFCR 73
+++ C SCG +H C L N R S W C C C+ICR+ T D KF+ C
Sbjct: 220 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQHETNDI-KFIKCE 273
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 274 QCQKMYHAMCLRPTISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 333
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCP 191
KG CP+C K YR + MV C C ++VH CD +D + + + + + Y CP
Sbjct: 334 RNKGFSCPICQKAYRAAAYKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEYNPDYDYVCP 393
Query: 192 TCR-----GECYQVRDLEDAVREL 210
C+ + V LE A+ ++
Sbjct: 394 ICKTSSTAAQLKAVDPLERAITDM 417
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 91/195 (46%), Gaps = 20/195 (10%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + G E +++C CG+ YH C + +R + W+C C +CE C +
Sbjct: 532 ICVMCGTLGIESDSVLITCAQCGQCYHPYC-ASVKHSRGILQ-KGWRCLDCTVCEGCGKK 589
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN----VPGNGLSVR 119
D + + C CD +YH YC PP + V G + C T C CG N V + S+
Sbjct: 590 NDEARLLLCDECDISYHIYCVKPPLETVPHGNWKCSFCTICQKCGRNPTEKVKNSDASLS 649
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 179
C C N CP+C K Y D E ++ C+ C++W H CD I+ + ++
Sbjct: 650 E------CLPCA----SQNSCPLCRKAYSDGEM--IIQCEQCEQWSHFLCDSINAQYTME 697
Query: 180 FQVDGNLQYRCPTCR 194
+ D N+ Y+C CR
Sbjct: 698 Y-YDNNV-YKCMKCR 710
>gi|443723098|gb|ELU11679.1| hypothetical protein CAPTEDRAFT_130729, partial [Capitella teleta]
Length = 625
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 104/208 (50%), Gaps = 27/208 (12%)
Query: 6 FVGENEGCERA----------RRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKC 51
V N C RA +++ C C + +H CL + A N + +W C
Sbjct: 4 LVSSNSLCVRAVCYLCGSGGHNQLIYCSVCCEPFHSFCLDKGERPLADNLE-----NWCC 58
Query: 52 PSCRICEICRRTGDPNKFMFCRRCDAAYH----CYCQHPPHKNVSSGPYLCPKHTKCHSC 107
C+ C +C + N + C +C AYH C ++P + + ++CPK KC SC
Sbjct: 59 RKCQFCRVCGKGS--NNLLQCIQCQDAYHPSQSCLQKYPNKPSKNRRIWVCPKCVKCKSC 116
Query: 108 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVH 166
G+ PG+ W ++ C++CG L KGN+CPVC K Y D + + M+ C C+ WVH
Sbjct: 117 GATSPGDSSDATWMYDFSLCNSCGLLMSKGNFCPVCHKCYADDDWDSKMMQCSTCESWVH 176
Query: 167 CQCDGISDEKY-LQFQVDGNLQYRCPTC 193
+C+G++DE Y + + ++QY CP C
Sbjct: 177 AKCEGLTDEMYSIMSYLPEDVQYHCPRC 204
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
Length = 1481
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 122/278 (43%), Gaps = 21/278 (7%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 193 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 251
C+ + A+ + D + + L SL+ P E + ++ P SD+ P
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-TMDPSSDELHKLP 448
Query: 252 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 282
V L + GRS K L GV+ + KK GK
Sbjct: 449 TGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 91/204 (44%), Gaps = 14/204 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN-VPGNGLSVRWFL 122
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKNEFGESNML 647
Query: 123 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
C +C + CPVC Y + E ++ C+ C+ W H CD ++ + +
Sbjct: 648 E---CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQLTID-HY 697
Query: 183 DGNLQYRCPTCRGECYQVRDLEDA 206
D N+ Y+C CR L D+
Sbjct: 698 DNNV-YKCFKCRCSTRSTNSLTDS 720
>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
Length = 1465
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 123/293 (41%), Gaps = 27/293 (9%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C SCG +H C L N R S W C C C+ICR+ + KF+ C +
Sbjct: 223 KLIMCCSCGDHFHSTCIGLANLPDTR-----SGWSCARCTKCQICRQQEANDIKFIKCEQ 277
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P ++ + C + C CGS PG G S RW YT CD+C +
Sbjct: 278 CQKIYHATCLRPVISSIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 337
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + + + Y CP
Sbjct: 338 NKGFSCPICQKAYRAAAYKEMVKCSCCHKFVHSTCDEEADLTAYHKTKEYNPDYDYVCPI 397
Query: 193 CRGECYQVRDLEDAVRELWRRKDMADKDLI----ASLRAAAGLPTEDEIFSISPYSDDEE 248
C+ + ++ R L DM + + G P D + S P
Sbjct: 398 CKSNSAGAKIVDPLERSL---ADMQSSEQFNIKHVEIEPLNGKPCMD-VNSEDPLKLPNT 453
Query: 249 NGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKKYPRKKGYQMPLNS 301
V L N G+ K L + + + GKK N R KG Q+ L S
Sbjct: 454 KKKVCLTNIRGKGGKFVLHRM-----GAISQLGKKRSN----RGKGRQLVLQS 497
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 90/192 (46%), Gaps = 12/192 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + G E +++C CG+ YH C + +R + W+C C +CE C +
Sbjct: 533 ICVMCGTLGIESDAVLITCAQCGQCYHPYC-ASVKHSRGMLQ-KGWRCLDCTVCEGCGKK 590
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC +PP + V G + C T C CG N P ++
Sbjct: 591 NDEARLLLCDECDISYHIYCVNPPLETVPHGNWKCSFCTICQKCGRN-PTEKVNFNEPSA 649
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C + N C VC K Y +E ++ C+ C++W H CD ++ + +++ D
Sbjct: 650 PECLPCASQ-----NNCFVCKKSY--TEGDMIIQCEQCEQWSHFLCDSVNTQLTMEY-YD 701
Query: 184 GNLQYRCPTCRG 195
N+ Y+C CR
Sbjct: 702 NNV-YKCMKCRA 712
>gi|427798099|gb|JAA64501.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 926
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 6 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 298 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 356
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 357 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 416
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 417 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 472
Query: 179 QFQVDGNLQYRCPTC 193
DG+ Y CP C
Sbjct: 473 SSPKDGD--YVCPVC 485
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G R+++C CG+ YH C+ + + W+C C +CE C +
Sbjct: 660 LCAMCGSFGRAEEGRLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCEGCGQP 717
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D ++ + C CD +YH YC PP + V G + C C CG+ PGNG +W
Sbjct: 718 HDESRLLLCDECDISYHTYCLSPPLETVPQGNWKCRWCVICVKCGATEPGNG--SQWQNN 775
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+ +
Sbjct: 776 YTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE--DAERC 827
Query: 184 GNLQYRCPTCR 194
Y CP CR
Sbjct: 828 AEYGYNCPYCR 838
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 565 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 622
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 623 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 680
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 681 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 734
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 735 --IGFDCSMCR 743
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 65/136 (47%), Gaps = 14/136 (10%)
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
R++G+ +K + C CD YH +C P K+V + + C C CG+ S +W
Sbjct: 1 RQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQW 55
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
CD C + + N CP C K Y M+ C++C+RWVH +CD +D
Sbjct: 56 HHNCLICDNCYQQ--QDNLCPFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH----- 108
Query: 181 QVDGNL--QYRCPTCR 194
++D L +Y C C+
Sbjct: 109 ELDPQLKEEYICMYCK 124
>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
latipes]
Length = 4536
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 101/206 (49%), Gaps = 20/206 (9%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
LCF+ + G + C+ C + +H CL R L + +W C CR C+ C R
Sbjct: 1676 LCFLCASSG---NVEFVYCRVCCEPFHLFCL--GESERPLQEQFENWCCRLCRFCQACGR 1730
Query: 63 TGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNVPGN 114
K + C +C +YH C P N + P ++C K +C SCG+ PG
Sbjct: 1731 QHQKAKQQLVECDKCRNSYHPECLGP---NYPTKPTKKKRIWICTKCVRCKSCGATKPGK 1787
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGIS 173
+W ++ C C +LF KGN+CP+C K Y D + M+ C C++WVH +C+ I+
Sbjct: 1788 SWDAQWSHDFSMCHDCAKLFAKGNFCPLCDKSYSDDYYDSKMMECARCKQWVHAKCENIT 1847
Query: 174 DEKY-LQFQVDGNLQYRCPTCRGECY 198
DE + L ++ N+ Y C C EC+
Sbjct: 1848 DEMFELLSKLPENIAYTCMKC-AECH 1872
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 906 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 963
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 964 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1021
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1022 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1075
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1076 --IGFDCSMCR 1084
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 306 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 362
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 363 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 415
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 416 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 465
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 959 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1016
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1017 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1074
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1075 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1128
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1129 --IGFDCSMCR 1137
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 469 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 518
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 471 MCVVCGSFGQGEEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 523
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 524 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 581
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 582 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 635
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 636 ENVAD--IGFDCSMCR 649
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 996 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1053
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1054 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1111
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1112 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1165
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1166 --IGFDCSMCR 1174
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 87/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 397 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 453
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ SV+W CD+C + + N CP
Sbjct: 454 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SVQWHHNCLICDSCYQQ--EDNLCP 506
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C+VC+RWVH +CD +D ++D L +Y C C+
Sbjct: 507 FCGKCYHPELQKDMLHCNVCKRWVHLECDKPTDH-----ELDSQLKEEYICMFCK 556
>gi|427791139|gb|JAA61021.1| Putative phagocytosis engulfment, partial [Rhipicephalus
pulchellus]
Length = 741
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 92/195 (47%), Gaps = 14/195 (7%)
Query: 6 FVGENEGCERARRMLS------CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+VG C+ M+S C CG YH CL W+CP C+ C+
Sbjct: 178 YVGSQANCQSCEEMVSVPELLFCTVCGAHYHGFCLDPPVVVTPTSRLG-WQCPDCKTCQG 236
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
C R GD + + C CD A+H YC P NV + C C CGS PG+G S R
Sbjct: 237 CGRAGDDARLLTCDVCDKAFHVYCVKPMVANVPKHGWKCQSCRVCGDCGSRTPGSGPSSR 296
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W + Y+ CD+C + KG CP+C K YR S M C VC++++H +CDG +
Sbjct: 297 WHMNYSVCDSCYQQRNKGVACPLCGKAYRQFSNRADMAQCTVCRKFIHVECDG----QLA 352
Query: 179 QFQVDGNLQYRCPTC 193
DG+ Y CP C
Sbjct: 353 SSPKDGD--YVCPVC 365
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 79/191 (41%), Gaps = 43/191 (22%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G R+++C CG+ YH C+ + + W+C C +CE C
Sbjct: 539 LCAMCGSFGRAEEGRLIACAQCGQCYHPYCVN--VKVTKMILKKGWRCLDCTVCEGC--- 593
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
G PN CR C C CG+ PGNG +W
Sbjct: 594 GQPN--WKCRWC--------------------------VICVKCGATEPGNG--SQWQNN 623
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C + CP+CL Y+DSE ++ C C+RW+H CD IS E+ +
Sbjct: 624 YTQCGPCWSM----TTCPLCLLKYKDSEL--VIQCVQCERWMHGMCDQISSEE--DAERC 675
Query: 184 GNLQYRCPTCR 194
Y CP CR
Sbjct: 676 AEYGYNCPYCR 686
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 978 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1035
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ PG W
Sbjct: 1036 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1093
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E ++ D
Sbjct: 1094 YTQCAPCASLSI----CPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1147
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1148 --IGFDCTMCR 1156
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 18/176 (10%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+ +G+ +K + C CD YH
Sbjct: 368 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 424
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY-C 140
+C P +V + + C C CG+ S +W CD+C + + N C
Sbjct: 425 FCLQPIMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDSCYQ--PQDNLSC 477
Query: 141 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
P C K Y+ M+ C +C+RW+H +CD +D ++D L +Y C C+
Sbjct: 478 PFCGKCYQPDLQKDMLHCHMCKRWIHIECDKPADT-----ELDSQLKEEYVCMYCK 528
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 971 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1028
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1029 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1086
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1087 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENIAD 1140
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1141 --IGFDCSMCR 1149
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 374 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 430
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 431 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 483
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 200
C K Y M+ C +C+RWVH +CD +D + QF+ + Y C C+ ++
Sbjct: 484 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 539
Query: 201 RDLE 204
L+
Sbjct: 540 DPLQ 543
>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
Length = 1481
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 121/288 (42%), Gaps = 41/288 (14%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 193 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 241
C+ G Q D D++ E K++ L G PT D
Sbjct: 392 CKSNSSGPGSSQQAIDSIVLSAMDSLSEQLSLKEI-------ELDPLEGKPTMD------ 438
Query: 242 PYSDDEENGP-----VVLKNEFGRSLKLSLK--GVVDKSPKKVKEHGK 282
P SD+ P V L + GRS K L GV+ + KK GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRIGVMSQINKKRSTRGK 486
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 116
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
C +C + CPVC Y + E ++ C+ C+ W H CD ++ +
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDTVNAQL 692
Query: 177 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 206
+ D N+ Y+C CR L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLTDA 720
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 891 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 948
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 949 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1006
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1007 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1060
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1061 --IGFDCSMCR 1069
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 400
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 401 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 450
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 953 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1069 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1123 --IGFDCSMCR 1131
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C CR
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCR 518
>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
Length = 3750
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 1 MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 55
+C LC G++E ML C+ C + +HR CL + +N++ +W C C+
Sbjct: 1633 VCLLCASKGQHE-------MLFCQVCCEPFHRFCLDPSERPLEENKE-----NWCCRRCK 1680
Query: 56 ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 112
C +C R+ + + C RC YH C P P N P++C +C SCG P
Sbjct: 1681 FCRVCGRKNKESKPLLECERCQNCYHPACLGPNYPKPNKRKKPWVCMTCIRCRSCGV-TP 1739
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G W C C RLF +GNYC +C K Y D++ + M+ C C WVH +C+G
Sbjct: 1740 GKSWDTEWNHDKGLCPDCTRLFDQGNYCTMCFKCYEDNDYDSQMMQCSTCNHWVHAKCEG 1799
Query: 172 ISDEKY-LQFQVDGNLQYRCPTCRGE 196
++D+ Y + + ++ Y C C E
Sbjct: 1800 LTDDLYEILSSLPESVVYSCQPCLKE 1825
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 20 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 72
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 73 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRC 130
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 131 EWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEV 184
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 185 ENVAD--IGFDCSMCR 198
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 92/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 10 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 62
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 63 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATC--AGLRC 120
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ +
Sbjct: 121 EWQNNYTQCAPCASL----SACPVCFRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEV 174
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 175 ESVAD--IGFDCSMCR 188
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 875 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 932
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 933 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 990
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 991 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1044
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1045 --IGFDCSMCR 1053
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 385 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 434
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 20 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 77
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 78 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 135
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 136 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 189
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 190 --IGFDCSMCR 198
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 827 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 884
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 885 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 942
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 943 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENIAD 996
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 997 --IGFDCSMCR 1005
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 89/184 (48%), Gaps = 15/184 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 230 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 286
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 287 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 339
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQV 200
C K Y M+ C +C+RWVH +CD +D + QF+ + Y C C+ ++
Sbjct: 340 FCGKCYHPELQKDMLHCSMCKRWVHLECDKPADHELDSQFKEE----YICMYCKHIAVEM 395
Query: 201 RDLE 204
L+
Sbjct: 396 DPLQ 399
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 957 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1014
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1015 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1072
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1073 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1126
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1127 --IGFDCSMCR 1135
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 359 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 415
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 416 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 468
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 200
C K Y M+ C++C+RWVH +CD +D E QF+ + Y C C+ ++
Sbjct: 469 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELESQFKEE----YICMYCKHLAAEM 524
Query: 201 RDLE 204
L+
Sbjct: 525 DPLQ 528
>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 3855
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 64 GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 101
K + C +C +YH C P N + P ++C K
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490
Query: 102 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 160
+C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + + M+ C
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550
Query: 161 CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 193
C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584
>gi|328773887|gb|EGF83924.1| hypothetical protein BATDEDRAFT_84646 [Batrachochytrium
dendrobatidis JAM81]
Length = 828
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 96/207 (46%), Gaps = 34/207 (16%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLF---HWSSWKCPSCRICEICRRTGDPNKFMFCRRC 75
+L+C CG K+H C++ +++ L W+C +C++C +C GD +K +FC C
Sbjct: 585 LLNCTQCGTKHHPRCIE--FEDKVLITKVMTFDWRCSNCKLCTVCNNAGDDDKLLFCDTC 642
Query: 76 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN--------------VPGNGLSVRWF 121
D YH YC +PP + + G +LC + C SC V LS++
Sbjct: 643 DRGYHMYCLNPPLEVLPEGSWLCSECAVCKSCKKRPEKQEGTEDMWRHVVIPPSLSLQEI 702
Query: 122 ----------LG-YTC--CDACGRLFVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHC 167
LG Y C C C F +CP+C+ VY DS+ MVCCD C RWVH
Sbjct: 703 QIKPPPATSALGTYLCTYCTDCYDHFEADRFCPLCIHVYSEDSDDLAMVCCDECDRWVHV 762
Query: 168 QCDG-ISDEKYLQFQVDGNLQYRCPTC 193
CD ++D+ Y + + C C
Sbjct: 763 GCDPELTDDVYQKLVEQEEPAFTCALC 789
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 97/215 (45%), Gaps = 14/215 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 738 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 795
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP + V G + C C CG+ P G W
Sbjct: 796 GKASDPSRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCVQCGAASP--GFHCEW 853
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L V CP C + Y + + ++ C C RW+H CD + E+ ++
Sbjct: 854 QNNYTHCAPCASLVV----CPFCREKYVEDDL--LIQCRHCDRWLHAACDSLFTEEEVEQ 907
Query: 181 QVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKD 215
D + C C + Y V+ + E+ + KD
Sbjct: 908 AADEG--FDCSAC--QPYVVKPVPAPSAEMIKAKD 938
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 9/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C +G R + C SCG+ +H CL R S W+CP C++C+ +
Sbjct: 126 CSVC-----DGPGELRDLAFCTSCGQHFHGACLDISLTPR---KRSGWQCPQCKVCQNLQ 177
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
D + + C CD YH C P + + + + C C CG G S +W
Sbjct: 178 PGQD-SAMLVCETCDKGYHTSCTEPAAQGLPTTSWKCKNCWVCSDCGQRPAGPVSSCQWS 236
Query: 122 LGYTCCDAC 130
G C C
Sbjct: 237 PGSEVCGDC 245
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 530 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 588 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 645
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 646 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 699
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 700 --IGFDCSMCR 708
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 37/60 (61%), Gaps = 2/60 (3%)
Query: 47 SSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK--HTKC 104
+ W+CP C++C+ C+++G+ +K + C CD YH +C P K+V + + C + H +C
Sbjct: 13 AGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKRWVHLEC 72
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 910 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 967
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 968 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1026 YTQCAPCASL----SSCPVCYRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1080 --IGFDCSMCR 1088
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 315 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 371
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CDAC + + N CP
Sbjct: 372 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHSCLVCDACYQQ--QDNLCP 424
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 425 FCGKCYHPELQKDMLHCNICKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 474
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 910 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 967
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 968 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1025
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1026 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1079
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1080 --IGFDCSMCR 1088
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 313 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 369
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 370 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 422
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 423 FCGKWYHPELQKDMLHCNMCKRWVHLECDKPTDN-----ELDSQLKEEYICMYCK 472
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 953 MCVVCGSFGQGEEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1069 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1123 --IGFDCSMCR 1131
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ CR++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L +Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 954 MCVVCGSFGQGEEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1011
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1012 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1069
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1070 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1123
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1124 --IGFDCSMCR 1132
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ CR++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCNQQ--QDNLCP 467
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L +Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEEYICMYCK 517
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 924 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 981
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 982 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1039
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1040 YTQCAPCASL----SSCPVCCRNYREDDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1093
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1094 --IGFDCSMCR 1102
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 87/185 (47%), Gaps = 16/185 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+ TG +FC C Y+
Sbjct: 327 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKETGKNTFVLFCFTCSLNYNP 383
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFLGYTCCDACGRLFVKGNYC 140
+C P + V + + T+C +C V G S +W CD+C + + N C
Sbjct: 384 FCVSPLVRIVPTNLF-----TQCRNCRICVECGTRSSSQWHHNCLVCDSCYQQ--QDNLC 436
Query: 141 PVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQ 199
P C K Y M+ C++C+RWVH +CD +D + QF+ + Y C C+ +
Sbjct: 437 PFCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDHELDSQFKEE----YICMYCKHIAAE 492
Query: 200 VRDLE 204
+ L+
Sbjct: 493 MDPLQ 497
>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
tropicalis]
Length = 2771
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 99/203 (48%), Gaps = 18/203 (8%)
Query: 1 MCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSCRICE 58
MC LC R R ++L C+ C + +HR CL+ R L + +W C C+ C
Sbjct: 1233 MCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRCKFCN 1283
Query: 59 ICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSNVPGN 114
+C + G K + C C YH C P P K SG + C +C SCG PG
Sbjct: 1284 VCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI-APGK 1342
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGIS 173
+ C C L+ KGN+CP+C++ Y +SE + M+ C C +W+H +C+G+S
Sbjct: 1343 DGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKCEGLS 1402
Query: 174 DEKY-LQFQVDGNLQYRCPTCRG 195
DE Y L + ++ Y CP C G
Sbjct: 1403 DEGYELLSNLPDSVVYTCPPCLG 1425
>gi|195347259|ref|XP_002040171.1| GM16061 [Drosophila sechellia]
gi|194135520|gb|EDW57036.1| GM16061 [Drosophila sechellia]
Length = 1476
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 121/278 (43%), Gaps = 21/278 (7%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 211 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 265
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 266 CQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 325
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 326 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 385
Query: 193 CRGECYQVRDLEDAVREL-WRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSDDEENGP 251
C+ + + + D + + L SL+ P E + S+ P SD+ P
Sbjct: 386 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQL--SLKEIELDPLEGKP-SMDPSSDELHKLP 442
Query: 252 -----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 282
V L + GRS K L GV+ + KK GK
Sbjct: 443 AGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 480
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 26/210 (12%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 524 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 581
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 116
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 582 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 641
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
C +C + CPVC Y + E+ ++ C+ C+ W H CD ++ +
Sbjct: 642 E---------CPSC----TSQSSCPVCKVSYSNGET--IIQCEHCELWAHFHCDTVNAQL 686
Query: 177 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 206
+ D N+ Y+C CR L DA
Sbjct: 687 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 714
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 1010 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1067
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ PG W
Sbjct: 1068 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGP--RCEWQNN 1125
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E ++ D
Sbjct: 1126 YTQCAPCASL----STCPVCSRNYREEDL--ILQCRQCDRWMHAVCQNLNTEDEVENVAD 1179
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1180 --IGFDCTMCR 1188
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 86/185 (46%), Gaps = 20/185 (10%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+ +G+ +K + C CD YH
Sbjct: 403 CTTCGQHYHGMCLDIAVT---ALKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHT 459
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P +V + + C C CG+ S +W CD C + CP
Sbjct: 460 FCLQPVMDSVPTNGWKCKNCRICAECGTRT-----SSQWHHNCLVCDNCYQP-QDNTACP 513
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCRGECYQ 199
C K Y+ M+ C +C+RW+H +CD +D ++D L+ Y C C+ Q
Sbjct: 514 FCGKCYQPDFQKDMLHCQMCKRWIHIECDKPADT-----ELDSQLKEDYVCMCCK----Q 564
Query: 200 VRDLE 204
+ DLE
Sbjct: 565 LGDLE 569
>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4137
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 93/198 (46%), Gaps = 10/198 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 62
+CF+ + G + C+ C + +H CL + D W +W C CR C +C R+
Sbjct: 1560 VCFLCASSG---NVEFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHVCGRK 1615
Query: 63 TGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
+ + C +C +YH C HP ++C K +C SCG+ PG +
Sbjct: 1616 YQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKAWDAQ 1675
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEK-Y 177
W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D+
Sbjct: 1676 WSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTDDMCE 1735
Query: 178 LQFQVDGNLQYRCPTCRG 195
L + N+ Y C C G
Sbjct: 1736 LMSSLPENVVYTCTNCTG 1753
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 952 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1009
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1010 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1067
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1068 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1121
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1122 --IGFDCSMCR 1130
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 953 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 1010
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 1011 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1068
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1069 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1122
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1123 --IGFDCSMCR 1131
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|4336749|gb|AAD17932.1| myeloid/lymphoid leukemia 2 [Homo sapiens]
Length = 1010
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 96/183 (52%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 553 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 611
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V+W Y+ C C +L+
Sbjct: 612 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVQWSGDYSLCPRCTQLY 670
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 671 EKGNYCPICTRCYEDNDYESKMMQCAQCDYWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 730
Query: 193 CRG 195
C G
Sbjct: 731 CAG 733
>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
Length = 3692
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 96/195 (49%), Gaps = 10/195 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ ++G ML C+ C + +HR CL+ A+ + +W C C+ C +C +
Sbjct: 1730 VCFLCASKG---QHEMLHCQVCCEPFHRFCLEP-AERPSEENKENWCCRRCKFCHVCGKK 1785
Query: 64 GDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
K + C RC YH C P P N ++C + +C SCG PG + W
Sbjct: 1786 NQLTKPLLECERCQNCYHASCLGPNYPKLNKKRKAWVCMRCIRCKSCGV-TPGKSWEIDW 1844
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-L 178
C C +L+ +GNYCP+C K Y D++ + M+ C C WVH +C+ ++DE Y +
Sbjct: 1845 NHDKGLCPDCSKLYEQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDLTDELYEI 1904
Query: 179 QFQVDGNLQYRCPTC 193
+ ++ Y C C
Sbjct: 1905 LSSLPESVVYSCRPC 1919
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 935 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKISKVVLSKGWRCLECTVCEACGKA 992
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 993 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 1050
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CP+C + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 1051 YTQCAPCASL----SSCPICCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1104
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1105 --IGFDCSMCR 1113
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 104/224 (46%), Gaps = 22/224 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 342 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 398
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 399 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SAQWHHNCLICDTCYQQ--QDNLCP 451
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVR 201
C K Y M+ C++C+RWVH +CD +D + L Q+ + Y C C+ ++
Sbjct: 452 FCGKCYNPEFQKDMLYCNMCKRWVHLECDKPTDHE-LDSQIKED--YICMYCKHLGAEID 508
Query: 202 DLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSISPYSD 245
L EL + D ++G+ EDE+ + P ++
Sbjct: 509 SLHPG-NELEMPELPTD--------YSSGMEIEDEVLFLDPTAN 543
>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
Length = 1482
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 120/288 (41%), Gaps = 41/288 (14%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRR 74
+++ C +CG +H C L N R S W C C C+ICR+ + K++ C +
Sbjct: 217 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQDSNDTKYVKCEQ 271
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 272 CQKTYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQR 331
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPT 192
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 332 NKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKEQNPDYDYVCPN 391
Query: 193 CR------GECYQVRD-----LEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIFSIS 241
C+ G Q D D+ E K++ L G PT D
Sbjct: 392 CKSNSSGPGSSQQTIDSIVLSAMDSSSEQLSLKEI-------ELDPLEGKPTMD------ 438
Query: 242 PYSDDEENGP-----VVLKNEFGRSLKLSL--KGVVDKSPKKVKEHGK 282
P SD+ P V L + GRS K L GV+ + KK GK
Sbjct: 439 PSSDELHKLPTGKKKVCLTSVRGRSGKFVLHRMGVMSQINKKRSTRGK 486
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 91/210 (43%), Gaps = 26/210 (12%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-------GNGL 116
D + + C CD +YH YC +PP + V +G + C T C CG N N L
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNML 647
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
C +C + CPVC Y + E ++ C+ C+ W H CD ++ +
Sbjct: 648 E---------CPSC----TSQSSCPVCKVSYSNGEM--IIQCEHCELWAHFHCDSVNAQL 692
Query: 177 YLQFQVDGNLQYRCPTCRGECYQVRDLEDA 206
+ D N+ Y+C CR L DA
Sbjct: 693 TID-HYDNNV-YKCFKCRCSTRSTNSLADA 720
>gi|161611540|gb|AAI55711.1| mll protein [Xenopus (Silurana) tropicalis]
Length = 2316
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 99/214 (46%), Gaps = 31/214 (14%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G + C+ C + +HR CL+ + + +W C C+ C +C R
Sbjct: 1378 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEERERPSE-DQIENWCCRHCKFCHVCGRQ 1433
Query: 64 GDPNK----------------FMFCRRCDAAYHCYCQHPPHKNVSSGP------YLCPKH 101
K + C +C +YH C P N + P ++C K
Sbjct: 1434 QQATKESIGRQNTISDMSLKQLLECNKCRNSYHPECLGP---NYPTKPTKKKRVWICTKC 1490
Query: 102 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDV 160
+C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + + M+ C
Sbjct: 1491 VRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESKMMQCGK 1550
Query: 161 CQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 193
C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 1551 CDRWVHSKCENLTDEMYEILSNLPESVAYTCINC 1584
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
niloticus]
Length = 4907
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 87/190 (45%), Gaps = 12/190 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G ++L+C C + YH C+ + L W+C C +CE+C +
Sbjct: 847 MCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCLECIVCEVCGKA 904
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP++ + C CD +YH YC PP V G + C C CGSN P G W
Sbjct: 905 SDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP--GFHCEWQNN 962
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L CPVC + + + E ++ C C RWVH C+ + E ++ D
Sbjct: 963 YTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTEDEVEQASD 1016
Query: 184 GNLQYRCPTC 193
+ C +C
Sbjct: 1017 EG--FACTSC 1024
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG YH CL+ A + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 230 LLFCTGCGLHYHAACLEIGATP---IQRAGWQCPECKVCQTCRQPGEDSKMLVCDACDKG 286
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 135
YH +C P ++ S P+ C + C CG +PG S +WF Y C+ C
Sbjct: 287 YHTFCLQPAMDSLPSDPWKCRRCRVCMVCGVRGLVLPG---SAQWFDNYAVCEGCQHH-- 341
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
+ + C VC K S + + CC +C RWVH +C
Sbjct: 342 RSSICCVCSKAANPSVA--LQCCSMCHRWVHSEC 373
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 845 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 902
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 903 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 960
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 961 YTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 1014
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 1015 --IGFDCSMCR 1023
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 290 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 346
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 347 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 399
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 400 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 449
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 695 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 752
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 753 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 810
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR+ + ++ C C RW+H C ++ E+ ++ D
Sbjct: 811 YTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAVCQNLNTEEEVENVAD 864
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 865 --IGFDCSMCR 873
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 95 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 151
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 152 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 204
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 205 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDPQLKEEYICMYCK 254
>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
Length = 1486
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 73
+++ C +CG +H C L N R S W C C C+ICR+ D N K++ C
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 191
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389
Query: 192 TCR 194
C+
Sbjct: 390 NCK 392
Score = 89.7 bits (221), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 112
D + + C CD +YH YC +PP + V SG + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
C +C + C VC Y E ++ C+ C+ W H CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688
Query: 173 SDEKYLQFQVDGNLQYRCPTCR 194
+ + +++ D N+ Y+C CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708
>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
Length = 1486
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 86/183 (46%), Gaps = 12/183 (6%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 73
+++ C +CG +H C L N R S W C C C+ICR+ D N K++ C
Sbjct: 216 KLIMCSTCGDHFHSTCIGLANLPDTR-----SGWNCARCTKCQICRQQ-DSNDLKYVKCE 269
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 270 QCQKIYHASCFRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 329
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCP 191
KG CP+C K YR + MV C C ++VH CD +D + + N Y CP
Sbjct: 330 RNKGFSCPICQKAYRAASHKEMVKCSWCHKFVHSTCDEEADLTAYHKKKEQNPDYDYICP 389
Query: 192 TCR 194
C+
Sbjct: 390 NCK 392
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 90/202 (44%), Gaps = 34/202 (16%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C + ++ + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSVMITCAQCGQCYHSYC-ASVKPSKGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 112
D + + C CD +YH YC +PP + V SG + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKSDYGDSNMP 647
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
C +C + C VC Y E ++ C+ C+ W H CD I
Sbjct: 648 E-------------CPSC----TSQSSCSVCRNPYSTGEM--IIQCETCELWSHFLCDSI 688
Query: 173 SDEKYLQFQVDGNLQYRCPTCR 194
+ + +++ D N+ Y+C CR
Sbjct: 689 NVQLTIEY-YDQNV-YKCLKCR 708
>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
Length = 1486
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 12/183 (6%)
Query: 18 RMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFMFCR 73
+++ C +CG +H C L N R S W C C C+ICR D N K++ C
Sbjct: 217 KLIMCSTCGDHFHSTCVGLANLPDTR-----SGWNCARCTKCQICR-VQDSNDLKYVKCE 270
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+C YH C P + + C + C CGS PG G S RW YT CD+C +
Sbjct: 271 QCQKIYHASCLRPVISAIPKYGWKCNRCRVCTDCGSRTPGGGSSSRWHSHYTICDSCYQQ 330
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD--EKYLQFQVDGNLQYRCP 191
KG CP+C K YR + MV C C ++VH CD +D + + +++ + Y CP
Sbjct: 331 RNKGFSCPICQKAYRAASHKEMVKCSWCNKFVHSTCDEEADLTAYHKKKELNPDYDYVCP 390
Query: 192 TCR 194
C+
Sbjct: 391 NCK 393
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 87/202 (43%), Gaps = 34/202 (16%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G E M++C CG+ YH C +R + W+C C +CE C +
Sbjct: 530 ICVMCGSLGIESDSAMITCAQCGQCYHPYC-AGVKPSRGILQ-KGWRCLDCTVCEGCGKK 587
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG-----------SNVP 112
D + + C CD +YH YC +PP + V +G + C T C CG SN+P
Sbjct: 588 NDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKSEFGDSNMP 647
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
C C + C VC Y + E ++ C+ C+ W H CD +
Sbjct: 648 E-------------CPPCA----SQSACNVCKSAYANGEM--IIQCEHCELWSHFLCDTV 688
Query: 173 SDEKYLQFQVDGNLQYRCPTCR 194
+ + + D N+ Y+C CR
Sbjct: 689 NAQLTID-HYDSNI-YKCLKCR 708
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 91/196 (46%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 898 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 950
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ +G
Sbjct: 951 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SSGPRC 1008
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C S E+ +
Sbjct: 1009 EWQNNYTQCAPCASL----SSCPVCCRNYREEDL--ILQCRQCDRWMHAVCQNFSTEEEV 1062
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 1063 ENVAD--IGFDCSLCR 1076
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 90/184 (48%), Gaps = 15/184 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 305 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 361
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 362 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 414
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQVDGNLQYRCPTCRGECYQV 200
C K Y M+ C++C+RWVH +CD +D E LQ + + Y C C+ ++
Sbjct: 415 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHEPDLQLREE----YICTYCKHLAAEM 470
Query: 201 RDLE 204
L+
Sbjct: 471 GPLQ 474
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G ++L+C C + YH C+ + L W+C C +CE+C
Sbjct: 828 MCVVCGSFGKGVEGQLLACAQCAQCYHPYCVNSKITKTML--RKGWRCLECIVCEVCGEA 885
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP++ + C CD +YH YC PP V G + C C CGSN P G W
Sbjct: 886 SDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSP--GFHCEWQNN 943
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L CPVC + + + E ++ C C RWVH C+ + E ++ D
Sbjct: 944 YTHCGPCASLVT----CPVCRENFMEEEL--LLQCQYCDRWVHAVCESLYTEDEVEQASD 997
Query: 184 GNLQYRCPTC 193
+ C +C
Sbjct: 998 EG--FACTSC 1005
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 79/159 (49%), Gaps = 13/159 (8%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG YH CL A + W+CP C++C+ CR+ G+ +K + C C+
Sbjct: 229 LLFCTGCGLHYHATCLDTGATP---ILRAGWQCPECKVCQTCRQPGEDSKMLVCDSCEKG 285
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN---VPGNGLSVRWFLGYTCCDACGRLFV 135
H +C P +V S + C C CG + +PG + +WF YT C+ C
Sbjct: 286 CHTFCLQPAMDSVPSDRWKCRSCRVCMECGVHGLVLPG---TAQWFESYTLCEGCQHH-- 340
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 174
+ + C VC K D+ S + CC +C RW+H +C +++
Sbjct: 341 RSSICCVCSKP--DNPSVSLQCCSLCHRWMHSECSSLTE 377
>gi|312384476|gb|EFR29199.1| hypothetical protein AND_02074 [Anopheles darlingi]
Length = 2401
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 12/210 (5%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CR C + G ++ C CG YH C+ AQ + + W+C SC+ C+ICR
Sbjct: 628 CRQCSALGDVG-----NLIICSLCGDHYHGTCV-GLAQLPGV--RTGWQCNSCKKCQICR 679
Query: 62 RT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ + C CD YH C P ++ + C C CG+ PG G S RW
Sbjct: 680 VPDSSEGRSVACELCDKIYHASCLRPIMTSIPKFGWKCRCCRVCSDCGARTPGAGASSRW 739
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT CD+C + KG CP+C + YR + MV C VC ++VH CD +D
Sbjct: 740 HSHYTVCDSCYQQRNKGFSCPICHRAYRAAAYREMVKCSVCSKFVHSTCDPDADLTVYNG 799
Query: 181 QVDGN--LQYRCPTCRGECYQVRDLEDAVR 208
+ + N +Y C C+ + R L AVR
Sbjct: 800 RKEANPDYEYLCTPCKAAIHSGR-LVAAVR 828
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 88/196 (44%), Gaps = 19/196 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C ICE
Sbjct: 1018 ICVMCGAIGTDQEGC-----LIACTQCGQCYHPYCTN--VKVTKVILQKGWRCLDCTICE 1070
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D + + C CD +YH YC PP + V G + C C CG+N P G +
Sbjct: 1071 GCGQRNDEARLILCDDCDISYHIYCMDPPLEQVPQGTWKCKWCAICQKCGTNSP--GFNS 1128
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C CP C Y D E ++ C C+RW+HC CD I +E
Sbjct: 1129 GWMNSYTECGPC----ASQTNCPSCNDGYADGEL--IIQCHQCERWLHCACDQIKNEAEA 1182
Query: 179 QFQVDGNLQYRCPTCR 194
+ Y C CR
Sbjct: 1183 ERCA--EEAYNCLICR 1196
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 88/197 (44%), Gaps = 19/197 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ N R + W+C C +CE
Sbjct: 1021 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCV-NIKITRVIL-TKGWRCLECTVCE 1073
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C DP + + C CD +YH YC PP V G + C C CGS P G+
Sbjct: 1074 ACGDASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKWCVWCVQCGSTSP--GVHS 1131
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W Y+ C C L + CP C + Y +E ++ C C RWVH C G+ E +
Sbjct: 1132 DWQRNYSLCGPCCSL----SRCPACQQAY--AEDDLILQCQQCDRWVHATCQGLCTEDEV 1185
Query: 179 QFQVDGNLQYRCPTCRG 195
+ D + C C+
Sbjct: 1186 EVAADEG--FDCSLCKA 1200
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 7/101 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CRLC + G +L C CG YH +CL L W+CP CR+C C
Sbjct: 464 CRLCAGSGDSGG-----LLMCSCCGSCYHGSCLDPPVTPSPLSRVG-WQCPQCRVCRSCS 517
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHT 102
GD + + C RCD AYH +C PP + + C T
Sbjct: 518 LQGD-SGVLLCARCDKAYHAHCLTPPLDDAPHAAWTCKAET 557
>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4219
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 92/196 (46%), Gaps = 10/196 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 62
+CF+ + G + C+ C + +H CL + D W +W C CR C +C R+
Sbjct: 1627 VCFLCASSG---NVEFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHVCGRK 1682
Query: 63 TGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
+ + C +C +YH C HP ++C K +C SCG+ PG +
Sbjct: 1683 YQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKAWDAQ 1742
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEK-Y 177
W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D+
Sbjct: 1743 WSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTDDMCE 1802
Query: 178 LQFQVDGNLQYRCPTC 193
L + N+ Y C C
Sbjct: 1803 LMSSLPENVVYTCTNC 1818
>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
norvegicus]
Length = 3894
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1362 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1416
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1417 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1476
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+SDE Y
Sbjct: 1477 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1536
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1537 EILSNLPESVAYTCVNC 1553
>gi|348530100|ref|XP_003452549.1| PREDICTED: hypothetical protein LOC100689867 [Oreochromis niloticus]
Length = 2557
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 102/216 (47%), Gaps = 16/216 (7%)
Query: 19 MLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCR 73
M+ C+ C + +H CL + +N++ +W C C+ C +C RR+ + CR
Sbjct: 1183 MIFCQICCEPFHSFCLSPEERPLEENKE-----NWFCRRCKFCHVCGRRSKSTKPVLQCR 1237
Query: 74 RCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACG 131
RC +YH C P P SS P++C +C SCG PG + W C C
Sbjct: 1238 RCQTSYHPSCLGPTYPKPMNSSVPWVCMTCIRCKSCGVT-PGKTWDLTWNHEQDLCPDCT 1296
Query: 132 RLFVKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYR 189
L KGN+C +C K Y DS + + M+ C C W+H +C+GIS+E + L G + +
Sbjct: 1297 SLHKKGNFCTICHKCYEDSIQPSQMLQCSQCSHWIHYRCEGISEELFGLLTSQPGRVDFT 1356
Query: 190 CPTCRGECYQVRDL-EDAVRELWRRKDMADKDLIAS 224
C C L E+ R L R + DL++S
Sbjct: 1357 CSPCSQHQTSHSILKEELQRRLTARVEEVLTDLLSS 1392
>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
Length = 3987
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 99/197 (50%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1455 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1509
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1510 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1569
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+SDE Y
Sbjct: 1570 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEMY 1629
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1630 EILSNLPESVAYTCVNC 1646
>gi|321469512|gb|EFX80492.1| hypothetical protein DAPPUDRAFT_318677 [Daphnia pulex]
Length = 1953
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 90/188 (47%), Gaps = 17/188 (9%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRR------TG-----DPNK 68
L C +CGK YH +C+ ++W+C C++C CR TG D K
Sbjct: 468 LFCVTCGKHYHGSCV---GLGSSPGVRTAWQCNECKVCITCRTPVAQQGTGAEAVTDRTK 524
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 128
+ C CD YH C P N+ + C C CGS PG+G S RW +T CD
Sbjct: 525 MLVCDTCDKNYHPSCVRPLISNIPKLGWKCKNCRVCGDCGSRTPGSGPSSRWHACFTVCD 584
Query: 129 ACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--L 186
+C + KG CP+C K YR S+ M C C+++VH CD +D +Q + D N
Sbjct: 585 SCYQQRNKGVSCPMCGKAYRHSQRE-MSQCTRCRKYVHSGCDPEADRTLVQRKKDMNSDY 643
Query: 187 QYRCPTCR 194
+Y CP C+
Sbjct: 644 EYLCPPCK 651
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 5 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 64
C + + G ++ R++SC CG+ YH C + + W+C C +CE C
Sbjct: 856 CAMCGSFGLDQEGRLISCAQCGQCYHPFCAN--VKVTKVILQKGWRCLDCTVCEGCGERH 913
Query: 65 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF--L 122
D + + C CD +YH YC PP V G + C C CGSN P GL+ W
Sbjct: 914 DEARLLLCDECDISYHIYCMEPPLDYVPQGNWKCKWCAVCQVCGSNEP--GLNANWTHQA 971
Query: 123 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
+ C C L CP C Y + E ++ C C +W+H CD I +E+ +F
Sbjct: 972 NGSLCGPCASL----RQCPSCSSSYNEGEL--IIQCQQCAQWLHAACDLIRNEREAEFCA 1025
Query: 183 DGNLQYRCPTCR 194
+ Y C CR
Sbjct: 1026 EDG--YTCVLCR 1035
>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
Length = 1622
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 99/203 (48%), Gaps = 18/203 (8%)
Query: 1 MCRLCFVGENEGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSCRICE 58
MC LC R R ++L C+ C + +HR CL+ R L + +W C C+ C
Sbjct: 36 MCLLC-------ASRGRHKLLYCQVCCEPFHRFCLEE--SERPLPNQEGTWCCRRCKFCN 86
Query: 59 ICRRTGDPNKFMF-CRRCDAAYHCYCQHP--PHKNVSSGP-YLCPKHTKCHSCGSNVPGN 114
+C + G K + C C YH C P P K SG + C +C SCG PG
Sbjct: 87 VCGQKGKAKKPLLECELCQTNYHVNCLGPNYPLKAPRSGKGWTCSACIRCRSCGI-APGK 145
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGIS 173
+ C C L+ KGN+CP+C++ Y +SE + M+ C C +W+H +C+G+S
Sbjct: 146 DGDLELTEDSKLCSECSTLYDKGNFCPICIRCYEESEYESKMIQCAKCDKWIHSKCEGLS 205
Query: 174 DEKY-LQFQVDGNLQYRCPTCRG 195
DE Y L + ++ Y CP C G
Sbjct: 206 DEGYELLSNLPDSVVYTCPPCLG 228
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 90/196 (45%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 923 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 975
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G + C C CG+ G
Sbjct: 976 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGPRG 1033
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + CPVC + YR+ + ++ C C RW+H C S E+ +
Sbjct: 1034 EWQNNYTQCAPCASL----SACPVCHRNYREEDL--ILQCRQCDRWMHAVCQNFSTEEEV 1087
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 1088 ENVAD--IGFDCSLCR 1101
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 78/155 (50%), Gaps = 10/155 (6%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 330 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 386
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 387 FCLQPVMKSVPTNGWRCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QENLCP 439
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
C K Y M+ C++C+RWVH +CD +D +
Sbjct: 440 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPADHE 474
>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
Length = 2934
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 91/197 (46%), Gaps = 21/197 (10%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C ICE
Sbjct: 630 ICVMCGAIGTDQEGC-----LIACTQCGQCYHPYCTN--VKVTKVILQKGWRCLDCTICE 682
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D + + C CD +YH YC PP + V G + C C CG+N P G +
Sbjct: 683 GCGQRNDEGRLILCDDCDISYHTYCMDPPLEQVPQGNWKCKWCAICLKCGTNDP--GYNC 740
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W Y+ C C CP C + Y D E ++ C+ C+RW+HC CD I E
Sbjct: 741 AWLNNYSECGPCASQV----SCPCCGEGYADGEL--IIQCNQCERWLHCGCDQIKSENEA 794
Query: 179 Q-FQVDGNLQYRCPTCR 194
+ DG Y C CR
Sbjct: 795 ERCAEDG---YNCLLCR 808
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 99/226 (43%), Gaps = 17/226 (7%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CR C + G ++ C CG YH C+ AQ + S W+C SC+ C+ICR
Sbjct: 291 CRQCSALGDVG-----NLMMCSICGDHYHGTCV-GLAQLPGV--RSGWQCGSCKKCQICR 342
Query: 62 RT-GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ + C +CD YH C P ++ + C C CGS PG G S RW
Sbjct: 343 VPDSSEGRTVGCEQCDKIYHASCLRPIMTSIPKYGWKCRCCRICSDCGSRTPGAGASSRW 402
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
++ CD+C + KG CP+C + YR + MV C C ++VH CD +D
Sbjct: 403 HAHFSVCDSCYQQRNKGFSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDAEADLSVYHA 462
Query: 181 QVDGN--LQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIAS 224
+ + N +Y C C+ + R RR D D +++
Sbjct: 463 KKETNPDYEYLCSPCKTAIHSGR------MAAMRRNSSVDDDSMSA 502
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 90/196 (45%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F +EG +L+C C + YH C+ + L W+C C +CE
Sbjct: 972 MCVVCGSFGRGSEG-----HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCE 1024
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
+C + DP++ + C CD +YH YC PP V G + C C CG+ P G
Sbjct: 1025 VCGKASDPSRLLLCDDCDISYHTYCLDPPLNTVPKGGWKCKWCVCCVQCGAVSP--GFHC 1082
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L CP+C Y + + ++ C C+RW+H CD + E+ +
Sbjct: 1083 EWQNNYTHCAPCASLVT----CPICQVKYVEEDL--LIQCQHCERWMHAVCDNLFTEEEV 1136
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + C +C+
Sbjct: 1137 EQAADEG--FDCTSCQ 1150
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 5/94 (5%)
Query: 5 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 64
C V + G R +L C SCG YH CL+ R S W+C C++C+ CR +G
Sbjct: 234 CMVCDAPG--ELRDLLFCTSCGLHYHGTCLEITVTPRK---RSGWQCHECKVCQTCRLSG 288
Query: 65 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ ++ + C C+ YH YC P ++V + + C
Sbjct: 289 EDSRMLVCEACEKCYHTYCLKPAIESVPADSWKC 322
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 90/196 (45%), Gaps = 19/196 (9%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F +EG +L+C C + YH C+ + L W+C C +CE
Sbjct: 809 MCVVCGSFGRGSEG-----HLLACSQCSQCYHPYCVNSRITKVMLL--KGWRCVECIVCE 861
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
+C + DP++ + C CD +YH YC PP V G + C C CG+ P G
Sbjct: 862 VCGKATDPSRLLLCDDCDISYHTYCLDPPLHTVPKGGWKCRWCVSCMQCGAVTP--GFRS 919
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L CPVC Y + + ++ C C+RW+H C+ + E+ +
Sbjct: 920 EWQNNYTHCAPCASLV----SCPVCHLKYLEGDL--LIQCRHCERWLHAVCENLFTEEEV 973
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + C +C+
Sbjct: 974 EQAADEG--FDCSSCQ 987
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 3/125 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C SCG YH CL+ S W+CP C++C+ CR+ G+ + C CD
Sbjct: 233 LLFCTSCGLHYHGTCLEITVSP---LKRSGWQCPECKVCQTCRQPGEDTMMLVCDACDKG 289
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
YH +C P + + + + C C CGS +W+ Y+ C C +
Sbjct: 290 YHTFCLKPAIECLPTDSWKCKTCRVCRICGSRTAHMEPGSQWYDNYSVCSKCQEKRNRAE 349
Query: 139 YCPVC 143
C +C
Sbjct: 350 TCVLC 354
>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
Length = 2812
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 91/187 (48%), Gaps = 14/187 (7%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK-FMFCRRCD 76
+++ C+ C +H CL++ Q SW C C+ C +C R +K + C RC
Sbjct: 1246 QLVFCQVCCDPFHVFCLEDDEQPLP-EQEESWCCRRCKFCHVCGRKNKASKQLLECERCR 1304
Query: 77 AAYHCYCQHPPHKNVSSGPY------LCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 130
YH C P N + P+ +C +C SCG+ PG W Y+ C AC
Sbjct: 1305 NCYHLACLGP---NYPTKPFRKRKNWVCSACIRCKSCGT-APGKNWDTEWSNDYSLCSAC 1360
Query: 131 GRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQY 188
L KGNYCP+CL Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y
Sbjct: 1361 SVLHDKGNYCPICLHCYEDNDYESKMMQCAKCDHWVHAKCEGLSDEGYEILSNLPESVVY 1420
Query: 189 RCPTCRG 195
C C G
Sbjct: 1421 ACRPCCG 1427
>gi|66811728|ref|XP_640043.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|60468063|gb|EAL66073.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 795
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 87/185 (47%), Gaps = 17/185 (9%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW--SSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
+++C SC KKYH CL + D + + WKC C+ CE+C +G K +FC CD
Sbjct: 578 LITCSSCSKKYHAKCLNLHQKCIDKYREDPTQWKCTDCKSCELCDDSGHDEKMLFCDVCD 637
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFV 135
YH +C PP G + C C C S V N L+ ++W YTCCD+C F
Sbjct: 638 KGYHTFCLTPPLSQTPEGGWRCNDCAFCIHCYSRVDKNSLNKIKWKENYTCCDSC---FS 694
Query: 136 KG-----NYCPVCLKVYRD--SESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQY 188
KG YCP+C +D E + C C + VH C D+ + + + Y
Sbjct: 695 KGFSEKSKYCPICSHSIKDEGEEEDSITTCQYCHKSVHDHC----DQNIIDNLENEHFIY 750
Query: 189 RCPTC 193
+CP C
Sbjct: 751 KCPNC 755
>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
guttata]
Length = 3849
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G + C+ C + +H+ CL+ + ++ +W C C+ C +C R
Sbjct: 1297 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEESERPQE-DQLENWCCRRCKFCHVCGRQ 1352
Query: 64 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 119
K + C +C +YH C P + + ++C K +C SCGS PG G +
Sbjct: 1353 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQ 1412
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 177
W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1413 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1472
Query: 178 LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1473 ILSNLPESVAYTCINC 1488
>gi|349603659|gb|AEP99439.1| Histone-lysine N-methyltransferase MLL3-like protein, partial
[Equus caballus]
Length = 452
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 89/191 (46%), Gaps = 12/191 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 62 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 119
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
DP + + C CD +YH YC PP + V G + C C CG+ GL W
Sbjct: 120 TDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNN 177
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
YT C C L + CPVC + YR E ++ C C RW+H C +S E+ ++ D
Sbjct: 178 YTQCAPCASL----SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLSTEEEVENVAD 231
Query: 184 GNLQYRCPTCR 194
+ + C CR
Sbjct: 232 --IGFDCSMCR 240
>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
rotundus]
Length = 3966
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1403 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1457
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1458 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1517
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1518 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1577
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1578 EILSNLPESVAYTCVNC 1594
>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Meleagris gallopavo]
Length = 3851
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1310 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1364
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1365 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1424
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1425 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1484
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1485 EILSNLPESVAYTCINC 1501
>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
Length = 3459
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 957 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1011
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1012 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1071
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1072 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1131
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1132 EILSNLPESVAYTCVNC 1148
>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
Length = 3871
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1368
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1369 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1489 EILSNLPESVAYTCINC 1505
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 14/192 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC V + G M+SC +C + YH C+ + W+C C ICE C
Sbjct: 442 LCLVCGSIGKGPEASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNG 501
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFL 122
GD K + C CD +YH YC PP ++V SGP+ C ++C C GN L+ +
Sbjct: 502 GDEEKLLLCDECDVSYHVYCMKPPLESVPSGPWRCHWCSRCRRCNHKATSGNDLTPKGL- 560
Query: 123 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
C +C L V CP C + Y+ ++ ++ C +C++W H C+ + E+ L+
Sbjct: 561 ----CHSCASLQV----CPCCNRGYQINDK--IIRCSLCKKWQHGACENLHTEEQLEQAA 610
Query: 183 DGNLQYRCPTCR 194
+ RC +CR
Sbjct: 611 QNRM--RCASCR 620
>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
Length = 3463
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +HR CL+ R L +W C C+ C +C R
Sbjct: 914 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 968
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 969 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1028
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1029 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1088
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1089 EILSSLPESVAYTCVNC 1105
>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
Length = 3844
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1309 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1363
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1364 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1423
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1424 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1483
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1484 EILSNLPESVAYTCVNC 1500
>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
domestica]
Length = 3960
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1421 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1475
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1476 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1535
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1536 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1595
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1596 EILSNLPESVAYTCVNC 1612
>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_d [Homo sapiens]
Length = 4002
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1640
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1641 EILSNLPESVAYTCVNC 1657
>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
Length = 3824
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1288 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1342
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1343 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1402
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1403 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1462
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1463 EILSNLPESVAYTCVNC 1479
>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
Length = 3969
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
[Cavia porcellus]
Length = 2799
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +HR CL+ R L +W C C+ C +C R
Sbjct: 269 VCFLCASSG---HVEFVYCQVCCEPFHRFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 323
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 324 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 383
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 384 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 443
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 444 EILSNLPESVAYTCINC 460
>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
Length = 3786
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1280 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--SERPLEDQLENWCCRRCKFCHVCGR 1334
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1335 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1394
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1395 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1454
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1455 EILSNLPESVAYTCINC 1471
>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Loxodonta africana]
Length = 3962
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1429 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1483
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1484 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1543
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1544 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1603
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1604 EILSNLPESVAYTCVNC 1620
>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
Length = 3969
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
sapiens]
gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
7; AltName: Full=Lysine N-methyltransferase 2A;
Short=KMT2A; AltName: Full=Trithorax-like protein;
AltName: Full=Zinc finger protein HRX; Contains: RecName:
Full=MLL cleavage product N320; AltName: Full=N-terminal
cleavage product of 320 kDa; Short=p320; Contains:
RecName: Full=MLL cleavage product C180; AltName:
Full=C-terminal cleavage product of 180 kDa; Short=p180
gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) [Homo sapiens]
Length = 3969
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
Length = 3554
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1023 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1077
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1078 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1137
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1138 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1197
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1198 EILSNLPESVAYTCVNC 1214
>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_a [Homo sapiens]
Length = 3969
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
Length = 3968
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
Length = 3710
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1178 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1232
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1233 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1292
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1293 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1352
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1353 EILSNLPESVAYTCVNC 1369
>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
melanoleuca]
Length = 3981
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1450 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1504
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1505 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1564
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1565 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1624
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1625 EILSNLPESVAYTCVNC 1641
>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
leucogenys]
Length = 3968
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1606
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
familiaris]
Length = 3829
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1300 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1354
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1355 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1414
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1415 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1474
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1475 EILSNLPESVAYTCVNC 1491
>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
boliviensis boliviensis]
Length = 3985
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1451 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1505
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1506 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1565
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1566 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1625
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1626 EILSNLPESVAYTCVNC 1642
>gi|256081465|ref|XP_002576990.1| myst-related protein [Schistosoma mansoni]
gi|353229452|emb|CCD75623.1| myst-related protein [Schistosoma mansoni]
Length = 1074
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 120/287 (41%), Gaps = 31/287 (10%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L+C CG+ YH C + R + W+C C +CE C T + + + C C+ +
Sbjct: 784 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 842
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
+H YC PP K V G + C C +CG P GL+ +W Y+ C C L
Sbjct: 843 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASL----T 896
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECY 198
CP+C YR+ E +V C +C RW H CD + E L+ D L Y C CR
Sbjct: 897 TCPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR---E 949
Query: 199 QVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIF-----SISPYS--------- 244
D+ ++ + A+ +L A G TE+ S+ PYS
Sbjct: 950 LGADIGTGHAQVLAYRQAANGNLGALENLKFGEITENLFADKLPSSLFPYSSTGVASLAR 1009
Query: 245 ---DDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKK 288
DD+ N + F + LS G+ +K K+ N+K
Sbjct: 1010 SQADDDNNSSSDTRQFFMDGVVLSECGLNTIKQALLKIQPKRHTNQK 1056
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 90/215 (41%), Gaps = 39/215 (18%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG YH +CL+ Q W+C C+ C IC + D NK + C CD
Sbjct: 325 LLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQCAECKTCLICNESKDENKMLVCDVCDKG 383
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG----SNVPG--------NGL--SVRWFLGY 124
YH YC PP ++ + C + C CG S + G N L +VRW Y
Sbjct: 384 YHTYCLKPPVSSIPKNGFRCERCRVCSDCGGGRSSTLSGLEGPVAFNNQLNPNVRWHSNY 443
Query: 125 TCCDACGRLFVKGNY-CPVCLKV-----------YRDSESTPMVC----CDVCQRWVHCQ 168
T CD C + N CPVC + YR+ ST +V C C+R VH +
Sbjct: 444 TLCDRCFHAHKRPNSCCPVCERAWRCSLPVPENFYRNPNSTFLVWPGSRCSQCRRMVHAE 503
Query: 169 CDGISDEKYL------QFQVDG--NLQYRCPTCRG 195
CD S++ + + G Y CP CR
Sbjct: 504 CDPSSNQATVSPLSAASEETSGICGTNYVCPVCRA 538
>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
garnettii]
Length = 4062
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1528 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1582
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1583 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1642
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1643 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1702
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1703 EILSNLPESVAYTCVNC 1719
>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
Length = 4202
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1666 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1720
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1721 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1780
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1781 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1840
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1841 EILSNLPESVAYTCVNC 1857
>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
Length = 3963
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSDEMY 1606
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1607 EILSNLPESVAYTCVNC 1623
>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
Length = 3846
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1314 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1368
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1369 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1428
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1429 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1488
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1489 EILSNLPESVAYTCVNC 1505
>gi|157109809|ref|XP_001650834.1| set domain protein [Aedes aegypti]
gi|108878936|gb|EAT43161.1| AAEL005378-PA [Aedes aegypti]
Length = 1458
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 93/196 (47%), Gaps = 19/196 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C ICE
Sbjct: 756 ICVMCGAIGTDQEGC-----LIACTQCGQCYHPYCTN--VKVTKVILQKGWRCLDCTICE 808
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + D + + C CD +YH YC PP ++V G + C C CGS+ PG+ +
Sbjct: 809 GCGQRNDEGRLILCDDCDISYHIYCMDPPLEHVPQGNWKCKWCAICLKCGSSNPGH--NS 866
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W Y+ C C CPVC + Y + E ++ C+ C+RW+HC CD I E
Sbjct: 867 NWLNNYSECGPCASQV----NCPVCAEGYVEGEL--IIQCNTCERWLHCGCDQIKTENDA 920
Query: 179 QFQVDGNLQYRCPTCR 194
+ + Y C CR
Sbjct: 921 ERCAEEG--YNCTLCR 934
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 11/212 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT-GDPNKFMFCRRCDA 77
++ C CG YH C+ AQ + + W+C SC+ C+ICR + + C +CD
Sbjct: 429 LMMCSICGDHYHGKCV-GLAQLPGV--RAGWQCSSCKKCQICRVPDSSEGRTVGCEQCDK 485
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
YH C P ++ + C C CGS PG G S RW YT CD+C + KG
Sbjct: 486 IYHASCLRPVMTSIPKYGWKCKCCRVCSDCGSRTPGAGASSRWHAHYTVCDSCYQQRNKG 545
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN--LQYRCPTCRG 195
CP+C + YR + MV C C ++VH CD ++ + + N +Y C C+
Sbjct: 546 FSCPICHRAYRAAAHREMVKCSGCNKFVHSTCDPEAELTVYHAKKENNPDYEYLCNPCKA 605
Query: 196 ECYQVRDLEDAVRELWRRKDMADKDLIASLRA 227
+ R + R + D + AS+ +
Sbjct: 606 SLHTGR-----FSAMRRTSSIDDDSMSASMES 632
>gi|2160396|dbj|BAA03407.1| MLL [Homo sapiens]
Length = 1909
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1607
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1608 EILSNLPESVAYTCVNC 1624
>gi|384494147|gb|EIE84638.1| hypothetical protein RO3G_09348 [Rhizopus delemar RA 99-880]
Length = 690
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 88/207 (42%), Gaps = 20/207 (9%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSS--WKCPSCRICEICRRTGDPNKFMFCRRCD 76
++ C C +KYH C N + + S W CP C++C +CR GD + M C CD
Sbjct: 435 LVKCSRCTRKYHPVC-ANLTTPKQVVGAESYPWLCPECKVCFVCRTAGDESTLMICDGCD 493
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN------------VPGNGLSVRWFLGY 124
+H C P ++ G +LC KCH C P +
Sbjct: 494 RGWHTGCCTPKVDHIPEGEWLCQLCAKCHGCNERGMKDESQYTHVAAPKSDKCKYPVYLA 553
Query: 125 TCCDACGRLFVKGNYCPVCLKVY----RDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQ 179
T CD C F + +CPVCLK Y D E MV CD C WVH +CD ++ E+Y
Sbjct: 554 TYCDKCVIDFKEDRFCPVCLKTYSDEENDEEDNEMVACDTCDHWVHTRCDESLTPERYQM 613
Query: 180 FQVDGNLQYRCPTCRGECYQVRDLEDA 206
D + +Y CP C D E A
Sbjct: 614 LCDDESAKYSCPMCEDRIKSTVDTEAA 640
>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
Length = 2525
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 9/183 (4%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 75
+ + C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 2 KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 59
Query: 76 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 132
+YH C P + + ++C K +C SCGS PG G +W ++ C C +
Sbjct: 60 RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 119
Query: 133 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 190
LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + + ++ Y C
Sbjct: 120 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYTC 179
Query: 191 PTC 193
C
Sbjct: 180 VNC 182
>gi|440792783|gb|ELR13991.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 506
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 92/197 (46%), Gaps = 16/197 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
CR+C + E E A ++ C C +YH +CL+ +N W+C C+ CE C+
Sbjct: 199 CRMCL--KEESAEGA--LIRCTECKDQYHPDCLELKKENIPKMMSFGWRCMHCKKCETCK 254
Query: 62 RTGDPNKFMFCRRC-DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
TGD K R D +H +C PP K G + C + +C SCG G S RW
Sbjct: 255 DTGDEEKARAARAFHDMGFHTFCLSPPLKRPPIGGWFCRECVECKSCGGKTAGKAKSCRW 314
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVC--CDVCQRWVHCQCDGISDEKY 177
GYT C+ C + + YCPVC VY+D ++ P + C C+ VH CDG
Sbjct: 315 HRGYTMCEMCYKRYKHNKYCPVCTLVYQDRDARNPALLRSCVSCRHCVHAGCDG------ 368
Query: 178 LQFQVDGNLQYRCPTCR 194
F Y+CP CR
Sbjct: 369 -NF-AGVTSPYQCPPCR 383
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 378 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 435
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 436 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 493
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 494 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 547
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 548 AADEG--FDCVSCQ 559
>gi|428181743|gb|EKX50606.1| hypothetical protein GUITHDRAFT_60438, partial [Guillardia theta
CCMP2712]
Length = 149
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 85/155 (54%), Gaps = 10/155 (6%)
Query: 19 MLSCKSCGKKYHRNC----LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
L C+ CG +H+ C LK + R++ W+CP+CRICE+C+ + ++ + C
Sbjct: 1 FLFCRDCGDSFHKYCFDLTLKIPPEKRNM-----WRCPACRICEVCKGEENWDEMLCCDE 55
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
CD +H YC PP K + + + C + +C SCGS PG S RW YT C +C +
Sbjct: 56 CDRGFHIYCLRPPLKQIPAEGWRCSECVRCLSCGSKTPGPKGSDRWRKDYTLCSSCWVEY 115
Query: 135 VKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
K NYCP+C KV S+ MV CD CQ WVH C
Sbjct: 116 EKKNYCPIC-KVVTSSKDIKMVNCDSCQMWVHVTC 149
>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
Length = 4678
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 17/177 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 362 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 414
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C C +CGSN P G +
Sbjct: 415 GCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--GFNS 472
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
W YT C C C C + Y + + ++ C C+RW+HC CD I E
Sbjct: 473 SWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSIKSE 523
>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
taurus]
gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
Length = 3821
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1289 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1343
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1344 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1403
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1404 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1463
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1464 EILSNLPESVAYTCVNC 1480
>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
Length = 3826
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1294 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1348
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1349 QHQAAKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1408
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1409 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 1468
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1469 EILSNLPESVAYTCVNC 1485
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1332 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1389
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1390 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1447
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CPVC Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1448 QNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDEVEQ 1501
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1502 AADEG--FDCVSCQ 1513
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C EG + +L C SCG YH CL R +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1332 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1389
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1390 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1447
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CPVC Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1448 QNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDEVEQ 1501
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1502 AADEG--FDCVSCQ 1513
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 63/129 (48%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C EG + +L C SCG YH CL R +SW+CP C++C+ CR
Sbjct: 229 CAVC-----EGPGQLCDLLFCTSCGHHYHGACLDTALTARK---RASWQCPECKVCQSCR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 916 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 973
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 974 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAVSP--GFHCEW 1031
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP C Y + + ++ C C+RW+H C+ + E+ ++
Sbjct: 1032 QNSYTHCGPCASLVT----CPACRAPYVEEDL--LIQCRHCERWMHAGCESLFTEEEVEQ 1085
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1086 AADEG--FDCASCQ 1097
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 63/131 (48%), Gaps = 8/131 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C +G R +L C SCG+ YH CL R + W+CP C++C+ CR
Sbjct: 228 CVVC-----DGLGELRDLLFCTSCGQHYHGACLDTALTARK---RAGWQCPDCKVCQTCR 279
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ + + C CD YH +C P +++ + C C +CG+ + +W+
Sbjct: 280 QPGEDSMMLVCEACDKGYHTFCLKPAIQSLPPDSWKCKTCRVCRACGACPAELDPNCQWY 339
Query: 122 LGYTCCDACGR 132
Y+ C+ C R
Sbjct: 340 ENYSLCERCQR 350
>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) variant [Homo sapiens]
Length = 2880
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 344 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 398
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 399 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 458
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 459 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 518
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 519 EILSNLPESVAYTCVNC 535
>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_c [Homo sapiens]
Length = 3130
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 594 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 648
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 649 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 708
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 709 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 768
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 769 EILSNLPESVAYTCVNC 785
>gi|431901376|gb|ELK08402.1| Histone-lysine N-methyltransferase MLL2 [Pteropus alecto]
Length = 5640
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1377 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1434
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1435 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1492
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1493 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHASCESLFTEDDVEQ 1546
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1547 AADEG--FDCVSCQ 1558
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 248 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 299
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + C C +CG+ + WF
Sbjct: 300 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPPHSWKCKACRVCRACGAGSAELNPNSEWF 359
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 360 ENYSLCHRC 368
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 814 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 871
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 872 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 929
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 930 QNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 983
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 984 AADEG--FDCVSCQ 995
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1332 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1389
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1390 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1447
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1448 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDEVEQ 1501
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1502 AADEG--FDCVSCQ 1513
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1332 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1389
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1390 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1447
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1448 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDEVEQ 1501
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1502 AADEG--FDCVSCQ 1513
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
Length = 4724
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 337 CTTCGQHYHGMCLDIVVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 393
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 394 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLICDTCYQQ--QDNLCP 446
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 447 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDSQLKEEYICMYCK 496
>gi|256081467|ref|XP_002576991.1| myst-related protein [Schistosoma mansoni]
gi|353229451|emb|CCD75622.1| myst-related protein [Schistosoma mansoni]
Length = 914
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 120/287 (41%), Gaps = 31/287 (10%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L+C CG+ YH C + R + W+C C +CE C T + + + C C+ +
Sbjct: 624 LLACSQCGQCYHSFCAEVPKITRTMIE-KGWRCLDCTVCEGCGGTSNESLLLLCDDCNIS 682
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
+H YC PP K V G + C C +CG P GL+ +W Y+ C C L
Sbjct: 683 FHTYCLDPPLKEVPKGGWKCTDCVICTNCGQKDP--GLNGKWHANYSVCAPCASL----T 736
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECY 198
CP+C YR+ E +V C +C RW H CD + E L+ D L Y C CR
Sbjct: 737 TCPICNLAYREEEL--LVRCALCTRWAHANCDQLRTEDELEIATD--LGYNCLLCR---E 789
Query: 199 QVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDEIF-----SISPYS--------- 244
D+ ++ + A+ +L A G TE+ S+ PYS
Sbjct: 790 LGADIGTGHAQVLAYRQAANGNLGALENLKFGEITENLFADKLPSSLFPYSSTGVASLAR 849
Query: 245 ---DDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVKEHGKKWLNKK 288
DD+ N + F + LS G+ +K K+ N+K
Sbjct: 850 SQADDDNNSSSDTRQFFMDGVVLSECGLNTIKQALLKIQPKRHTNQK 896
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 90/215 (41%), Gaps = 39/215 (18%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG YH +CL+ Q W+C C+ C IC + D NK + C CD
Sbjct: 165 LLFCTGCGSHYHASCLEPPLQPSPTIRIG-WQCAECKTCLICNESKDENKMLVCDVCDKG 223
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG----SNVPG--------NGL--SVRWFLGY 124
YH YC PP ++ + C + C CG S + G N L +VRW Y
Sbjct: 224 YHTYCLKPPVSSIPKNGFRCERCRVCSDCGGGRSSTLSGLEGPVAFNNQLNPNVRWHSNY 283
Query: 125 TCCDACGRLFVKGN-YCPVCLKV-----------YRDSESTPMVC----CDVCQRWVHCQ 168
T CD C + N CPVC + YR+ ST +V C C+R VH +
Sbjct: 284 TLCDRCFHAHKRPNSCCPVCERAWRCSLPVPENFYRNPNSTFLVWPGSRCSQCRRMVHAE 343
Query: 169 CDGISDEKYL------QFQVDG--NLQYRCPTCRG 195
CD S++ + + G Y CP CR
Sbjct: 344 CDPSSNQATVSPLSAASEETSGICGTNYVCPVCRA 378
>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 3311
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 99/203 (48%), Gaps = 30/203 (14%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS-SWKCPSCRICEICRR 62
LCF+ + G E+ ++ C SC + YH C+ W W CP C +C C +
Sbjct: 922 LCFLCGSSGQEK---LIHCASCCEPYHEFCIDEAQLKLQNNTWKFDWVCPRCTVCFTCGK 978
Query: 63 TGDPNKFMFCRRCDAAYHCYC--------QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 114
T + C +CD +YH C H P + P++C +C SC N
Sbjct: 979 TSGQQ--LKCVKCDNSYHIECVDRVGGRLLHSPDR-----PWVCSICLRCKSC------N 1025
Query: 115 GLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGI 172
G+ V F+G C AC L KGN+CP+C + Y D + + M+ C C+ WVH +C+G+
Sbjct: 1026 GVDVSVFVGNLPLCRACFVLRQKGNFCPLCQRCYNDDDYDSKMMECGQCKCWVHAKCEGL 1085
Query: 173 SDEKY--LQFQVDGNLQYRCPTC 193
SDEKY L F + +++Y C C
Sbjct: 1086 SDEKYQVLSF-LPESVEYVCRMC 1107
>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
[Nasonia vitripennis]
Length = 5138
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 81/174 (46%), Gaps = 17/174 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 475 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 527
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP + V G + C +C +CG+N P G +
Sbjct: 528 GCGERNDEGRLILCDDCDISYHIYCTDPPLECVPQGTWKCKWCAQCQTCGANDP--GFNS 585
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
W YT C C C C + Y E ++ C C+RW+HC CD I
Sbjct: 586 NWQKNYTQCGPCS----SHTACAACNESY--GEGDLIIQCVQCERWLHCMCDAI 633
>gi|344254289|gb|EGW10393.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 4002
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1746 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1803
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1804 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1861
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CPVC Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1862 QNSYTHCGPCASLVT----CPVCHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDEVEQ 1915
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1916 AADEG--FDCVSCQ 1927
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C+ C+ CR
Sbjct: 642 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKECQACR 693
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 694 KPGNDSKMLVCETCDKGYHTFCLKPPIEELPAHSWKCMTCRVCRACGVGSAELNPNSEWF 753
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+ F V G PVC
Sbjct: 754 ENYSLCHRCHKAQGGQPFISVAGQRLPVC 782
>gi|86129850|gb|ABC86577.1| myeloid/lymphoid or mixed-lineage leukemia protein [Danio rerio]
Length = 1154
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 92/196 (46%), Gaps = 10/196 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 62
+CF+ + G + C+ C + +H CL + D W +W C CR C +C R+
Sbjct: 143 VCFLCASSG---NVEFVFCQVCCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHVCGRK 198
Query: 63 TGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
+ + C +C +YH C HP ++C K +C SCG+ PG +
Sbjct: 199 YQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKAWDAQ 258
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEK-Y 177
W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D+
Sbjct: 259 WSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTDDMCE 318
Query: 178 LQFQVDGNLQYRCPTC 193
L + N+ Y C C
Sbjct: 319 LMSSLPENVVYTCTNC 334
>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Anolis carolinensis]
Length = 3817
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 99/196 (50%), Gaps = 10/196 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G + C+ C + +H+ CL++ + + +W C C+ C +C R
Sbjct: 1336 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEDGERPLE-DQLENWCCRRCKFCHVCGRQ 1391
Query: 64 GDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVR 119
K + C +C +YH C P + + ++C K +C SCG+ PG G +
Sbjct: 1392 HQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGATTPGKGWDAQ 1451
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY- 177
W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 1452 WSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYE 1511
Query: 178 LQFQVDGNLQYRCPTC 193
L + ++ Y C C
Sbjct: 1512 LLSNLPESVAYTCINC 1527
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1101 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1158
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1159 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1216
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1217 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1270
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1271 AADEG--FDCVSCQ 1282
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1208 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1265
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1266 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1323
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1324 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1377
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1378 AADEG--FDCVSCQ 1389
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 79/175 (45%), Gaps = 10/175 (5%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 796 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 853
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 854 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 911
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
YT C C L CP+C Y + + ++ C C+RW+H C+ + E
Sbjct: 912 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTE 960
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1104 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1161
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1162 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1219
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1220 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1273
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1274 AADEG--FDCVSCQ 1285
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1337 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1394
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1395 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1452
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1453 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1506
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1507 AADEG--FDCVSCQ 1518
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CTVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG++ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRICRTCGASSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
++ C C
Sbjct: 341 ENFSLCHRC 349
>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
abelii]
Length = 1215
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
R+L+C C + YH C+ + + W+C C +CE C + DP + + C CD
Sbjct: 635 RLLACSQCCQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 692
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
+YH YC PP + V G + C C CG+ GL W YT C C L
Sbjct: 693 SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 746
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
+ CPVC + YR E ++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 747 SSCPVCYRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 799
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 21 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 77
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 78 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDSCYQQ--QDNLCP 130
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 131 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTD-----HELDTQLKEEYICMYCK 180
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1101 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1158
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1159 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1216
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1217 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1270
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1271 AADEG--FDCVSCQ 1282
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1394 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1451
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1452 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1509
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1510 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1563
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1564 AADEG--FDCVSCQ 1575
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHHCHKAQGGQPLSSVAEQHTPVC 369
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1376 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1433
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1434 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1491
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1492 QNSYTHCGPCASLVT----CPICHAPYMEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1545
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1546 AADEG--FDCVSCQ 1557
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+L V PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLVSSVAEQQPPVC 369
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1375 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1432
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1433 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1490
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1491 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1544
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1545 AADEG--FDCISCQ 1556
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 168
Y+ C C R V + PVC + +S TP D +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 955 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1012
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1013 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1070
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1071 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1124
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1125 AADEG--FDCVSCQ 1136
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1349 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1406
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1407 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1464
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1465 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1518
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1519 XADEG--FDCISCQ 1530
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 8/131 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGR 132
Y+ C C R
Sbjct: 341 ENYSLCHRCHR 351
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1367 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1424
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1425 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1482
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1483 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1536
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1537 AADEG--FDCVSCQ 1548
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPIRSVAEQHTPVC 369
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1376 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1433
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1434 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1491
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1492 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1545
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1546 AADEG--FDCVSCQ 1557
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1367 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1424
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1425 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1482
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1483 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1536
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1537 AADEG--FDCVSCQ 1548
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPISSVAEQHTPVC 369
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1376 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1433
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1434 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1491
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1492 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1545
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1546 AADEG--FDCVSCQ 1557
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C M C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELC----NMFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRVCGAGSSELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1433 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1490
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1491 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1548
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1549 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1602
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1603 AADEG--FDCVSCQ 1614
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
Length = 2720
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 85/181 (46%), Gaps = 8/181 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 76
M+ C+ C + +H CL + R L + +W C C+ C +C RR+ + + CRRC
Sbjct: 996 MIFCQICCEPFHSFCLS--PEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 1053
Query: 77 AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
+YH C P P P++C +C SCG PG + W C C L
Sbjct: 1054 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKSWDLAWNHDEDLCPDCTLLH 1112
Query: 135 VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 193
KGN+C +C K Y D+ + T M+ C C W+H C+GISDE Y + C C
Sbjct: 1113 NKGNFCTICHKCYDDNMQHTEMIQCSACNHWIHYSCEGISDELYGLVSNQREDSFTCQPC 1172
Query: 194 R 194
R
Sbjct: 1173 R 1173
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1339 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1396
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1397 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1454
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1455 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1508
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1509 AADEG--FDCVSCQ 1520
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1388 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1445
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1446 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1503
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1504 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1557
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1558 AADEG--FDCVSCQ 1569
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSVCHRCHKAQGGQPVSSVAEQHTPVC 369
>gi|297262270|ref|XP_001099471.2| PREDICTED: histone-lysine N-methyltransferase MLL2 [Macaca mulatta]
Length = 5505
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1323 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1380
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1381 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1438
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1439 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1492
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1493 AADEG--FDCISCQ 1504
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 77/175 (44%), Gaps = 18/175 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQ 168
Y+ C C R V + PVC + +S TP D +V CQ
Sbjct: 341 ENYSLCHRCHRAQGGQPVSSVAEQHTPVCSRFSPPESGDTPTDEPDAL--YVACQ 393
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 79/175 (45%), Gaps = 10/175 (5%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1101 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1158
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1159 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1216
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDE 175
YT C C L CP+C Y + + ++ C C+RW+H C+ + E
Sbjct: 1217 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTE 1265
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+ V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQTIRSVAEQHTPVC 369
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1358 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1415
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1416 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1473
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1474 QNSYTHCGPCASLVT----CPICHTPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1527
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1528 AADEG--FDCVSCQ 1539
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 69/149 (46%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC-----GRLF--VKGNYCPVC 143
Y+ C C G+L V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQLIGSVAEQHPPVC 369
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1361 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLL--KGWRCVECIVCEVC 1418
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1419 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1476
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1477 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1530
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1531 AADEG--FDCVSCQ 1542
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 15/149 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNSNSEWF 340
Query: 122 LGYTCCDACGRL-------FVKGNYCPVC 143
Y+ C C + V + PVC
Sbjct: 341 ENYSLCHRCHKAQGGQPLSSVAEQHTPVC 369
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1361 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1418
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1419 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1476
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1477 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1530
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1531 AADEG--FDCVSCQ 1542
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWF 340
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 341 ENYSLCHRC 349
>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
norvegicus]
Length = 3725
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1190 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1244
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1245 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1304
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+G+S +
Sbjct: 1305 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSGTED 1364
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1365 EMYEILSNLPESVAYTCVNC 1384
>gi|241687917|ref|XP_002401627.1| mll protein, putative [Ixodes scapularis]
gi|215504524|gb|EEC14018.1| mll protein, putative [Ixodes scapularis]
Length = 1259
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 93/196 (47%), Gaps = 15/196 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+CF+ + G E +L C C + YH CL + L SW CP C+ C C
Sbjct: 806 VCFLCASAGEEE---LLFCTVCCEPYHWFCLDPEEAPQGLDK-ESWCCPRCQTCIACGHR 861
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRW 120
++ + C +C YH C P + + S +LC K +C SCG++ + W
Sbjct: 862 SSVSQLLRCSKCQQTYHTDCLGPGYPSKPSRKKKIWLCVKCIRCKSCGTSSKQSA----W 917
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-- 177
+ C C L KGNYCP+C K Y D + + MV C CQ+W+H +CDGIS+E Y
Sbjct: 918 NFDLSLCQDCMLLREKGNYCPLCEKCYEDDDYESMMVQCSQCQKWIHARCDGISEELYQV 977
Query: 178 LQFQVDGNLQYRCPTC 193
L + L Y C C
Sbjct: 978 LSLLPETEL-YLCRIC 992
>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile
rotundata]
Length = 4805
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 80/174 (45%), Gaps = 17/174 (9%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C +G + EGC +++C CG+ YH C + + W+C C +CE
Sbjct: 475 ICVMCGAIGTDQEGC-----LIACAQCGQCYHPYCAN--VKVTKVILQKGWRCLDCTVCE 527
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C D + + C CD +YH YC PP V G + C C +CGSN P G +
Sbjct: 528 GCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDP--GFNS 585
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
W YT C C C C + Y + + ++ C C+RW+HC CD I
Sbjct: 586 SWQKNYTQCGPCA----SHTACISCQEAYNEGDL--IIQCIQCERWLHCACDSI 633
>gi|322792929|gb|EFZ16759.1| hypothetical protein SINV_09310 [Solenopsis invicta]
Length = 549
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 102/222 (45%), Gaps = 37/222 (16%)
Query: 19 MLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF------- 69
++ C CG+ YH +C L R + W+C SCR+C++CR+ D +K
Sbjct: 223 LVMCSVCGQHYHGSCVGLALLPGVR-----AGWQCVSCRVCQVCRQPEDVSKINVLQNIK 277
Query: 70 ----------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
M C RCD AYH C P ++ + C C CGS PG GLS R
Sbjct: 278 NIHSKSYTHVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCRVCTDCGSRTPGAGLSSR 337
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQ 179
W YT CD+C + KG CP+C K YR + MV C C+++VH CD +D Q
Sbjct: 338 WHSHYTVCDSCYQQRNKGFSCPLCRKAYRAAAYREMVQCHGCKKFVHGTCDPEADPLTYQ 397
Query: 180 FQVDG--NLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADK 219
+ D + +Y C C+ ++ + RRKD D+
Sbjct: 398 QRKDAKPDYEYVCLHCK-----------SIAMVARRKDSIDE 428
>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4218
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 91/196 (46%), Gaps = 10/196 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RR 62
+CF+ + G + C+ + +H CL + D W +W C CR C +C R+
Sbjct: 1627 VCFLCASSG---NVEFVFCQVRCEPFHLFCLGEAERPHDE-QWENWCCRRCRFCHVCGRK 1682
Query: 63 TGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
+ + C +C +YH C HP ++C K +C SCG+ PG +
Sbjct: 1683 YQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCTKCVRCKSCGATKPGKAWDAQ 1742
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEK-Y 177
W ++ C C + KGN CP+C K Y D + + M+ C C RWVH +C+ ++D+
Sbjct: 1743 WSHDFSLCHDCAKRLTKGNLCPLCNKGYDDDDCDSKMMKCKKCDRWVHAKCESLTDDMCE 1802
Query: 178 LQFQVDGNLQYRCPTC 193
L + N+ Y C C
Sbjct: 1803 LMSSLPENVVYTCTNC 1818
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 87/174 (50%), Gaps = 15/174 (8%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD+C + + N CP
Sbjct: 348 FCLQPVMKSVPTNGWKCKNCRICVECGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 400
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCR 194
C K Y M+ C++C+RW+H +CD +D++ QF+ + Y C C+
Sbjct: 401 FCGKCYNPELQKDMLHCNMCKRWIHLECDKPTDQELDSQFREE----YICTYCK 450
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
W+C C +CE C + DP + + C CD +YH YC PP + V G + C
Sbjct: 822 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKC 871
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 275 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 331
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 332 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 384
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 385 FCGKYYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 434
Score = 99.0 bits (245), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 71/146 (48%), Gaps = 10/146 (6%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
W+C C +CE C + DP + + C CD +YH YC PP + V G + C C CG
Sbjct: 849 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLXPPLQTVPKGGWKCKWCVWCRHCG 908
Query: 109 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 168
+ GL W YT C C L + CPVC + YR+ + ++ C C RW+H
Sbjct: 909 AT--SAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREEDL--ILQCRQCDRWMHAV 960
Query: 169 CDGISDEKYLQFQVDGNLQYRCPTCR 194
C ++ E+ ++ D + + C CR
Sbjct: 961 CQNLNTEEEVENVAD--IGFDCSMCR 984
>gi|449672214|ref|XP_002156610.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Hydra
magnipapillata]
Length = 686
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 89/187 (47%), Gaps = 13/187 (6%)
Query: 13 CERARRM---LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
C+++ M L C SCG+ +H C+ + W+C C++C+ C++ GD K
Sbjct: 246 CQKSDNMQSQLFCTSCGRHFHSYCVDMNIPITPVVRMG-WQCSFCKVCQGCKQPGDEEKM 304
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDA 129
+ C +CD YH YC +PP V + C KC CGS+ PG+G S RW ++ CD
Sbjct: 305 LCCDQCDKGYHIYCLNPPISVVPKSVWKCVSCRKCSDCGSSKPGSGPSCRWHNNFSLCDR 364
Query: 130 CGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYR 189
C + KG CP+C + R + + C C + VH +C +D +Y
Sbjct: 365 CYQQRKKGQSCPICKRAVRLFNNGDAIQCKKCFKCVHGECHS---------PLDDGAEYI 415
Query: 190 CPTCRGE 196
CP C E
Sbjct: 416 CPDCIEE 422
>gi|432114496|gb|ELK36344.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
Length = 3462
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 1363 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1420
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1421 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1478
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1479 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1532
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1533 AADEG--FDCVSCQ 1544
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 261 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTAR---KRAGWQCPECKVCQACR 312
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C SCG+ + WF
Sbjct: 313 KPGNDSKMLVCETCDKGYHTFCLKPPMEELPAHSWKCKACRVCRSCGAGSAELNPNSEWF 372
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 373 ENYSLCYRC 381
>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
griseus]
Length = 3907
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1370 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1424
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1425 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1484
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ + DE Y
Sbjct: 1485 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1544
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1545 EILSNLPESVAYTCVNC 1561
>gi|344249614|gb|EGW05718.1| Histone-lysine N-methyltransferase HRX [Cricetulus griseus]
Length = 3512
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 97/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1164 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1218
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1219 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1278
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ + DE Y
Sbjct: 1279 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLLDEMY 1338
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 1339 EILSNLPESVAYTCVNC 1355
>gi|410046801|ref|XP_003313790.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 2476
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 12/194 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
M +C V + G +L+C C + YH C+ + L W+C C +CE+C
Sbjct: 973 MQDMCVVCGSFGRGAEGHLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCEVC 1030
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ DP++ + C CD +YH YC PP V G + C C CG+ P G W
Sbjct: 1031 GQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASP--GFHCEW 1088
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
YT C C L CP+C Y + + ++ C C+RW+H C+ + E ++
Sbjct: 1089 QNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLFTEDDVEQ 1142
Query: 181 QVDGNLQYRCPTCR 194
D + C +C+
Sbjct: 1143 AADEG--FDCVSCQ 1154
>gi|357617693|gb|EHJ70933.1| hypothetical protein KGM_14791 [Danaus plexippus]
Length = 4460
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 90/183 (49%), Gaps = 10/183 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF-----MFCR 73
+++C +CG YH C+ AQ + + W C SCR+C++CR + C
Sbjct: 388 LMTCVTCGGHYHGTCV-GLAQLPGV--RAGWSCRSCRVCQVCRGEAGGGAGGEARAVACE 444
Query: 74 RCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
CD YH C P V + C C CG+ PG G S RW YT CD+C +
Sbjct: 445 HCDKLYHAACLRPVMATVPKYGWKCKCCRVCSDCGARSPGAGPSSRWHAHYTVCDSCYQQ 504
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISD-EKYLQFQ-VDGNLQYRCP 191
KG+ CP+C + YR + M+ C C+R+VH CD ++ + Y Q + + + +Y CP
Sbjct: 505 RNKGSCCPLCRRAYRAAAYRDMIRCSACRRYVHGMCDPEAEPQNYKQKKGENSSYEYTCP 564
Query: 192 TCR 194
C+
Sbjct: 565 ICK 567
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 90/197 (45%), Gaps = 22/197 (11%)
Query: 1 MCRLC-FVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C VG ++EGC +++C CG+ YH C+ N ++ + W+C C +CE
Sbjct: 716 LCVMCGAVGTDSEGC-----LIACSQCGQTYHPYCV-NIKVSQVIVSLG-WRCLDCTVCE 768
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C GD + C CD A+H YC P V G + C + +C CG+ +
Sbjct: 769 GCGSRGDEPLLVLCDDCDTAWHTYCARPALAEVPRGAWRCGRCRRCLVCGTRD-----TA 823
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
W YT C C L + C VC + Y D E ++ C C RW+H CD I E
Sbjct: 824 LWCDNYTECAPCASLVM----CCVCSEPYSDGEL--IIQCTACSRWLHAACDSIRSEADA 877
Query: 179 QFQVDGNLQYRCPTCRG 195
+ Y+C CRG
Sbjct: 878 ETCCRAG--YKCTWCRG 892
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 85/175 (48%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 264 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 320
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 321 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----SSQWHHNCLICDNCYQQ--QDNLCP 373
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL--QYRCPTCR 194
C K Y M+ C++C+RWVH +CD +D ++D L +Y C C+
Sbjct: 374 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDH-----ELDTQLKEEYICMYCK 423
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 72/154 (46%), Gaps = 12/154 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK-----NWAQNRDLFH-WSSWKCPSCRIC 57
+C V + G R+L+C CG+ YH C+ + + R+ + W+C C +C
Sbjct: 864 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKGNTGFCEFRNQNERFKGWRCLECTVC 923
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 117
E C + DP + + C CD +YH YC PP + V G + C C CG+ GL
Sbjct: 924 EACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLR 981
Query: 118 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE 151
W YT C C L + CPVC + YR+ +
Sbjct: 982 CEWQNNYTQCAPCASL----SSCPVCYRNYREED 1011
>gi|14626491|gb|AAK70213.1| MLL3-like protein [Mus musculus]
Length = 420
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 84/177 (47%), Gaps = 12/177 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
R+L+C CG+ YH C+ + + W+C C +CE C + DP + + C CD
Sbjct: 10 RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDI 67
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
+YH YC PP + V G + C C CG+ GL W YT C C L
Sbjct: 68 SYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGAT--SAGLRCEWQNNYTQCAPCASL---- 121
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
+ CPVC + YR E ++ C C RW+H C ++ E+ ++ D + + C CR
Sbjct: 122 SSCPVCCRNYR--EEDLILQCRQCDRWMHAVCQNLNTEEEVENVAD--IGFDCSMCR 174
>gi|328716144|ref|XP_001947369.2| PREDICTED: hypothetical protein LOC100162709 isoform 1 [Acyrthosiphon
pisum]
Length = 1495
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 79/182 (43%), Gaps = 17/182 (9%)
Query: 2 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ D +K +FC CD YH YC + V G + C + C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398
Query: 121 FLGY-------------TCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 167
F + T C C R + KG++CP C++ Y M C+ C R++H
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRSWKKGHFCPNCMRCYPIKNVERMTQCNSCDRYLHS 1458
Query: 168 QC 169
+C
Sbjct: 1459 EC 1460
>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
Length = 4005
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1466 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1520
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1521 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1580
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1581 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1640
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1641 EMYEILSNLPESVAYTCVNC 1660
>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
harrisii]
Length = 3995
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1446 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1500
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1501 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1560
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1561 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1620
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1621 EMYEILSNLPESVAYTCVNC 1640
>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_e [Homo sapiens]
Length = 3972
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627
>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
jacchus]
Length = 3994
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1467 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1521
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1522 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1581
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1582 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1641
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1642 EMYEILSNLPESVAYTCVNC 1661
>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
sapiens]
Length = 3972
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1433 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1487
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1488 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1547
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1548 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1607
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1608 EMYEILSNLPESVAYTCVNC 1627
>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
Length = 3986
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1447 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1501
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1502 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1561
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1562 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1621
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1622 EMYEILSNLPESVAYTCVNC 1641
>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL [Pongo abelii]
Length = 4012
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1473 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1527
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1528 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1587
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1588 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSGTED 1647
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1648 EMYEILSNLPESVAYTCVNC 1667
>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
Length = 3866
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 97/203 (47%), Gaps = 18/203 (8%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEI 59
+C LC E+ + C+ C + +H+ CL+ R L +W C C+ C +
Sbjct: 1332 VCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHV 1383
Query: 60 CRRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNG 115
C R K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1384 CGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKG 1443
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISD 174
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 1444 WDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSG 1503
Query: 175 EKYLQFQVDGNL----QYRCPTC 193
+ +++ NL Y C C
Sbjct: 1504 TEDEMYEILSNLPESVAYTCVNC 1526
>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
Length = 3869
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 97/203 (47%), Gaps = 18/203 (8%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEI 59
+C LC E+ + C+ C + +H+ CL+ R L +W C C+ C +
Sbjct: 1335 VCFLCSSSEHV------EFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHV 1386
Query: 60 CRRTGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNG 115
C R K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1387 CGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKG 1446
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISD 174
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 1447 WDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSG 1506
Query: 175 EKYLQFQVDGNL----QYRCPTC 193
+ +++ NL Y C C
Sbjct: 1507 TEDEMYEILSNLPESVAYTCVNC 1529
>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
Length = 4925
Score = 102 bits (253), Expect = 9e-19, Method: Composition-based stats.
Identities = 72/244 (29%), Positives = 106/244 (43%), Gaps = 53/244 (21%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKN--------------------------- 36
LCF+ + G E ML C C + YH+ C+K+
Sbjct: 1713 LCFLCGSAGLES---MLFCVCCCEPYHQYCVKDEYNLRTGTGTGLDDTGNMSLLDVTLGA 1769
Query: 37 ---WAQNRDLFHWSSWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 91
Q + L +W CP C +C C TG K C++C YH C + +
Sbjct: 1770 SPQQQQEQLLIARYNWMCPRCTVCFSCNMATGAKVK---CQKCAKHYHTTCLGTSKRLHG 1826
Query: 92 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 150
+ P +C +C SCG+ +V F+G C C RL KGNYCP+C K Y D+
Sbjct: 1827 ADRPLICAACLRCKSCGTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1880
Query: 151 E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 208
+ M+ C C+RWVH +C+G++DE+Y V N+++ C C + D+
Sbjct: 1881 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFVCKKC------AKHSSDSTA 1934
Query: 209 ELWR 212
LWR
Sbjct: 1935 HLWR 1938
>gi|395526071|ref|XP_003765195.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
[Sarcophilus harrisii]
Length = 1005
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 85/164 (51%), Gaps = 7/164 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G K + C RC
Sbjct: 584 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRSTKHLLECERCRH 642
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG W + C +C +L+
Sbjct: 643 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGA-APGKNWDSEWSGDCSLCPSCTQLY 701
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 177
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y
Sbjct: 702 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEGY 745
>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=Zinc finger protein HRX;
Contains: RecName: Full=MLL cleavage product N320;
AltName: Full=N-terminal cleavage product of 320 kDa;
Short=p320; Contains: RecName: Full=MLL cleavage product
C180; AltName: Full=C-terminal cleavage product of 180
kDa; Short=p180
Length = 3966
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1432 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1486
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1487 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1546
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1547 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1606
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1607 EMYEILSNLPESVAYTCVNC 1626
>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
Length = 3706
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 1172 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 1226
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 1227 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 1286
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 1287 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 1346
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 1347 EMYEILSNLPESVAYTCVNC 1366
>gi|380807935|gb|AFE75843.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
Length = 314
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 104 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 162
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +L+
Sbjct: 163 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLY 221
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGNYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 222 EKGNYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 281
Query: 193 CRG 195
C G
Sbjct: 282 CAG 284
>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
niloticus]
Length = 4517
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 90/199 (45%), Gaps = 17/199 (8%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
LCF+ + G + C+ C + +H CL R L + +W C CR C+ C R
Sbjct: 1646 LCFLCASSG---NVEFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRYCQACGR 1700
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP------YLCPKHTKCHSCGSNVPGNG 115
K + C +C +YH C P N + P ++C K +C SCG+ PG
Sbjct: 1701 QHQKTKQLLECDKCHNSYHPECLGP---NYPTRPTKKKRIWVCTKCVRCKSCGTTKPGKS 1757
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPV-CLKVYRDSESTPMVCCDVCQRWVHCQCDGISD 174
+W ++ C C +LF KGN+CP+ D + M+ C C WVH +C+ ++D
Sbjct: 1758 WDAQWSHDFSMCHDCAKLFAKGNFCPLCDKCYDDDDYDSKMMLCGRCNHWVHAKCENLTD 1817
Query: 175 EKYLQFQVDGNLQYRCPTC 193
E Y ++ Y C C
Sbjct: 1818 EMYELLSKPESVAYTCTKC 1836
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 81/177 (45%), Gaps = 16/177 (9%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
L C +CG+ YH CL + W+CP C++C+ C+ +GD N+ + C CD Y
Sbjct: 682 LFCTTCGQHYHGMCLDIAVTP---LKRAGWQCPDCKVCQNCKHSGDDNQMLVCDTCDKGY 738
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNY 139
H +C P +V + + C C CG+ S W L CD C + V
Sbjct: 739 HTFCLQPVMDSVPTNGWKCKNCRICTECGTRT-----SSLWHLNCLLCDPCFQQQVSLP- 792
Query: 140 CPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
CP+C K + M+ C VC+RW+H C EK + +D L+ Y C C+
Sbjct: 793 CPICDKPLQPELQKDMLHCHVCKRWIHLDC-----EKCTENDIDDQLKEDYACTLCK 844
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 68/146 (46%), Gaps = 10/146 (6%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
W+C C +CE C + DP + + C CD +YH +C PP + V G + C C +C
Sbjct: 1113 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTFCLDPPLQTVPKGGWKCKWCVSCTNCK 1172
Query: 109 SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQ 168
+ P GL W YT C C L + CPVC + Y + E ++ C C RW H
Sbjct: 1173 AITP--GLRCEWQNNYTQCAPCASL----SACPVCCQNYIEEEL--ILQCRQCIRWSHAS 1224
Query: 169 CDGISDEKYLQFQVDGNLQYRCPTCR 194
C ++ E ++ D + C C+
Sbjct: 1225 CQNLNTEAEVELAADSG--FDCAACK 1248
>gi|553800|gb|AAA92511.1| trithorax, partial [Homo sapiens]
Length = 1012
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 98/197 (49%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 117 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 171
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 172 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 231
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 232 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 291
Query: 178 -LQFQVDGNLQYRCPTC 193
+ + ++ Y C C
Sbjct: 292 EILSNLPESVAYTCVNC 308
>gi|403362853|gb|EJY81162.1| PHD zinc finger-containing protein [Oxytricha trifallax]
Length = 473
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 82/183 (44%), Gaps = 12/183 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
+ L C C K YH C + N +L W+C C+ C+ C + D +K + C CD
Sbjct: 273 KSLKCFRCLKMYHSTCHQP-PLNTELVKRFQWECSDCKTCKNCNQNNDEDKIIICDMCDK 331
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-----PGNGLSVRWFLG-YTCCDACG 131
A H +C +PP + S + C C SC + GL W G Y C C
Sbjct: 332 AVHIHCLNPPLFQIPSHNWFCKDCVNCLSCDKELGPISQKSQGL---WCEGIYRMCKDCN 388
Query: 132 RLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCP 191
+GN+C VC K Y + + CD CQ W+H CDG EK + + D +Y CP
Sbjct: 389 YQLQQGNFCKVCRKSYSQDSNEDFIQCDECQDWIHAACDGFDSEKLAKMKDDE--KYSCP 446
Query: 192 TCR 194
C+
Sbjct: 447 ICK 449
>gi|339244153|ref|XP_003378002.1| putative PHD finger protein [Trichinella spiralis]
gi|316973126|gb|EFV56753.1| putative PHD finger protein [Trichinella spiralis]
Length = 864
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 118/282 (41%), Gaps = 20/282 (7%)
Query: 1 MCR-LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
CR +C V + G + M++C CG+ YH C N N + H W+C C +CE
Sbjct: 212 FCRDMCVVCGSFGRGQEGHMVACTQCGQCYHTYC-ANVTLNSVIVH-RGWRCLDCTVCEG 269
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--S 117
C D + C CD +YH YC PP ++ G + C + C CG+ P NG+ S
Sbjct: 270 CGTGDDEQHLLLCDECDVSYHMYCLDPPLDSIPQGAWRCKWCSTCQFCGAT-PPNGMLDS 328
Query: 118 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 177
++ C C L+ C C Y+ E ++ CD+C RW H C+G+ E
Sbjct: 329 IK---NLRACFKCASLYS----CCFCHLQYK--EEDMIILCDICHRWSHANCNGLCAEDI 379
Query: 178 LQFQVDGNLQYRCPTCR-GECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTEDE 236
L+ +D + C CR G+ + + + K + + A P
Sbjct: 380 LKKGLDAG--FICVYCRPGDACSSAAMHFVIEGVLLTK--SGLSTVQWRPKPAASPVCSA 435
Query: 237 IFSISPYSDDEENGPVVLKNEFGRSLKLSLKGVVDKSPKKVK 278
+++ +P + +N +E S + GVV SP K
Sbjct: 436 VYNSTPSFESLQNATADGFSELSTSQLMIQDGVVSASPDTEK 477
>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
Length = 3489
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 48
LCF+ + G + +L C C + YH+ C+K N Q N L +S
Sbjct: 857 LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 913
Query: 49 ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 94
W CP C +C C TG K C++C YH C + + +
Sbjct: 914 SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 970
Query: 95 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 152
P +C KC SC + +V F+G C C RL KGN+CP+C + Y D++
Sbjct: 971 PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 1024
Query: 153 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 209
M+ C C+RWVH +C+G++DE+Y + + N+++ C C EC V DAV
Sbjct: 1025 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 1082
Query: 210 LWRRKDMADKDLIASLRAAAGL 231
++ ++ L++ R A L
Sbjct: 1083 EFKAGLLSVVKLLSKSRQACAL 1104
>gi|74189196|dbj|BAC35712.2| unnamed protein product [Mus musculus]
Length = 814
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 86/175 (49%), Gaps = 17/175 (9%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C CG+ S +W CD C + + N CP
Sbjct: 415 FCLQPVMKSVPTNGWKCKNCRICIECGTRS-----STQWHHNCLICDTCYQQ--QDNLCP 467
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQ--YRCPTCR 194
C K Y M+ C++C+RWVH +CD +D+ ++D L+ Y C C+
Sbjct: 468 FCGKCYHPELQKDMLHCNMCKRWVHLECDKPTDQ-----ELDSQLKEDYICMYCK 517
>gi|156353194|ref|XP_001622959.1| predicted protein [Nematostella vectensis]
gi|156209597|gb|EDO30859.1| predicted protein [Nematostella vectensis]
Length = 634
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 78/163 (47%), Gaps = 12/163 (7%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M CK C + +H CL + D SW C SC C +C G +K + C +C
Sbjct: 1 MFFCKVCSEPFHGFCLDEEPIDED-----SWCCDSCSTCVVC---GQQDKLLMCDKCQRG 52
Query: 79 YHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 135
YH C P + V G ++C + +C CGS G W +T C CG +
Sbjct: 53 YHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFTHCYDCGTAWD 112
Query: 136 KGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 177
GNYCP+C K Y D++ + M+ C+ CQ WVH C I+ ++Y
Sbjct: 113 NGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 155
>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
porcellus]
Length = 2692
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 86/180 (47%), Gaps = 24/180 (13%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 1215 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCLK 1273
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
+C +C SCG+ PG V W Y+ C C +L+ KG
Sbjct: 1274 -------------------ICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLYEKG 1313
Query: 138 NYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 195
NYCP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C C G
Sbjct: 1314 NYCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGPCAG 1373
>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
Length = 3069
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 117/262 (44%), Gaps = 48/262 (18%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK---NWAQ------NRDLFHWSS------ 48
LCF+ + G + +L C C + YH+ C+K N Q N L +S
Sbjct: 656 LCFLCGSSGLDE---LLFCVCCCEPYHQYCVKDEYNIRQVSLDDTNVSLLELTSTTMNAG 712
Query: 49 ------------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 94
W CP C +C C TG K C++C YH C + + +
Sbjct: 713 SSPQQQALNRFNWMCPRCTVCYTCNMATGSKVK---CQKCGKNYHTTCLGTSKRLLGADR 769
Query: 95 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 152
P +C KC SC + +V F+G C C RL KGN+CP+C + Y D++
Sbjct: 770 PLICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEDNDFD 823
Query: 153 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLEDAVRE 209
M+ C C+RWVH +C+G++DE+Y + + N+++ C C EC V DAV
Sbjct: 824 LKMMECGDCKRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNECANV--WRDAVAA 881
Query: 210 LWRRKDMADKDLIASLRAAAGL 231
++ ++ L++ R A L
Sbjct: 882 EFKAGLLSVVKLLSKSRQACAL 903
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 87/191 (45%), Gaps = 11/191 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C + + G + M+SC +C + YH C+ + W+C C +CE C
Sbjct: 376 VCLICGSIGNDIEGTMVSCATCAQSYHTFCVGLHDKLNSTVVKRGWRCLDCTVCEGCGDG 435
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + + G + C C C + +P NG + G
Sbjct: 436 RDESNLLLCDECDISYHIYCLDPPLECIPHGSWRCKWCATCRRCSAQIP-NGTDTQRMEG 494
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C+ C L CP CL++Y E ++ C C RW+H +C+ I E+ L+ +
Sbjct: 495 L--CETCYSL----RKCPKCLRLYEIGEH--IIKCQHCSRWLHGKCEEICGEEMLEAAAE 546
Query: 184 GNLQYRCPTCR 194
+RC CR
Sbjct: 547 NG--FRCSLCR 555
>gi|170058059|ref|XP_001864757.1| trithorax [Culex quinquefasciatus]
gi|167877298|gb|EDS40681.1| trithorax [Culex quinquefasciatus]
Length = 3165
Score = 99.4 bits (246), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 113/259 (43%), Gaps = 43/259 (16%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNW---------AQNRDLFHWSS------ 48
LCF+ + G + ML C C + YH+ C+K+ N L +S
Sbjct: 721 LCFLCGSSGLDE---MLFCVCCCEPYHQYCVKDEYNIRHASLDETNISLLELTSTTIVNS 777
Query: 49 -----------WKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG-P 95
W CP C +C C TG K C++C YH C + + + P
Sbjct: 778 SPAQQALNRFNWMCPRCTVCYTCNMATGTKVK---CQKCCKNYHTTCLGTSKRLLGADRP 834
Query: 96 YLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-ST 153
+C KC SC + +V F+G C C RL KGN+CP+C + Y +++
Sbjct: 835 MICAACLKCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNFCPLCQRCYEENDFDL 888
Query: 154 PMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 212
M+ C CQRWVH +C+G++DE+Y + + N+++ C C DAV ++
Sbjct: 889 KMMECGDCQRWVHAKCEGLTDEQYNMLSALPENIEFICKKCGKNNESANVWRDAVAAEFK 948
Query: 213 RKDMADKDLIASLRAAAGL 231
++ L++ R A L
Sbjct: 949 AGLLSVVKLLSKSRQACAL 967
>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
Length = 4717
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 112/263 (42%), Gaps = 47/263 (17%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLK--------NWAQNRDLFHWS-------- 47
LCF+ + G E ML C C + YH+ C+K N D + S
Sbjct: 1583 LCFLCGSAGLED---MLFCVCCCEPYHQYCVKDEYNLRAGNGGALDDTLNVSLLDVTLGA 1639
Query: 48 --------------SWKCPSCRICEICRR-TGDPNKFMFCRRCDAAYHCYCQHPPHK-NV 91
+W CP C +C C TG K C++C YH C + +
Sbjct: 1640 SPQEQQQQLLLGRFNWMCPRCTVCFSCNMATGSKVK---CQKCAKYYHTTCLGTSKRLHG 1696
Query: 92 SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 150
+ P +C +C SC + +V F+G C C RL KGNYCP+C K Y D+
Sbjct: 1697 ADRPLICADCLRCKSCSTT------NVTKFIGNLPMCTPCFRLRQKGNYCPLCQKCYEDN 1750
Query: 151 E-STPMVCCDVCQRWVHCQCDGISDEKYLQFQV-DGNLQYRCPTCRGECYQVRDLEDAVR 208
+ M+ C C+RWVH +C+G++DE+Y V N+++ C C DAV
Sbjct: 1751 DFDLKMMECGDCRRWVHARCEGLTDEQYNMLSVLPENIEFICKKCGKHNETANMWRDAVA 1810
Query: 209 ELWRRKDMADKDLIASLRAAAGL 231
++ ++ L++ R A L
Sbjct: 1811 AEFKAGLLSVVKLLSKSRQACAL 1833
>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
mellifera]
Length = 3195
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 31/204 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 780 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 829
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C VS+ PY+C KC SCGS
Sbjct: 830 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSARLYSPERPYVCQSCVKCKSCGSE-- 883
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH QC+G
Sbjct: 884 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHAQCEG 940
Query: 172 ISDEKY--LQFQVDGNLQYRCPTC 193
+SDE+Y L + D +++ C C
Sbjct: 941 LSDERYQILSYLPD-TIEFTCSQC 963
>gi|121483956|gb|ABM54289.1| MLL [Pan paniscus]
Length = 523
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 91/180 (50%), Gaps = 11/180 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 71 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 125
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 126 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 185
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y
Sbjct: 186 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMY 245
>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 921
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 79/156 (50%), Gaps = 10/156 (6%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 294 CTTCGQHYHGMCLDVAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 350
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCP 141
+C P K+V + + C C +CG+ S +W CD+C + + N CP
Sbjct: 351 FCLQPVMKSVPTNGWKCKNCRICVACGTRS-----SSQWHHNCLVCDSCYQQ--QDNLCP 403
Query: 142 VCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 177
C K Y M+ C++C+RWVH +CD +D +
Sbjct: 404 FCGKSYHPELQKDMLHCNMCKRWVHLECDKPADHEL 439
>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
Length = 3530
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 95/204 (46%), Gaps = 31/204 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 908 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 957
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C VS+ PY+C KC SCGS
Sbjct: 958 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSARLYSPERPYVCQNCVKCKSCGSE-- 1011
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH C+G
Sbjct: 1012 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHAYCEG 1068
Query: 172 ISDEKY--LQFQVDGNLQYRCPTC 193
ISDE+Y L + D +++ C C
Sbjct: 1069 ISDERYQILSYLPD-TIEFTCSQC 1091
>gi|402592532|gb|EJW86460.1| hypothetical protein WUBG_02629, partial [Wuchereria bancrofti]
Length = 2207
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 88/191 (46%), Gaps = 14/191 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 268 LCLICGSIGKDAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 327
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + + GP+ C + C CG+ + N + F
Sbjct: 328 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQI-FNVTDNQNF-- 384
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C+ C L CP CL++Y ++ ++ C C RW+H +C+ + E+ F+
Sbjct: 385 ---CETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MFETA 433
Query: 184 GNLQYRCPTCR 194
+RC CR
Sbjct: 434 SENGFRCSLCR 444
>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
Length = 3434
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 96/204 (47%), Gaps = 31/204 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 905 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 954
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C VS+ PY+C KC SCGS
Sbjct: 955 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSARLYSPERPYVCQSCVKCKSCGSE-- 1008
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH +C+G
Sbjct: 1009 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHARCEG 1065
Query: 172 ISDEKY--LQFQVDGNLQYRCPTC 193
+SDE+Y L + D +++ C C
Sbjct: 1066 LSDERYQILSYLPDS-IEFTCSQC 1088
>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
Length = 3522
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 95/207 (45%), Gaps = 37/207 (17%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 908 ICYLC------GSAGREPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 957
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQ---------HPPHKNVSSGPYLCPKHTKCHSCGS 109
C P + C RC ++H C + P + PY+C KC SCGS
Sbjct: 958 SCHLRSGPK--LSCIRCRQSFHHSCLSKSGVSARLYSPER-----PYVCQNCVKCKSCGS 1010
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQ 168
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH
Sbjct: 1011 E----GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSYWVHAY 1065
Query: 169 CDGISDEKY--LQFQVDGNLQYRCPTC 193
C+GISDE+Y L + D +++ C C
Sbjct: 1066 CEGISDERYQILSYLPD-TIEFTCSQC 1091
>gi|47221226|emb|CAG13162.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3783
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/198 (28%), Positives = 91/198 (45%), Gaps = 13/198 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
LCF+ + G + C+ C + +H CL R L + +W C CR C+ C R
Sbjct: 1449 LCFLCASSG---NVEFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQACGR 1503
Query: 63 TGDPNK--FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLS 117
K + C +C +YH C P H + ++C +C CG+ PG
Sbjct: 1504 QHQKTKQQLLECDKCRNSYHPECLGPSHPTRPTKKKRVWVCNNCVRCKCCGATKPGKSWD 1563
Query: 118 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEK 176
+W ++ C C +LF K N+C +C K Y D E+ M+ C C VH +C+ ++D+
Sbjct: 1564 AQWSHDFSMCHDCAKLFAKRNFCHICTKCYEDDEADAKMIECGRCHHRVHAKCEKLTDDM 1623
Query: 177 Y-LQFQVDGNLQYRCPTC 193
Y L ++ ++ Y C C
Sbjct: 1624 YELLSKLPESVAYTCTKC 1641
>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
Length = 2144
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 20/194 (10%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G M++C +C + YH C+ + W+C C +CE C
Sbjct: 166 LCLICGSIGKGAEGTMVTCVTCSQSYHTYCVGLHDKLNSTIVRRGWRCLDCTVCEGCGDG 225
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG---SNVPGNGLSVRW 120
D + + C CD +YH YC PP + + GP+ C + C CG SN+ N
Sbjct: 226 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRCGNQISNITDN------ 279
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQF 180
C+ C L CP CL++Y ++ ++ C C RW+H +C+ + E+ F
Sbjct: 280 ---QNFCETCFTL----RKCPKCLRLYEIGDN--IIKCQHCARWLHGKCEELYGEE--MF 328
Query: 181 QVDGNLQYRCPTCR 194
+ +RC CR
Sbjct: 329 ETASENGFRCSLCR 342
>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
Length = 2498
Score = 95.9 bits (237), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 93/204 (45%), Gaps = 17/204 (8%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M++C +C + YH C+ + W+C C ICE C + GD M C CD
Sbjct: 447 MIACLNCAQTYHTYCVLLHEKINSAIMTHGWRCLDCTICEGCGKGGDDKNLMLCDECDVP 506
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV-PGNGLSVRWFLGYTCCDACGRLFVKG 137
YH YC PP + V +G + C ++C C V G+ L+ R C C L
Sbjct: 507 YHTYCLKPPIEKVPTGSWRCQWCSRCRRCNHKVSSGSELTARGL-----CHPCDSL---- 557
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGE 196
+C C + Y+ ++ ++ C C +W H +C+G+ +DE+ Q ++ + RC CR
Sbjct: 558 QHCARCRQGYQLNDK--LIRCSQCNKWEHGRCEGLYTDEQLEQAALN---RMRCAACRPN 612
Query: 197 CYQVRDLEDAVRELWRRKDMADKD 220
+ L D V +W DKD
Sbjct: 613 RIRNNGLSD-VDTVWCDSIALDKD 635
>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Takifugu rubripes]
Length = 4478
Score = 95.5 bits (236), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 90/197 (45%), Gaps = 12/197 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
LCF+ + G + C+ C + +H CL R L + +W C CR C+ C R
Sbjct: 1669 LCFLCASSG---NVEFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQACGR 1723
Query: 63 TGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C HP ++C K +C CG+ PG
Sbjct: 1724 QHQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPGKSWDA 1783
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF K N C +C K Y D E+ MV C C VH +C+ ++D+ Y
Sbjct: 1784 QWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKLTDDMY 1843
Query: 178 -LQFQVDGNLQYRCPTC 193
L ++ ++ Y C C
Sbjct: 1844 ELLSKLPESVAYTCTKC 1860
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 95.5 bits (236), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 94/219 (42%), Gaps = 17/219 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G M++C +C + YH C+ + W+C C +CE C
Sbjct: 427 MCLVCGSIGKGPEGSMVACSNCAQTYHTYCVTLHDKLNSAVVGRGWRCLDCTVCEGCGTG 486
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP-GNGLSVRWFL 122
GD + C CD +YH YC P + GP+ C ++C C GN L+ +
Sbjct: 487 GDEANLLLCDECDVSYHIYCMKPLLDKIPQGPWRCQWCSRCRRCNHKAASGNDLTSQGL- 545
Query: 123 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI-SDEKYLQFQ 181
C C L CP C + Y+ +E ++ C C +W H C+G+ +DE+ Q
Sbjct: 546 ----CFPCASL----RKCPRCERNYQLNEK--LIRCSQCSKWQHGACEGLYTDEQLEQAA 595
Query: 182 VDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 220
+D + RC CR + Q D V +W DKD
Sbjct: 596 ID---RMRCSACRPKRVQPSGFSD-VDTVWCDYVALDKD 630
>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus
floridanus]
Length = 3218
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 31/204 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 779 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 828
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C VSS PY+C KC SCGS
Sbjct: 829 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSSRLYNPDRPYVCQSCIKCKSCGSE-- 882
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH +C+G
Sbjct: 883 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSCWVHARCEG 939
Query: 172 ISDEKY--LQFQVDGNLQYRCPTC 193
+SDE+Y L + D +++ C C
Sbjct: 940 LSDERYQILSYLPDS-IEFTCSQC 962
>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
Length = 4498
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 90/198 (45%), Gaps = 13/198 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
LCF+ + G + C+ C + +H CL R L + +W C CR C+ C R
Sbjct: 1669 LCFLCASSG---NVEFVFCQVCCEPFHLFCLGE--SERPLQEQFENWCCRRCRFCQACGR 1723
Query: 63 TGDPNK--FMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 117
K + C +C +YH C HP ++C K +C CG+ PG
Sbjct: 1724 QHQKTKQQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCVRCKCCGATKPGKSWD 1783
Query: 118 VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEK 176
+W ++ C C +LF K N C +C K Y D E+ MV C C VH +C+ ++D+
Sbjct: 1784 AQWSHDFSMCHDCAKLFAKRNICVLCNKCYEDDEADGKMVECGRCHHRVHAKCEKLTDDM 1843
Query: 177 Y-LQFQVDGNLQYRCPTC 193
Y L ++ ++ Y C C
Sbjct: 1844 YELLSKLPESVAYTCTKC 1861
>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
Length = 2288
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 14/199 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 296 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 355
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + + GP+ C + C C SN N + F
Sbjct: 356 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 412
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C+ C L CP CL+ Y +S ++ C C RW+H +C+ + ++ F+
Sbjct: 413 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 461
Query: 184 GNLQYRCPTCRGECYQVRD 202
+RC CR + V D
Sbjct: 462 SENGFRCSLCRPQGNVVGD 480
>gi|312071355|ref|XP_003138570.1| F/Y-rich family protein [Loa loa]
Length = 1597
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 14/199 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + + G + M++C +C + YH C+ + W+C C +CE C
Sbjct: 273 LCLICGSIGKDVEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDG 332
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D + + C CD +YH YC PP + + GP+ C + C C SN N + F
Sbjct: 333 HDESNLILCDECDISYHIYCLEPPLERIPHGPWRCKWCSACRRC-SNQISNIADNQHF-- 389
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
C+ C L CP CL+ Y +S ++ C C RW+H +C+ + ++ F+
Sbjct: 390 ---CETCFTL----RRCPKCLRFYEIGDS--IIKCQHCARWLHGKCEELYGDE--MFETA 438
Query: 184 GNLQYRCPTCRGECYQVRD 202
+RC CR + V D
Sbjct: 439 SENGFRCSLCRPQGNVVGD 457
>gi|355732606|gb|AES10757.1| histone-lysine N-methyltransferase MLL4-like protein [Mustela
putorius furo]
Length = 212
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 95/183 (51%), Gaps = 8/183 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 11 LVFCQVCCDPFHPFCLEE-AERPLPQHHDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 69
Query: 78 AYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C +LF
Sbjct: 70 AYHPACLGPSYPTRATRKRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTQLF 128
Query: 135 VKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPT 192
KGN+CP+C + Y D++ + M+ C C WVH +C+G+SDE Y + + ++ Y C
Sbjct: 129 EKGNFCPICTRCYEDNDYESKMMQCAQCDHWVHAKCEGLSDEDYEILSGLPDSVLYTCGP 188
Query: 193 CRG 195
C G
Sbjct: 189 CAG 191
>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
Length = 2812
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 85/182 (46%), Gaps = 7/182 (3%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCDA 77
M+ C+ C + +H CL ++ +W C C+ C +C RR+ + CRRC
Sbjct: 1050 MIFCQICCEPFHSFCLTPEECPQE-DSKENWCCRRCKFCHVCGRRSKSAKPVLQCRRCQT 1108
Query: 78 AYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 135
YH C P P S P++C +C SCG PG + W C C L
Sbjct: 1109 CYHPSCLGPTYPKPVNCSLPWVCMTCIRCKSCGVT-PGKTWDLAWNHEQDLCPECTILNK 1167
Query: 136 KGNYCPVCLKVYRD-SESTPMVCCDVCQRWVHCQCDGISDEK-YLQFQVDGNLQYRCPTC 193
KG++C VC K Y D S M+ C C+ W+H +C+G+S+E L + + + C C
Sbjct: 1168 KGHFCTVCQKCYEDGSRPLQMIQCSECRHWIHPKCEGLSEELCGLMSTLPDSGGFTCTPC 1227
Query: 194 RG 195
RG
Sbjct: 1228 RG 1229
>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
Length = 2924
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 69/142 (48%), Gaps = 5/142 (3%)
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPG 113
+C +C G + C RC YH C P P +N ++C T+C SCG PG
Sbjct: 1411 VCFLCASKGQHEPLLECERCQNCYHASCLGPNYPKQNKKRKAWVCMTCTRCKSCGV-TPG 1469
Query: 114 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGI 172
W C C +L+ +GNYCP+C K Y D++ + M+ C C WVH +C+ +
Sbjct: 1470 KSWDTEWNHDKGLCPDCSKLYDQGNYCPICFKCYEDNDYDSQMMQCGTCNHWVHAKCEDL 1529
Query: 173 SDEKY-LQFQVDGNLQYRCPTC 193
+DE Y + + ++ Y C C
Sbjct: 1530 TDELYEILSSLPESVVYSCRPC 1551
>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
Length = 3452
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 88/186 (47%), Gaps = 28/186 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 903 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 952
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C VSS PY+C KC SCGS
Sbjct: 953 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGVSSRLYSPDRPYVCQSCIKCKSCGSE-- 1006
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH +C+G
Sbjct: 1007 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSCWVHARCEG 1063
Query: 172 ISDEKY 177
+SDE+Y
Sbjct: 1064 LSDERY 1069
>gi|60360484|dbj|BAD90486.1| mKIAA4050 protein [Mus musculus]
Length = 762
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 26 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 80
Query: 63 TGDPNK-FMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 81 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 140
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S +
Sbjct: 141 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCESLSGTED 200
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 201 EMYEILSNLPESVAYTCVNC 220
>gi|1042097|gb|AAB34770.1| trx Zinc-finger region homolog [Homo sapiens]
Length = 366
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 92/183 (50%), Gaps = 9/183 (4%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRC 75
+ + C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 156 KFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKC 213
Query: 76 DAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 132
+YH C P + + ++C K +C SCGS PG G +W ++ C C +
Sbjct: 214 RNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 273
Query: 133 LFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRC 190
LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + + + Y C
Sbjct: 274 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPECVAYTC 333
Query: 191 PTC 193
C
Sbjct: 334 VNC 336
>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
Length = 2027
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 103/264 (39%), Gaps = 24/264 (9%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW----SSWKCPSCRICEICRRTGDPNKFMFCRR 74
+ C+ C + YHR C + D W W C C+ C C + + + C +
Sbjct: 641 FVHCRVCCQPYHRFCTGDSRLQSDE-RWRDAAEQWMCIDCQTCCACGQAEPRSTLLTCAK 699
Query: 75 CDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGR 132
C H +C P Y+C C CGS PG W + C+ C
Sbjct: 700 CTRHIHSHCAGDGVPQGLKRDDVYMCSACVVCTKCGSTSPGEFKGSTWHCQFELCEECHG 759
Query: 133 LFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQC--DGISDEKYLQFQVDG--NLQ 187
+KGN CP+C K+Y D + TPM CC+ C+RW+H C L + + G +
Sbjct: 760 QTLKGNICPMCDKLYSDDDFDTPMFCCEKCERWLHATCVDPAFKTNMELYYLISGVQTID 819
Query: 188 YRCPTC----RGECYQVRDLEDAVRELWRRK------DMADKDLIASLRAAAGLPTEDEI 237
Y C C +G L A + ++ + DM ++ +L A + P D
Sbjct: 820 YHCQDCVRLSKGVLPDFAALWKATQAAFQERLVGLVEDMKQQEGYTALAAVSPAPVRDHT 879
Query: 238 F--SISPYSDDEENGPVVLKNEFG 259
+ + SDD P +E G
Sbjct: 880 SDDTRTRASDDTRTRPSTGTDEDG 903
>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
Length = 3837
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 66/245 (26%), Positives = 105/245 (42%), Gaps = 48/245 (19%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNC----------------------------LK 35
LCF+ + G + ++ C C + YH+ C +
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFDDTLMNSLLETSSISAIT 1309
Query: 36 NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG- 94
N A + L +W CP C +C C + + C++C YH C + + +
Sbjct: 1310 NTALTQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGADR 1366
Query: 95 PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-S 152
P +C KC SC + V F+G C AC +L KGN+CP+C K Y D++
Sbjct: 1367 PLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDDNDFD 1420
Query: 153 TPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELW 211
M+ C C +WVH +C+G+SDE+Y L + ++++ C C C D+ E W
Sbjct: 1421 LKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRC----DVSRNKAEEW 1476
Query: 212 RRKDM 216
R+ M
Sbjct: 1477 RQAVM 1481
>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
Length = 3272
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 88/186 (47%), Gaps = 28/186 (15%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICE 58
+C LC G ++ C+ C + YH CL+ W + +W CP C IC+
Sbjct: 856 ICYLC------GSAGKEPLIHCQCCCEPYHAFCLEPSEW----NACAQPNWCCPRCTICQ 905
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG------PYLCPKHTKCHSCGSNVP 112
C P + C RC ++H C +SS PY+C +C SCGS
Sbjct: 906 SCHLRSGPK--LSCIRCRQSFHHSCLS--KSGISSRLYSPDRPYVCQSCIRCKSCGSE-- 959
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G++V C C +L +GNYCP+C + Y +++ T M+ C C WVH +C+G
Sbjct: 960 --GVNVH-VGNLPLCSMCFKLRQQGNYCPLCQRCYNENDFDTKMMECSECSCWVHARCEG 1016
Query: 172 ISDEKY 177
+SDE+Y
Sbjct: 1017 LSDERY 1022
>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
Length = 3822
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 52/268 (19%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 34
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1273 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLETSNNACAI 1329
Query: 35 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1330 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1386
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1387 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1440
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLED-- 205
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R+ D
Sbjct: 1441 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADEW 1500
Query: 206 --AVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1501 RQAVMEEFKSSLYSVLKLLSKSRQACAL 1528
>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
Length = 3828
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 107/251 (42%), Gaps = 52/251 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 34
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1253 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMTSLLETSNNACAI 1309
Query: 35 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1310 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1366
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1367 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1420
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAV 207
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R+ D
Sbjct: 1421 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRNKADE- 1479
Query: 208 RELWRRKDMAD 218
WR+ M +
Sbjct: 1480 ---WRQAVMEE 1487
>gi|195446233|ref|XP_002070689.1| GK10889 [Drosophila willistoni]
gi|194166774|gb|EDW81675.1| GK10889 [Drosophila willistoni]
Length = 3189
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 116/267 (43%), Gaps = 53/267 (19%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1155 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLDTSVN 1208
Query: 48 ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 92
+W CP C +C C + + C++C YH C + +
Sbjct: 1209 PSTCAPSMNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHATCLGTSKRLLG 1266
Query: 93 SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 150
+ P +C KC SC + V F+G C AC +L KGNYCP+C K Y D+
Sbjct: 1267 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTACFKLRKKGNYCPICQKCYDDN 1320
Query: 151 E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 204
+ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E ++ E
Sbjct: 1321 DFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSHIKADEWR 1380
Query: 205 DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1381 QAVMEEFKSSLYSVLKLLSKSRQACAL 1407
>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
Length = 3756
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 52/249 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCL----------------------------- 34
LCF+ + G + ++ C C + YH+ C+
Sbjct: 1234 LCFLCGSTGLDP---LIFCACCCEPYHQYCVLDEYNLKHSSFEDTLMNSLLDTSSNACAI 1290
Query: 35 ---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
N A N+ L +W CP C +C C + + C++C YH C + +
Sbjct: 1291 SAATNTALNQ-LTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1347
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C AC +L KGN+CP+C K Y D
Sbjct: 1348 GADRPLICVNCLKCKSCATT------KVSKFVGNLPMCTACFKLRKKGNFCPICQKCYDD 1401
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAV 207
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C C R D
Sbjct: 1402 NDFDLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRCDVSRQKADE- 1460
Query: 208 RELWRRKDM 216
WR+ M
Sbjct: 1461 ---WRQAVM 1466
>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1776
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 76/160 (47%), Gaps = 8/160 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEIC-RRTGDPNKFMFCRRCD 76
M+ C+ C + +H CL + R L + +W C C+ C +C RR+ + + CRRC
Sbjct: 184 MIFCQICCEPFHSFCL--LPEERPLKDNKENWCCRRCKFCHVCGRRSKNTKPVLQCRRCQ 241
Query: 77 AAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
+YH C P P P++C +C SCG PG + W C C L
Sbjct: 242 TSYHPACLGPTYPKPMNCKIPWVCMTCIRCKSCGV-TPGKTWDLAWNHDEDLCPDCTLLH 300
Query: 135 VKGNYCPVCLKVYRDS-ESTPMVCCDVCQRWVHCQCDGIS 173
KGN+C +C K Y D+ M+ C C W+H C+GIS
Sbjct: 301 KKGNFCTICHKCYDDNMRHAEMIQCSACNHWIHYSCEGIS 340
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 93/199 (46%), Gaps = 26/199 (13%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+ +LCF+ + G E+ ML C SC + YH C + SW C C C C
Sbjct: 856 LAKLCFLCGSAGREK---MLVCSSCCEWYHVWCAEEAG------GGGSWTCARCVWCAAC 906
Query: 61 RRTGDPNKFMFCRRCDAAYHCYC--QHPP-HKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 117
R + CR C YH C PP H+ S P +C KC SC SN
Sbjct: 907 ARPA---ARLRCRSCARPYHAACLPSAPPDHR--SDWPQICSSCLKCKSCDSN------R 955
Query: 118 VRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDE 175
V F+G C C +L KGNYCP+C YRD++ + M+ C C RWVH C+G+S E
Sbjct: 956 VNKFVGSLPFCRPCFKLRQKGNYCPLCQACYRDNDFDSKMMECGWCARWVHASCEGLSGE 1015
Query: 176 KY-LQFQVDGNLQYRCPTC 193
Y L + +++Y C C
Sbjct: 1016 GYQLLSALPPSIEYICCKC 1034
>gi|167526642|ref|XP_001747654.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773758|gb|EDQ87394.1| predicted protein [Monosiga brevicollis MX1]
Length = 1547
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 77/158 (48%), Gaps = 13/158 (8%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHS 106
W+C +C+ C++C R + C C H C P +++ ++C +C
Sbjct: 633 WQCKNCQTCDVCTRIEPTQHLLSCDVCGVHRHAACASAATPSYMLATQRWVCTDCVQCEH 692
Query: 107 CG-SNVPGNGLSVR------WFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCC 158
CG ++V G+ + W + C CG ++GN+CPVC K YR D MV C
Sbjct: 693 CGATDVRGHRPDPKLREEPTWQCDFRLCFDCGLNKLRGNFCPVCGKTYRGDDYDVKMVGC 752
Query: 159 DVCQRWVHCQCDGISDEKY--LQFQVDGNLQYRCPTCR 194
D C RW+H +CD I + +Y L F V ++ Y CP CR
Sbjct: 753 DRCDRWLHAECDDIDEARYHLLTF-VPSSMSYFCPDCR 789
>gi|349803513|gb|AEQ17229.1| putative trx zinc-finger region [Pipa carvalhoi]
Length = 506
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 76/160 (47%), Gaps = 29/160 (18%)
Query: 50 KCPSCRI---CEICRRTGDPNKF-----------MFCRRCDAAYHCYCQHPPHKNVSSGP 95
+CP C++ C +C D KF M C +C +N
Sbjct: 312 QCPGCQVPDDCGVCTNCLDKPKFGGRNIKKQCCKMECNKC-------------RNKKKRV 358
Query: 96 YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STP 154
++C K +C SCGS PG G +W ++ C C +LF KGN+CP+C K Y D + +
Sbjct: 359 WICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCNKCYDDDDYESK 418
Query: 155 MVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 193
M+ C C RWVH +C+ ++DE Y + + ++ Y C C
Sbjct: 419 MMQCGKCDRWVHSKCESLTDEMYEILSNLPESVAYTCINC 458
>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
Length = 3899
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 65 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWF 121
D + + C +C +YH C P + + ++C K +C SCGS PG G +W
Sbjct: 1434 DFKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWS 1493
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQ 179
++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y +
Sbjct: 1494 HDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEIL 1553
Query: 180 FQVDGNLQYRCPTC 193
+ ++ Y C C
Sbjct: 1554 SNLPESVAYTCINC 1567
>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
Length = 2509
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 96/225 (42%), Gaps = 37/225 (16%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCL--------------------KNWAQNRDL 43
LC V + G M++C +C + YH C+ + + N +
Sbjct: 437 LCLVCGSIGKGVEASMVACSNCAQTYHTYCISLHDKVGSQMIIKMLSTIIFQKFQLNSAV 496
Query: 44 FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTK 103
W+C C CE C GD K + C CD +YH YC PP + + GP+ C ++
Sbjct: 497 I-TRGWRCLDCTFCEGCGAGGDEEKLLLCEECDVSYHMYCIKPPLEAIPKGPWRCQWCSR 555
Query: 104 CHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
C C + GN L+ + C +C L C C + Y+ +E ++ C C
Sbjct: 556 CRRCNHKSTSGNDLTSKGL-----CHSCQSL----QSCSRCNRGYQLNEK--IIKCSQCS 604
Query: 163 RWVHCQCDGI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDA 206
+W H C+G+ +DE+ Q ++ + RC +CR Q+ +A
Sbjct: 605 KWHHGFCEGLHTDEQLEQAALN---RMRCTSCRPNRAQINGFSEA 646
>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
gorilla]
Length = 3837
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 68 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 124
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1363 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 1422
Query: 125 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 182
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 1423 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 1482
Query: 183 DGNLQYRCPTC 193
++ Y C C
Sbjct: 1483 PESVAYTCVNC 1493
>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
Length = 3806
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 5/134 (3%)
Query: 65 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWF 121
D + + C +C +YH C P + + ++C K +C SCGS PG G +W
Sbjct: 1396 DFKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWS 1455
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQ 179
++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y +
Sbjct: 1456 HDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEIL 1515
Query: 180 FQVDGNLQYRCPTC 193
+ ++ Y C C
Sbjct: 1516 SNLPESVAYTCVNC 1529
>gi|899268|emb|CAA58584.1| ALL-1 protein [Homo sapiens]
Length = 395
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 88/176 (50%), Gaps = 11/176 (6%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 222 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 276
Query: 63 TGDPNKFMF-CRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C P + + ++C K +C SCGS PG G
Sbjct: 277 QHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 336
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 173
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 337 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 392
>gi|195329040|ref|XP_002031219.1| GM25861 [Drosophila sechellia]
gi|194120162|gb|EDW42205.1| GM25861 [Drosophila sechellia]
Length = 3603
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
spliced, exon II-containing isoform} [Drosophila,
embryos, Peptide, 3726 aa]
Length = 3726
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|326676507|ref|XP_689347.4| PREDICTED: hypothetical protein LOC560856 [Danio rerio]
Length = 1761
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 98/204 (48%), Gaps = 24/204 (11%)
Query: 1 MCRLCFV-GENEGCERARRMLSCKSCGKKYHRNCL----KNWAQNRDLFHWSSWKCPSCR 55
+C LC G++E M+ C+ C + +H CL + +N++ +W C C+
Sbjct: 1098 VCLLCASKGQHE-------MVYCQMCCEPFHHFCLPPDDRPKEENKE-----NWCCRRCK 1145
Query: 56 ICEIC-RRTGDPNKFMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVP 112
C +C R++ + C+RC YH C P P + ++C +C SCG P
Sbjct: 1146 FCHVCGRKSKQTKPVLQCKRCMYCYHPSCLGPTYPKPVKMNTSWVCMLCIRCKSCGV-TP 1204
Query: 113 GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDG 171
G W C C +L KG++C VCLK Y + E + M+ C C WVH +C+G
Sbjct: 1205 GKSSDTSWNHELELCPDCNKLHSKGDFCTVCLKCYEEHEFDSSMMQCARCAHWVHPKCEG 1264
Query: 172 ISDEKY-LQFQVDG-NLQYRCPTC 193
++D+ + + ++ G +L + C C
Sbjct: 1265 LTDDLHEILCRLRGKSLVFSCAPC 1288
>gi|195501651|ref|XP_002097884.1| GE26459 [Drosophila yakuba]
gi|194183985|gb|EDW97596.1| GE26459 [Drosophila yakuba]
Length = 2853
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 16/191 (8%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG-PYLCPKHTKCHSC 107
W CP C +C C + + C++C YH C + + + P +C KC SC
Sbjct: 1354 WLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGADRPLICVNCLKCKSC 1411
Query: 108 GSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWV 165
+ V F+G C C +L KGN+CP+C + Y D++ M+ C C +WV
Sbjct: 1412 STT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDDNDFDLKMMECGDCAQWV 1465
Query: 166 HCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE--DAVRELWRRKDMADKD 220
H +C+G+SDE+Y L + ++++ C C R E +++ E AV E ++ +
Sbjct: 1466 HSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEWRQAVMEEFKASLYSVLK 1525
Query: 221 LIASLRAAAGL 231
L++ R A L
Sbjct: 1526 LLSKSRQACAL 1536
>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
Length = 3741
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 16/191 (8%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG-PYLCPKHTKCHSC 107
W CP C +C C + + C++C YH C + + + P +C KC SC
Sbjct: 1353 WLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGADRPLICVNCLKCKSC 1410
Query: 108 GSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWV 165
+ V F+G C C +L KGN+CP+C + Y D++ M+ C C +WV
Sbjct: 1411 STT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDDNDFDLKMMECGDCAQWV 1464
Query: 166 HCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE--DAVRELWRRKDMADKD 220
H +C+G+SDE+Y L + ++++ C C R E +++ E AV E ++ +
Sbjct: 1465 HSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSRIKAEEWRQAVMEEFKASLYSVLK 1524
Query: 221 LIASLRAAAGL 231
L++ R A L
Sbjct: 1525 LLSKSRQACAL 1535
>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
Full=Lysine N-methyltransferase 2A
gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
Length = 3726
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
Length = 3759
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1268 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 1321
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1379
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1380 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1433
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1434 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1493
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1494 RQAVMEEFKASLYSVLKLLSKSRQACAL 1521
>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 3779
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1191 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 1244
Query: 48 -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 94
+W CP C +C C + + C++C YH C + + +
Sbjct: 1245 GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 1302
Query: 95 -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 151
P +C KC SC + V F+G C C +L KGNYCP+C K Y D++
Sbjct: 1303 RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1356
Query: 152 STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 210
M+ C C +WVH +C+G+SDE+Y L + ++++ C C RD +
Sbjct: 1357 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1411
Query: 211 WRRKDMADKDLIASLRAAAGL 231
WR+ M ++ ASL + L
Sbjct: 1412 WRQAVM--EEFKASLYSVLKL 1430
>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
Length = 3708
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 117/267 (43%), Gaps = 53/267 (19%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 1266 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEETTLMGSLLETSVT 1319
Query: 48 ---------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVS 92
+W CP C +C C + + C++C YH C + +
Sbjct: 1320 AVGSSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLG 1377
Query: 93 SG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDS 150
+ P +C KC SC + V F+G C C +L KGN+CP+C K Y D+
Sbjct: 1378 ADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTNCFKLRKKGNFCPICQKCYDDN 1431
Query: 151 E-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE-- 204
+ M+ C C++WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1432 DFDLKMMECGDCRQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKMKAEEWR 1491
Query: 205 DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1492 QAVMEEFKVSLYSVLKLLSKSRQACAL 1518
>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
Length = 2805
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 77/188 (40%), Gaps = 23/188 (12%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2585 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2643
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +CGS PG S R
Sbjct: 2644 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2701
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMVCCDVC 161
W Y T C C +L+ KG YCP C + + R +V C C
Sbjct: 2702 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEVNLVHCSAC 2761
Query: 162 QRWVHCQC 169
+++H C
Sbjct: 2762 DKYLHLGC 2769
>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
Length = 3358
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 900 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 954 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153
>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
Length = 3358
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 116/268 (43%), Gaps = 54/268 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 900 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHGSFEDTTLMGSLLETTVN 953
Query: 48 ----------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNV 91
+W CP C +C C + + C++C YH C + +
Sbjct: 954 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLL 1011
Query: 92 SSG-PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRD 149
+ P +C KC SC + V F+G C C +L KGN+CP+C + Y D
Sbjct: 1012 GADRPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDD 1065
Query: 150 SE-STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC--RGECYQVRDLE- 204
++ M+ C C +WVH +C+G+SDE+Y L + ++++ C C R E +++ E
Sbjct: 1066 NDFDLKMMECGDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEW 1125
Query: 205 -DAVRELWRRKDMADKDLIASLRAAAGL 231
AV E ++ + L++ R A L
Sbjct: 1126 RQAVMEEFKASLYSVLKLLSKSRQACAL 1153
>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 3474
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 54/261 (20%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---------------- 47
LCF+ + G + ++ C C + YH+ C+++ +L H S
Sbjct: 886 LCFLCGSTGLDP---LIFCACCCEPYHQYCVQD---EYNLKHSSFEDTTLMNSLLETSLS 939
Query: 48 -------------SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 94
+W CP C +C C + + C++C YH C + + +
Sbjct: 940 GAHSSLNQLTQRLNWLCPRCTVCYTCNMSSGSK--VKCQKCQKNYHSTCLGTSKRLLGAD 997
Query: 95 -PYLCPKHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSE- 151
P +C KC SC + V F+G C C +L KGNYCP+C K Y D++
Sbjct: 998 RPLICVNCLKCKSCSTT------KVSKFVGNLPMCTGCFKLRKKGNYCPICQKCYDDNDF 1051
Query: 152 STPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTCRGECYQVRDLEDAVREL 210
M+ C C +WVH +C+G+SDE+Y L + ++++ C C RD +
Sbjct: 1052 DLKMMECGDCNQWVHSKCEGLSDEQYNLLSTLPESIEFICKRCAR-----RDSSRLKADE 1106
Query: 211 WRRKDMADKDLIASLRAAAGL 231
WR+ M ++ ASL + L
Sbjct: 1107 WRQAVM--EEFKASLYSVLKL 1125
>gi|383863051|ref|XP_003706996.1| PREDICTED: uncharacterized protein LOC100875915 [Megachile rotundata]
Length = 3343
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 3123 CKMCLKTLNKHG-KVEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3181
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 3182 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCASKEPGGINSDRNS 3239
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 160
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 3240 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3298
Query: 161 CQRWVHCQC 169
C +++H C
Sbjct: 3299 CDKYLHLDC 3307
>gi|307199377|gb|EFN80002.1| Supporter of activation of yellow protein [Harpegnathos saltator]
Length = 1532
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 77/188 (40%), Gaps = 23/188 (12%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 1311 CKMCLKVLNKH-NKTEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 1369
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +CGS PG S R
Sbjct: 1370 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREPGGANSDRNS 1427
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVY---RDSESTPMVCCDVC 161
W Y T C C +L+ KG YCP C + + R +V C C
Sbjct: 1428 VAQWQHEYKKGEKNTRVYVSTLCIPCSKLWRKGRYCPHCSRCHTAQRLDLEPNLVHCSAC 1487
Query: 162 QRWVHCQC 169
+++H C
Sbjct: 1488 DKYLHLDC 1495
>gi|11072099|gb|AAG26335.2| MLL protein [Homo sapiens]
Length = 380
Score = 86.3 bits (212), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 95/200 (47%), Gaps = 15/200 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRR 62
+CF+ + G + C+ C + +H+ CL+ R L +W C C+ C +C R
Sbjct: 123 VCFLCASSG---HVEFVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGR 177
Query: 63 TGDPNKFMF-CRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
K + C +C +YH C +P ++C K +C SCGS PG G
Sbjct: 178 QHRATKQLLECNKCRNSYHLECLGPTYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDA 237
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY 177
+W ++ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C +S +
Sbjct: 238 QWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCGNLSGTED 297
Query: 178 LQFQVDGNL----QYRCPTC 193
+++ NL Y C C
Sbjct: 298 EMYEILSNLPESVAYTCVNC 317
>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
Length = 3066
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 79/189 (41%), Gaps = 25/189 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2845 CKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2903
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +CGS G S R
Sbjct: 2904 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGANSDRNS 2961
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 160
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 2962 VAQWQHEYKKGEKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-LVHCSA 3020
Query: 161 CQRWVHCQC 169
C +++H +C
Sbjct: 3021 CDKYLHLEC 3029
>gi|357614029|gb|EHJ68865.1| putative PHD finger protein 10 [Danaus plexippus]
Length = 2413
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 81/192 (42%), Gaps = 23/192 (11%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
R L C +C K H +C++ SW+C C+ C C R D +K +FC CD
Sbjct: 2163 RFLVCSNCNAKLHPSCVELGPDTIRKCREYSWQCAECKTCCACSRPADDDKMLFCDLCDR 2222
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN--------------GLSVRWFLG 123
+H YC V +G + C + + C SCGS G G
Sbjct: 2223 GFHIYCVG--LHTVPNGRWHCVECSVCKSCGSRSAGGPGGGPGDWHHQTRRGPGGHKLYS 2280
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSEST-PMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
++ C C R + G YCP+C + + + T +V C +C R +H +C + + +V
Sbjct: 2281 HSLCTPCARAYRIGRYCPLCDRSFIGPKGTMQLVICKLCDRQLHQEC---VKQTTSRLKV 2337
Query: 183 DGNLQYRCPTCR 194
L Y C CR
Sbjct: 2338 ---LDYTCGECR 2346
>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
Length = 3002
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 75/162 (46%), Gaps = 10/162 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
++L C SC + +H C++ + R H+ C +C C++CRR P + C RC
Sbjct: 965 QLLFCVSCAEPFHFYCVERQFRPRRKEHFV---CRNCTTCKVCRR---PASDLRCIRCST 1018
Query: 78 AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 135
YH C PP K G + CP+ + C CG W C AC +
Sbjct: 1019 GYHPECLADFPPAKTSQRGCWTCPECSVCLHCGVRA-CKPAETTWSYETNKCAACSQAES 1077
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 177
KG+ CP C + Y + + M+ CD CQ W+H C ++ ++Y
Sbjct: 1078 KGDVCPECDRAYLPT-TKQMIQCDSCQLWMHRTCTKLTVDEY 1118
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 81/180 (45%), Gaps = 15/180 (8%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDL--FHWSSWKCPSCRICEICR 61
LCFV + + + C C + YH C+ Q RD+ + +W CP C+ C C
Sbjct: 869 LCFVCASS--VKHNPAVYCAMCCEPYHTFCV----QIRDVENINTMNWLCPKCQACFECG 922
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS---GPYLCPKHTKCHSCGSNVPGNGLSV 118
+ N+ + C C YH C P + + S + C K +C SC +N +
Sbjct: 923 KRDKRNQMLKCNSCREVYHTECIPPVYPSKPSKRKNIWTCIKCIRCKSCETNA---RVMS 979
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVY-RDSESTPMVCCDVCQRWVHCQCDGISDEKY 177
W C C L +GNYCP+C K Y D M+ C C++WVH C+ ++ E+Y
Sbjct: 980 GWNFDLQLCTECVSLRQRGNYCPLCEKCYDADDYDINMIECARCRKWVHASCEELTAEEY 1039
>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
Length = 2891
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2671 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2729
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 2730 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 2787
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 160
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 2788 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 2846
Query: 161 CQRWVHCQC 169
C +++H C
Sbjct: 2847 CDKYLHLGC 2855
>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
Length = 3312
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 25/189 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 3092 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 3150
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +C S PG S R
Sbjct: 3151 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCSSREPGGINSDRNS 3208
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 160
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 3209 VAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEAN-LVHCSA 3267
Query: 161 CQRWVHCQC 169
C +++H C
Sbjct: 3268 CDKYLHLGC 3276
>gi|444515379|gb|ELV10878.1| Histone-lysine N-methyltransferase MLL2 [Tupaia chinensis]
Length = 3975
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 314 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARK---RAGWQCPECKVCQACR 365
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP + + + + C C +CG + WF
Sbjct: 366 KPGNDSKMLVCEMCDKGYHTFCLKPPMEELPAHSWKCKACRVCRACGVGSAELDPNSEWF 425
Query: 122 LGYTCCDAC 130
Y+ C C
Sbjct: 426 ENYSLCHRC 434
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 17/141 (12%)
Query: 54 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
C +C R + + C +C YH YC + VSSG C C CG+ PG
Sbjct: 1038 CVVCGSFGRGAE-GHLLACSQCSQCYHPYCVN---SKVSSGLKRC---VSCMQCGAASPG 1090
Query: 114 NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGIS 173
W YT C C L CP+C Y + + ++ C C+RW+H C+ +
Sbjct: 1091 --FHCEWQNSYTHCGPCASLVT----CPICHAPYVEEDL--LIQCRHCERWMHAGCESLF 1142
Query: 174 DEKYLQFQVDGNLQYRCPTCR 194
E ++ D + C +C+
Sbjct: 1143 TEDDVEQAADEG--FDCVSCQ 1161
>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
Length = 1472
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 79/189 (41%), Gaps = 25/189 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 1252 CKMCLKVLNKH-NKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCVQCH 1310
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +CGS G S R
Sbjct: 1311 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCANCGSREAGGANSDRNS 1368
Query: 120 ---WFLGY------------TCCDACGRLFVKGNYCPVCLKVYR----DSESTPMVCCDV 160
W Y T C C +L+ KG YCP C + + D E+ +V C
Sbjct: 1369 VAQWQHEYKKGEKNTRIYVSTLCVPCSKLWRKGRYCPHCSRCHTAQRLDLEAN-LVHCSA 1427
Query: 161 CQRWVHCQC 169
C +++H +C
Sbjct: 1428 CDKYLHLEC 1436
>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
Length = 4493
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 13/139 (9%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C LC G ML C C + +H CL+ + ++ H +W C C+ C++C
Sbjct: 945 ICFLC------GSAGKHEMLYCNVCCEPFHEFCLEEEERPHEI-HSDNWCCKKCQSCQVC 997
Query: 61 RRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 117
G N + C +C YH C +P + ++C K KC SCG+ PG+GL+
Sbjct: 998 ---GRQNNLLQCDKCQNTYHPECLGPNYPTKPSKKKNIWICTKCVKCKSCGATTPGSGLA 1054
Query: 118 VRWFLGYTCCDACGRLFVK 136
W + C CG+L K
Sbjct: 1055 ATWMYDFQLCYECGQLMDK 1073
>gi|124111214|gb|ABM91995.1| MLL [Pan troglodytes]
Length = 390
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 68 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 124
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 1 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 60
Query: 125 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 182
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 61 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 120
Query: 183 DGNLQYRCPTC 193
++ Y C C
Sbjct: 121 PESVAYTCVNC 131
>gi|395755470|ref|XP_003779950.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like, partial
[Pongo abelii]
Length = 225
Score = 82.8 bits (203), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 71/147 (48%), Gaps = 10/147 (6%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC- 107
W+C C +CE C + DP + + C CD +YH YC PP + V G + C +++C C
Sbjct: 10 WRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWYSRCVWCR 69
Query: 108 GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHC 167
GL W YT C C L + CPVC + YR+ ++ C C RW+H
Sbjct: 70 HCGATSAGLRCEWQNNYTQCAPCASL----SSCPVCYRNYREDL---ILQCRQCDRWMHA 122
Query: 168 QCDGISDEKYLQFQVDGNLQYRCPTCR 194
C ++ E+ ++ D + + C CR
Sbjct: 123 VCQNLNTEEEVENVAD--IGFDCSMCR 147
>gi|326431279|gb|EGD76849.1| hypothetical protein PTSG_12693 [Salpingoeca sp. ATCC 50818]
Length = 3557
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 77/183 (42%), Gaps = 17/183 (9%)
Query: 19 MLSCKSCGKKYHRNCLKN------WAQNRDLFHWSSWKCPSCRICEICRRTGDPN--KFM 70
ML C +C +++H C +A N + + W C C C C T M
Sbjct: 783 MLLCMACARRFHPRCAGMQPSSPLFAANEESVR-TCWLCADCTTCSRCDSTASARSRARM 841
Query: 71 FCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 130
CR C +H C P + S Y C +C C N +S W + C C
Sbjct: 842 ACRVCGKEFHPGCAGLPKRPPS---YCCEDCRRCLDCHRT--ANEVS-SWSATFDYCTPC 895
Query: 131 GRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRC 190
+L +GN CPVC YRD ++ MV C+ C WVH +C+G+ D + + Y C
Sbjct: 896 SQLRKRGNVCPVCKVSYRD-DTPDMVLCETCDTWVHAECEGLDDAGLRRLGATDD-PYHC 953
Query: 191 PTC 193
TC
Sbjct: 954 STC 956
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 86/213 (40%), Gaps = 33/213 (15%)
Query: 1 MCRLCF---VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN--RDLFHW--SSWKCPS 53
MC LC VG ++ M+ C +C + +H CL + R +H+ S ++CP
Sbjct: 1038 MCTLCHSAGVGGDDDL-----MVFCDTCCEAFHLGCLLDTVPTTYRHAWHYDFSKFRCPR 1092
Query: 54 CRICEICRRTGDPN------------KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKH 101
C C+ C R P+ + + C RC H C P H ++ + ++CP+
Sbjct: 1093 CLQCQCCARALAPDLVIGRIINMDTAEHVVCSRCQTICHTECIPPSHPSMDTKHWVCPEC 1152
Query: 102 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 160
KC C R+ + T L +CPVC + S + C+
Sbjct: 1153 IKCSMCRRWQAARSSDWRYCMNCT-------LAQDTQHCPVCKDEFEIKLSLQYILQCNA 1205
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 193
C+ VH CD + + + + N Y CPTC
Sbjct: 1206 CKGLVHDHCDPLIKSRRTRKKYIEN-GYECPTC 1237
>gi|403360488|gb|EJY79922.1| Histone-lysine N-methyltransferase [Oxytricha trifallax]
Length = 2438
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
MC LC N LSC CG+ +H CL+ +++ + WKC +C+ CEIC
Sbjct: 1208 MCYLCGSFGN-----GEDFLSCTLCGESFHTYCLQ-LPEDQVSKYQQYWKCLNCKFCEIC 1261
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------NVP 112
++C CD A+H +C P K++ + + C + KC CG+ N
Sbjct: 1262 ASATQEAFLLYCDVCDKAFHSFCLKPQLKSIPNCQWKCQECFKCQQCGTKEFFSAKDNEE 1321
Query: 113 GNGLSVRWF---LGYTCCDACGRLFVKGNYCPVCLKVYRDSES 152
L V F ++ C CG+ K ++C +C K DS+S
Sbjct: 1322 KRNLEVTDFEFSQNFSFCYQCGKNEYKKSFCKICQKKTEDSDS 1364
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 93/227 (40%), Gaps = 32/227 (14%)
Query: 20 LSCKSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
L C +C K YH CLK N N L + W+C C C C + + + C++C+A
Sbjct: 579 LKCSNCSKNYHLQCLKLPNIHSNLGL-RETDWRCQDCIRCTNCLSLRNRDSMLICQKCNA 637
Query: 78 AYHCYC------QHPP----HKNVSSGP-------------YLCPKHTKCHSCGSNVPGN 114
+H C Q P K++ G Y C + +C +CGS G
Sbjct: 638 GFHYDCLDQSVKQGVPSISNEKDLKLGIETRKTTYSQLNQFYKCEQCVECENCGSKEAGE 697
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDS----ESTPMVCCDVCQRWVHCQCD 170
+ +W Y C C + +C VC K + D+ + + C C VH CD
Sbjct: 698 KRNNKWSKDYKLCTTCNKKRSNKEFCLVCEKFWPDTKEKQDQLQTIQCVQCLMCVHQDCD 757
Query: 171 GI-SDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDM 216
I + LQ DG+L+Y C CR ++R + + + +L M
Sbjct: 758 RIFKNPNVLQQFSDGSLKYNCQKCRQNV-RLRFISEIIEKLASEDKM 803
>gi|326436266|gb|EGD81836.1| hypothetical protein PTSG_02551 [Salpingoeca sp. ATCC 50818]
Length = 830
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 79/167 (47%), Gaps = 21/167 (12%)
Query: 41 RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
+D+ ++W +CR+ E C T D +FC CD +H YC ++ G ++C
Sbjct: 658 KDIMTGTAW---NCRVFEKCGSTKDEKDILFCDECDRGFHTYCTG--LTSLPRGRWICSH 712
Query: 101 HTKCHSCGSNVPGNGLSVRW------------FLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ C C P + + +W FL T CDAC + + G++CP+CL+++
Sbjct: 713 CSVCDGCNFK-PDDPSTYKWSHFTDRRDNQRRFL-KTFCDACFKKWNTGDFCPICLELF- 769
Query: 149 DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 195
S ++ C +C R VH C+ ++ E F+ G ++ C C G
Sbjct: 770 -EPSADLLECSLCSRLVHKDCEDLTPEDEETFKAQGWGRFACSICSG 815
>gi|449679774|ref|XP_002159618.2| PREDICTED: uncharacterized protein LOC100210554 [Hydra
magnipapillata]
Length = 1172
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 89/211 (42%), Gaps = 43/211 (20%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+ C C + +H CL + + +W C C+ C +C G + C C
Sbjct: 1 MIFCSLCCEPFHTFCL-----DVEPVFKKNWYCDRCKYCTVC---GMKESLLMCDICHDC 52
Query: 79 YHCYCQHPPHKNVSSG----------------------PYLCPKH----TKCHSC----- 107
YH C P ++ + G Y+ + +C C
Sbjct: 53 YHAECLGPFYQCETEGEDEIWFRWNWLQEKDYNDEHYSAYITKINKAGAVQCTICNKDFY 112
Query: 108 -GSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR-DSESTPMVCCDVCQRWV 165
GS V N S RW +T C+ C + KGNYCPVCL +Y D ++ M+ C C+ WV
Sbjct: 113 YGSEVTMNK-SSRWMQNFTMCEKCDFNWKKGNYCPVCLVLYNDDDDALKMMFCTKCEAWV 171
Query: 166 HCQCDGISDEKY-LQFQVDGNLQYRCPTCRG 195
H +C+GI+++ Y + + ++ Y C C G
Sbjct: 172 HMECEGITEDDYEILADLPDDVPYLCKLCTG 202
>gi|196016261|ref|XP_002117984.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
gi|190579457|gb|EDV19552.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
Length = 183
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 10/146 (6%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+++C CG+ YH C+ + + W+C C +CE C + GD ++ + C CD +
Sbjct: 46 LIACFQCGQSYHHYCVSAKLTRSVIV--NGWRCLDCAVCEGCGKAGDEDRLLLCDECDIS 103
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
YH YC +P V G + C + CH CGSN P G++ +W YT C C V
Sbjct: 104 YHTYCLNPQLDKVPEGEWKCHRCVSCHDCGSNFP--GINCQWTSNYTQCGPCASTSV--- 158
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRW 164
CP C K Y + + +V C+ C R+
Sbjct: 159 -CPKCNKKYVNDD--IIVQCNNCDRY 181
>gi|330804473|ref|XP_003290219.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
gi|325079683|gb|EGC33272.1| hypothetical protein DICPUDRAFT_15851 [Dictyostelium purpureum]
Length = 630
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 83/190 (43%), Gaps = 27/190 (14%)
Query: 19 MLSCKSCGKKYHRNCLK------NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFC 72
++ C C KK+H CL + +N L +WKC C+ CE+C+ D +K + C
Sbjct: 416 LIKCSECQKKFHPQCLGLHQTCVDSIRNNTL----AWKCTDCKNCEVCQNDVDESKIIIC 471
Query: 73 RRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN--GLSVRWFLGYTCCDAC 130
CD +H YC +PP + SG + C C C P N +++W YT CD+C
Sbjct: 472 DVCDKGFHTYCLNPPLSSPPSGGWRCSNCVFCTHCYIR-PENSENKNIKWKDNYTSCDSC 530
Query: 131 GRLFVKG-----NYCPVCLKVYRDSEST--PMVCCDVCQRWVHCQCDGISDEKYLQFQVD 183
F KG YC +C + E C C + VH CD I + +
Sbjct: 531 ---FSKGFSDKSKYCSICRHSLKSEEEEEDSTTQCTYCGKLVHDDCDSIITDNL----EN 583
Query: 184 GNLQYRCPTC 193
+ Y+CP C
Sbjct: 584 EHFIYKCPGC 593
>gi|68070837|ref|XP_677332.1| SET-domain protein [Plasmodium berghei strain ANKA]
gi|56497407|emb|CAH97187.1| SET-domain protein, putative [Plasmodium berghei]
Length = 1439
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 10/159 (6%)
Query: 45 HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 97
H+ + C C C C ++T + ++ C+ C+ H C +P ++ +
Sbjct: 969 HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCYPNVPDIYLFNWK 1028
Query: 98 CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 156
C KC C SN+ + W L CC C + + K N+C +C + Y+ +S V
Sbjct: 1029 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1087
Query: 157 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 195
CDVC+ W+H CD D + ++ Y+CPTCR
Sbjct: 1088 ECDVCKFWIHLSCDKDEDRNIETLAI-KHINYKCPTCRS 1125
>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
Length = 906
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 14/176 (7%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C +CG H CL + W+C +C+IC+ CR++ D + + C CD
Sbjct: 174 LLFCTTCGAHSHARCLNEGIIVTGEVR-AGWQCYTCKICQQCRKSDDDAQMIICETCDKG 232
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGN 138
+H YC +P +V + C C C GN + + W Y+ C C
Sbjct: 233 WHTYCLNPVMDSVPKDGWSCTNCRNCIEC-----GNKIHLEWQNDYSTCPVCW----SKQ 283
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
C VC K Y++ + ++ C C RW H + + E + N ++C CR
Sbjct: 284 ACSVCNKDYKEDDV--LLKCSECLRWQHALHEQVYSESDADSMAEQN--FKCKLCR 335
>gi|332839578|ref|XP_003313789.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 429
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 10/137 (7%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+ C SCG YH CL R + W+CP C++C+ CR+ G+ +K + C CD
Sbjct: 293 LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACRKPGNDSKMLVCETCDKG 349
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL----- 133
YH +C PP + + + + C C +CG+ + WF Y+ C C +
Sbjct: 350 YHTFCLKPPMEELPAHSWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQP 409
Query: 134 --FVKGNYCPVCLKVYR 148
V + PVC + +R
Sbjct: 410 IRSVAEQHTPVCSRAWR 426
>gi|149032108|gb|EDL87020.1| rCG50635 [Rattus norvegicus]
Length = 609
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 73/156 (46%), Gaps = 15/156 (9%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C +C G E C+ + C SCG YH CL R + W+CP C++C+ CR
Sbjct: 229 CAVC-EGPGELCD----LFFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQACR 280
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+ G+ +K + C CD YH +C PP +++ + + C C +CG+ + WF
Sbjct: 281 KPGNDSKMLVCETCDKGYHTFCLKPPIEDLPAHSWKCKTCRICRACGAGSADLNPNSEWF 340
Query: 122 LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
Y+ C C + V+G+ V +E P VC
Sbjct: 341 ENYSLCHRCHK--VQGSQ-----PVISVAEQHPAVC 369
>gi|82596471|ref|XP_726275.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481616|gb|EAA17840.1| Bromodomain, putative [Plasmodium yoelii yoelii]
Length = 4805
Score = 77.8 bits (190), Expect = 2e-11, Method: Composition-based stats.
Identities = 44/158 (27%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 45 HWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 97
H+ + C C C C ++T + ++ C+ C+ H C P ++ +
Sbjct: 1337 HYKKYICKECYRCIYCCESIYNYKQTPNVANYVICKSCNMVAHGSCCFPNVPDIYLFNWK 1396
Query: 98 CPKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 156
C KC C SN+ + W L CC C + + K N+C +C + Y+ +S V
Sbjct: 1397 CDDCLKCSKCDYSNLCFINYN-EWELHLDCCINCYKEYEKKNFCIICNEKYKVDDSNKWV 1455
Query: 157 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CDVC+ W+H CD D + ++ Y+CPTCR
Sbjct: 1456 ECDVCKFWIHLSCDKDEDRNIETLAIK-HINYKCPTCR 1492
>gi|148672214|gb|EDL04161.1| mCG145001 [Mus musculus]
Length = 630
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 5 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 64
C V E G + +L C SCG YH CL R + W+CP C++C+ CR+ G
Sbjct: 250 CAVCEGPG--QLCDLLFCTSCGHHYHGACLDTALTARKR---AGWQCPECKVCQSCRKPG 304
Query: 65 DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGY 124
+ +K + C CD YH +C PP +++ + + C C +CG+ + WF Y
Sbjct: 305 NDSKMLVCETCDKGYHTFCLKPPMEDLPAHSWKCKTCRLCRACGAGSAELNPNSEWFENY 364
Query: 125 TCCDAC 130
+ C C
Sbjct: 365 SLCHRC 370
>gi|156374109|ref|XP_001629651.1| predicted protein [Nematostella vectensis]
gi|156216656|gb|EDO37588.1| predicted protein [Nematostella vectensis]
Length = 251
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 78/177 (44%), Gaps = 12/177 (6%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
+++ C CG+ +H C+ + + W+C C +CE C + D + + C CD
Sbjct: 15 QLIVCSQCGQCFHPYCVG--VKVNKMILSKGWRCLDCTLCEGCGKGSDEARLLLCDSCDI 72
Query: 78 AYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKG 137
+YH YC PP + V G + C C CG+ G W YT C C
Sbjct: 73 SYHTYCLDPPLEKVPPGGWKCKWCVSCDDCGATSAGT--QCEWQSNYTQCGPC----ASK 126
Query: 138 NYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CPVC Y ++ M+ C C RW+H CDG+ E+ + D Y+C CR
Sbjct: 127 TSCPVCNIKYNLNDL--MIQCLHCDRWLHGSCDGLMTEEEVDRAAD--YGYQCLYCR 179
>gi|328716146|ref|XP_003245847.1| PREDICTED: hypothetical protein LOC100162709 isoform 2 [Acyrthosiphon
pisum]
Length = 1426
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 4/148 (2%)
Query: 2 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
+ D +K +FC CD YH YC + V G + C + C SCG + PG G S +W
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPGPGDS-KW 1398
Query: 121 FLGYTCCDACGRLFVKGNYCPVCLKVYR 148
F + + G C C ++++
Sbjct: 1399 FYEFKKTEKTGSKVYCRTLCAPCSRMHQ 1426
>gi|388854432|emb|CCF52016.1| related to histone acetyltransferase 3 (myst) [Ustilago hordei]
Length = 1215
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 7/104 (6%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 55
+C C E+ + + ++SC CG H +CLK W +N R + W+C C+
Sbjct: 83 VCAFCLRTAEHPKGDTPKLLVSCHECGSSGHPSCLK-WGRNPTKVRQALSYD-WRCIECK 140
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
CE+CR GD + MFC +CD +H YC PP G + CP
Sbjct: 141 KCEVCRDKGDDAQLMFCDKCDRGWHLYCLSPPLSKPPKGQWHCP 184
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 75.9 bits (185), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 76/196 (38%), Gaps = 59/196 (30%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG R+L+C CG+ YH C+ + + W+C C +CE
Sbjct: 735 MCVVCGSFGQGAEG-----RLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCE 787
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
C + DP + + C CD +YH YC PP + V G
Sbjct: 788 ACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKG------------------------ 823
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYL 178
G+ C K YR+ + ++ C C RW+H C +S E+ +
Sbjct: 824 ----GWKC------------------KCYREEDL--ILQCRQCDRWMHAVCQNLSTEEEV 859
Query: 179 QFQVDGNLQYRCPTCR 194
+ D + + C CR
Sbjct: 860 ENVAD--IGFDCSMCR 873
>gi|74150118|dbj|BAE24369.1| unnamed protein product [Mus musculus]
Length = 742
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 54/115 (46%), Gaps = 9/115 (7%)
Query: 1 MCRLC--FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
MC +C F EG +L+C C + YH C+ + L W+C C +CE
Sbjct: 633 MCVVCGSFGRGAEG-----HLLACSQCSQCYHPYCVNSKITKVMLLK--GWRCVECIVCE 685
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
+C + DP++ + C CD +YH YC PP V G + C C CG+ PG
Sbjct: 686 VCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPG 740
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 75.1 bits (183), Expect = 1e-10, Method: Composition-based stats.
Identities = 46/178 (25%), Positives = 78/178 (43%), Gaps = 8/178 (4%)
Query: 23 KSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC-------RRTGDPNKFMFCRRC 75
KS KKY R + + + C C C C ++T + ++ C+ C
Sbjct: 1640 KSKNKKYRRCINYIPSVEHSDITYKKFICKDCYRCIYCCESIYDYKQTPNVANYVICKNC 1699
Query: 76 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 135
+ H C P ++ + C KC+ C + W L CC C + +
Sbjct: 1700 NMVAHGSCCFPNVPDIYLFNWKCDDCLKCNKCNYSNLCYINYNEWELHLDCCINCYKEYE 1759
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 193
K N+C +C + Y + +S V CDVC+ W+H CD ++ + ++ + N+ Y+CPTC
Sbjct: 1760 KKNFCIMCNEKYDEDDSKKWVQCDVCKFWIHLSCDK-NESRNIETLSNKNIDYKCPTC 1816
>gi|358252877|dbj|GAA50314.1| histone-lysine N-methyltransferase MLL4 [Clonorchis sinensis]
Length = 1769
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 77/180 (42%), Gaps = 27/180 (15%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
++L C SC + +H C++ + R H+ CR C C+ P + C RC A
Sbjct: 497 QLLFCVSCAEPFHFYCVERQFRPRRKDHFI------CRNCTECKECHSPAADLRCIRCSA 550
Query: 78 AYH--CYCQHPPHKNVSSGPYLCPKHTKCHSCGS--------------NVPGNGLSVRWF 121
YH C + P ++V G ++CP C CGS P G R
Sbjct: 551 GYHPSCLSDYAPAQSVHRGNWVCPHCASCVHCGSKPLHKRQEAGTETGKPPNTGGCSRST 610
Query: 122 LGYTC----CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY 177
++ C AC +G+ CP C + Y + + M+ CD CQ W+H C ++ ++Y
Sbjct: 611 TAWSSEPNKCAACCSAEARGDICPECDRAYLPT-TKQMIQCDTCQLWMHRTCTKLTADEY 669
>gi|319411664|emb|CBQ73708.1| related to histone acetyltransferase 3 (myst) [Sporisorium
reilianum SRZ2]
Length = 1223
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 9/105 (8%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQN-----RDLFHWSSWKCPSC 54
+C C + + + ++SC CG H +CL+ W +N + L + W+C C
Sbjct: 87 LCAFCLQTADRPKGDTPKLLISCYECGSSGHPSCLR-WGRNPTKVGKALSY--DWRCIEC 143
Query: 55 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
+ CEICR GD + MFC RCD +H YC PP G + CP
Sbjct: 144 KKCEICRDKGDDAQLMFCDRCDRGWHLYCLSPPLLKPPKGQWHCP 188
>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
Length = 6442
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 70/162 (43%), Gaps = 10/162 (6%)
Query: 46 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1582 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1641
Query: 99 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
+ KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1642 DECLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1700
Query: 158 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 199
CDVC+ W+H CD + + + Y+CPTCR +
Sbjct: 1701 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCRSGSFH 1741
>gi|20977848|gb|AAM33377.1|AF492830_1 CDK6/MLL fusion protein [Homo sapiens]
Length = 182
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 5/131 (3%)
Query: 68 KFMFCRRCDAAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGY 124
+ + C +C +YH C P + + ++C K +C SCGS PG G +W +
Sbjct: 49 QLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDF 108
Query: 125 TCCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKY-LQFQV 182
+ C C +LF KGN+CP+C K Y D + + M+ C C RWVH +C+ +SDE Y + +
Sbjct: 109 SLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNL 168
Query: 183 DGNLQYRCPTC 193
++ Y C C
Sbjct: 169 PESVAYTCVNC 179
>gi|443696184|gb|ELT96955.1| hypothetical protein CAPTEDRAFT_106026 [Capitella teleta]
Length = 319
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 48/95 (50%), Gaps = 3/95 (3%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC V + GC L C SCG+ YH NCL Q + S W+CP C+IC+ CR+
Sbjct: 219 LCIVCDVPGC--ISDQLFCTSCGQHYHGNCLDPPVQVNPVVR-SGWQCPECKICQTCRQP 275
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
GD NK + C CD YH +C P + + C
Sbjct: 276 GDDNKMLVCDTCDKGYHIFCLRPVMTTIPKNGWKC 310
>gi|240952194|ref|XP_002399348.1| zinc finger protein, putative [Ixodes scapularis]
gi|215490554|gb|EEC00197.1| zinc finger protein, putative [Ixodes scapularis]
Length = 379
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 60/125 (48%), Gaps = 5/125 (4%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C++C + + ++SC CGK H CL + W+C C++C IC
Sbjct: 232 VCKVC--NNVDASKEGEELISCSECGKVGHVTCLDILPEMAVAIKSYRWQCMECKMCNIC 289
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG-NGLSVR 119
T + K MFC RCD YH +C K+V +G ++C +C +CG PG G V+
Sbjct: 290 MATDNEEKMMFCDRCDRGYHSFCVG--MKSVPAGRWICRLCGRCATCGVASPGPEGPRVQ 347
Query: 120 WFLGY 124
W Y
Sbjct: 348 WHHEY 352
>gi|443696185|gb|ELT96956.1| hypothetical protein CAPTEDRAFT_106029, partial [Capitella teleta]
Length = 175
Score = 73.2 bits (178), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 70/148 (47%), Gaps = 8/148 (5%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC + G R++ C CG+ YH C N +R + W+C C +CE C R
Sbjct: 26 LCVSCGSLGANEESRLIVCSQCGQCYHPYC-ANVKLSRIILE-KGWRCLDCTVCEGCGRP 83
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
D ++ + C CD +YH YC PP ++V G + C C +CG+ PG + W
Sbjct: 84 HDESRLILCDECDISYHIYCLDPPLESVPRGTWKCKWCAICVTCGTTAPGTNCA--WQNN 141
Query: 124 YTCCDACGRLFVKGNYCPVCLKVYRDSE 151
Y+ C C + CP+C + Y++ +
Sbjct: 142 YSQCGPCYSTVM----CPLCYRSYKEDD 165
>gi|320163082|gb|EFW39981.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 660
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 74/190 (38%), Gaps = 23/190 (12%)
Query: 21 SCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYH 80
+C SC H +C + W+C +C+ C +C T + + + C CD YH
Sbjct: 196 ACASCNNGGHVSCFHMTKLMGETVQTYGWECSNCKSCAMCNSTENETEMLLCDVCDRGYH 255
Query: 81 CYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL-------SVRWFLGYTC------- 126
C K + G ++C C +C S + G L V W Y
Sbjct: 256 IQCID--LKTMPLGRWVCSLCNACTNCHSKLAGAILPRKEMDRPVDWLTFYATDVQSMKF 313
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C A + + YC +CLKV E + CD C R H CD L + G
Sbjct: 314 CSA-RSSWSQHEYCAICLKVNHPGEKLRLYKCDGCLRQTHPSCDK------LYKRPRGRT 366
Query: 187 QYRCPTCRGE 196
+Y CP CRG+
Sbjct: 367 RYFCPICRGD 376
>gi|389584530|dbj|GAB67262.1| SET domain containing protein [Plasmodium cynomolgi strain B]
Length = 5788
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 73/162 (45%), Gaps = 10/162 (6%)
Query: 46 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1113 YKKYVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1172
Query: 99 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1173 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1231
Query: 158 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 199
CDVC+ W+H CD ++ + ++ ++ Y+CPTCR +
Sbjct: 1232 CDVCKFWIHLSCDK-NESRNIETLSIKSINYKCPTCRSGSFH 1272
>gi|428178234|gb|EKX47110.1| hypothetical protein GUITHDRAFT_137718 [Guillardia theta CCMP2712]
Length = 1125
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 46/94 (48%), Gaps = 1/94 (1%)
Query: 102 TKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC-CDV 160
KC+SCG+ PG W YT C CG L K YC VC KV ++ E+ + C V
Sbjct: 715 VKCNSCGAKTPGKKPMDAWKQAYTQCSRCGELHAKKRYCSVCDKVLKEIETEDQILKCSV 774
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
C WVH +CD + + + + + Y CP R
Sbjct: 775 CDLWVHGRCDDLDELSLVGMNLLSSEHYVCPKHR 808
>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
Length = 6587
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 10/162 (6%)
Query: 46 WSSWKCPSCRICEIC-------RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + C C C C ++T + ++ C+ C+ H C P ++ + C
Sbjct: 1629 YKKFVCKDCYRCIYCCESIYNYKQTPNIANYVICKTCNMVAHGSCCFPNVPDIYLFNWKC 1688
Query: 99 PKHTKCHSCG-SNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
KC+ C SN+ + W CC C + + K N+C +C + Y +S V
Sbjct: 1689 DDCLKCNKCDYSNLCFINYN-EWEFHLDCCINCYKEYEKKNFCIMCNEKYEIDDSNKWVQ 1747
Query: 158 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQ 199
CDVC+ W+H CD + + + Y+CPTCR +
Sbjct: 1748 CDVCKFWIHLSCDKNENRNIETLSIKS-INYKCPTCRSGSFH 1788
>gi|149056303|gb|EDM07734.1| rCG53696 [Rattus norvegicus]
Length = 169
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 69/137 (50%), Gaps = 6/137 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMF-CRRCDA 77
++ C+ C +H CL+ A+ H +W C C+ C +C R G +K + C RC
Sbjct: 29 LVFCQVCCDPFHPFCLEE-AERPLPQHRDTWCCRRCKFCHVCGRKGRGSKHLLECERCRH 87
Query: 78 AYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLF 134
AYH C P + ++ ++C +C SCG+ PG V W Y+ C C L+
Sbjct: 88 AYHPACLGPSYPTRATRRRRHWICSACVRCKSCGAT-PGKNWDVEWSGDYSLCPRCTELY 146
Query: 135 VKGNYCPVCLKVYRDSE 151
KGNYCP+C + Y D++
Sbjct: 147 EKGNYCPICTRCYEDND 163
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW-SSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
+L CK CG K H +CL A+ + SW+C C+ C IC TGDP+ +FC CD
Sbjct: 209 LLICKDCGNKAHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDK 268
Query: 78 AYHCYCQHPPHKNVSSGPYLC 98
YH C P + SG + C
Sbjct: 269 GYHMNCHEPKLTQMPSGKWAC 289
>gi|355702665|gb|AES02007.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
Length = 167
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 82/161 (50%), Gaps = 8/161 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLF-HWSSWKCPSCRICEICRRTGDPNKFMF-CRRCD 76
+ C+ C + +H+ CL+ R L +W C C+ C +C R K + C +C
Sbjct: 9 FVYCQVCCEPFHKFCLEE--NERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCR 66
Query: 77 AAYHCYCQHPPHKNVSSGP---YLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRL 133
+YH C P + + ++C K +C SCGS PG G +W ++ C C +L
Sbjct: 67 NSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKL 126
Query: 134 FVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGIS 173
F KGN+CP+C K Y D + + M+ C C RWVH +C+ +S
Sbjct: 127 FAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLS 167
>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
Length = 2857
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 63/153 (41%), Gaps = 9/153 (5%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C++C N+ + ++ C +C H +C+ +W+C C+ C C
Sbjct: 2697 CKMCLKTLNKHS-KNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCH 2755
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-- 119
D +K +FC CD YH YC + V G + C + C +CGS PG S R
Sbjct: 2756 DPADEDKMLFCDMCDRGYHIYCVG--LRRVPQGRWHCQECAVCVNCGSREPGGINSDRNS 2813
Query: 120 ---WFLGYTCCDACGRLFVKGNYCPVCLKVYRD 149
W Y D R++V C C K+ D
Sbjct: 2814 VAQWQHEYKKGDKNTRVYV-STLCVPCSKLRGD 2845
>gi|321261507|ref|XP_003195473.1| histone acetyltransferase [Cryptococcus gattii WM276]
gi|317461946|gb|ADV23686.1| Histone acetyltransferase, putative [Cryptococcus gattii WM276]
Length = 947
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
M+SC +CG+ H CL R W C C++CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKMCEQCEIKGDDSRLMFCDTCDR 100
Query: 78 AYHCYCQHPPHKNVSSGPYLCPK 100
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
Length = 265
Score = 69.3 bits (168), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
ML C SCG+ YH CL + L W+CP C++C+ CR+ GD NK + C CD
Sbjct: 187 MLFCTSCGRHYHGRCLDPAVEITSLVR-MGWQCPDCKVCQGCRQPGDDNKMLVCDVCDRG 245
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH +C PP + + C
Sbjct: 246 YHTFCLDPPMTTIPKTGWKC 265
>gi|328716148|ref|XP_003245848.1| PREDICTED: hypothetical protein LOC100162709 isoform 3 [Acyrthosiphon
pisum]
Length = 1397
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 48/113 (42%), Gaps = 3/113 (2%)
Query: 2 CRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C+LC + ++ C C YH CL + +W+C C+ C C
Sbjct: 1282 CKLCLGTADKNKIGSVEPLIHCSKCLTIYHPTCLDMTLEMVPYIKRYNWQCNECKSCAQC 1341
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
+ D +K +FC CD YH YC + V G + C + C SCG + PG
Sbjct: 1342 KEVADEDKMLFCDLCDRGYHIYCVG--LRRVPEGRWHCQECAMCSSCGVSDPG 1392
>gi|405122036|gb|AFR96804.1| Myst4 protein [Cryptococcus neoformans var. grubii H99]
Length = 943
Score = 68.9 bits (167), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCEIKGDDSRLMFCDTCDR 100
Query: 78 AYHCYCQHPPHKNVSSGPYLCPK 100
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|58269200|ref|XP_571756.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57227992|gb|AAW44449.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 940
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100
Query: 78 AYHCYCQHPPHKNVSSGPYLCPK 100
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|134114447|ref|XP_774152.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256785|gb|EAL19505.1| hypothetical protein CNBG4520 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 940
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
M+SC +CG+ H CL R W C C+ CE C GD ++ MFC CD
Sbjct: 41 MVSCAACGRSGHPTCLNMLTPKLRKRVMMYDWHCIECKTCEQCAIKGDDSRLMFCDTCDR 100
Query: 78 AYHCYCQHPPHKNVSSGPYLCPK 100
+H YC +PP G + CPK
Sbjct: 101 GWHSYCLNPPLAKPPKGSWHCPK 123
>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
Length = 2007
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 53/93 (56%), Gaps = 6/93 (6%)
Query: 104 CHSCGSNVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
C SCG V G+ GL + G C C +L+ +G +CPVC +V++ + PMV CD C
Sbjct: 180 CGSCG--VTGDEGLKSKNAQGR--CSLCQKLYKEGQFCPVCDRVWQWATGDPMVGCDRCD 235
Query: 163 RWVHCQCDGISDEKYLQFQVDG-NLQYRCPTCR 194
W+H +CD ++ E + + DG L Y CP CR
Sbjct: 236 MWIHRECDALAAEVLDREENDGEELAYECPKCR 268
>gi|71018437|ref|XP_759449.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
gi|46099056|gb|EAK84289.1| hypothetical protein UM03302.1 [Ustilago maydis 521]
Length = 1283
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWS---SWKCPSCRICEICRRTGDPNKFMFCRRC 75
++SC CG H +CLK W + H + +W+C C+ CE+C GD + MFC RC
Sbjct: 215 LISCYECGSSGHPSCLK-WGRKSTKVHKALSYNWRCIECKKCEVCDDKGDDAQLMFCDRC 273
Query: 76 DAAYHCYCQHPPHKNVSSGPYLCP 99
D +H YC P G + CP
Sbjct: 274 DRGWHLYCLTPALSKPPKGQWHCP 297
>gi|47225576|emb|CAG12059.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1216
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
Query: 97 LCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPM 155
+C +C SCG PG + W C C +LF GNYCP+C K Y D++ + M
Sbjct: 1035 VCMTCIRCKSCGV-TPGKSWDIEWNHEKGLCQDCSKLFEMGNYCPICFKCYEDNDYDSQM 1093
Query: 156 VCCDVCQRWVHCQCDGISDEKYL 178
+ C C WVH +C+ +++ ++
Sbjct: 1094 MQCGTCNHWVHAKCEDLTEGSHV 1116
>gi|299749795|ref|XP_002911422.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
gi|298408603|gb|EFI27928.1| histone acetyltransferase mst2 [Coprinopsis cinerea okayama7#130]
Length = 2272
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N E+ +M C CG+ H C++ A D+ W+C C+ICE+C R GD +F
Sbjct: 649 NVRTEQPEQMTHCIECGRSGHPTCMQ-LAHIGDVIRSYPWRCIECKICEVCSRKGDDVRF 707
Query: 70 --------------------MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 109
MFC CD +H YC +PP G + CP+ C
Sbjct: 708 VQLDLLGQTTDNPLLFQEKMMFCDSCDRGWHMYCLNPPMDETPPGKWSCPQ------CSP 761
Query: 110 NVPGNGLSV 118
P G+ +
Sbjct: 762 LFPEQGIPM 770
>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2437
Score = 67.8 bits (164), Expect = 2e-08, Method: Composition-based stats.
Identities = 49/179 (27%), Positives = 75/179 (41%), Gaps = 19/179 (10%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR-RTGDPNKFMFCRRCDAAYH 80
C CG YH C+ NR+ + C +C C IC + D N + C C +H
Sbjct: 603 CFYCGAYYHTQCILQEEDNRN----QHFACDTCEPCSICHGKITDDN--ITCCECKTHFH 656
Query: 81 CYCQHPPHKNVSSGP-----YLCPKHTKCHSCGSNVPG--NGLSVRWFLGYTCCDACGRL 133
C ++++ + C +C C + + NG S + Y C+ C
Sbjct: 657 KKCGFSIAYDMNTSDKQIMRWYCESCVQCCICNNKLSDFQNGYSFKDDQIY--CNDCNEQ 714
Query: 134 FVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--SDEKYLQFQVDGNLQYRC 190
K YCP+C K + + MV C C W+H CD I D+ Y +++ + QYRC
Sbjct: 715 LQKKEYCPICKKFWSQETNKDMVQC-TCAMWIHRACDPILKDDKLYDEYKNNLRQQYRC 772
Score = 56.6 bits (135), Expect = 5e-05, Method: Composition-based stats.
Identities = 48/230 (20%), Positives = 86/230 (37%), Gaps = 65/230 (28%)
Query: 17 RRMLSCKSCGKKYHRNCL-------------KNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+ +L C SC + YH CL + NR+ W CP C++C++C +
Sbjct: 1131 KSLLFCTSCFESYHPYCLMIPGRQEYFKEKMERAMNNRE------WNCPKCQVCKVCSKG 1184
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPP----HKNVSSGPYLCPKHTKCHSCGSNVPGNG---- 115
+ K +FCR+CDA H C+ +++ + + C C C S +
Sbjct: 1185 PNITKNLFCRKCDAMVHFECEFKDVQVWNESKNELYWQCSDCFNCAKCSSKSLIDESDKQ 1244
Query: 116 --LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP------------------- 154
+++ + ++ C CG L +C C K + + P
Sbjct: 1245 LMINLDFTDNFSLCYKCGFLDAYYRFCKFCNKYCKKIPTLPKPANQQPLDDLFQEKSVRL 1304
Query: 155 ----------MVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
+ C +CQ+ H +C + +Y+Q Q+ C TC+
Sbjct: 1305 KYLDQYLEEVLYQCKLCQQVYHKKCFEKAYPEYIQ-------QFFCYTCK 1347
>gi|409078326|gb|EKM78689.1| hypothetical protein AGABI1DRAFT_107193 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1494
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M++C CG+ H +C++ ++ D+ WKC C+ CE+C GD + +FC CD
Sbjct: 76 MVTCSECGRSGHPSCME-LSKIGDMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134
Query: 79 YHCYCQHPPHKNVSSGPYLCP 99
+H C PP + G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155
>gi|196002948|ref|XP_002111341.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
gi|190585240|gb|EDV25308.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
Length = 879
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+LSC CG H +CLK + W+C C+ C C+ G+P+ +FC CD
Sbjct: 207 LLSCVDCGNSGHPSCLKYSPELTSRVKTEPWQCIECKTCSYCQNAGNPDNLLFCDACDKG 266
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
+H C PP + SG ++C
Sbjct: 267 FHMECLSPPLTGMPSGRWVC 286
>gi|390332561|ref|XP_003723529.1| PREDICTED: uncharacterized protein LOC580929 isoform 2
[Strongylocentrotus purpuratus]
Length = 3278
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 8/128 (6%)
Query: 1 MCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 57
+CR C G E C R + +LSC CG H +CLK + ++ S W+C C+ C
Sbjct: 209 VCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKTC 266
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGNG 115
IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 267 AICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGRP 325
Query: 116 LSVRWFLG 123
+ R F G
Sbjct: 326 GTNRLFRG 333
>gi|390332559|ref|XP_003723528.1| PREDICTED: uncharacterized protein LOC580929 isoform 1
[Strongylocentrotus purpuratus]
Length = 3300
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 8/128 (6%)
Query: 1 MCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 57
+CR C G E C R + +LSC CG H +CLK + ++ S W+C C+ C
Sbjct: 209 VCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKTC 266
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGNG 115
IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 267 AICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGRP 325
Query: 116 LSVRWFLG 123
+ R F G
Sbjct: 326 GTNRLFRG 333
>gi|390332557|ref|XP_786050.3| PREDICTED: uncharacterized protein LOC580929 isoform 3
[Strongylocentrotus purpuratus]
Length = 2843
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 59/128 (46%), Gaps = 8/128 (6%)
Query: 1 MCRLCFVGENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 57
+CR C G E C R + +LSC CG H +CLK + ++ S W+C C+ C
Sbjct: 209 VCRSCH-GTAE-CNREGKPEDLLSCAECGNSAHPSCLKYSSALKERIRLSRWQCAECKTC 266
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-PKHT-KCHSCGSNVPGNG 115
IC + G + + C C+ YH C PP K + G + C P T K G PG
Sbjct: 267 AICSQGGT-KELLVCDACNQGYHASCLKPPLKRIPKGCWRCKPCRTGKMPGMGRRGPGRP 325
Query: 116 LSVRWFLG 123
+ R F G
Sbjct: 326 GTNRLFRG 333
>gi|340381940|ref|XP_003389479.1| PREDICTED: hypothetical protein LOC100636799 [Amphimedon
queenslandica]
Length = 469
Score = 66.2 bits (160), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 78/188 (41%), Gaps = 37/188 (19%)
Query: 72 CRRCDAAYH--CYCQHPPHKNVSSG-PYLCPKHTKCHSCGSNVPGN-------------G 115
C C+ YH C H + + C TKC SC SN+P
Sbjct: 260 CCECNEWYHEDCLLGHTSRPRTDNKRVWKCVTCTKCISCHSNMPNKIDDSGRSLLSSLLA 319
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISD 174
+W GY C C L KGN CPVC + Y D++ + MV C C WVH C+ ++D
Sbjct: 320 TPTQWSDGYCYCSDCIVLKAKGNSCPVCGECYLDNDFDSKMVQCSQCDNWVHSHCENMTD 379
Query: 175 EKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKDLIASLRAAAGLPTE 234
E+Y + DL D+V + R ++ +I+S++ A G
Sbjct: 380 EEY--------------------EILSDLPDSVEYVCRLCIAYNRTMISSIKTALGPIWV 419
Query: 235 DEIFSISP 242
++I ++P
Sbjct: 420 NDINVLTP 427
>gi|299115811|emb|CBN74374.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 3157
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 74/195 (37%), Gaps = 47/195 (24%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----SWKCPSCRICEIC--------RRTGD 65
+ L C +C YH CL R++ + W+C C+ C+ C RR GD
Sbjct: 1085 QTLVCSTCLMHYHPGCLDPPMTPREIASHAHSKEEWRCDYCQTCQGCGKGNEDEMRRGGD 1144
Query: 66 PNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 125
P + C +H + P + GP K W
Sbjct: 1145 P---LVCHERGKVHHVHLDAP--EGCDPGPRAQEKI------------------WL---- 1177
Query: 126 CCDACGRLFVKGNYCPVCLKVYRDSES-TPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 184
C C + + NYCP C Y + ++ + C C+ WVH C+G+S +Y Q VDG
Sbjct: 1178 -CSPCLDKYKERNYCPKCGVTYDEHDNMVQAIGCAACEFWVHASCEGLSTAEY-QMLVDG 1235
Query: 185 -----NLQYRCPTCR 194
+Y CP CR
Sbjct: 1236 KDSWFGAEYLCPVCR 1250
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 56 ICEICRRTG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 107
+CE+C T D + + C CD AYH YC P G ++C + C +C
Sbjct: 1411 VCEVCTETAKSDESLLLMCELCDRAYHTYCLTPSTDKPPEGTWICGQCISCTTC 1464
>gi|340376023|ref|XP_003386533.1| PREDICTED: zinc finger protein ubi-d4-like [Amphimedon
queenslandica]
Length = 402
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C + N+ + RM+SC CG+ H +CL+ + W+C C+ C +
Sbjct: 301 CDFCLGDDTLNQKSGKPERMVSCADCGRSGHPSCLQFSPSLAAVVLTYRWQCIECKSCSL 360
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C ++ + ++ +FC CD YH YC PP K G + C
Sbjct: 361 CGKSDNDDQLLFCDDCDRGYHMYCLKPPMKEAPEGSWSC 399
>gi|390349864|ref|XP_003727298.1| PREDICTED: uncharacterized protein LOC100893490 [Strongylocentrotus
purpuratus]
Length = 631
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L CK C K H +C+K Q + S W+C C+ C +C GD + +FC CD
Sbjct: 248 LLVCKDCTIKVHPSCMKYSRQLAERSRLSPWQCIDCKTCHVCNDAGDADTLLFCDSCDKG 307
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH C +P + G ++C
Sbjct: 308 YHMACHNPKVEEKPLGRWVC 327
>gi|356554670|ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max]
Length = 832
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH +CL + Q + H W CPSC IC++C D NK + C CD AYH YC P
Sbjct: 683 KYYHVSCLSS-KQLKSYGH--CWYCPSC-ICQVCLTDKDDNKIVLCDACDHAYHVYCMKP 738
Query: 87 PHKNVSSGPYLCPK 100
P ++ G + C K
Sbjct: 739 PQNSIPKGKWFCIK 752
>gi|426199317|gb|EKV49242.1| hypothetical protein AGABI2DRAFT_177299 [Agaricus bisporus var.
bisporus H97]
Length = 1474
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M++C CG+ H +C++ ++ ++ WKC C+ CE+C GD + +FC CD
Sbjct: 76 MVTCSECGRSGHPSCME-LSKIGEMIRTYPWKCIECKNCELCGDKGDDERILFCDGCDRG 134
Query: 79 YHCYCQHPPHKNVSSGPYLCP 99
+H C PP + G + CP
Sbjct: 135 WHFDCMQPPINELPEGEWYCP 155
>gi|449543136|gb|EMD34113.1| hypothetical protein CERSUDRAFT_141605, partial [Ceriporiopsis
subvermispora B]
Length = 260
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+SC CG+ H CL + A D+ WKC C+ CEIC D N+ MFC CD
Sbjct: 1 MVSCAECGRSAHPTCL-DLADIGDVMRSYDWKCMECKNCEICHSKEDDNRMMFCDFCDRG 59
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
+H C PP G + C
Sbjct: 60 WHMDCLDPPLSEAPPGKWHC 79
>gi|196010657|ref|XP_002115193.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
gi|190582576|gb|EDV22649.1| hypothetical protein TRIADDRAFT_28636 [Trichoplax adhaerens]
Length = 167
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C V N+ ++ R++SC CG+ H +CLK D +W+C C+ C I
Sbjct: 70 CDFCLGDVNLNKKTGQSERLISCADCGRSGHPSCLKFTPSLTDTVLMYAWQCIECKSCSI 129
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPP 87
C + + +K +FC CD YH YC PP
Sbjct: 130 CGTSDNDDKLLFCDDCDRGYHMYCLSPP 157
>gi|325192337|emb|CCA26782.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 2128
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 82/198 (41%), Gaps = 26/198 (13%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
LC V + G A +L C C + H C+ + + F +++ CP+C C +C T
Sbjct: 1170 LCVVCASAG--SASSLLFCMDCAQTVHTFCVLSDTSAWNKF--NAFHCPNCLQCRVCDET 1225
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLG 123
+ C +C H C PP + S+ C + +C C + S+ L
Sbjct: 1226 ---EGVLACPKCGKGAHGACLDPPIDDPSA--IYCGECVECKHCQTPGTPRTYSLVPDLC 1280
Query: 124 YTCCD-----ACGRLFVK--GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
CC R F K + CP+CL+ E+ + CD C+RW H CD ++
Sbjct: 1281 LQCCSRQERWKSARQFTKPLADKCPICLETCNVKEA---IHCDACERWTHSTCDPMAFR- 1336
Query: 177 YLQFQVDGNLQYRCPTCR 194
D + Y CP+CR
Sbjct: 1337 ------DKDALYVCPSCR 1348
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 60/140 (42%), Gaps = 29/140 (20%)
Query: 56 ICEICRR-TGDPNKFMF-CRRCDAAYHCYCQHPPHKNV--------------SSGPYLCP 99
IC CR + D N+ + C+RC+ +H C PP+ + S P+ C
Sbjct: 740 ICSGCRLPSMDSNQSLLSCQRCNKRFHATCTDPPYVDTITVWDPHDHSELDHISLPFTCA 799
Query: 100 KHTKCHSCGSNVPGNGLSVRWFLGYTC---CDACGRLFVKGNYCPVCLKVYRDSES-TP- 154
C C N N VRW L T C AC L+ +C +C YR E+ TP
Sbjct: 800 DCDVCAGCNQNRTENPW-VRWRLPLTIVSLCGACELLYRSDQFCSIC---YRALEALTPR 855
Query: 155 ----MVCCDVCQRWVHCQCD 170
++ C C+ +VH +C+
Sbjct: 856 KELSLLSCSSCRHFVHPECE 875
>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
Length = 2244
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 55/130 (42%), Gaps = 3/130 (2%)
Query: 68 KFMFCRRCDAAYHCYC--QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 125
+F+ C C YH C P + C KC CG G W ++
Sbjct: 538 EFVVCGTCGICYHGSCGNSFVPPLLFGGNNFNCSNCCKCIHCGYRDNGFMDYASWDSTFS 597
Query: 126 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 185
C C + F +G +C +C K++ S + CD+C+ WVH CD +E ++F +
Sbjct: 598 SCIRCCKGFERGQFCSICRKIWTSSWEGEWLQCDICKFWVHYDCDKDLNEP-IEFYSNVK 656
Query: 186 LQYRCPTCRG 195
Y CP CR
Sbjct: 657 NLYNCPACRS 666
>gi|443897765|dbj|GAC75104.1| hypothetical protein PANT_14d00040 [Pseudozyma antarctica T-34]
Length = 1176
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 47/104 (45%), Gaps = 7/104 (6%)
Query: 1 MCRLCFVGENEGCERARRML-SCKSCGKKYHRNCLKNWAQN----RDLFHWSSWKCPSCR 55
+C C + E ++L SC CG H CL+ W + R + W+C C+
Sbjct: 91 LCAFCQQPADRPKENTPKLLISCFECGSSGHPACLR-WGRKPTKVRSALSYE-WRCIECK 148
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
CEIC GD + MFC CD +H YC PP G + CP
Sbjct: 149 KCEICCDKGDDAQLMFCDGCDRGWHLYCLSPPLAKPPKGQWQCP 192
>gi|281205681|gb|EFA79870.1| hypothetical protein PPL_06690 [Polysphondylium pallidum PN500]
Length = 486
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 11/109 (10%)
Query: 97 LCPKHTKCHSCGSNVPGNGLSVRWFLGYT---CCDACGRLFVKGNYCPVCLKVYRDSEST 153
+ P+ +C CGS PG G + +W G + C++CG +K CP+C K+Y ++ +
Sbjct: 200 MVPRVNRC-VCGSTTPGKGPTCKWRKGPSGEVLCNSCGLQNMKKPKCPLCGKMYNKNKDS 258
Query: 154 PMVC-------CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 195
C CD C +WV +CDGI D L Y CP CR
Sbjct: 259 STECDSDEWIRCDDCSQWVMTECDGIKDISLYDDTQPNPLHYSCPKCRN 307
>gi|156353196|ref|XP_001622960.1| predicted protein [Nematostella vectensis]
gi|156354354|ref|XP_001623361.1| predicted protein [Nematostella vectensis]
gi|156209598|gb|EDO30860.1| predicted protein [Nematostella vectensis]
gi|156210052|gb|EDO31261.1| predicted protein [Nematostella vectensis]
Length = 132
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 4/113 (3%)
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSG---PYLCPKHTKCHSCGSNVPGNGLSVRWFLGYT 125
+ C +C YH C P + V G ++C + +C CGS G W +T
Sbjct: 1 LLMCDKCQRGYHVDCLGPSYPVVPEGSEDTWICGRCAQCKLCGSKSAGEDPEAVWMHEFT 60
Query: 126 CCDACGRLFVKGNYCPVCLKVYRDSE-STPMVCCDVCQRWVHCQCDGISDEKY 177
C CG + GNYCP+C K Y D++ + M+ C+ CQ WVH C I+ ++Y
Sbjct: 61 HCYDCGTAWDNGNYCPICEKCYSDNDFDSKMMHCNDCQHWVHASCQNINPDEY 113
>gi|390350878|ref|XP_788653.3| PREDICTED: zinc finger protein DPF3-like [Strongylocentrotus
purpuratus]
Length = 418
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ W+C C+ C +
Sbjct: 315 CDFCLGDATENKKTQTPEDLISCSDCGRSGHPTCLQFTDTMIQKVKGYRWQCIECKSCGL 374
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP + G ++C
Sbjct: 375 CGTSDNDDQLLFCDDCDRGYHMYCLNPPMQAPPEGSWIC 413
>gi|340381804|ref|XP_003389411.1| PREDICTED: hypothetical protein LOC100638610 [Amphimedon
queenslandica]
Length = 2366
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 1/100 (1%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C EN ++ ++LSC CG H +CLK + + W C C+ C
Sbjct: 197 ICSFCLGTEENNRDKQYEQLLSCHECGNSGHPSCLKYSKELVEFITAEPWLCLECKKCIY 256
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C + + + + C CD +H C PP ++ G ++CP
Sbjct: 257 CNASANADDLLICDACDKGFHMVCLDPPISSLPEGRWVCP 296
>gi|391344898|ref|XP_003746731.1| PREDICTED: zinc finger protein DPF3-like [Metaseiulus occidentalis]
Length = 470
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++ M++C CG+ H CL+ + W+C C+ C +
Sbjct: 350 CDFCLGTESENKKTKQPEEMVTCADCGRSGHPTCLQFTDVMTNNVKKYRWQCIECKTCTL 409
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP K G + C
Sbjct: 410 CGTSENDDQMLFCDDCDRGYHMYCLSPPLKEPPEGSWSC 448
>gi|195132645|ref|XP_002010753.1| GI21713 [Drosophila mojavensis]
gi|193907541|gb|EDW06408.1| GI21713 [Drosophila mojavensis]
Length = 2287
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 2/98 (2%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
R + C SC +K H +C++ + +W+C C+ C CR + P K ++C +
Sbjct: 1971 RPEAFIRCYSCRRKVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQ 2030
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
CD YH YC K V G + C + + C CG+ P
Sbjct: 2031 CDRGYHIYCLG--IKTVPEGRWSCERCSICMRCGATRP 2066
>gi|260818085|ref|XP_002603915.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
gi|229289239|gb|EEN59926.1| hypothetical protein BRAFLDRAFT_131253 [Branchiostoma floridae]
Length = 479
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 38/80 (47%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L CK C K H +C++ A S W+C C+ C IC +GD +FC CD
Sbjct: 248 LLICKDCNAKAHPSCMRYSADLARRSRMSPWQCIDCKTCYICDDSGDAETLLFCDACDKG 307
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH C P + G ++C
Sbjct: 308 YHMACHEPAVTHKPLGKWVC 327
>gi|198431091|ref|XP_002124209.1| PREDICTED: similar to monocytic leukemia zinc finger protein [Ciona
intestinalis]
Length = 2554
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C+ E + +LSC CG H C+K + S W+C C+ C +
Sbjct: 257 ICSYCYGTAECNKTGKQEELLSCADCGSSGHPICMKLSSDLVPKIRGSRWQCIECKSCRV 316
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C G+ + +FC CD +H C +PP + G ++C
Sbjct: 317 CGSKGNADNLLFCDSCDRGFHMECCNPPLLKMPKGSFIC 355
>gi|290973625|ref|XP_002669548.1| predicted protein [Naegleria gruberi]
gi|284083097|gb|EFC36804.1| predicted protein [Naegleria gruberi]
Length = 370
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 9/100 (9%)
Query: 5 CFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRR 62
C+V NEG ++ + ML+CK+CG K H C +N + L + SW+C C+ CE+C+
Sbjct: 32 CYVCHKTNEGTDK-QTMLTCKTCGLKVHAGCYEN--THFSLKYADSWECVDCKKCEVCKD 88
Query: 63 TG--DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG--PYLC 98
+ N+ + C RCD YH C + V G P+ C
Sbjct: 89 ANYREGNEILMCNRCDKGYHQLCCSEKFRTVPEGDKPWFC 128
>gi|84997431|ref|XP_953437.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304433|emb|CAI76812.1| hypothetical protein, conserved [Theileria annulata]
Length = 3595
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 58/134 (43%), Gaps = 12/134 (8%)
Query: 70 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 128
+ C C + H C +P N+ + C T+C SCG + W L + C
Sbjct: 1141 VVCVSCSISAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDHNTADYLNWGLFFFFCL 1200
Query: 129 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 183
C L + NYC +C KV+ D S V C+ C+ W+H +CD ++ + D
Sbjct: 1201 KCWELLERSNYCGICYKVWTHFDGSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1255
Query: 184 -GNLQYRCPTCRGE 196
N YRC CR E
Sbjct: 1256 SRNQNYRCCICRSE 1269
>gi|405964745|gb|EKC30194.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 387
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C+ C K H +C+ A S W+C C+ C +C+ +GDP+ +FC CD
Sbjct: 163 ILVCQDCNAKAHPSCMGYNAILARRTLESPWQCIDCKTCTVCQDSGDPDTMLFCDACDKG 222
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH C P ++ G + C
Sbjct: 223 YHMTCHEPAIEDKPQGKWEC 242
>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
Length = 1990
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 47/99 (47%), Gaps = 14/99 (14%)
Query: 103 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
+C SCG + L + F C C ++ +G YCP C KV+ + PM+ CD C+
Sbjct: 223 QCASCGVSCARKELDAKQF-----CLLCAKMHGEGQYCPCCGKVWHYANCGPMIQCDTCE 277
Query: 163 RWVHCQCDGISDEKYLQFQVDG--------NLQYRCPTC 193
WVH CD + E L+ + D + Y CPTC
Sbjct: 278 MWVHDLCDATAAE-ILKKEADALKEGREEEEIPYNCPTC 315
>gi|195345773|ref|XP_002039443.1| GM22724 [Drosophila sechellia]
gi|194134669|gb|EDW56185.1| GM22724 [Drosophila sechellia]
Length = 1889
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ A+ +W+C C+ C C
Sbjct: 1568 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPARMVGRVRNYNWQCAGCKCCIKC 1627
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K +FC +CD YH YC K V G + C + C CG+ P
Sbjct: 1628 RSSQRPGKMLFCEQCDRGYHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 1677
>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 2678
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 55/131 (41%), Gaps = 7/131 (5%)
Query: 69 FMFCRRCDAAYHCYCQHP--PHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTC 126
++ C C +H C +P P + C C CG W +T
Sbjct: 697 YVLCNICHVGFHGSCSNPFIPPLIAEKKYFKCSTCCYCSHCGYRDHNFMDYAAWDTTFTS 756
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG--ISDEKYLQFQVDG 184
C C R F KG YC +C +++ S + CD C+ W+H +CD I+ + L+
Sbjct: 757 CIRCCRGFEKGQYCAICRQIWSSSWDGDWLQCDTCRFWIHTECDTNLINSIEVLK---SP 813
Query: 185 NLQYRCPTCRG 195
++ Y CP CR
Sbjct: 814 SVSYHCPVCRS 824
>gi|71029596|ref|XP_764441.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351395|gb|EAN32158.1| hypothetical protein, conserved [Theileria parva]
Length = 3588
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 59/134 (44%), Gaps = 12/134 (8%)
Query: 70 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 128
+ C C + H C +P N+ + C T+C SCG + W L + C
Sbjct: 1176 VVCVSCSTSAHRSCCYPMVPNLLFIESWKCDYCTQCISCGYRDITCADYLNWGLFFFFCL 1235
Query: 129 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD--- 183
C L + NYC +C KV+ D+ S V C+ C+ W+H +CD ++ + D
Sbjct: 1236 KCWELLERSNYCGICYKVWTNFDTSSQKWVQCEGCKLWIHIECDDLA-----RLITDCPS 1290
Query: 184 -GNLQYRCPTCRGE 196
N YRC CR E
Sbjct: 1291 SRNQNYRCLICRSE 1304
>gi|355733003|gb|AES10880.1| myeloid/lymphoid or mixed-lineage leukemia 3 isoform 2 [Mustela
putorius furo]
Length = 102
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRT 63
+C V + G R+L+C CG+ YH C+ + + W+C C +CE C +
Sbjct: 8 MCVVCGSFGQGAEGRLLACSQCGQCYHPYCVS--IKITKVVLSKGWRCLECTVCEACGKA 65
Query: 64 GDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
DP + + C CD +YH YC PP + V G + C
Sbjct: 66 SDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKC 100
>gi|327259541|ref|XP_003214595.1| PREDICTED: zinc finger protein DPF3-like [Anolis carolinensis]
Length = 398
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ A + W+C C+ C +
Sbjct: 282 CDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTANMTEAVKTYQWQCIECKSCSL 341
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 342 CGTSENDDQLLFCDDCDRGYHMYCLNPPVSEPPEGSWSC 380
>gi|320167672|gb|EFW44571.1| MYST histone acetyltransferase 2 [Capsaspora owczarzaki ATCC 30864]
Length = 570
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 3/100 (3%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C LC ++ C ++ C +C H +CL W+C +C+ C +C
Sbjct: 102 VCALC---QSATCAARDSLIMCSNCSDCAHPSCLNLTKAAAAKVKTYPWRCSNCKTCSVC 158
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
+ G K MFC CD H +C PP K+ S + CP+
Sbjct: 159 DKAGHEKKMMFCITCDRGTHSFCAQPPMKDPSEVAWSCPE 198
>gi|312377713|gb|EFR24474.1| hypothetical protein AND_10892 [Anopheles darlingi]
Length = 539
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL+ A W+C C+ C I
Sbjct: 430 CDFCLGDARENKKTLEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRWQCIECKYCTI 489
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 490 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 528
>gi|307169876|gb|EFN62385.1| Zinc finger protein ubi-d4 A [Camponotus floridanus]
Length = 528
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 419 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 478
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 479 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 517
>gi|156384761|ref|XP_001633301.1| predicted protein [Nematostella vectensis]
gi|156220369|gb|EDO41238.1| predicted protein [Nematostella vectensis]
Length = 315
Score = 62.4 bits (150), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C V EN+ R +LSC CG+ H +CL+ + W+C C+ C +
Sbjct: 192 CDFCLGDVSENKKSGRPEELLSCSDCGRSGHPSCLQFTPKLTYNVKKYRWQCIECKSCTL 251
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G ++C
Sbjct: 252 CGTSDNDDQLLFCDDCDRGYHMYCLNPPMDKPPEGHWMC 290
>gi|256070756|ref|XP_002571708.1| requim req/dpf2 [Schistosoma mansoni]
gi|350646387|emb|CCD58946.1| requim, req/dpf2, putative [Schistosoma mansoni]
Length = 450
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C E N+ R+ ML C CG+ H +CL+ H W+C C+ C +
Sbjct: 331 CDFCLGDESLNKKTGRSEDMLRCSDCGRFAHFSCLQFTPNMITSVHTYRWQCIECKTCWL 390
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + + +FC CD YH YC +PP G + C
Sbjct: 391 CGTSENDEQMLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 429
>gi|26336851|dbj|BAC32109.1| unnamed protein product [Mus musculus]
Length = 440
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 358 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 414
Query: 82 YCQHPPHKNVSSGPYLC 98
+C P K+V + + C
Sbjct: 415 FCLQPVMKSVPTNGWKC 431
>gi|91094021|ref|XP_967377.1| PREDICTED: similar to d4 CG2682-PB [Tribolium castaneum]
Length = 525
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL + SW+C C+ C +
Sbjct: 418 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 477
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 516
>gi|195048475|ref|XP_001992534.1| GH24152 [Drosophila grimshawi]
gi|193893375|gb|EDV92241.1| GH24152 [Drosophila grimshawi]
Length = 2464
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 2/93 (2%)
Query: 20 LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAY 79
+ C SC ++ H +C++ + +W+C C+ C CR + P K ++C +CD Y
Sbjct: 2085 IRCYSCRRRVHPSCIEMPQRMVGRVRNYNWQCAECKCCIKCRSSQQPGKMLYCEQCDRGY 2144
Query: 80 HCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
H YC K V G + C + + C CG+ P
Sbjct: 2145 HIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2175
>gi|380014950|ref|XP_003691477.1| PREDICTED: zinc finger protein DPF3-like [Apis florea]
Length = 527
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|170059917|ref|XP_001865571.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
gi|167878516|gb|EDS41899.1| zinc-finger protein DPF3 [Culex quinquefasciatus]
Length = 450
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ A ++SC CG+ H CL+ A W+C C+ C +
Sbjct: 342 CDFCLGDARENKKTFEAEELVSCSDCGRSGHPTCLQFTANMIISVRKYRWQCIECKYCTM 401
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 402 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLISPPEGSWSC 440
>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 973
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 47/94 (50%), Gaps = 6/94 (6%)
Query: 104 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C CG ++P N L G C C RL +YC +C KV+ S+S V CD
Sbjct: 368 CDECGLDLPFNMSKKTKDLTPGGQLLCKTCARLMKSKHYCGICKKVWNQSDSGSWVRCDG 427
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
C+ WVH +CD IS + F+ G+ Y CP C+
Sbjct: 428 CKVWVHAECDKISS---ILFKNLGSTDYFCPACK 458
>gi|325181008|emb|CCA15418.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 639
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 30/59 (50%)
Query: 137 GNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRG 195
G YCPVC +VY D ++ VCCD C+ WVH CD L N Y CP C G
Sbjct: 580 GQYCPVCNQVYEDDDAASFVCCDSCEMWVHSACDTDLTPAKLATLAGTNETYICPLCGG 638
>gi|196007802|ref|XP_002113767.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
gi|190584171|gb|EDV24241.1| hypothetical protein TRIADDRAFT_26973 [Trichoplax adhaerens]
Length = 443
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 52/112 (46%), Gaps = 6/112 (5%)
Query: 2 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C +CF E N+ E ++ C CG H CL+ + + W+C C+ C
Sbjct: 324 CGVCFASEYVNDLGE-IEELIKCSQCGSLTHPTCLELTPEMVKVIQTYHWQCMDCKTCTA 382
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 111
C D +K MFC RCD YH +C ++ SG ++CP T+ HS N
Sbjct: 383 CSDPYDEDKMMFCDRCDRGYHTFCVG--LDSIPSGNWICPSCTQ-HSGNKNA 431
>gi|332030886|gb|EGI70522.1| Zinc finger protein ubi-d4 [Acromyrmex echinatior]
Length = 527
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|405965654|gb|EKC31016.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 2037
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
Query: 1 MCRLCFVGENEGCERARR-MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C EN+ + ++SC CG H +CLK + + W+C C+ C
Sbjct: 208 ICCFCLGDENKNRDGVPEDLISCAECGNSGHPSCLKFSPELTETVKKLRWQCIDCKTCSF 267
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+++G + +FC CD +H C PP G + C
Sbjct: 268 CQKSGREDNMLFCDLCDRGFHMECCDPPLSKAPKGKWKC 306
>gi|392564180|gb|EIW57358.1| hypothetical protein TRAVEDRAFT_125931 [Trametes versicolor
FP-101664 SS1]
Length = 270
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 4/101 (3%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+SC CG+ H +C+ D W+C +C+ C +CRR G+ + C CD
Sbjct: 1 MVSCVDCGRSGHPSCM-GLDNMGDAMRGYDWQCATCKSCSVCRRKGNEASMLICDHCDRG 59
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
+H C PP + G + CP C G P S R
Sbjct: 60 WHMSCFDPPFRAPPEGTWHCP---SCPRVGETFPDQYHSSR 97
>gi|443694019|gb|ELT95254.1| hypothetical protein CAPTEDRAFT_227914 [Capitella teleta]
Length = 675
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
Query: 18 RMLSCKSCGKKYH-RNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
+M++C C H CL+ A+ + H SW+C C++C C GD K MFC CD
Sbjct: 577 QMVTCIKCHTPAHPSTCLELSAEMIPIIHTYSWQCMDCKMCAKCNDAGDEEKMMFCDHCD 636
Query: 77 AAYHCYCQHPPHKNVSSGPYLCP 99
+H +C + + +G ++CP
Sbjct: 637 RGFHTFCLG--LRVIPTGRWVCP 657
>gi|195476347|ref|XP_002086095.1| GE11247 [Drosophila yakuba]
gi|194185954|gb|EDW99565.1| GE11247 [Drosophila yakuba]
Length = 497
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486
>gi|198455671|ref|XP_001357517.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
gi|198135345|gb|EAL24641.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
Length = 507
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 407 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 466
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 467 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 496
>gi|269785119|ref|NP_001161515.1| Cer-d4 protein [Saccoglossus kowalevskii]
gi|268054007|gb|ACY92490.1| Cer-d4 [Saccoglossus kowalevskii]
Length = 392
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 2/95 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H +CL+ + W+C C+ C +
Sbjct: 294 CDFCLGDASENKKTGVSEELISCSDCGRSGHPSCLQFTTKMTSNVKKYRWQCIECKSCHL 353
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 94
C + + ++ +FC CD YH YC +PP + G
Sbjct: 354 CGTSDNDDQLLFCDDCDRGYHMYCLNPPMSHPPEG 388
>gi|350413485|ref|XP_003490006.1| PREDICTED: zinc finger protein DPF3-like [Bombus impatiens]
Length = 468
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 359 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 418
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 419 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 457
>gi|307195046|gb|EFN77104.1| Zinc finger protein ubi-d4 A [Harpegnathos saltator]
Length = 534
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 425 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 484
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 485 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 523
>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1782
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 103 KCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
+C SCG + S+R C C +L +G +CPVC +V+ S MV CD C+
Sbjct: 211 RCASCGVT---DDESLRSKNSNGLCALCRKLHKEGQFCPVCDRVWHWSAGDAMVGCDRCE 267
Query: 163 RWVHCQCDGISDEKYLQFQ--VDGNLQYRCPTCR 194
W+H +CD ++ E + Q D ++ Y CP CR
Sbjct: 268 MWIHRECDAVAAEVLDREQNGEDEDIPYACPVCR 301
>gi|156380495|ref|XP_001631804.1| predicted protein [Nematostella vectensis]
gi|156218850|gb|EDO39741.1| predicted protein [Nematostella vectensis]
Length = 273
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 36/76 (47%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG H +CLK W+C C+ C +CR GD + +FC CD
Sbjct: 197 LISCADCGNSGHPSCLKYSPALTARVQSEPWQCIECKTCSVCRDAGDADNLLFCDMCDRG 256
Query: 79 YHCYCQHPPHKNVSSG 94
+H C PP + +G
Sbjct: 257 FHMECLDPPMSEMPTG 272
>gi|45382857|ref|NP_989970.1| zinc finger protein DPF3 [Gallus gallus]
gi|18202301|sp|P58270.1|DPF3_CHICK RecName: Full=Zinc finger protein DPF3; AltName: Full=Zinc finger
protein cer-d4
gi|14010362|gb|AAK51968.1|AF362754_1 cer-d4 [Gallus gallus]
Length = 427
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 311 CDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKSCSL 370
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 371 CGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 409
>gi|291221144|ref|XP_002730583.1| PREDICTED: d4-like [Saccoglossus kowalevskii]
Length = 493
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDL-FHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
+L CK C K H +C+ N+ Q + S W+C C+ C +C +GD + +FC CD
Sbjct: 220 LLVCKDCNAKAHPSCM-NYTQELAVRARMSPWQCFDCKTCCVCGDSGDADNLLFCDACDK 278
Query: 78 AYHCYCQHPPHKNVSSGPYLCPK 100
YH C P +G ++C K
Sbjct: 279 GYHMACHTPQILRKPTGKWMCIK 301
>gi|340717364|ref|XP_003397154.1| PREDICTED: zinc finger protein DPF3-like [Bombus terrestris]
Length = 527
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 418 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 477
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 478 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 516
>gi|427778555|gb|JAA54729.1| Putative d4 [Rhipicephalus pulchellus]
Length = 487
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFV--GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C GEN+ + ++SC CG+ H +CL+ W+C C+ C +
Sbjct: 352 CDFCLGDNGENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIECKSCGL 411
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 412 CGTSDNDDQLLFCDDCDRGYHMYCLQPPLSEPPEGLWSC 450
>gi|388580883|gb|EIM21195.1| hypothetical protein WALSEDRAFT_38994 [Wallemia sebi CBS 633.66]
Length = 808
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 19 MLSCKSCGKKYHRNCL---KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRC 75
M++C C H CL QN ++WS C C+ CE+C+ G+ + +FC C
Sbjct: 1 MITCGQCQSSAHPTCLHFTPTLTQNTLSYNWS---CVDCKGCEVCKNKGNEDDIIFCDLC 57
Query: 76 DAAYHCYCQHPPHKNVSSGPYLCP 99
D +H +C +PP +G + CP
Sbjct: 58 DRGWHMHCLNPPMNEPPAGDFACP 81
>gi|37362278|gb|AAQ91267.1| requiem, apoptosis response zinc finger gene [Danio rerio]
Length = 368
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H +CL+ A W+C C+ C +C + + ++
Sbjct: 258 NQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 317
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP + G + C
Sbjct: 318 LFCDDCDRGYHMYCLSPPMSDPPEGSWSC 346
>gi|345480756|ref|XP_001605917.2| PREDICTED: zinc finger protein DPF3 [Nasonia vitripennis]
Length = 551
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 442 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 501
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 502 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLTSPPEGSWSC 540
>gi|157128953|ref|XP_001661565.1| requim, req/dpf2 [Aedes aegypti]
gi|108872440|gb|EAT36665.1| AAEL011279-PA [Aedes aegypti]
Length = 433
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 325 CDFCLGDARENKKTLEPEELVSCSDCGRSGHPTCLQFTANMIISVRKYRWQCIECKYCTI 384
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 385 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLLTPPEGSWSC 423
>gi|449502401|ref|XP_004174505.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein DPF3
[Taeniopygia guttata]
Length = 392
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 276 CDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKSCSL 335
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 336 CGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 374
>gi|62177137|ref|NP_997861.2| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|62026699|gb|AAH92130.1| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|182892074|gb|AAI65789.1| Dpf2l protein [Danio rerio]
Length = 405
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H +CL+ A W+C C+ C +C + + ++
Sbjct: 295 NQKTGQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 354
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP + G + C
Sbjct: 355 LFCDDCDRGYHMYCLSPPMSDPPEGSWSC 383
>gi|194864252|ref|XP_001970846.1| GG10866 [Drosophila erecta]
gi|190662713|gb|EDV59905.1| GG10866 [Drosophila erecta]
Length = 497
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLMTPPEGSWSC 486
>gi|195148883|ref|XP_002015392.1| GL11042 [Drosophila persimilis]
gi|194109239|gb|EDW31282.1| GL11042 [Drosophila persimilis]
Length = 567
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 467 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 526
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 527 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 556
>gi|328778645|ref|XP_395098.4| PREDICTED: zinc finger protein DPF3-like [Apis mellifera]
Length = 533
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 424 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 483
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 484 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 522
>gi|221120366|ref|XP_002164134.1| PREDICTED: histone acetyltransferase KAT6B-like [Hydra
magnipapillata]
Length = 832
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 9/111 (8%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C CG H +C++ + W+C C+ C IC+ G+ +FC CD
Sbjct: 214 LLVCDECGNSGHPSCMQYSKELTARVRQEPWQCMECKKCNICKDQGEAANLLFCDACDKG 273
Query: 79 YHCYCQHPPHKNVSSGPYLC-----PKHTKCHSCGSNVPGNGL----SVRW 120
YH C PP ++ G ++C ++ K S+VP + L V+W
Sbjct: 274 YHMACLDPPLDDMPIGTWICDNCLSERNGKRRRISSSVPASLLLSTPDVKW 324
>gi|162287269|ref|NP_001104639.1| zinc finger protein DPF3 [Danio rerio]
gi|215275221|sp|A9LMC0.1|DPF3_DANRE RecName: Full=Zinc finger protein DPF3
gi|159906528|gb|ABX10892.1| D4 zinc and double PHD fingers family 3 protein [Danio rerio]
Length = 391
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 44/99 (44%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N +A ++SC CG+ H +CL+ W+C C+ C +
Sbjct: 276 CDFCLGDSGSNRKTGQAEELVSCSDCGRSGHPSCLQFTDNMMQAVRTYQWQCIECKSCSL 335
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 336 CGTSENDDQLLFCDDCDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|18032212|gb|AAL56647.1|AF217500_1 histone acetyltransferase MOZ2 [Homo sapiens]
Length = 2072
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++A +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|270003134|gb|EEZ99581.1| hypothetical protein TcasGA2_TC001567 [Tribolium castaneum]
Length = 481
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL + SW+C C+ C +
Sbjct: 374 CDFCLGDSRENKKTGVMEELVSCSDCGRSGHPSCLLFTENMKISVKKYSWQCIECKCCSV 433
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 434 CGNSDNDDQLLFCDDCDRGYHMYCLSPPLTDPPEGSWSC 472
>gi|410059939|ref|XP_003318982.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Pan
troglodytes]
Length = 365
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 82 YCQHPPHKNVSSGPYLC 98
+C P K+V + + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364
>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 7/90 (7%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C K++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 359 CKPCAKLIKSKQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKHFKDLE---NT 415
Query: 187 QYRCPTCRGECYQVRDLEDAVRELWRRKDM 216
Y CP C+G+ + E + + ++ K+M
Sbjct: 416 DYYCPDCKGK----FNCESSTSQTYKSKNM 441
>gi|194758284|ref|XP_001961392.1| GF11022 [Drosophila ananassae]
gi|190622690|gb|EDV38214.1| GF11022 [Drosophila ananassae]
Length = 812
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 2 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL+ A W+C C+ C I
Sbjct: 389 CDFCLGDQRENKKTSMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSI 448
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPP 87
C + + ++ +FC CD YH YC PP
Sbjct: 449 CGTSDNDDQLLFCDDCDRGYHMYCLSPP 476
>gi|354468679|ref|XP_003496779.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Cricetulus
griseus]
Length = 2047
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++A +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|14010360|gb|AAK51967.1|AF362753_1 cer-d4 [Gallus gallus]
Length = 378
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 262 CDFCLGGSNMNKKSGRPEELVSCSDCGRSGHPTCLQFTTNMTEAVKTYQWQCIECKSCSL 321
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 322 CGTSENDDQLLFCDDCDRGYHMYCLNPPVFEPPEGSWSC 360
>gi|354468677|ref|XP_003496778.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Cricetulus
griseus]
Length = 1756
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++A +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|403276505|ref|XP_003929938.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 371
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 291 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 347
Query: 82 YCQHPPHKNVSSGPYLC 98
+C P K+V + + C
Sbjct: 348 FCLQPVMKSVPTNGWKC 364
>gi|19921648|ref|NP_610163.1| d4, isoform A [Drosophila melanogaster]
gi|16417832|gb|AAL18868.1|AF427473_1 dd4 protein [Drosophila melanogaster]
gi|16198077|gb|AAL13829.1| LD29238p [Drosophila melanogaster]
gi|21626860|gb|AAF57340.2| d4, isoform A [Drosophila melanogaster]
gi|220942560|gb|ACL83823.1| d4-PA [synthetic construct]
gi|220952536|gb|ACL88811.1| d4-PA [synthetic construct]
Length = 497
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 397 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 456
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 457 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 486
>gi|24585823|ref|NP_724404.1| d4, isoform C [Drosophila melanogaster]
gi|7302246|gb|AAF57339.1| d4, isoform C [Drosophila melanogaster]
Length = 495
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 395 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 454
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 455 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 484
>gi|442622301|ref|NP_001260707.1| d4, isoform D [Drosophila melanogaster]
gi|440214084|gb|AGB93242.1| d4, isoform D [Drosophila melanogaster]
Length = 496
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 396 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 455
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 456 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 485
>gi|393908955|gb|EJD75261.1| hypothetical protein LOAG_17567 [Loa loa]
Length = 371
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 1 MCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C LC +N+ + +++SC CG+ H +CLK W+C C+ C
Sbjct: 252 VCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIECKSCA 311
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
IC + + ++ +FC CD +H YC PP G + C
Sbjct: 312 ICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 351
>gi|383856201|ref|XP_003703598.1| PREDICTED: zinc finger protein ubi-d4-like [Megachile rotundata]
Length = 559
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ + ++SC CG+ H CL+ A W+C C+ C I
Sbjct: 450 CDFCLGDARENKKTGGSEELVSCSDCGRSGHPTCLQFTANMIVSVRKYRWQCIECKCCSI 509
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 510 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLASPPEGSWSC 548
>gi|301769749|ref|XP_002920299.1| PREDICTED: zinc finger protein DPF3-like [Ailuropoda melanoleuca]
Length = 633
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 517 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 576
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 577 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 615
>gi|25396154|pir||A88925 protein F33E11.3 [imported] - Caenorhabditis elegans
Length = 1192
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 46/103 (44%), Gaps = 2/103 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+ C +C YH C++ + L W C CR+C IC + ++ +FC RCD
Sbjct: 437 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 496
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWF 121
+H YC K + G ++C + + N + +V F
Sbjct: 497 FHTYCVG--LKKLPQGTWICDTYCAIENMKFNRRASAAAVGGF 537
>gi|118344068|ref|NP_001071860.1| zinc finger protein [Ciona intestinalis]
gi|70571572|dbj|BAE06775.1| zinc finger protein [Ciona intestinalis]
Length = 399
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 10/103 (9%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS----SWKCPSCR 55
C C EN+ + ++SC CG+ H CL Q D+ + SW+C C+
Sbjct: 280 CDFCLGDADENKKTGESEELVSCSDCGRSGHPTCL----QFTDIMTMNVKKYSWQCIECK 335
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C +C + + + +FC CD YH YC P +N G ++C
Sbjct: 336 SCHVCGTSDNDEQLLFCDDCDRGYHMYCLQPRMENPPEGSWIC 378
>gi|116008472|ref|NP_724405.2| d4, isoform B [Drosophila melanogaster]
gi|113194576|gb|AAM68376.2| d4, isoform B [Drosophila melanogaster]
Length = 339
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 239 ENKKTNMPEELVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQ 298
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 299 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 328
>gi|268571913|ref|XP_002641182.1| Hypothetical protein CBG09043 [Caenorhabditis briggsae]
Length = 373
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +C+ + S W+C C+ C IC + + +K +FC CD
Sbjct: 277 LVSCHDCGRSGHPSCMSFNQNVTMIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 336
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC +PP + Y C
Sbjct: 337 YHLYCLNPPLEKAPDDEYSC 356
>gi|341879787|gb|EGT35722.1| hypothetical protein CAEBREN_06378 [Caenorhabditis brenneri]
Length = 375
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 2/81 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
++SC CG+ H +C+ N+ QN + + S W+C C+ C IC + + +K +FC CD
Sbjct: 279 LISCHDCGRSGHPSCM-NFNQNVTKIINRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 337
Query: 78 AYHCYCQHPPHKNVSSGPYLC 98
YH YC P + Y C
Sbjct: 338 GYHLYCLRPALEKAPDDEYSC 358
>gi|158292808|ref|XP_314129.4| AGAP005225-PA [Anopheles gambiae str. PEST]
gi|157017167|gb|EAA09477.4| AGAP005225-PA [Anopheles gambiae str. PEST]
Length = 468
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL+ A W+C C+ C I
Sbjct: 359 CDFCLGDARENKKTFEPEELVSCSDCGRSGHPSCLQFTANMIISVRKYRWQCIECKYCTI 418
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP + G + C
Sbjct: 419 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLVSPPEGSWSC 457
>gi|348523632|ref|XP_003449327.1| PREDICTED: zinc finger protein neuro-d4-like [Oreochromis
niloticus]
Length = 381
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 6/116 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 268 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 325
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTKCHSCGSNVPG 113
+ + ++ +FC CD YH YC PP G + LC + K + G PG
Sbjct: 326 GTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSCHLCLRQLKEKASGIEDPG 381
>gi|332842752|ref|XP_001140541.2| PREDICTED: zinc finger protein DPF3 [Pan troglodytes]
Length = 367
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 251 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 310
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 311 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349
>gi|321460287|gb|EFX71331.1| hypothetical protein DAPPUDRAFT_308933 [Daphnia pulex]
Length = 501
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL+ A W+C C+ C +
Sbjct: 393 CDFCLGDATENKKSGHPEELVSCADCGRSGHPSCLQFTANMIISVKQYRWQCIECKCCSL 452
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + + +FC CD YH YC PP G + C
Sbjct: 453 CGNSDNDEQLLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 491
>gi|25148780|ref|NP_498281.2| Protein DPFF-1, isoform a [Caenorhabditis elegans]
gi|22096399|sp|Q09477.2|YP99_CAEEL RecName: Full=Uncharacterized zinc finger protein C28H8.9
gi|351058508|emb|CCD65970.1| Protein DPFF-1, isoform a [Caenorhabditis elegans]
Length = 372
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQN-RDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDA 77
++SC CG+ H +CL N+ QN + S W+C C+ C IC + + +K +FC CD
Sbjct: 276 LVSCHDCGRSGHPSCL-NFNQNVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDR 334
Query: 78 AYHCYCQHPPHKNVSSGPYLC 98
YH YC P + Y C
Sbjct: 335 GYHLYCLTPALEKAPDDEYSC 355
>gi|344273540|ref|XP_003408579.1| PREDICTED: zinc finger protein DPF3-like [Loxodonta africana]
Length = 427
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 311 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 370
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 371 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 409
>gi|357607405|gb|EHJ65481.1| putative requim, req/dpf2 [Danaus plexippus]
Length = 513
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ R++SC CG+ H CL+ W+C C+ C +C + + ++
Sbjct: 416 ENKKTGTPERLVSCSDCGRSGHPTCLQFTVNMIVSVRKYRWQCIECKCCSVCGTSDNDDQ 475
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 476 LLFCDDCDRGYHMYCLAPPLDAPPEGSWSC 505
>gi|260837382|ref|XP_002613683.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
gi|229299071|gb|EEN69692.1| hypothetical protein BRAFLDRAFT_250354 [Branchiostoma floridae]
Length = 809
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 45/101 (44%), Gaps = 5/101 (4%)
Query: 1 MCRLCFVGENEGCER---ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 57
+C C +G E C R A +LSC CG H +CLK Q W+C C+ C
Sbjct: 191 VCSFC-LGTAE-CNRDGQAEELLSCADCGNSGHPSCLKYSPQLTAKVRSMRWQCIDCKTC 248
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C D + +FC CD +H C +PP + G + C
Sbjct: 249 TACENKNDLDNILFCDACDRGFHMKCCNPPLTKMPKGNWEC 289
>gi|148670787|gb|EDL02734.1| D4, zinc and double PHD fingers, family 3, isoform CRA_a [Mus
musculus]
Length = 381
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 260 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 319
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 320 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 358
>gi|403223612|dbj|BAM41742.1| predicted protein [Theileria orientalis strain Shintoku]
Length = 4555
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 3/107 (2%)
Query: 70 MFCRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCD 128
+ C C+ A H C +P N+ + C T+C SCG + W L + C
Sbjct: 1805 VVCVSCNVAAHRGCCNPVVPNLLFIESWKCDSCTQCISCGYRDTTCIEYLNWGLFFFFCL 1864
Query: 129 ACGRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGIS 173
C L + NYC VC KV+ DS + V C+ C+ W+H +CD ++
Sbjct: 1865 KCWELLERSNYCGVCYKVWTNFDSNTQKWVQCEGCKLWIHVECDDLA 1911
>gi|395857461|ref|XP_003801110.1| PREDICTED: zinc finger protein DPF3 [Otolemur garnettii]
Length = 489
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 373 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 432
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 433 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 471
>gi|402876619|ref|XP_003902055.1| PREDICTED: zinc finger protein DPF3-like, partial [Papio anubis]
Length = 277
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 161 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 220
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 221 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 259
>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
Length = 2166
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 2/69 (2%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN- 185
C C +L +G YCPVC +V++ + MV CD C WVHC CD + ++ Q G+
Sbjct: 226 CRLCAKLHREGQYCPVCDRVWQWANCPAMVGCDSCDFWVHCACDEPA-RTVMEAQERGDE 284
Query: 186 LQYRCPTCR 194
+ Y CP CR
Sbjct: 285 VDYHCPRCR 293
>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 1053
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 104 CHSCGSNVPGN------GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
C +CG +P GL+ G C C RL +YC +C KV S+S V
Sbjct: 389 CEACGLALPYKMSKKIKGLTPN---GQLLCKTCTRLTKSKHYCGICKKVSNHSDSGSWVR 445
Query: 158 CDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
CD C+ WVH +CD IS + + Y CPTCRG+
Sbjct: 446 CDGCKVWVHAECDKISSNHFKDLE---TTDYFCPTCRGK 481
>gi|403418283|emb|CCM04983.1| predicted protein [Fibroporia radiculosa]
Length = 1278
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 7 VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDP 66
VGE E M+SC CG+ H +CLK + + W C C+ CEICR+
Sbjct: 130 VGEPE------LMVSCAECGRSGHPSCLK-LVEMSETIRLYPWICSECKNCEICRKKEGE 182
Query: 67 NKFMFCRRCDAAYHCYCQHPPHKNVSSG 94
N+ + C CD +H C PP + G
Sbjct: 183 NRMIMCDFCDRGWHMDCLQPPLVEMPPG 210
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 51/127 (40%), Gaps = 22/127 (17%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C K YH CL + HWS CPSC
Sbjct: 260 EVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWS---CPSCEAAGIPQKDEEEEKKV 316
Query: 58 ----EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
E CR D + C C ++YH YC +PP V G + CP+ C N P
Sbjct: 317 ATNMEYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPR-CLCPEP-KNRPE 374
Query: 114 NGLSVRW 120
LS RW
Sbjct: 375 KVLSWRW 381
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 25/113 (22%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
CE+C++ G+ + C C AYH C P + G + CP C + G +P
Sbjct: 259 CEVCQQGGE---IILCDTCPKAYHMVCLDPDMEEAPEGHWSCP---SCEAAG--IPQKD- 309
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
+ ++ YC VC V ++CCD C H C
Sbjct: 310 ----------EEEEKKVATNMEYCRVCKDV------GWLLCCDTCPSSYHAYC 346
>gi|432924374|ref|XP_004080595.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
KAT6B-like [Oryzias latipes]
Length = 2014
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ +RA +LSC CG H +CLK W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRAEELLSCADCGSSGHPSCLKFSPDLTSNVKKLRWQCIECKTCSS 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNAEEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|194226394|ref|XP_001914899.1| PREDICTED: histone acetyltransferase MYST3 [Equus caballus]
Length = 2012
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK A+ W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSAELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
Length = 591
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 43/90 (47%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ + ++SC CG+ H CL+ A W+C C+ C +C + + ++
Sbjct: 485 ENKKSNQPEELVSCSDCGRSGHPTCLQFTANMIISVKKYPWQCIECKSCGLCGTSDNDDQ 544
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC +PP G + C
Sbjct: 545 LLFCDDCDRGYHMYCLNPPLSEPPEGNWSC 574
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 40/79 (50%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ + ++SC CG+ H CL+ A W+C C+ C +C + + ++
Sbjct: 373 ENKKSNQPEELVSCSDCGRSGHPTCLQFTANMIISVKKYPWQCIECKSCGLCGTSDNDDQ 432
Query: 69 FMFCRRCDAAYHCYCQHPP 87
+FC CD YH YC +PP
Sbjct: 433 LLFCDDCDRGYHMYCLNPP 451
>gi|195567753|ref|XP_002107423.1| GD15571 [Drosophila simulans]
gi|194204830|gb|EDX18406.1| GD15571 [Drosophila simulans]
Length = 770
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 44/94 (46%), Gaps = 2/94 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+ C +C K+ H +C+ + +W+C C+ C CR + P K +FC +CD
Sbjct: 478 FIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKCRSSQRPGKMLFCEQCDRG 537
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
YH YC K V G + C + C CG+ P
Sbjct: 538 YHIYCLG--LKTVPDGRWSCERCCFCVRCGATKP 569
>gi|410916367|ref|XP_003971658.1| PREDICTED: zinc finger protein DPF3-like [Takifugu rubripes]
Length = 391
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 39/84 (46%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
+A ++SC CG+ H CL+ W+C C+ C IC + + ++ +FC
Sbjct: 291 QAEELVSCSDCGRSGHPTCLQFTDNMMQAVRTYQWQCIECKSCSICGTSENDDQLLFCDD 350
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC PP G + C
Sbjct: 351 CDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|308235954|ref|NP_001184101.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 388
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 42/89 (47%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ +A ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 282 NKKTNQAEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQL 341
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 342 LFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370
>gi|395503977|ref|XP_003756337.1| PREDICTED: zinc finger protein DPF3 [Sarcophilus harrisii]
Length = 375
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N+ R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 259 CDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 318
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 319 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 357
>gi|345803644|ref|XP_854603.2| PREDICTED: zinc finger protein DPF3 [Canis lupus familiaris]
Length = 322
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 206 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 265
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 266 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 304
>gi|197091705|gb|ACH42085.1| PHD zinc finger protein 10 [Crassostrea gigas]
Length = 367
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 10/119 (8%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C +C G+N + ++ C C K+ H CL + + W+C C+ C C
Sbjct: 224 VCSICTQGQNSEVKGESDLVVCSECNKEGHPGCLDLTNEMVTVIKTYPWQCMDCKTCVEC 283
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR 119
D +K MFC CD YH +C K++ +G H KC SC P N +V+
Sbjct: 284 MDPYDEDKMMFCDLCDRGYHTFCVG--LKSIPTG------HWKCKSCKG--PENPQTVQ 332
>gi|426234251|ref|XP_004011110.1| PREDICTED: zinc finger protein DPF3 [Ovis aries]
Length = 408
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 292 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 351
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 352 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 390
>gi|410962581|ref|XP_003987847.1| PREDICTED: zinc finger protein DPF3 [Felis catus]
Length = 411
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 295 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 354
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 355 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 393
>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
Length = 3609
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 58/131 (44%), Gaps = 12/131 (9%)
Query: 72 CRRCDAAYHCYCQHPPHKNV-SSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 130
C C+ + H C +PP N+ + C ++C SCG + W L + C C
Sbjct: 1097 CVSCNLSAHRNCCNPPVPNLLFIESWKCDWCSQCISCGYRDTNGTEYLHWGLFFLLCLKC 1156
Query: 131 GRLFVKGNYCPVCLKVYR--DSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVD----G 184
K N+C VC KV+ DS + V C+ C+ W+H +CD ++ Q D
Sbjct: 1157 WESLEKNNFCGVCYKVWTNYDSVTQKWVQCEGCKLWIHVECDDLA-----QIITDCPSSR 1211
Query: 185 NLQYRCPTCRG 195
+ YRC CR
Sbjct: 1212 SQNYRCKVCRS 1222
>gi|432891023|ref|XP_004075510.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Oryzias
latipes]
Length = 381
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 3/98 (3%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 268 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 325
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + ++ +FC CD YH YC PP G + C
Sbjct: 326 GTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 363
>gi|363735536|ref|XP_421609.3| PREDICTED: histone acetyltransferase KAT6B [Gallus gallus]
Length = 2025
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTSNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|344241713|gb|EGV97816.1| Histone acetyltransferase MYST4 [Cricetulus griseus]
Length = 709
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++A +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKAEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|390407656|ref|NP_001254554.1| zinc finger protein DPF3 isoform 1 [Mus musculus]
gi|215274004|sp|P58269.2|DPF3_MOUSE RecName: Full=Zinc finger protein DPF3; AltName:
Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
Full=Zinc finger protein cer-d4
gi|26332973|dbj|BAC30204.1| unnamed protein product [Mus musculus]
Length = 378
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 262 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 321
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 322 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|296215435|ref|XP_002754118.1| PREDICTED: zinc finger protein DPF3 isoform 1 [Callithrix jacchus]
gi|332229063|ref|XP_003263707.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Nomascus leucogenys]
gi|397507377|ref|XP_003824173.1| PREDICTED: zinc finger protein DPF3 isoform 3 [Pan paniscus]
gi|215274167|sp|Q92784.3|DPF3_HUMAN RecName: Full=Zinc finger protein DPF3; AltName:
Full=BRG1-associated factor 45C; Short=BAF45C; AltName:
Full=Zinc finger protein cer-d4
gi|60459281|gb|AAX20019.1| DPF3 [Homo sapiens]
Length = 378
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 262 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 321
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 322 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|317419460|emb|CBN81497.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
Length = 2149
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|297695458|ref|XP_002824959.1| PREDICTED: zinc finger protein DPF3 isoform 2 [Pongo abelii]
Length = 378
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 262 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 321
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 322 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 360
>gi|317419461|emb|CBN81498.1| Histone acetyltransferase MYST4 [Dicentrarchus labrax]
Length = 1996
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C K++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 358 CKPCAKLIKSRQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDKISSKLFKDLE---NA 414
Query: 187 QYRCPTCRGECYQVRDLEDAVRELWRRKDMA 217
Y CP C+G+ + E + + ++ K+++
Sbjct: 415 DYYCPDCKGK----FNYESSTSQTYKSKNIS 441
>gi|354488945|ref|XP_003506626.1| PREDICTED: zinc finger protein DPF3-like, partial [Cricetulus
griseus]
Length = 367
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 251 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 310
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 311 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 349
>gi|126282822|ref|XP_001375927.1| PREDICTED: zinc finger protein DPF3-like [Monodelphis domestica]
Length = 384
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGE--NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N+ R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 268 CDFCLGGSSMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 327
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 328 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 366
>gi|328701324|ref|XP_001945217.2| PREDICTED: zinc finger protein ubi-d4-like isoform 1 [Acyrthosiphon
pisum]
Length = 521
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 39/80 (48%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H CL+ A W+C C+ C IC + + ++ +FC CD
Sbjct: 432 LISCSDCGRSGHPTCLQFTANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRG 491
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP + G + C
Sbjct: 492 YHVYCLTPPLTSPPEGCWSC 511
>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
Length = 2062
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 4/117 (3%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C SC ++ H +C+ + +W+C C+ C C
Sbjct: 1732 CGVCLRSQHRNARDMPEVFIRCYSCRRRVHPSCIDMPQRMVGRVRNYNWQCSGCKCCIKC 1791
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLS 117
R P K +FC +CD YH YC + V G + C + C CG+ P GLS
Sbjct: 1792 RSNQRPGKMLFCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCGATRP-EGLS 1845
>gi|348508657|ref|XP_003441870.1| PREDICTED: histone acetyltransferase MYST4 [Oreochromis niloticus]
Length = 2141
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ +R +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + ++ +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGTWIC 313
>gi|328701326|ref|XP_003241562.1| PREDICTED: zinc finger protein ubi-d4-like isoform 2 [Acyrthosiphon
pisum]
Length = 458
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 39/80 (48%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H CL+ A W+C C+ C IC + + ++ +FC CD
Sbjct: 369 LISCSDCGRSGHPTCLQFTANMIISVGKYRWQCIECKCCSICGTSDNDDQLLFCDDCDRG 428
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP + G + C
Sbjct: 429 YHVYCLTPPLTSPPEGCWSC 448
>gi|355778710|gb|EHH63746.1| hypothetical protein EGM_16777, partial [Macaca fascicularis]
Length = 363
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 252 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 311
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 312 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|6002696|gb|AAF00100.1|AF119231_1 histone acetyltransferase MORF beta [Homo sapiens]
Length = 2073
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|6002694|gb|AAF00099.1|AF119230_1 histone acetyltransferase MORF alpha [Homo sapiens]
Length = 1890
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|426365193|ref|XP_004049670.1| PREDICTED: histone acetyltransferase KAT6B [Gorilla gorilla
gorilla]
Length = 2072
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|341888296|gb|EGT44231.1| hypothetical protein CAEBREN_09890 [Caenorhabditis brenneri]
gi|341902393|gb|EGT58328.1| hypothetical protein CAEBREN_07005 [Caenorhabditis brenneri]
Length = 454
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 3/89 (3%)
Query: 11 EGCERAR-RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
+ CER M+ C +C YH +C++ + + W C CR+C +C + G+ N+
Sbjct: 339 DSCERVGGEMVCCATCNIAYHPHCIEMPERMAMIVKTYEWSCVDCRVCSVCHKPGEENEV 398
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC RCD +H C K+ G ++C
Sbjct: 399 VFCDRCDRGFHNSCVG--LKSTPIGSWIC 425
>gi|397483738|ref|XP_003813054.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Pan paniscus]
Length = 2075
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|100816397|ref|NP_036462.2| histone acetyltransferase KAT6B isoform 1 [Homo sapiens]
gi|143811424|sp|Q8WYB5.3|KAT6B_HUMAN RecName: Full=Histone acetyltransferase KAT6B; AltName:
Full=Histone acetyltransferase MOZ2; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Monocytic leukemia zinc finger
protein-related factor
gi|119574944|gb|EAW54559.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_c [Homo sapiens]
Length = 2073
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395820450|ref|XP_003783579.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Otolemur
garnettii]
Length = 1880
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|449505049|ref|XP_002192975.2| PREDICTED: histone acetyltransferase KAT6B [Taeniopygia guttata]
Length = 1842
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|397483742|ref|XP_003813056.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Pan paniscus]
Length = 1892
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|291404131|ref|XP_002718449.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 2 [Oryctolagus cuniculus]
Length = 1774
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|355693412|gb|EHH28015.1| hypothetical protein EGK_18348, partial [Macaca mulatta]
Length = 368
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 252 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 311
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 312 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|348573139|ref|XP_003472349.1| PREDICTED: zinc finger protein DPF3-like [Cavia porcellus]
Length = 369
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 253 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 312
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 313 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 351
>gi|374349205|ref|NP_001243397.1| histone acetyltransferase KAT6B isoform 2 [Homo sapiens]
gi|119574942|gb|EAW54557.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_a [Homo sapiens]
Length = 1890
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|332834457|ref|XP_003312688.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Pan troglodytes]
Length = 2070
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|432100457|gb|ELK29089.1| Histone acetyltransferase MYST4 [Myotis davidii]
Length = 2022
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|440907612|gb|ELR57740.1| Zinc finger protein DPF3, partial [Bos grunniens mutus]
Length = 368
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 252 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 311
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 312 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 350
>gi|403298004|ref|XP_003939830.1| PREDICTED: histone acetyltransferase KAT6B [Saimiri boliviensis
boliviensis]
Length = 2051
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|390472131|ref|XP_002807481.2| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Callithrix jacchus]
Length = 2066
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|402880392|ref|XP_003903787.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Papio anubis]
Length = 1887
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|348506563|ref|XP_003440828.1| PREDICTED: zinc finger protein DPF3-like [Oreochromis niloticus]
Length = 391
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
+A ++SC CG+ H CL+ W+C C+ C +C + + ++ +FC
Sbjct: 291 QAEELVSCSDCGRSGHPTCLQFTENMMQAVRTYQWQCIECKSCSLCGTSENDDQLLFCDD 350
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC PP G + C
Sbjct: 351 CDRGYHMYCLKPPMTQPPEGSWSC 374
>gi|432891025|ref|XP_004075511.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Oryzias
latipes]
Length = 371
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 3/98 (3%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 258 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 315
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + ++ +FC CD YH YC PP G + C
Sbjct: 316 GTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 353
>gi|341942234|sp|Q8BRB7.3|KAT6B_MOUSE RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Protein querkopf
Length = 1872
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|297301091|ref|XP_002805720.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Macaca
mulatta]
Length = 1893
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|291404129|ref|XP_002718448.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 1 [Oryctolagus cuniculus]
Length = 2065
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
Length = 2002
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1696 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKC 1755
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1756 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCMRCGATKP 1805
>gi|187957110|gb|AAI50619.1| MYST histone acetyltransferase (monocytic leukemia) 4 [Homo
sapiens]
Length = 2073
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|73953062|ref|XP_536397.2| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Canis lupus
familiaris]
Length = 2090
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|6002686|gb|AAF00095.1| histone acetyltransferase MORF [Homo sapiens]
gi|20521021|dbj|BAA20837.2| KIAA0383 [Homo sapiens]
gi|152012887|gb|AAI50271.1| MYST4 protein [Homo sapiens]
gi|168267336|dbj|BAG09724.1| MYST histone acetyltransferase 4 [synthetic construct]
Length = 1781
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|417406735|gb|JAA50012.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1778
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|338719785|ref|XP_001489567.3| PREDICTED: zinc finger protein DPF3-like [Equus caballus]
Length = 415
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 299 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 358
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 359 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 397
>gi|297467918|ref|XP_872746.3| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Bos taurus]
gi|297491533|ref|XP_002698931.1| PREDICTED: histone acetyltransferase KAT6B [Bos taurus]
gi|296472060|tpg|DAA14175.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 4 [Bos
taurus]
Length = 2054
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|301773210|ref|XP_002922022.1| PREDICTED: histone acetyltransferase MYST4-like [Ailuropoda
melanoleuca]
gi|281342250|gb|EFB17834.1| hypothetical protein PANDA_010953 [Ailuropoda melanoleuca]
Length = 2063
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|148235385|ref|NP_001079117.1| zinc finger protein ubi-d4 A [Xenopus laevis]
gi|18203564|sp|Q9W638.1|REQUA_XENLA RecName: Full=Zinc finger protein ubi-d4 A; AltName: Full=Apoptosis
response zinc finger protein A; AltName: Full=Protein
requiem A; Short=xReq A
gi|4808462|dbj|BAA77574.1| Requiem protein [Xenopus laevis]
Length = 388
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 43/89 (48%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H +CL+ A W+C C+ C IC + + ++
Sbjct: 282 NKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQCIECKCCNICGTSENDDQL 341
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 342 LFCDDCDRGYHMYCLVPPVAEPPEGSWSC 370
>gi|417406752|gb|JAA50020.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1807
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|355782819|gb|EHH64740.1| hypothetical protein EGM_18047 [Macaca fascicularis]
Length = 2069
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|355562477|gb|EHH19071.1| hypothetical protein EGK_19714 [Macaca mulatta]
Length = 2077
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|338716911|ref|XP_003363544.1| PREDICTED: histone acetyltransferase MYST4 isoform 3 [Equus
caballus]
Length = 1878
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|440893247|gb|ELR46092.1| Histone acetyltransferase MYST4 [Bos grunniens mutus]
Length = 2054
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395820446|ref|XP_003783577.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Otolemur
garnettii]
Length = 2062
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|297301093|ref|XP_002805721.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Macaca
mulatta]
Length = 1784
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|374349207|ref|NP_001243398.1| histone acetyltransferase KAT6B isoform 3 [Homo sapiens]
gi|119574943|gb|EAW54558.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_b [Homo sapiens]
Length = 1781
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|397483740|ref|XP_003813055.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Pan paniscus]
Length = 1783
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|332244078|ref|XP_003271198.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Nomascus leucogenys]
Length = 2055
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 196 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 255
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 256 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 295
>gi|426255808|ref|XP_004021540.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Ovis aries]
Length = 1760
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|338716908|ref|XP_003363543.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Equus
caballus]
Length = 1769
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|194042830|ref|XP_001928984.1| PREDICTED: histone acetyltransferase MYST4 [Sus scrofa]
Length = 2065
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|402880388|ref|XP_003903785.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Papio anubis]
Length = 2070
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|348576162|ref|XP_003473856.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Cavia
porcellus]
Length = 2053
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
Length = 1843
Score = 59.3 bits (142), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1532 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1591
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1592 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1641
>gi|395741628|ref|XP_002820847.2| PREDICTED: histone acetyltransferase KAT6B, partial [Pongo abelii]
Length = 1870
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 8 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 67
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 68 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 107
>gi|395820448|ref|XP_003783578.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Otolemur
garnettii]
Length = 1771
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|6716789|gb|AAF26744.1|AF222800_1 histone acetyltransferase querkopf [Mus musculus]
Length = 1763
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|74184716|dbj|BAE27963.1| unnamed protein product [Mus musculus]
Length = 1763
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|149689991|ref|XP_001504001.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Equus
caballus]
Length = 2061
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|110556652|ref|NP_059507.2| histone acetyltransferase KAT6B [Mus musculus]
gi|327365366|ref|NP_001192170.1| histone acetyltransferase KAT6B [Mus musculus]
gi|148669523|gb|EDL01470.1| mCG123147, isoform CRA_a [Mus musculus]
Length = 1763
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|410975403|ref|XP_003994122.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Felis catus]
Length = 2078
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|126272817|ref|XP_001366112.1| PREDICTED: histone acetyltransferase MYST4 [Monodelphis domestica]
Length = 2045
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|431904096|gb|ELK09518.1| Histone acetyltransferase MYST4 [Pteropus alecto]
Length = 1926
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNRDKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|68565919|sp|Q8WML3.1|KAT6B_MACFA RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4
gi|17025966|dbj|BAB72094.1| histone acetyltransferase MORF [Macaca fascicularis]
Length = 1784
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395501560|ref|XP_003755161.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Sarcophilus
harrisii]
Length = 1862
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|426255810|ref|XP_004021541.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Ovis aries]
Length = 1869
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|402880390|ref|XP_003903786.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Papio anubis]
Length = 1778
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|426255806|ref|XP_004021539.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Ovis aries]
Length = 2052
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
Full=Protein enhancer of yellow 3
gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
Length = 2006
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1695 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1754
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1755 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1804
>gi|348576164|ref|XP_003473857.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Cavia
porcellus]
Length = 1762
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|395501556|ref|XP_003755159.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Sarcophilus
harrisii]
Length = 1753
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
Length = 1982
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 4/125 (3%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1683 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKC 1742
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRW 120
R + P K ++C +CD YH YC + V G + C + C CG+ P GL+
Sbjct: 1743 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCVCIRCGATKP-EGLAHVA 1799
Query: 121 FLGYT 125
L T
Sbjct: 1800 ALSQT 1804
>gi|395501558|ref|XP_003755160.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Sarcophilus
harrisii]
Length = 2045
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 67/179 (37%), Gaps = 60/179 (33%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHW---SSWKCPSCRICEICRRTGDPNKFMFCRRC 75
++ C C + YH C + N D+ ++W C C+ C +C G P + CRRC
Sbjct: 1041 LVHCACCCEPYHPFCAEPDFLNTDVLAQMKRNTWICRKCQCCHVC---GHPKNLLVCRRC 1097
Query: 76 DAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFV 135
+Y H++C G + P N S
Sbjct: 1098 KKSY---------------------HSEC--LGPSYPTNCYS------------------ 1116
Query: 136 KGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKY-LQFQVDGNLQYRCPTC 193
D ES M+ C C+RWVH +C+ +S E Y L + N+ Y+CP C
Sbjct: 1117 -----------DEDYESR-MIQCSGCKRWVHSKCESLSVEMYTLLAHMPNNVTYKCPDC 1163
>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
Length = 2012
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1701 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1760
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1761 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1810
>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
Length = 2011
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)
Query: 2 CRLCFVGENEGC-ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C +C ++ + + C +C K+ H +C+ + +W+C C+ C C
Sbjct: 1700 CGVCLRSQHRNARDMPEAFIRCYTCRKRVHPSCVDMPPRMVGRVRNYNWQCAGCKCCIKC 1759
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
R + P K ++C +CD YH YC + V G + C + C CG+ P
Sbjct: 1760 RSSQRPGKMLYCEQCDRGYHIYCLG--LRTVPDGRWSCERCCFCMRCGATKP 1809
>gi|195027395|ref|XP_001986568.1| GH21439 [Drosophila grimshawi]
gi|193902568|gb|EDW01435.1| GH21439 [Drosophila grimshawi]
Length = 504
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 404 ENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSICGTSDNDDQ 463
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 464 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 493
>gi|327276821|ref|XP_003223166.1| PREDICTED: histone acetyltransferase MYST4-like [Anolis
carolinensis]
Length = 2024
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCLELTTNVKALRWQCIECKTCSA 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313
>gi|195382643|ref|XP_002050039.1| GJ20409 [Drosophila virilis]
gi|194144836|gb|EDW61232.1| GJ20409 [Drosophila virilis]
Length = 490
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 390 ENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSICGTSDNDDQ 449
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 450 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 479
>gi|308497276|ref|XP_003110825.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
gi|308242705|gb|EFO86657.1| hypothetical protein CRE_04507 [Caenorhabditis remanei]
Length = 372
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 38/80 (47%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL + S W+C C+ C IC + + +K +FC CD
Sbjct: 276 LVSCHDCGRSGHPSCLSFNENVTKIIKRSGWQCLECKSCTICGTSENDDKLLFCDDCDRG 335
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P + Y C
Sbjct: 336 YHLYCLRPALEKAPDDEYSC 355
>gi|410895483|ref|XP_003961229.1| PREDICTED: histone acetyltransferase KAT6B-like [Takifugu rubripes]
Length = 2123
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNRDKQPEELLSCADCGSSGHPSCLKFSPELTSNVKRLRWQCIECKTCSS 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + ++ +FC CD +H C +PP + G ++C
Sbjct: 274 CRIQGKNADEMLFCDSCDRGFHMECCNPPLSRMPKGTWIC 313
>gi|297298207|ref|XP_002805197.1| PREDICTED: hypothetical protein LOC694878 [Macaca mulatta]
Length = 472
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 356 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 415
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 416 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 454
>gi|115533182|ref|NP_001041113.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
gi|351062484|emb|CCD70456.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
Length = 447
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+ C +C YH C++ + L W C CR+C IC + ++ +FC RCD
Sbjct: 345 MICCATCKIAYHPQCIEMPERMAALVKTYEWSCVDCRLCSICNKPEKEDEIVFCDRCDRG 404
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
+H YC K + G ++C
Sbjct: 405 FHTYCVG--LKKLPQGTWIC 422
>gi|351714578|gb|EHB17497.1| Histone acetyltransferase MYST4 [Heterocephalus glaber]
Length = 2108
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 217 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 276
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 277 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 316
>gi|344274631|ref|XP_003409118.1| PREDICTED: histone acetyltransferase MYST4 [Loxodonta africana]
Length = 1878
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|260830577|ref|XP_002610237.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
gi|229295601|gb|EEN66247.1| hypothetical protein BRAFLDRAFT_245816 [Branchiostoma floridae]
Length = 354
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 43/98 (43%), Gaps = 2/98 (2%)
Query: 2 CRLCFVGENEGCERA--RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C E E + ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 255 CDFCLGDEKENKKSGVPEELISCSDCGRSGHPTCLQFTPHMTESVKKYRWQCIECKSCHL 314
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYL 97
C + + ++ +FC CD YH YC PP + G L
Sbjct: 315 CGTSENDDQLLFCDDCDRGYHMYCLSPPLSSPPEGKTL 352
>gi|297459745|ref|XP_001254780.2| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|297479887|ref|XP_002691098.1| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|296483064|tpg|DAA25179.1| TPA: Zinc finger protein DPF3-like [Bos taurus]
Length = 474
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 358 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 417
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 418 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 456
>gi|355755784|gb|EHH59531.1| hypothetical protein EGM_09668 [Macaca fascicularis]
Length = 340
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 227 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 284
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 285 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 330
>gi|344243487|gb|EGV99590.1| Zinc finger protein DPF3 [Cricetulus griseus]
Length = 408
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 2/88 (2%)
Query: 2 CRLCFVGEN--EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 251 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 310
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPP 87
C + + ++ +FC CD YH YC +PP
Sbjct: 311 CGTSENDDQLLFCDDCDRGYHMYCLNPP 338
>gi|37998957|dbj|BAD00088.1| chimeric MOZ-ASXH2 fusion protein [Homo sapiens]
Length = 2228
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|195425644|ref|XP_002061104.1| GK10624 [Drosophila willistoni]
gi|194157189|gb|EDW72090.1| GK10624 [Drosophila willistoni]
Length = 515
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCFVG--ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C EN+ ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 406 CDFCLGDQRENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSI 465
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 466 CGTSDNDDQLLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 504
>gi|431909715|gb|ELK12873.1| Zinc finger protein neuro-d4 [Pteropus alecto]
Length = 425
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 312 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 369
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 370 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415
>gi|189526911|ref|XP_697383.3| PREDICTED: hypothetical protein LOC568932 [Danio rerio]
Length = 2011
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ +R +LSC CG H +CLK A W+C C+ C
Sbjct: 215 ICSFCLGTKESNRDKRPEELLSCADCGSSGHPSCLKFSADLTANVKALRWQCIECKTCSS 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + ++ +FC CD +H C PP + G ++C
Sbjct: 275 CQIQGKNADEMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|345305893|ref|XP_001506182.2| PREDICTED: histone acetyltransferase MYST4 isoform 1
[Ornithorhynchus anatinus]
Length = 2066
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + +FC CD +H C PP + G ++C
Sbjct: 275 CRIQGKNAENMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|403293045|ref|XP_003937534.1| PREDICTED: zinc finger protein neuro-d4 [Saimiri boliviensis
boliviensis]
Length = 293
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 180 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 237
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 238 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 283
>gi|356551207|ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786712 [Glycine max]
Length = 525
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + + + + W CPSC IC++C D +K + C CD AYH YC P
Sbjct: 376 KYYHVRCL---SSKQLKSYGNCWYCPSC-ICQVCLTDKDDDKIVLCDGCDHAYHIYCMKP 431
Query: 87 PHKNVSSGPYLCPK 100
P ++ G + C K
Sbjct: 432 PQNSIPKGKWFCIK 445
>gi|4808460|dbj|BAA77573.1| Requiem protein [Xenopus laevis]
Length = 290
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C N+ + ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 174 CDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNI 233
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 234 CGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 272
>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 989
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 126 CCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGN 185
CC C +L YC +C +++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 360 CCKYCSKLRKSKQYCGICKRIWHHSDGGNWVCCDGCNVWVHAECDKISSKVFKDLE---N 416
Query: 186 LQYRCPTCRGE 196
Y CP C+G+
Sbjct: 417 TDYYCPDCKGK 427
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 59
E C++ ++ C +C + YH CL+ + WS CP C E
Sbjct: 421 EVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWS---CPHCEGEGIQEQEEDEHMEF 477
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C AA+H +C +PP KNV +G + CP+
Sbjct: 478 CRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPR 518
>gi|301780976|ref|XP_002925892.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Ailuropoda
melanoleuca]
gi|410983100|ref|XP_003997881.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Felis catus]
Length = 414
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 301 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 358
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 359 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|324512595|gb|ADY45214.1| Zinc finger protein [Ascaris suum]
Length = 382
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 1 MCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C LC EN+ +++SC CG+ H +CLK + W+C C+ C
Sbjct: 264 ICDLCLGDRNENKKTSLPEQLVSCHDCGRSGHPSCLKFTENMITSTNKYGWQCIECKSCA 323
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
IC + + ++ +FC CD +H YC P G + C
Sbjct: 324 ICGTSDNDDQLLFCDDCDRGFHLYCLRPRLATAPEGEWSC 363
>gi|205830430|ref|NP_001128627.1| zinc finger protein neuro-d4 isoform a [Homo sapiens]
gi|297704629|ref|XP_002829197.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Pongo abelii]
gi|395847017|ref|XP_003796183.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Otolemur
garnettii]
gi|402905395|ref|XP_003915505.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Papio anubis]
gi|387540168|gb|AFJ70711.1| zinc finger protein neuro-d4 isoform a [Macaca mulatta]
Length = 414
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 301 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 358
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 359 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|410053854|ref|XP_001162157.3| PREDICTED: zinc finger protein neuro-d4 [Pan troglodytes]
Length = 519
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 406 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 463
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 464 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 509
>gi|301765980|ref|XP_002918393.1| PREDICTED: histone acetyltransferase MYST3-like [Ailuropoda
melanoleuca]
Length = 1702
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|443693752|gb|ELT95039.1| hypothetical protein CAPTEDRAFT_19718 [Capitella teleta]
Length = 421
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 43/90 (47%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ A W+C C+ C +C + + ++
Sbjct: 315 ENKKTGMKEELVSCADCGRSGHPSCLQFTANMIISVKKYPWQCIECKSCGLCGTSDNDDQ 374
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC +PP G + C
Sbjct: 375 LLFCDDCDRGYHMYCLNPPLSEPPEGSWSC 404
>gi|56790323|ref|NP_001007153.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
gi|54035542|gb|AAH83281.1| D4, zinc and double PHD fingers family 2 [Danio rerio]
gi|182890270|gb|AAI65788.1| Dpf2 protein [Danio rerio]
Length = 400
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 42/89 (47%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 290 NQKTGQSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQCIECKCCNICGTSENDDQL 349
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 350 LFCDDCDRGYHMYCLSPPMSVPPEGSWSC 378
>gi|380798715|gb|AFE71233.1| zinc finger protein neuro-d4 isoform a, partial [Macaca mulatta]
Length = 407
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 294 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 351
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 352 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 397
>gi|195122592|ref|XP_002005795.1| GI20660 [Drosophila mojavensis]
gi|193910863|gb|EDW09730.1| GI20660 [Drosophila mojavensis]
Length = 492
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 392 ENKKTNMPEELVSCSDCGRSGHPSCLQFTPNMIISVKRYRWQCIECKYCSICGTSDNDDQ 451
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 452 LLFCDDCDRGYHMYCLSPPLVTPPEGSWSC 481
>gi|432877939|ref|XP_004073268.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
Length = 407
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 41/89 (46%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H CL+ W+C C+ C +C + + ++
Sbjct: 296 NQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 355
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 356 LFCDDCDRGYHMYCLTPPMTEPPEGSWSC 384
>gi|195399353|ref|XP_002058285.1| GJ15575 [Drosophila virilis]
gi|194150709|gb|EDW66393.1| GJ15575 [Drosophila virilis]
Length = 2162
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 6/112 (5%)
Query: 5 CFVGENEGCERARRM----LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C V + AR M + C SC ++ H +C++ + +W+C C+ C C
Sbjct: 1896 CGVCQRTQHRNARNMPEAFIRCYSCRRRVHPSCIEMPHRMVGRVRNYNWQCADCKCCIKC 1955
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+ P K ++C +CD YH YC K V G + C + + C CG+ P
Sbjct: 1956 GSSQQPGKMLYCEQCDRGYHIYCLG--VKTVPEGRWSCERCSICMRCGATRP 2005
>gi|119577164|gb|EAW56760.1| D4, zinc and double PHD fingers family 1, isoform CRA_a [Homo
sapiens]
Length = 425
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 312 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 369
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 370 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 415
>gi|417406854|gb|JAA50068.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 2010
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|354482358|ref|XP_003503365.1| PREDICTED: histone acetyltransferase MYST3 [Cricetulus griseus]
Length = 1992
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSTELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|45383492|ref|NP_989662.1| zinc finger protein ubi-d4 [Gallus gallus]
gi|18202299|sp|P58268.1|REQU_CHICK RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName: Full=Protein
requiem
gi|14010356|gb|AAK51965.1|AF362751_1 requiem [Gallus gallus]
Length = 405
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 306 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 365
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP G + C
Sbjct: 366 YHMYCLTPPMSEPPEGSWSC 385
>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
Length = 1078
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 104 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C CG+++P + G C C RL ++C +C KV+ S+S V CD
Sbjct: 407 CEGCGTSLPLKPAKKIKGTSPGGQLLCKTCARLTKSKHFCGICKKVWNHSDSGSWVRCDG 466
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
C+ WVH +CD IS + F+ G Y CP C+ +
Sbjct: 467 CKVWVHAECDKISSNR---FKDLGGTDYYCPACKAK 499
>gi|348518782|ref|XP_003446910.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
Length = 399
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
++ ++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC
Sbjct: 294 QSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNMCGTSENDDQLLFCDD 353
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC +PP G + C
Sbjct: 354 CDRGYHMYCLNPPMSEPPEGSWSC 377
>gi|410913627|ref|XP_003970290.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
Length = 407
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 41/89 (46%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H CL+ W+C C+ C +C + + ++
Sbjct: 296 NQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 355
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 356 LFCDDCDRGYHMYCLSPPMTEPPEGSWSC 384
>gi|334312613|ref|XP_001373063.2| PREDICTED: histone acetyltransferase MYST3 [Monodelphis domestica]
Length = 1951
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 59
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 460 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 516
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C +PP ++ G + CP+
Sbjct: 517 CRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPR 557
>gi|432897021|ref|XP_004076387.1| PREDICTED: zinc finger protein ubi-d4-like [Oryzias latipes]
Length = 399
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
++ ++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC
Sbjct: 294 QSEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNMCGTSENDDQLLFCDD 353
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC +PP G + C
Sbjct: 354 CDRGYHMYCLNPPMSEPPEGSWSC 377
>gi|168045004|ref|XP_001774969.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673716|gb|EDQ60235.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 293
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 77/159 (48%), Gaps = 22/159 (13%)
Query: 569 VLQSLPKDSKPP---LRLKFRK-PNLE------NQNSQVSQPEEEKSLIKGQRSKRKRPS 618
+L S P P LRLK +K P E +++++ S+ + E ++ Q SK+KR +
Sbjct: 142 LLHSSPGSDSPVTRRLRLKIKKQPAREVVIPTTSKDNKASEHDGELAIKAHQTSKKKRTA 201
Query: 619 PFTEKTLFNEDEDAAQSNQDSLM--SEIMDANWILKKLGKDAIGKRVEVHQQSDNSWHKG 676
+A S Q L D +WIL++LG DAI KRVEV DN W+KG
Sbjct: 202 --------TPQRNAEGSTQRRLAIGDATDDDSWILQRLGTDAISKRVEVFWPIDNIWYKG 253
Query: 677 VVTDTVEGTSTLSITLDDSRVKTLELGKQGVRFVPQKQK 715
+ ++ + DD +TLE GK+ VR + +K
Sbjct: 254 TIVAVF--STQFCVDYDDGDQETLEFGKEKVRLLSTTKK 290
>gi|355697899|gb|EHH28447.1| Histone acetyltransferase MYST3 [Macaca mulatta]
Length = 2099
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 304 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 363
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 364 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 403
>gi|395507492|ref|XP_003758058.1| PREDICTED: histone acetyltransferase KAT6A [Sarcophilus harrisii]
Length = 1993
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|149410700|ref|XP_001509833.1| PREDICTED: histone acetyltransferase MYST3-like isoform 2
[Ornithorhynchus anatinus]
Length = 2003
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|335289596|ref|XP_003355927.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Sus scrofa]
gi|115527907|gb|AAI25154.1| DPF1 protein [Homo sapiens]
gi|119577167|gb|EAW56763.1| D4, zinc and double PHD fingers family 1, isoform CRA_d [Homo
sapiens]
gi|208966118|dbj|BAG73073.1| D4, zinc and double PHD fingers family 1 [synthetic construct]
Length = 387
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 274 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 331
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 332 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|348514482|ref|XP_003444769.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
Length = 405
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 41/89 (46%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H CL+ W+C C+ C +C + + ++
Sbjct: 294 NQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 353
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 354 LFCDDCDRGYHMYCLTPPMTEPPEGSWSC 382
>gi|345781619|ref|XP_003432152.1| PREDICTED: histone acetyltransferase KAT6A [Canis lupus familiaris]
Length = 2017
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|426359475|ref|XP_004046999.1| PREDICTED: histone acetyltransferase KAT6A [Gorilla gorilla
gorilla]
Length = 2005
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|392333207|ref|XP_003752828.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1855
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|13431818|sp|Q9QX66.2|DPF1_MOUSE RecName: Full=Zinc finger protein neuro-d4; AltName:
Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
Full=D4, zinc and double PHD fingers family 1
Length = 387
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 274 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 331
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 332 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|432099934|gb|ELK28828.1| Histone acetyltransferase MYST3 [Myotis davidii]
Length = 1861
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|7305309|ref|NP_038902.1| zinc finger protein neuro-d4 [Mus musculus]
gi|6649546|gb|AAF21455.1|U48238_1 zinc finger protein neuro-d4 [Mus musculus]
gi|30481687|gb|AAH52348.1| D4, zinc and double PHD fingers family 1 [Mus musculus]
gi|148692120|gb|EDL24067.1| neuronal d4 domain family member [Mus musculus]
Length = 388
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 275 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 332
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 333 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|116003927|ref|NP_001070323.1| zinc finger protein neuro-d4 [Bos taurus]
gi|115305354|gb|AAI23606.1| D4, zinc and double PHD fingers family 1 [Bos taurus]
Length = 387
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 274 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 331
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 332 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|205830434|ref|NP_001128628.1| zinc finger protein neuro-d4 isoform c [Homo sapiens]
gi|395751088|ref|XP_002829195.2| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Pongo abelii]
gi|395751090|ref|XP_003779218.1| PREDICTED: zinc finger protein neuro-d4 [Pongo abelii]
gi|395847019|ref|XP_003796184.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Otolemur
garnettii]
gi|395847023|ref|XP_003796186.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Otolemur
garnettii]
gi|402905397|ref|XP_003915506.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Papio anubis]
gi|402905399|ref|XP_003915507.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Papio anubis]
gi|410983102|ref|XP_003997882.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Felis catus]
gi|193787694|dbj|BAG52900.1| unnamed protein product [Homo sapiens]
Length = 332
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 219 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 276
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 277 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322
>gi|440901050|gb|ELR52053.1| Histone acetyltransferase MYST3 [Bos grunniens mutus]
Length = 1923
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|347800720|ref|NP_001099199.2| zinc finger protein neuro-d4 [Rattus norvegicus]
Length = 387
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 274 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 331
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 332 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 377
>gi|324512933|gb|ADY45341.1| PHD finger protein 10 [Ascaris suum]
Length = 481
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 25/119 (21%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNC--LKNWAQNRDLFHWSSWKCPSCRICEI 59
C +C G + +MLSC +C K H +C L N L + W C C+ C +
Sbjct: 351 CSIC------GVTNSAQMLSCATCSTKVHPDCAGLPERVVNVALNYM--WSCIECKKCTV 402
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC---QHPPHKNVSSGPYLCPK-------HTKCHSCG 108
C + + + MFC RCD YH +C PP +G ++C +KC+ C
Sbjct: 403 CEKPDNEDAMMFCDRCDRGYHTFCVGLSAPP-----TGTWVCTNFCADQTIQSKCNKCS 456
>gi|332241000|ref|XP_003269676.1| PREDICTED: histone acetyltransferase KAT6A [Nomascus leucogenys]
Length = 2004
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|326932701|ref|XP_003212452.1| PREDICTED: histone acetyltransferase MYST3-like [Meleagris
gallopavo]
Length = 1981
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|426256598|ref|XP_004023426.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Ovis aries]
Length = 1931
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|119918267|ref|XP_874495.2| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
gi|297491293|ref|XP_002698753.1| PREDICTED: histone acetyltransferase KAT6A [Bos taurus]
gi|296472344|tpg|DAA14459.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 3 [Bos
taurus]
Length = 2018
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|431902227|gb|ELK08728.1| Histone acetyltransferase MYST3 [Pteropus alecto]
Length = 1731
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|126010798|gb|AAI33649.1| DPF1 protein [Bos taurus]
Length = 388
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 275 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 332
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 333 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|118101408|ref|XP_424402.2| PREDICTED: histone acetyltransferase KAT6A [Gallus gallus]
Length = 1981
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|410915434|ref|XP_003971192.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
Length = 398
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
++ ++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC
Sbjct: 293 QSEELVSCSDCGRSGHPSCLQFTPIMMAAVKTYRWQCIECKCCNMCGTSENDDQLLFCDD 352
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC +PP G + C
Sbjct: 353 CDRGYHMYCLNPPMSEPPEGSWSC 376
>gi|296222095|ref|XP_002757039.1| PREDICTED: histone acetyltransferase KAT6A [Callithrix jacchus]
Length = 2003
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|395857485|ref|XP_003801122.1| PREDICTED: histone acetyltransferase KAT6A [Otolemur garnettii]
Length = 2002
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|403303692|ref|XP_003942458.1| PREDICTED: histone acetyltransferase KAT6A [Saimiri boliviensis
boliviensis]
Length = 1968
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|348557736|ref|XP_003464675.1| PREDICTED: histone acetyltransferase MYST3 [Cavia porcellus]
Length = 2016
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 207 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 266
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 267 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 306
>gi|397505596|ref|XP_003846081.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Pan paniscus]
Length = 2002
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|114619920|ref|XP_001140373.1| PREDICTED: histone acetyltransferase KAT6A isoform 1 [Pan
troglodytes]
gi|332826020|ref|XP_003311745.1| PREDICTED: histone acetyltransferase KAT6A [Pan troglodytes]
Length = 2002
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|1517914|gb|AAC50662.1| monocytic leukaemia zinc finger protein [Homo sapiens]
Length = 2004
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|297682771|ref|XP_002819083.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Pongo abelii]
Length = 2010
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|150378463|ref|NP_001092882.1| histone acetyltransferase KAT6A [Homo sapiens]
gi|150378493|ref|NP_006757.2| histone acetyltransferase KAT6A [Homo sapiens]
gi|150378543|ref|NP_001092883.1| histone acetyltransferase KAT6A [Homo sapiens]
gi|215274095|sp|Q92794.2|KAT6A_HUMAN RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger protein;
AltName: Full=Runt-related transcription factor-binding
protein 2; AltName: Full=Zinc finger protein 220
gi|119583643|gb|EAW63239.1| MYST histone acetyltransferase (monocytic leukemia) 3, isoform
CRA_a [Homo sapiens]
gi|208965270|dbj|BAG72649.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
construct]
gi|225000792|gb|AAI72379.1| MYST histone acetyltransferase (monocytic leukemia) 3 [synthetic
construct]
Length = 2004
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|224082358|ref|XP_002306661.1| predicted protein [Populus trichocarpa]
gi|222856110|gb|EEE93657.1| predicted protein [Populus trichocarpa]
Length = 524
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 26 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 85
GK YH CL N + + + W CPSC +C C D +K + C CD AYH YC
Sbjct: 377 GKYYHVRCLTN---RQLILYGPRWYCPSC-LCRGCLTDKDDDKIVLCDGCDHAYHLYCMI 432
Query: 86 PPHKNVSSGPYLC 98
PP +V G + C
Sbjct: 433 PPRISVPKGKWFC 445
>gi|226498206|ref|NP_001147779.1| LOC100281389 [Zea mays]
gi|195613724|gb|ACG28692.1| PHD-finger family protein [Zea mays]
gi|219885501|gb|ACL53125.1| unknown [Zea mays]
gi|413921539|gb|AFW61471.1| PHD-finger family [Zea mays]
Length = 558
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 50/105 (47%), Gaps = 15/105 (14%)
Query: 1 MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKNWA----QNRDLFHWSSWKCPS 53
+C+LC E+E ++ + C C K YH CLK + R+L W CPS
Sbjct: 419 LCKLCGTCEDEN----KKFVVCGHGYCSFKFYHALCLKESQIASEKQRNL---KCWYCPS 471
Query: 54 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C +C C + D K + C CD AYH YC PP +V G + C
Sbjct: 472 C-LCRRCFKNKDDEKIVLCDGCDEAYHTYCMDPPRSSVPRGKWFC 515
>gi|149056396|gb|EDM07827.1| neuronal d4 domain family member, isoform CRA_b [Rattus norvegicus]
gi|166796914|gb|AAI59416.1| D4, zinc and double PHD fingers family 1 [Rattus norvegicus]
Length = 332
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 219 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 276
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 277 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 322
>gi|402878102|ref|XP_003902742.1| PREDICTED: histone acetyltransferase KAT6A [Papio anubis]
Length = 2010
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|147904561|ref|NP_001081346.1| zinc finger protein ubi-d4 B [Xenopus laevis]
gi|47682308|gb|AAH70839.1| LOC397786 protein [Xenopus laevis]
gi|52078456|gb|AAH82478.1| LOC397786 protein [Xenopus laevis]
Length = 387
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C N+ + ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 271 CDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNI 330
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 331 CGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 369
>gi|440894955|gb|ELR47273.1| Zinc finger protein neuro-d4, partial [Bos grunniens mutus]
Length = 383
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 270 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 327
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 328 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 373
>gi|353523840|ref|NP_001088070.2| PHD finger protein 10 [Xenopus laevis]
gi|296439270|sp|Q63ZP1.2|PHF10_XENLA RecName: Full=PHD finger protein 10
Length = 506
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G++ + R+ R++ C C H +CL A+ + W+C C+ C I
Sbjct: 387 ICGICLKGKDANKKGRSERLIHCSQCDNSGHPSCLDMSAELVAVIKKYPWQCMECKTCII 446
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 447 CGQPHHEEEMMFCDTCDRGYHTFC 470
>gi|380792697|gb|AFE68224.1| histone acetyltransferase KAT6B, partial [Macaca mulatta]
Length = 1077
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|241640825|ref|XP_002409304.1| requim, req/dpf2, putative [Ixodes scapularis]
gi|215501331|gb|EEC10825.1| requim, req/dpf2, putative [Ixodes scapularis]
Length = 379
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 43/91 (47%)
Query: 8 GENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPN 67
GEN+ + ++SC CG+ H +CL+ W+C C+ C +C + + +
Sbjct: 270 GENKKTRQPEELVSCSDCGRSAHPSCLQFTPNMTVSVKKYRWQCIECKSCGLCGTSDNDD 329
Query: 68 KFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ +FC CD YH YC PP G + C
Sbjct: 330 QLLFCDDCDRGYHMYCLTPPLSEPPEGLWSC 360
>gi|302141752|emb|CBI18955.3| unnamed protein product [Vitis vinifera]
Length = 795
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH++CL + + W CPSC +C C D K + C CD AYH YC +P
Sbjct: 665 KYYHKSCLTS---TELRMYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNP 720
Query: 87 PHKNVSSGPYLCPK 100
P ++ G + C K
Sbjct: 721 PRTSIPRGKWFCRK 734
>gi|358338934|dbj|GAA39327.2| zinc finger protein ubi-d4, partial [Clonorchis sinensis]
Length = 331
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 36/80 (45%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
ML C CG+ H CL+ A W+C C+ C +C + + + +FC CD
Sbjct: 190 MLRCSDCGRCAHFTCLQFTANMVSSVRTYRWQCIECKTCWLCGTSENDEQMLFCDDCDRG 249
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP G + C
Sbjct: 250 YHMYCLSPPLSEPPEGSWSC 269
>gi|449488248|ref|XP_004176107.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6A
[Taeniopygia guttata]
Length = 2010
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|344281584|ref|XP_003412558.1| PREDICTED: histone acetyltransferase MYST3 [Loxodonta africana]
Length = 2011
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
Length = 1018
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 104 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C CG ++P + G C C +L +YC +C K++ S+S V CD
Sbjct: 351 CEGCGVSLPFKLSKKMKSSITGGQFLCKTCAKLTKLKHYCGICKKIWNHSDSGSWVRCDG 410
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
C+ WVH +CD IS+ + F+ G Y CP C+ +
Sbjct: 411 CKVWVHAECDKISNSR---FKDLGATDYYCPACKAK 443
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 59
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 417 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 473
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C +PP ++ G + CP+
Sbjct: 474 CRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPR 514
>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
Length = 2123
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 1633 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 1687
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 1688 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 1738
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 1633 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 1692
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 1693 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 1725
>gi|124487239|ref|NP_001074618.1| histone acetyltransferase KAT6A [Mus musculus]
gi|148700926|gb|EDL32873.1| mCG13090 [Mus musculus]
Length = 2003
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2509 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2563
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2564 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2614
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2509 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2568
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2569 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2601
>gi|68565903|sp|Q8BZ21.2|KAT6A_MOUSE RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger homolog;
AltName: Full=Monocytic leukemia zinc finger protein
Length = 2003
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|417410666|gb|JAA51801.1| Putative phd zn-finger protein, partial [Desmodus rotundus]
Length = 433
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G+ G +A ++ C CG H +CL + + W+C C+ C +
Sbjct: 314 LCGICLKGKESGRRGKAESLIHCSQCGNSGHPSCLDMTTELVSMIKTYPWQCMECKTCIV 373
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 374 CGQPHHEEEMMFCDVCDRGYHTFC 397
>gi|327286450|ref|XP_003227943.1| PREDICTED: histone acetyltransferase MYST3-like [Anolis
carolinensis]
Length = 2017
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 209 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 268
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 269 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 308
>gi|297299302|ref|XP_001094798.2| PREDICTED: histone acetyltransferase MYST3 [Macaca mulatta]
Length = 1905
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 73/191 (38%), Gaps = 18/191 (9%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSV 118
CR G + + +FC CD +H C PP + G L +C + G+
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGNTLA-----VEACSGEL---GVEA 319
Query: 119 RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQR----WVHCQCDGISD 174
W + C A VKG C +R V +C++ W+ GI
Sbjct: 320 GWHMS-ACVGAFRGAKVKGG-CGQRGTSHRSHSGQTCVVVVMCKQLYWNWISTFTSGIRL 377
Query: 175 EKYLQFQVDGN 185
E+ Q DGN
Sbjct: 378 EE--QLAADGN 386
>gi|47225244|emb|CAG09744.1| unnamed protein product [Tetraodon nigroviridis]
Length = 393
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 41/89 (46%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H CL+ W+C C+ C +C + + ++
Sbjct: 285 NQKTGQSEELVSCSDCGRSGHPTCLQFTPVMMAAVKTYRWQCIECKCCNVCGTSENDDQL 344
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 345 LFCDDCDRGYHMYCLSPPMTEPPEGSWSC 373
>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1067
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 104 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C +CG ++P L G C C RL +YC +C KV+ S+S V CD
Sbjct: 403 CEACGLSLPYKMLKKTKDSSPGGQFLCRTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 462
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
C+ WVH +CD IS + + Y CPTC+ +
Sbjct: 463 CKVWVHAECDKISSNLFKNLE---GTDYYCPTCKAK 495
>gi|118344120|ref|NP_001071881.1| zinc finger protein [Ciona intestinalis]
gi|70571741|dbj|BAE06812.1| zinc finger protein [Ciona intestinalis]
Length = 667
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 2/83 (2%)
Query: 16 ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRC 75
A ++ C C H +CL+ + + +W+C C+ C IC + MFC RC
Sbjct: 416 AEELIKCSQCDNHGHPSCLEMSVEQVSVIETYNWQCMECKTCTICSMPHREDLMMFCDRC 475
Query: 76 DAAYHCYCQHPPHKNVSSGPYLC 98
D YH +C + + SG + C
Sbjct: 476 DRGYHTFCVS--LRAIPSGVWAC 496
>gi|18203563|sp|Q9W636.2|REQUB_XENLA RecName: Full=Zinc finger protein ubi-d4 B; AltName: Full=Apoptosis
response zinc finger protein B; AltName: Full=Protein
requiem B; Short=xReq B
Length = 366
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C N+ + ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 250 CDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNI 309
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 310 CGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 348
>gi|56001099|dbj|BAD72833.1| monocytic leukemia zinc finger protein [Rattus norvegicus]
Length = 1991
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 201 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 260
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 261 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 300
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2604 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2658
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2659 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2709
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2600 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2659
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2660 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2696
>gi|351707582|gb|EHB10501.1| Histone acetyltransferase MYST3 [Heterocephalus glaber]
Length = 2068
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTIRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLIRMPKGMWIC 307
>gi|213972547|ref|NP_001094040.1| histone acetyltransferase KAT6A [Rattus norvegicus]
gi|68565633|sp|Q5TKR9.2|KAT6A_RAT RecName: Full=Histone acetyltransferase KAT6A; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 3; Short=MYST-3;
AltName: Full=Monocytic leukemia zinc finger homolog;
AltName: Full=Monocytic leukemia zinc finger protein
gi|149057780|gb|EDM09023.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Rattus
norvegicus]
Length = 1998
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEDLISCADCGNSGHPSCLKFSPELTVRVRALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2641 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2695
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2696 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2746
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2637 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2696
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2697 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2733
>gi|33869836|gb|AAH21191.1| DPF1 protein, partial [Homo sapiens]
Length = 414
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 301 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 358
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 359 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 404
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2650 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2704
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2705 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2755
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2646 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2705
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2706 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2742
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2454 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2508
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2509 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2559
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2454 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2513
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2514 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2546
>gi|52354758|gb|AAH82869.1| LOC494765 protein [Xenopus laevis]
Length = 417
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G++ + R+ R++ C C H +CL A+ + W+C C+ C I
Sbjct: 298 ICGICLKGKDANKKGRSERLIHCSQCDNSGHPSCLDMSAELVAVIKKYPWQCMECKTCII 357
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 358 CGQPHHEEEMMFCDTCDRGYHTFC 381
>gi|291409041|ref|XP_002720841.1| PREDICTED: MYST histone acetyltransferase 2-like [Oryctolagus
cuniculus]
Length = 1806
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|270013653|gb|EFA10101.1| hypothetical protein TcasGA2_TC012280 [Tribolium castaneum]
Length = 559
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 13/119 (10%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C +C V G RM++C+ C H +CL ++ ++W+CP C+ C IC
Sbjct: 290 VCSVCHVQNKMGPND--RMVACRECT---HYSCLNGDDMMLRMYPDNTWQCPHCKTCVIC 344
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHP-PHKNVSSGP--YLC-----PKHTKCHSCGSNV 111
T D C C AYH C P H+ P +LC P+ K + SN+
Sbjct: 345 FETSDAGYLTVCAVCADAYHAGCHQPRIHEKFIKPPAKWLCINCEMPEELKINEIQSNI 403
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2568 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2622
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2623 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2673
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2564 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2623
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2624 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2660
>gi|74197305|dbj|BAC32253.2| unnamed protein product [Mus musculus]
Length = 933
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2590 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2644
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2645 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2695
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2586 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2645
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2646 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2682
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2636 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2690
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2691 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2741
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2632 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2691
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2692 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2728
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 46/102 (45%), Gaps = 15/102 (14%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI------------CE 58
E C++ ++ C +C K YH CL+ ++ WS CP+C E
Sbjct: 403 EVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWS---CPTCEADGGVAEDDDDEHQE 459
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C PP ++ G + CP+
Sbjct: 460 FCRICKDGGELLCCDMCPSAYHTFCLTPPLDDIPDGDWRCPR 501
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2542 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2596
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 2597 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 2647
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2542 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2601
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2602 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2634
>gi|149470805|ref|XP_001506848.1| PREDICTED: zinc finger protein ubi-d4-like [Ornithorhynchus
anatinus]
Length = 425
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 36/76 (47%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 290 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 349
Query: 79 YHCYCQHPPHKNVSSG 94
YH YC PP G
Sbjct: 350 YHMYCLTPPMSEPPEG 365
>gi|6648954|gb|AAF21305.1|AF108133_1 neuro-d4 [Mus musculus]
Length = 323
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 210 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAPVRTYRWQCIECKSCSLC 267
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 268 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 313
>gi|148745647|gb|AAI42660.1| MYST3 protein [Homo sapiens]
Length = 1149
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|426235049|ref|XP_004011503.1| PREDICTED: PHD finger protein 10 [Ovis aries]
Length = 451
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C CG H +CL A+ + W+C C+ C +
Sbjct: 332 LCGICLKGKESSKRGKAEPLVHCSQCGNSGHPSCLDMPAELVSMIKTYPWQCMECKACIV 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|148669525|gb|EDL01472.1| mCG123147, isoform CRA_c [Mus musculus]
Length = 938
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 226 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 285
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 286 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 325
>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
sativus]
Length = 1073
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C +C RL +YC +C K++ S+S V CD C+ WVH +CD IS F+ G+
Sbjct: 434 CKSCTRLTNSKHYCGICKKIWNHSDSGSWVRCDGCKVWVHAECDKISSN---LFKDLGST 490
Query: 187 QYRCPTCRGE 196
Y CPTC+ +
Sbjct: 491 DYFCPTCKAK 500
>gi|55726215|emb|CAH89880.1| hypothetical protein [Pongo abelii]
Length = 1275
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|355779658|gb|EHH64134.1| Histone acetyltransferase MYST3 [Macaca fascicularis]
Length = 2276
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 444 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 503
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 504 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 543
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 76/184 (41%), Gaps = 54/184 (29%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK--CPSCRICEI 59
C LC E EG +R R ++ +CGK YH NCL +W Q+ HW + CP +C
Sbjct: 606 CFLC--NEREG-DRIRCIVP--ACGKHYHSNCLLSWPQS----HWQGGRLTCPY-HVCHT 655
Query: 60 C-------RRTGDPNKFMF-CRRCDAAYHCYCQHPPHKNV--SSGPYLCPKHTKCHSCGS 109
C +R+ PN+ M C RC ++YH P +V ++ +CPKH K
Sbjct: 656 CSSDNPQDKRSRAPNEKMARCVRCPSSYHASTLCLPAGSVILTANQIICPKHYK------ 709
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
P L+ W C C R ++CCD C H +C
Sbjct: 710 -APHPPLNAAW------CFLCTR-------------------GGSLICCDTCPTSFHLEC 743
Query: 170 DGIS 173
GI+
Sbjct: 744 LGIN 747
>gi|348562787|ref|XP_003467190.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4-like
[Cavia porcellus]
Length = 358
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 245 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 302
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 303 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 348
>gi|312081277|ref|XP_003142959.1| hypothetical protein LOAG_07378 [Loa loa]
Length = 147
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 1 MCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C LC +N+ + +++SC CG+ H +CLK W+C C+ C
Sbjct: 28 VCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIECKSCA 87
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
IC + + ++ +FC CD +H YC PP G + C
Sbjct: 88 ICGFSDNDDQLLFCDDCDRGFHLYCLRPPLSQAPEGEWSC 127
>gi|74228562|dbj|BAE25366.1| unnamed protein product [Mus musculus]
Length = 1291
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
Length = 1057
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C K++ S+ VCCD C WVH +CD IS + + + N
Sbjct: 424 CKHCAKLRKSKQYCGICKKIWHHSDGGNWVCCDGCNVWVHAECDNISRKLFKDLE---NF 480
Query: 187 QYRCPTCR 194
Y CP CR
Sbjct: 481 DYYCPDCR 488
>gi|63146269|gb|AAH95974.1| Myst4 protein, partial [Mus musculus]
Length = 828
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|410912120|ref|XP_003969538.1| PREDICTED: PHD finger protein 10-like [Takifugu rubripes]
Length = 493
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 1 MCRLCFVGENEGCERAR--RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C +C G E +R R ++ C C H +CL + + SW+C C+ C
Sbjct: 370 ICGICQKG-REANKRGRPEALIHCSQCDNSGHPSCLDMSGELVSVIQTYSWQCMECKTCT 428
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYC 83
+C++ ++ MFC +CD YH +C
Sbjct: 429 VCQQPHHEDEMMFCDKCDRGYHTFC 453
>gi|154757359|gb|AAI51762.1| MYST4 protein [Bos taurus]
Length = 349
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 214 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 273
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 274 CRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 313
>gi|148232579|ref|NP_001090745.1| D4, zinc and double PHD fingers family 1 [Xenopus (Silurana)
tropicalis]
gi|120537304|gb|AAI29026.1| dpf1 protein [Xenopus (Silurana) tropicalis]
Length = 383
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 3/98 (3%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 270 CDFCLGGAKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCILC 327
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + ++ +FC CD YH YC PP G + C
Sbjct: 328 GTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 365
>gi|339522307|gb|AEJ84318.1| D4 zinc and double PHD fingers family 2 [Capra hircus]
Length = 391
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 43/98 (43%), Gaps = 6/98 (6%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ + ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 283 NKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQL 342
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSC 107
+FC CD YH YC P G + +CH C
Sbjct: 343 LFCDACDRGYHMYCLTPSMSEPPEGSW------RCHLC 374
>gi|54020946|ref|NP_001005717.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
gi|49522323|gb|AAH75306.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 428
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%)
Query: 9 ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
EN+ M+SC CG+ H +CL+ W+C C+ C +C + + ++
Sbjct: 321 ENKKTGSKEEMVSCADCGRSGHPSCLQFSPNMIISVKKYPWQCIECKSCGLCGTSDNDDQ 380
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 381 LLFCDDCDRGYHMYCLKPPLSEPPEGSWSC 410
>gi|359492251|ref|XP_002284634.2| PREDICTED: uncharacterized protein LOC100247132 [Vitis vinifera]
Length = 386
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH++CL + + W CPSC +C C D K + C CD AYH YC +P
Sbjct: 256 KYYHKSCLTSTELR---MYGPCWYCPSC-LCRACLTDRDDEKIILCDGCDHAYHIYCMNP 311
Query: 87 PHKNVSSGPYLCPK 100
P ++ G + C K
Sbjct: 312 PRTSIPRGKWFCRK 325
>gi|392353369|ref|XP_003751480.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1640
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELVSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 315
>gi|6850865|emb|CAB71104.1| putative protein [Arabidopsis thaliana]
Length = 902
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 338 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHNN-- 395
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 396 -YYCPDCK 402
>gi|338710071|ref|XP_001916345.2| PREDICTED: zinc finger protein neuro-d4-like [Equus caballus]
Length = 205
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 92 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 149
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC CD YH YC PP G + LC +H K
Sbjct: 150 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 195
>gi|358336360|dbj|GAA54889.1| histone acetyltransferase MYST3 [Clonorchis sinensis]
Length = 1190
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 30/68 (44%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
L CK CG + H CL W + + S W+C C+ C +C+ + C CD
Sbjct: 808 FLICKDCGLRAHPTCLDYWPELTERARQSPWQCTDCKTCTVCQNKQITTDLLVCDACDKG 867
Query: 79 YHCYCQHP 86
+H C P
Sbjct: 868 FHIECHVP 875
>gi|357631309|gb|EHJ78887.1| hypothetical protein KGM_12125 [Danaus plexippus]
Length = 501
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C +C + + G R++ C+ C K H +CL++ + ++W+CP C+ C +C
Sbjct: 271 VCTVCLIQKTRGSND--RLVECRDCNNKAHLSCLQSGSGILKPRPDNTWQCPHCKTCVVC 328
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C C +YH C P
Sbjct: 329 CETNDAGILTVCSICSDSYHALCHTP 354
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2873 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2927
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2928 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIARAP 2978
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2869 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2928
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2929 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2965
>gi|308488788|ref|XP_003106588.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
gi|308253938|gb|EFO97890.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
Length = 452
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 4/92 (4%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M+ C C YH C++ + L W C CR+C IC + N+ +FC +CD
Sbjct: 352 MICCSVCQIVYHPRCIEMPDRMAALVRTYEWSCVDCRVCSICNKPEKENEIVFCDKCDRG 411
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 110
+H +C K++ G ++C T C N
Sbjct: 412 FHTFCVG--LKSLPRGTWIC--DTYCSETNRN 439
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2720 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2774
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2775 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCIARAP 2825
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2720 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2779
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2780 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2812
>gi|29387208|gb|AAH48199.1| MYST4 protein, partial [Homo sapiens]
gi|33874216|gb|AAH14143.1| MYST4 protein, partial [Homo sapiens]
Length = 325
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTTNVKALRWQCIECKTCSA 274
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CRVQGRNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 314
>gi|74208866|dbj|BAE21185.1| unnamed protein product [Mus musculus]
Length = 1148
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2657 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2711
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2712 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2762
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2657 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2716
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2717 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2749
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 45/108 (41%), Gaps = 12/108 (11%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C G+ E ++L C C K YH C + N W ++C R C
Sbjct: 1611 CQFCHSGDKED-----QLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNC 1665
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCH 105
+C + G+ + C +C AYH C PP V G +LC CH
Sbjct: 1666 IVCGKPGNKTISVLCDQCPKAYHIECLQPPLAKVPRGKWLC---VLCH 1710
Score = 45.8 bits (107), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 39/100 (39%), Gaps = 17/100 (17%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK-------HTKCHSCGS 109
C+ C ++ + C CD YH YC PP N+ G + C + C CG
Sbjct: 1611 CQFCHSGDKEDQLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRNCIVCGK 1670
Query: 110 NVPGN-GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
PGN +SV CD C + + P KV R
Sbjct: 1671 --PGNKTISV-------LCDQCPKAYHIECLQPPLAKVPR 1701
>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
Length = 1070
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 123 GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
G C C RL ++C +C KV+ S+S CD C+ W+H +CD IS F+
Sbjct: 427 GQFLCKKCARLTKSKHFCGICKKVWNHSDSGSWARCDGCKVWIHAECDRISSN---HFKD 483
Query: 183 DGNLQYRCPTCRGE 196
G + Y CPTC+ +
Sbjct: 484 LGGIDYYCPTCKAK 497
>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 982
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2877 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2931
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2932 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2982
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 41/103 (39%), Gaps = 13/103 (12%)
Query: 53 SCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCH 105
S + C+ C + +K + C CD YH YC P N+ G + C KC
Sbjct: 2873 SLQNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCI 2932
Query: 106 SCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
CG + P + + CD C R + Y P LKV R
Sbjct: 2933 VCGGHRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2969
>gi|327276253|ref|XP_003222884.1| PREDICTED: zinc finger protein neuro-d4-like [Anolis carolinensis]
Length = 388
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 3/98 (3%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 275 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 332
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + ++ +FC CD YH YC PP G + C
Sbjct: 333 GTSENDDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSC 370
>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
Full=Protein SET DOMAIN GROUP 14; AltName:
Full=Trithorax-homolog protein 3; Short=TRX-homolog
protein 3
gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 1018
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
Length = 1018
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C +++ S+ VCCD C WVH +CD I++E++ + + +
Sbjct: 352 CKHCSKLRKSNQYCGICKRIWHPSDDGDWVCCDGCDVWVHAECDNITNERFKELEHN--- 408
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 409 NYYCPDCK 416
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2620 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2674
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2675 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2725
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2620 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2679
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2680 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2712
>gi|356518577|ref|XP_003527955.1| PREDICTED: uncharacterized protein LOC100795906 [Glycine max]
Length = 646
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL N+ + W CPSC +C +C D ++ + C CD AYH YC P
Sbjct: 499 KYYHVRCL---TINQLKSYGHCWYCPSC-LCRVCLTDQDDDRIVLCDGCDHAYHIYCMKP 554
Query: 87 PHKNVSSGPYLCPK 100
P ++ G + C K
Sbjct: 555 PRTSIPRGNWFCRK 568
>gi|403165473|ref|XP_003325473.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165738|gb|EFP81054.2| hypothetical protein PGTG_07306 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1108
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 2/96 (2%)
Query: 19 MLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK-FMFCRRCD 76
M+SC CG+ H +C++ N + W C CR C C + GD ++ + C CD
Sbjct: 294 MVSCWECGQSGHFSCMELNNLTIKSHAKSYPWLCLECRRCHGCDKKGDDDQNMLLCAVCD 353
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+H C +PP + V SG + CP + C +P
Sbjct: 354 RGWHGECLNPPLRTVPSGDFTCPFDHQSTQCIPPLP 389
>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1003
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C RL +YC +C K++ S+S V CD C+ WVH +CD IS F+ G
Sbjct: 364 CKTCARLTKSKHYCGICKKIWNYSDSGSWVRCDGCKVWVHAECDKISSN---LFKNLGGS 420
Query: 187 QYRCPTCR 194
Y CPTC+
Sbjct: 421 DYFCPTCK 428
>gi|4808454|dbj|BAA77570.1| Requiem protein [Xenopus laevis]
Length = 386
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 41/89 (46%), Gaps = 2/89 (2%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ ++SC CG+ H +CL+ A W+C C+ C IC + N
Sbjct: 282 NKKTNQSEELVSCSDCGRSGHPSCLQFTAVMMAAVKTYRWQCIECKCCNICGTSE--NDL 339
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC PP G + C
Sbjct: 340 LFCDDCDRGYHMYCLVPPVAEPPEGSWSC 368
>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
Length = 3086
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 2585 CQFCTSGENED-----KLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 2639
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C + P
Sbjct: 2640 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCITRAP 2690
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 2585 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 2644
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 2645 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 2677
>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine
max]
Length = 1035
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 104 CHSCGSNVPGNGLSVRWFL---GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C +CG ++P L G C C RL +YC +C KV+ S+S V CD
Sbjct: 370 CEACGLSLPYKMLKKTKDSSPGGQFLCKTCARLTKSKHYCGICKKVWNHSDSGSWVRCDG 429
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
C+ WVH +CD I + + Y CPTC+ +
Sbjct: 430 CKVWVHAECDKICSNLFKNLE---GTDYYCPTCKAK 462
>gi|26331782|dbj|BAC29621.1| unnamed protein product [Mus musculus]
Length = 1010
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|443690042|gb|ELT92280.1| hypothetical protein CAPTEDRAFT_224752 [Capitella teleta]
Length = 1892
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 1/99 (1%)
Query: 1 MCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C +++ + +++SC CG H +CLK + W+C C+ C +
Sbjct: 210 LCSFCLGADDKNRDGVPEQLISCADCGNCGHPSCLKFSDSLVERVGHMRWQCIECKKCSL 269
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C TG + +FC CD H C PP + G ++C
Sbjct: 270 CGETGKEDNMLFCDACDRGIHMECCIPPLTSAPEGKWVC 308
>gi|301121094|ref|XP_002908274.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103305|gb|EEY61357.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 634
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 9/72 (12%)
Query: 133 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDG-------ISD--EKYLQFQVD 183
L +G YCPVC +VY D + VCCD C+ WVH CD + D E + +
Sbjct: 562 LRAQGQYCPVCNEVYEDDDQNTFVCCDSCELWVHGACDPSLTPYVVLLDMMESIIAAMAN 621
Query: 184 GNLQYRCPTCRG 195
+Y CP C G
Sbjct: 622 TEDKYICPLCAG 633
>gi|336370765|gb|EGN99105.1| hypothetical protein SERLA73DRAFT_160636 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1506
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%)
Query: 49 WKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
WKC C+ CE+CR GD + +FC CD +H C PP + G + CP
Sbjct: 9 WKCLECKNCEVCREKGDDERILFCDFCDRGWHMDCLQPPLQESPPGKWHCP 59
>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 1149
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 44/98 (44%), Gaps = 14/98 (14%)
Query: 104 CHSCGSNVPGNGLS-------VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 156
C SCG +P + FL C C +L YC +C K++ S+ V
Sbjct: 283 CASCGLMLPCKTMKKVKDSSHAPQFL----CKHCVKLRKSKQYCGICKKIWHHSDGGNWV 338
Query: 157 CCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CCD C WVH +CD IS E + + N Y CP C+
Sbjct: 339 CCDGCNVWVHAECDKISTEHFKDLE---NTDYYCPDCK 373
>gi|159164819|pdb|2YSM|A Chain A, Solution Structure Of The First And Second Phd Domain
From MyeloidLYMPHOID OR MIXED-Lineage Leukemia Protein
3 Homolog
Length = 111
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 22 CKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHC 81
C +CG+ YH CL + W+CP C++C+ C+++G+ +K + C CD YH
Sbjct: 25 CTTCGQHYHGMCLDIAVTP---LKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHT 81
Query: 82 YCQHPPHKNVSSGPYLC 98
+C P K+V + + C
Sbjct: 82 FCLQPVMKSVPTNGWKC 98
>gi|71895933|ref|NP_001025643.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
gi|60550967|gb|AAH91612.1| PHD finger protein 10 [Xenopus (Silurana) tropicalis]
Length = 430
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 50/115 (43%), Gaps = 10/115 (8%)
Query: 1 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G++ + R+ R++ C C H +CL + + W+C C+ C I
Sbjct: 311 ICGICLKGKDANKKGRSERLIHCSQCDNSGHPSCLDMSPELVTVIKKYPWQCMECKTCII 370
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 114
C + + MFC CD YH +C + SG ++C C NVP
Sbjct: 371 CGQPHHEEEMMFCDTCDRGYHTFCVG--LGALPSGRWIC-------DCCQNVPST 416
>gi|443725765|gb|ELU13216.1| hypothetical protein CAPTEDRAFT_167868 [Capitella teleta]
Length = 236
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 11/100 (11%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C C K H +C+ + S W+C C+ C +C GDP+ +FC CD
Sbjct: 34 LLICTDCQAKAHPSCMDYSSDLARRARRSPWQCIDCKTCCLCEDAGDPDAMLFCDACDKG 93
Query: 79 YHCYCQHPPHKNVSSGPYLC-----------PKHTKCHSC 107
YH C P ++ +G ++C P+ T C SC
Sbjct: 94 YHMSCHSPVIEDKPTGKWVCSRCCQEIEADAPETTFCGSC 133
>gi|395535344|ref|XP_003769687.1| PREDICTED: PHD finger protein 10 [Sarcophilus harrisii]
Length = 598
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
MC +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 479 MCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTCII 538
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 539 CGQPHHEEEMMFCDVCDRGYHTFC 562
>gi|326923554|ref|XP_003208000.1| PREDICTED: histone acetyltransferase MYST4-like [Meleagris
gallopavo]
Length = 2028
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 5/103 (4%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCG---KKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 56
+C C E+ ++ +LSC CG K H +CLK + W+C C+
Sbjct: 215 ICSFCLGTKESNREKKPEELLSCADCGSSGKLEHPSCLKFCPELTSNVKALRWQCIECKT 274
Query: 57 CEICRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C CR G + + +FC CD +H C PP + G ++C
Sbjct: 275 CSACRIQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKGMWIC 317
>gi|26333367|dbj|BAC30401.1| unnamed protein product [Mus musculus]
Length = 461
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|148744463|gb|AAI42960.1| MYST3 protein [Homo sapiens]
Length = 815
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|26344145|dbj|BAC35729.1| unnamed protein product [Mus musculus]
Length = 435
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 59
E C++ ++ C +C K YH CL ++ WS CP+C E
Sbjct: 382 EVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWS---CPTCEAEGPADEDDDEHQEF 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C PP ++ G + CP+
Sbjct: 439 CRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPR 479
>gi|242014022|ref|XP_002427697.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512132|gb|EEB14959.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 639
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 1 MCRLC-FVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C++C F + + R SC CGKK H C+++ Q F S W+C C+ C
Sbjct: 390 LCKICSFNIDFKSSRNLDRWTSCHFCGKKAHVTCIQDTEQ-WTRFKLSKWQCRDCKNCST 448
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
C+ + C CD AYH C + K+ S+ + C K ++ S G
Sbjct: 449 CKNKFSDGDLIVCGLCDDAYHLTCAN-VKKSKSNQKWFCNKCSRVFSDG 496
>gi|327291384|ref|XP_003230401.1| PREDICTED: zinc finger protein ubi-d4-like, partial [Anolis
carolinensis]
Length = 178
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%)
Query: 15 RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
+ ++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC
Sbjct: 75 QPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDD 134
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLC 98
CD YH YC PP G + C
Sbjct: 135 CDRGYHMYCLTPPMSEPPEGSWSC 158
>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
Length = 1603
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 1 MCRLCFVGENEGCE----RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI 56
+C++C G+ ++ C CG+ +H C+ + + +WKC C+
Sbjct: 857 LCKVCLSGDVPSVVGKSFVPSTLICCVDCGEVFHTFCIGLPEEVASVIDRLTWKCADCKC 916
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGP----YLCPKHTKCHSCG--SN 110
C +C + + + C RCD +H YC + + P ++CP +K S G +
Sbjct: 917 CSVCMALDNEDLLLICDRCDLGFHTYC-----AGLDALPEEDDWVCPSCSKIQSNGDEQD 971
Query: 111 VPGNGLSVRWFL 122
V + +W L
Sbjct: 972 VKVETVKEKWLL 983
>gi|432875795|ref|XP_004072911.1| PREDICTED: histone acetyltransferase KAT6A-like [Oryzias latipes]
Length = 1964
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 236 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 295
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 296 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 335
>gi|4808456|dbj|BAA77571.1| Requiem protein [Xenopus laevis]
Length = 198
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 43/99 (43%), Gaps = 2/99 (2%)
Query: 2 CRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C N+ + ++SC CG+ H +CL+ W+C C+ C I
Sbjct: 82 CDFCLGDSNTNKKSNQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNI 141
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC PP G + C
Sbjct: 142 CGTSENDDQLLFCDDCDRGYHMYCLSPPVAEPPEGSWSC 180
>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 2943
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/262 (22%), Positives = 86/262 (32%), Gaps = 88/262 (33%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNR----DLFHWSSWKCPSCRICEICRRTG---------- 64
+ C CG+ +H C+ + R D + W+CP+C++CEIC + G
Sbjct: 1593 FIFCVDCGEGFHSFCVSGMSAARLEDSDQLR-AYWRCPNCKMCEICGQPGAVCGAESSAR 1651
Query: 65 ---------DPNK-------------FMFCRRCDAAYHCYCQHPPHK---------NVSS 93
PN + C CD +H C P K SS
Sbjct: 1652 VPATAGNVDSPNTDTETSLELAKTESLLLCGHCDRGFHGSCLVPAIKLPLNPKKRDGTSS 1711
Query: 94 GPYL-CPKHTKCHSCGSN--------VPGNGLSV-------------------------- 118
P + C C +C S+ P + L
Sbjct: 1712 SPVIYCASCVSCVNCKSSREYLDSDVAPNDHLDALERTYSYEQDKCLHCHNRKEREVQAL 1771
Query: 119 ---RWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI--- 172
L DA R CP+C + + D++ ++ CD C+RWVH CD +
Sbjct: 1772 RERTRLLTEVWMDAARRSKKDAEKCPLCRRKW-DADLEELMQCDACERWVHPPCDDLLKK 1830
Query: 173 SDEKYLQFQVDGNLQYRCPTCR 194
++Y D N Y C CR
Sbjct: 1831 EPKRYQTLVSDPNAVYVCAACR 1852
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 47/138 (34%), Gaps = 29/138 (21%)
Query: 62 RTGDPNK--FMFCRRCDAAYHCYCQHPPHKNVS--------------SGPYLCPKHTKCH 105
R GD + + C +CD +H C PPH +S P++C T C
Sbjct: 1063 RRGDAGEEELLACAQCDNQFHATCCDPPHAPLSLVSPDDGDVLVADLKTPFVCSDCTSCA 1122
Query: 106 SC----GSNVPGNGLSVRW------FLGYTCCDACGRLFVKGNYCPVCLKVYRDSE---S 152
C S RW C C + +C VC V D + S
Sbjct: 1123 GCRCRKSDEARAEEPSPRWSQWRLPLQTAALCTTCIPYYKANRFCGVCNLVLDDEQLATS 1182
Query: 153 TPMVCCDVCQRWVHCQCD 170
++ C C W+H C+
Sbjct: 1183 VDLLTCATCHHWIHADCE 1200
>gi|297267436|ref|XP_002808108.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein ubi-d4-like
[Macaca mulatta]
Length = 391
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|154152087|ref|NP_001093826.1| zinc finger protein ubi-d4 [Bos taurus]
gi|296218741|ref|XP_002755572.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Callithrix
jacchus]
gi|426252022|ref|XP_004019718.1| PREDICTED: zinc finger protein ubi-d4 [Ovis aries]
gi|118582243|gb|ABL07500.1| zinc-finger protein ubi-d4 [Capra hircus]
gi|151557067|gb|AAI49970.1| DPF2 protein [Bos taurus]
gi|152941218|gb|ABS45046.1| D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|296471617|tpg|DAA13732.1| TPA: D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|417400089|gb|JAA47013.1| Putative transcription factor requiem/neuro-d4 [Desmodus rotundus]
Length = 391
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|291414421|ref|XP_002723450.1| PREDICTED: D4, zinc and double PHD fingers family 2-like
[Oryctolagus cuniculus]
Length = 388
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 289 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 348
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 349 YHMYCLTPSMSEPPEGSWSC 368
>gi|395544822|ref|XP_003774305.1| PREDICTED: zinc finger protein ubi-d4 [Sarcophilus harrisii]
Length = 423
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 40/89 (44%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ + ++SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 315 NKKTGQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQL 374
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC P G + C
Sbjct: 375 LFCDDCDRGYHMYCLTPSMSEPPEGSWSC 403
>gi|347966735|ref|XP_001689318.2| AGAP001877-PA [Anopheles gambiae str. PEST]
gi|333469922|gb|EDO63223.2| AGAP001877-PA [Anopheles gambiae str. PEST]
Length = 2382
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 5/115 (4%)
Query: 1 MCRLCFVGENEG-CERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C EN+ + + C C +K H +C+ + W+C C++C
Sbjct: 2153 LCAVCMGPENKNKYSKPELFVRCTRCRRKAHPSCIGMSSVMYKRVQQYKWQCSECKLCMK 2212
Query: 60 CRR--TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
C R +K ++C +CD YH C+ +N+ G + C T C CG+ P
Sbjct: 2213 CNRQPAAIDSKMVYCDQCDRGYHLACKG--LRNLPEGRWHCNICTICGLCGAQTP 2265
>gi|390470774|ref|XP_003734353.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Callithrix
jacchus]
Length = 405
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 306 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 365
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 366 YHMYCLTPSMSEPPEGSWSC 385
>gi|30584805|gb|AAP36655.1| Homo sapiens requiem, apoptosis response zinc finger gene
[synthetic construct]
gi|61370771|gb|AAX43549.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
Length = 392
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|72015501|ref|XP_785947.1| PREDICTED: uncharacterized protein LOC580820 [Strongylocentrotus
purpuratus]
Length = 1065
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 9/103 (8%)
Query: 1 MCRLCFVGENEGCERA-RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C LC + ++ C C H +CL+ + W+C C+ C
Sbjct: 847 ICGLCLKDRRSNTKGVPENLVHCSQCDNSGHPSCLEMNDELVATIKTYPWQCMECKTCSQ 906
Query: 60 CRRTGDP---NKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C GDP +K MFC +CD YH +C ++ +G +LCP
Sbjct: 907 C---GDPTHEDKMMFCDKCDRGYHTFCVG--LTDIPTGNWLCP 944
>gi|410922269|ref|XP_003974605.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase
KAT6A-like [Takifugu rubripes]
Length = 2234
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E +R ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNAENMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|431910278|gb|ELK13351.1| Zinc finger protein ubi-d4 [Pteropus alecto]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|224066781|ref|XP_002302212.1| predicted protein [Populus trichocarpa]
gi|222843938|gb|EEE81485.1| predicted protein [Populus trichocarpa]
Length = 604
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 26 GKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 85
GK YH CL Q H W CPSC +C +C D +K + C CD AYH YC
Sbjct: 458 GKYYHVRCLTT-RQIDSCGH--RWYCPSC-LCRVCITDRDDDKIVLCDGCDHAYHLYCMI 513
Query: 86 PPHKNVSSGPYLC 98
PP +V G + C
Sbjct: 514 PPRISVPKGKWFC 526
>gi|26389386|dbj|BAC25728.1| unnamed protein product [Mus musculus]
Length = 803
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 2/100 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKQPEELVSCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
CR G + + +FC CD +H C PP + G ++C
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 307
>gi|168001639|ref|XP_001753522.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695401|gb|EDQ81745.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 428
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 48/99 (48%), Gaps = 22/99 (22%)
Query: 113 GNGLSVRWFL----------GYTCCDACGRLFVKGNYCPVCLKVYRDSESTPM-----VC 157
G G + RW + T C+AC F +G YCP C+++YR+ + V
Sbjct: 72 GKGGTCRWHVRNYGSQKDPKHVTLCNACKINFDQGKYCPFCVQIYREKDPDSFDGKEWVG 131
Query: 158 CD--VCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
CD C+RWVH +C+ IS QVD Y CP+CR
Sbjct: 132 CDNRTCRRWVHVECE-ISGGN----QVDSTAFYLCPSCR 165
>gi|5454004|ref|NP_006259.1| zinc finger protein ubi-d4 [Homo sapiens]
gi|350534556|ref|NP_001233580.1| zinc finger protein ubi-d4 [Pan troglodytes]
gi|73983120|ref|XP_866588.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Canis lupus
familiaris]
gi|332250193|ref|XP_003274238.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Nomascus
leucogenys]
gi|397516924|ref|XP_003828671.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Pan paniscus]
gi|402892867|ref|XP_003909628.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Papio anubis]
gi|403293482|ref|XP_003937745.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Saimiri
boliviensis boliviensis]
gi|410974412|ref|XP_003993641.1| PREDICTED: zinc finger protein ubi-d4 [Felis catus]
gi|426369135|ref|XP_004051552.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Gorilla gorilla
gorilla]
gi|2842711|sp|Q92785.2|REQU_HUMAN RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|2121234|gb|AAB58307.1| requiem homolog [Homo sapiens]
gi|2529705|gb|AAB81203.1| requiem [Homo sapiens]
gi|15928853|gb|AAH14889.1| D4, zinc and double PHD fingers family 2 [Homo sapiens]
gi|28144169|gb|AAO26041.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|30582275|gb|AAP35364.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|61361059|gb|AAX41982.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|61361064|gb|AAX41983.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|119594781|gb|EAW74375.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|119594782|gb|EAW74376.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|123983164|gb|ABM83323.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|123997873|gb|ABM86538.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|158257320|dbj|BAF84633.1| unnamed protein product [Homo sapiens]
gi|208967739|dbj|BAG72515.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|343962233|dbj|BAK62704.1| zinc-finger protein ubi-d4 [Pan troglodytes]
gi|355566318|gb|EHH22697.1| Protein requiem [Macaca mulatta]
gi|355751970|gb|EHH56090.1| Protein requiem [Macaca fascicularis]
gi|380815318|gb|AFE79533.1| zinc finger protein ubi-d4 [Macaca mulatta]
gi|410218232|gb|JAA06335.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410249592|gb|JAA12763.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410288496|gb|JAA22848.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410336195|gb|JAA37044.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|311247329|ref|XP_003122585.1| PREDICTED: zinc finger protein ubi-d4 [Sus scrofa]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|149725409|ref|XP_001492666.1| PREDICTED: zinc finger protein ubi-d4 [Equus caballus]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|47227720|emb|CAG09717.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2476
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E +R ++SC CG H +CLK + W+C C+ C
Sbjct: 441 ICSFCLGTKEQNRDKRPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 500
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 501 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 540
>gi|444724505|gb|ELW65108.1| Zinc finger protein ubi-d4 [Tupaia chinensis]
Length = 412
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 313 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 372
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 373 YHMYCLTPSMSEPPEGSWSC 392
>gi|197098008|ref|NP_001127678.1| zinc finger protein ubi-d4 [Pongo abelii]
gi|332250195|ref|XP_003274239.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Nomascus
leucogenys]
gi|397516926|ref|XP_003828672.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Pan paniscus]
gi|402892869|ref|XP_003909629.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Papio anubis]
gi|403293484|ref|XP_003937746.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Saimiri
boliviensis boliviensis]
gi|426369137|ref|XP_004051553.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Gorilla gorilla
gorilla]
gi|56403615|emb|CAI29608.1| hypothetical protein [Pongo abelii]
Length = 405
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 306 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 365
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 366 YHMYCLTPSMSEPPEGSWSC 385
>gi|395852334|ref|XP_003798694.1| PREDICTED: zinc finger protein ubi-d4 [Otolemur garnettii]
Length = 391
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|440907399|gb|ELR57553.1| Zinc finger protein ubi-d4, partial [Bos grunniens mutus]
Length = 380
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 281 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 340
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 341 YHMYCLTPSMSEPPEGSWSC 360
>gi|427797535|gb|JAA64219.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2019
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 16 ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
A +LSC SC H +CLK N L W+C CR+C C + + C
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 109
CD+ +H C PP G + KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336
>gi|432091134|gb|ELK24346.1| Zinc finger protein ubi-d4 [Myotis davidii]
Length = 405
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 306 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 365
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 366 YHMYCLTPSMSEPPEGSWSC 385
>gi|334324322|ref|XP_001381625.2| PREDICTED: PHD finger protein 10-like [Monodelphis domestica]
Length = 662
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
MC +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 543 MCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSIIKTYPWQCMECKTCII 602
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 603 CGQPHHEEEMMFCDVCDRGYHTFC 626
>gi|427797307|gb|JAA64105.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2011
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 16 ARRMLSCKSCGKKYHRNCLK-NWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRR 74
A +LSC SC H +CLK N L W+C CR+C C + + C
Sbjct: 248 AEELLSCHSCTLSAHPSCLKHNKELALVLLSSRKWQCSQCRMCSRCGNKKEGEHLLCCEV 307
Query: 75 CDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGS 109
CD+ +H C PP G + KC SC S
Sbjct: 308 CDSHFHLRCLKPPLLKAPKGSW------KCTSCSS 336
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI-----------CEI 59
E C++ ++ C +C + YH CL + WS CP C E
Sbjct: 355 EVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWS---CPHCEAEGTQEQDDDEHNEF 411
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 412 CRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPR 452
>gi|348522233|ref|XP_003448630.1| PREDICTED: histone acetyltransferase MYST3-like [Oreochromis
niloticus]
Length = 2258
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 243 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTARVKALWWQCIECKTCSS 302
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 303 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 342
>gi|344295884|ref|XP_003419640.1| PREDICTED: zinc finger protein ubi-d4 [Loxodonta africana]
Length = 391
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C IC + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|405951463|gb|EKC19373.1| Bromodomain adjacent to zinc finger domain protein 2B [Crassostrea
gigas]
Length = 2317
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 43/114 (37%), Gaps = 34/114 (29%)
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 115
+C++CRR + + + C CD YH YC P N+ G + C+ C S G
Sbjct: 2033 LCQLCRRDDNEAQLLLCDGCDQGYHTYCFKPKMDNIPDGDWY------CYECISKATGE- 2085
Query: 116 LSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
CC CG+ + +V CD+C R +H C
Sbjct: 2086 ---------PCCVVCGKRMGR------------------IVECDLCPRAIHLDC 2112
Score = 42.4 bits (98), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 48/117 (41%), Gaps = 24/117 (20%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC-------PS 53
+C+LC +NE ++L C C + YH C K N W ++C P
Sbjct: 2033 LCQLCRRDDNEA-----QLLLCDGCDQGYHTYCFKPKMDNIPDGDWYCYECISKATGEPC 2087
Query: 54 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSN 110
C +C +R G + + C C A H C +PP + P+ C +C +N
Sbjct: 2088 CVVC--GKRMG---RIVECDLCPRAIHLDCLNPPLPRM-------PRKWVCPACTAN 2132
>gi|290976199|ref|XP_002670828.1| predicted protein [Naegleria gruberi]
gi|284084391|gb|EFC38084.1| predicted protein [Naegleria gruberi]
Length = 349
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 10/74 (13%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSES----TPMVCCDVCQRWVHCQCDGISDEKYLQFQV 182
C+ACG + KG++C C ++Y++S++ P + CD C RWVH C E+ F++
Sbjct: 36 CNACGLHYKKGHFCIYCNQIYKESDADDKEEPWIGCDSCHRWVHQNC-----ERQNGFEI 90
Query: 183 DGNLQYRCPTCRGE 196
N Y CP CR +
Sbjct: 91 KPN-GYLCPCCRNQ 103
>gi|74219112|dbj|BAE26697.1| unnamed protein product [Mus musculus]
Length = 391
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 40/89 (44%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ + ++SC CG+ H +CL+ W+C C+ C +C + + ++
Sbjct: 283 NKKTRQPEELVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQL 342
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC P G + C
Sbjct: 343 LFCDDCDRGYHMYCLTPSMSEPPEGSWSC 371
>gi|328872173|gb|EGG20540.1| hypothetical protein DFA_00401 [Dictyostelium fasciculatum]
Length = 436
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 51/114 (44%), Gaps = 8/114 (7%)
Query: 107 CGSNVPGNGLSVRWFLG---YTCCDACGRLFVKGNYCPVCLKVYRDSE----STPMVCCD 159
CGS PG G + +W G C++CG +K C +C VY E S + CD
Sbjct: 278 CGSTTPGKGPTCKWRKGPNGEVLCNSCGLQNMKKPKCLLCGIVYNSKEAMASSISWIRCD 337
Query: 160 VCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWR 212
C++WV +CD G+ D L Y CP CR + + + + R L +
Sbjct: 338 DCKQWVMSKCDSGMGDISLYDDSNPNPLHYSCPKCRTDPSKPKTTRNNHRSLLK 391
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 56
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 378 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCESEGGQEQEEDEHQEF 434
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C +C+ G+ + C C AAYH +C PP +V G + CP+
Sbjct: 435 CRVCKDGGE---LLCCDSCPAAYHTFCLSPPITDVPDGDWKCPR 475
>gi|118402055|ref|XP_001033347.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89287695|gb|EAR85684.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 510
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 6/121 (4%)
Query: 77 AAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVK 136
+ ++ YC P KN+SS P + TK S L + T CD C + + +
Sbjct: 165 SCFNQYC--PKDKNISSVPSPQQESTKKQSVNGQYKQAKLE-KTGEKVTFCDLCCQRYQE 221
Query: 137 GNYCPVCLKVYRDS---ESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTC 193
++C C +VY D + + CD C RW H C+ +K N+QY CP C
Sbjct: 222 KHFCYYCQQVYFDDYNIDDKEWILCDTCDRWCHLHCEEEKIKKAFSSSQSENVQYDCPRC 281
Query: 194 R 194
R
Sbjct: 282 R 282
>gi|118344196|ref|NP_001071920.1| zinc finger protein [Ciona intestinalis]
gi|92081536|dbj|BAE93315.1| zinc finger protein [Ciona intestinalis]
Length = 257
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 34/82 (41%)
Query: 17 RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCD 76
ML CK C K H +C+K + W+C C+ C C D +FC CD
Sbjct: 34 EEMLFCKDCDAKAHPSCMKYSSTLAAQALSYPWQCVECKTCSSCFTARDGASILFCDGCD 93
Query: 77 AAYHCYCQHPPHKNVSSGPYLC 98
AYH C P G +LC
Sbjct: 94 KAYHMLCHEPEVITKPEGKWLC 115
>gi|351701967|gb|EHB04886.1| Zinc finger protein ubi-d4 [Heterocephalus glaber]
Length = 601
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 502 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 561
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 562 YHMYCLTPSMSEPPEGSWSC 581
>gi|426388550|ref|XP_004060697.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4
[Gorilla gorilla gorilla]
Length = 388
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 6/106 (5%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 275 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 332
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
+ + ++ +FC D YH YC PP G + LC +H K
Sbjct: 333 GTSENDDQLLFCDDSDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 378
>gi|354505054|ref|XP_003514587.1| PREDICTED: zinc finger protein ubi-d4-like [Cricetulus griseus]
gi|344258641|gb|EGW14745.1| Zinc finger protein ubi-d4 [Cricetulus griseus]
Length = 391
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1002
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C +++ S+ VCCD C WVH CD I++E++ + + +
Sbjct: 355 CKHCSKLRKFNQYCGICKRIWHPSDDGDWVCCDGCNVWVHAGCDNITNERFKELEHN--- 411
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 412 NYYCPDCK 419
>gi|6755314|ref|NP_035392.1| zinc finger protein ubi-d4 [Mus musculus]
gi|2500148|sp|Q61103.1|REQU_MOUSE RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|1167972|gb|AAC52783.1| ubi-d4 [Mus musculus]
gi|12836275|dbj|BAB23583.1| unnamed protein product [Mus musculus]
gi|15215228|gb|AAH12709.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
gi|74184334|dbj|BAE25702.1| unnamed protein product [Mus musculus]
gi|74201274|dbj|BAE26098.1| unnamed protein product [Mus musculus]
gi|74201435|dbj|BAE26153.1| unnamed protein product [Mus musculus]
gi|74206142|dbj|BAE23543.1| unnamed protein product [Mus musculus]
gi|148701237|gb|EDL33184.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
Length = 391
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|157817959|ref|NP_001101986.1| zinc finger protein ubi-d4 [Rattus norvegicus]
gi|149062118|gb|EDM12541.1| D4, zinc and double PHD fingers family 2 (predicted) [Rattus
norvegicus]
Length = 391
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|432852854|ref|XP_004067418.1| PREDICTED: PHD finger protein 10-like [Oryzias latipes]
Length = 442
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ R ++ C C H +CL ++ + W+C C+ C +
Sbjct: 319 ICGICQKGKESNKKGRPEALIHCSQCDNSGHPSCLDMSSELVSVIQTYRWQCMECKTCTV 378
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C++ ++ MFC +CD YH +C
Sbjct: 379 CQQPHHEDEMMFCDKCDRGYHTFC 402
>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1094
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)
Query: 104 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C CG +P + V G C C RL YC +C K+ S+S V CD
Sbjct: 422 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 481
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 220
C+ WVH +C IS + F+ G Y CP C+ + + E + E W+ K +K+
Sbjct: 482 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 534
>gi|170595283|ref|XP_001902318.1| Hypothetical C28H8.9 in chromosome III [Brugia malayi]
gi|158590068|gb|EDP28834.1| Hypothetical C28H8.9 in chromosome III, putative [Brugia malayi]
Length = 149
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 1 MCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C LC +N+ + +++SC CG+ H +CLK W+C C+ C
Sbjct: 30 VCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIECKSCA 89
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
IC + + ++ +FC CD +H YC PP G + C
Sbjct: 90 ICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129
>gi|327272024|ref|XP_003220786.1| PREDICTED: PHD finger protein 10-like [Anolis carolinensis]
Length = 489
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL A+ + W+C C+ C I
Sbjct: 370 ICGICLKGKESNKKGKAEALIHCSQCENSGHPSCLDMSAELVAIIKTYPWQCMECKTCII 429
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 430 CGQPHHEEEMMFCDLCDRGYHTFC 453
>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
Length = 1018
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 10/120 (8%)
Query: 104 CHSCGSNVP---GNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV 160
C CG +P + V G C C RL YC +C K+ S+S V CD
Sbjct: 352 CDGCGLRIPLKSTKKMKVLTPKGRFLCKTCDRLLKSKQYCGICKKMQNQSDSGTWVRCDG 411
Query: 161 CQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGECYQVRDLEDAVRELWRRKDMADKD 220
C+ WVH +C IS + F+ G Y CP C+ + + E + E W+ K +K+
Sbjct: 412 CKVWVHAECGKISSK---LFKNLGATDYYCPACKAK----FNFELSDSERWQPKVKCNKN 464
>gi|293332508|ref|NP_001169841.1| uncharacterized protein LOC100383733 [Zea mays]
gi|224031939|gb|ACN35045.1| unknown [Zea mays]
gi|413941582|gb|AFW74231.1| hypothetical protein ZEAMMB73_231911 [Zea mays]
Length = 555
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 9/104 (8%)
Query: 1 MCRLCFVGENEGCERARRMLSCKS--CGKK-YHRNCLKN-WAQNRDLFHWSSWKCPSCRI 56
+C+ C E+E R+ + C C K YH CLK + + W CPSC +
Sbjct: 417 LCKNCGTCEDED----RKFMVCGHGLCSFKFYHVLCLKERQIASEKQKNLKCWYCPSC-L 471
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C C + D K + C CD AYH YC PP ++V G + C +
Sbjct: 472 CRRCFKDKDDEKIVLCDGCDEAYHIYCMDPPCESVPRGKWFCTR 515
>gi|345316943|ref|XP_001509649.2| PREDICTED: PHD finger protein 10, partial [Ornithorhynchus
anatinus]
Length = 468
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 349 ICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTTELVSMIKTYPWQCMECKTCII 408
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 409 CGQPHHEEEMMFCDVCDRGYHTFC 432
>gi|344306723|ref|XP_003422034.1| PREDICTED: PHD finger protein 10-like [Loxodonta africana]
Length = 533
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C +
Sbjct: 414 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTCIV 473
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + ++ MFC CD YH +C
Sbjct: 474 CGQPHHEDEMMFCDVCDRGYHTFC 497
>gi|13938144|gb|AAH07188.1| Dpf2 protein, partial [Mus musculus]
Length = 351
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 252 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 311
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 312 YHMYCLTPSMSEPPEGSWSC 331
>gi|357130254|ref|XP_003566765.1| PREDICTED: uncharacterized protein LOC100821699 [Brachypodium
distachyon]
Length = 918
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 9/101 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCG---KKYHRNCLKN-WAQNRDLFHWSSWKCPSCRIC 57
C++C E++ +R L C K YH CLK+ ++ W CPSC +C
Sbjct: 762 CKMCGTPEDDD----KRFLICGHSHCPYKYYHIRCLKSKQIASKVQRDKPCWYCPSC-LC 816
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+C GD + + C CD AYH YC P +V G + C
Sbjct: 817 RVCLSDGDDEQTILCDGCDEAYHLYCMTPRRTSVPKGKWYC 857
>gi|297679669|ref|XP_002817646.1| PREDICTED: PHD finger protein 10 [Pongo abelii]
Length = 498
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 379 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 439 CGQPHHEEEMMFCDMCDRGYHTFC 462
>gi|187469691|gb|AAI66788.1| Dpf2 protein [Rattus norvegicus]
Length = 390
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 291 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 350
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 351 YHMYCLTPSMSEPPEGSWSC 370
>gi|402591828|gb|EJW85757.1| Dpf2 protein [Wuchereria bancrofti]
Length = 149
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 1 MCRLCF--VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICE 58
+C LC +N+ + +++SC CG+ H +CLK W+C C+ C
Sbjct: 30 VCDLCLGDCNQNKKTMKPEQLISCHDCGRSGHPSCLKFTDNMLTSTGKYGWQCIECKSCA 89
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
IC + + ++ +FC CD +H YC PP G + C
Sbjct: 90 ICGFSDNDDQLLFCDDCDRGFHLYCLRPPLPQAPEGEWSC 129
>gi|345778329|ref|XP_532272.3| PREDICTED: PHD finger protein 10 isoform 1 [Canis lupus familiaris]
Length = 410
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|320167424|gb|EFW44323.1| jumonji [Capsaspora owczarzaki ATCC 30864]
Length = 2147
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
+CE+C R D +K + C CD YH YC HPP V G + CP
Sbjct: 388 VCEVCLRPDDESKIILCDSCDHGYHVYCLHPPLPRVPDGDWYCP 431
>gi|348680969|gb|EGZ20785.1| hypothetical protein PHYSODRAFT_298767 [Phytophthora sojae]
Length = 606
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 133 LFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPT 192
L G YCPVC +VY D + + VCCD C+ WVH CD S Y+ ++D T
Sbjct: 539 LRALGQYCPVCSEVYEDDDQSTFVCCDSCELWVHGACDP-SLTPYVAVKIDRATTLISNT 597
Query: 193 C 193
C
Sbjct: 598 C 598
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 46/102 (45%), Gaps = 15/102 (14%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR------------ICE 58
E C++ ++ C +C K +H CL + WS CP+C E
Sbjct: 352 EVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWS---CPNCEGEGIPEPEPADEHME 408
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C +C ++YH +C +PP + + ++CP+
Sbjct: 409 FCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPR 450
>gi|338722861|ref|XP_001499514.3| PREDICTED: PHD finger protein 10 [Equus caballus]
Length = 451
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 LCGICLKGKETNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|323450933|gb|EGB06812.1| hypothetical protein AURANDRAFT_28864 [Aureococcus anophagefferens]
Length = 266
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 2/97 (2%)
Query: 14 ERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC--RRTGDPNKFMF 71
+R +L C CG+ H C + ++W+CP+C++CE+C + D ++ ++
Sbjct: 66 KRGGELLFCVDCGEACHAMCASTPIDSMSDAARATWRCPNCKVCELCGESKVDDESRLLY 125
Query: 72 CRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
C CD AYH C P G ++C C CG
Sbjct: 126 CDLCDKAYHLDCVTPKLDVAPPGRWICGLCVTCRHCG 162
>gi|302393562|ref|NP_001116784.3| histone acetyltransferase MYST3 [Danio rerio]
Length = 2247
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 329
>gi|190339720|gb|AAI63677.1| MYST histone acetyltransferase (monocytic leukemia) 3 [Danio rerio]
Length = 2246
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 288
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|348564966|ref|XP_003468275.1| PREDICTED: zinc finger protein ubi-d4-like [Cavia porcellus]
Length = 391
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 292 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 351
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 352 YHMYCLTPSMSEPPEGSWSC 371
>gi|296439269|sp|Q4V7A6.2|PHF10_RAT RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
Length = 497
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 378 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 437
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 438 CGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|298708138|emb|CBJ30479.1| myst-related protein [Ectocarpus siliculosus]
Length = 1620
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 15/101 (14%)
Query: 12 GC---ERARRMLSCKS--CGKKYHRNCLKNWAQNRDLFHWSSWKCPSC-----RI-CEIC 60
GC +R +L C CG +YH CL W W CP C R+ C +C
Sbjct: 560 GCMQNDRPTEILQCDGPMCGLEYHYGCLDPPLDKVPSSKW--WYCPDCVRTDNRVGCRVC 617
Query: 61 RRTGDPNKFMFC--RRCDAAYHCYCQHPPHKNVSSGPYLCP 99
+ D +K + C C+ +H YC PP K V G + CP
Sbjct: 618 KVDVDYDKLLKCDGPGCELEWHTYCLKPPVKTVPKGDFFCP 658
>gi|291190717|ref|NP_001167047.1| Zinc finger protein ubi-d4 [Salmo salar]
gi|223647844|gb|ACN10680.1| Zinc finger protein ubi-d4 [Salmo salar]
Length = 402
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 40/89 (44%)
Query: 10 NEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKF 69
N+ ++ + SC CG+ H +CL+ W+C C+ C IC + + ++
Sbjct: 292 NQKTGQSEELQSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNICGTSENDDQL 351
Query: 70 MFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+FC CD YH YC P G + C
Sbjct: 352 LFCDDCDRGYHMYCLSPAMAEPPEGSWSC 380
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|12841710|dbj|BAB25323.1| unnamed protein product [Mus musculus]
Length = 410
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|402868771|ref|XP_003898462.1| PREDICTED: PHD finger protein 10 [Papio anubis]
Length = 498
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 379 LCGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 439 CGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475
>gi|294868766|ref|XP_002765684.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
50983]
gi|239865763|gb|EEQ98401.1| bromodomain-containing protein, putative [Perkinsus marinus ATCC
50983]
Length = 1071
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%)
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
YCPVCL+ + MV CD C+ WVH +CD + ++ D +++Y CP CR
Sbjct: 353 YCPVCLRAWSTVWCDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 408
>gi|229594235|ref|XP_001024908.3| PHD-finger family protein [Tetrahymena thermophila]
gi|225566987|gb|EAS04663.3| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 425
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 1/90 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L CK+C K +H C + + + W C C++C C + N+ + C CD
Sbjct: 279 ILVCKNCNKSFHAECC-DPPLEKGIVSKYDWFCTECKLCIACNKNTKENELLMCDCCDRP 337
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
+H C P ++ G + C KC CG
Sbjct: 338 FHMSCLEPARTDIPEGRWFCKDCEKCPCCG 367
>gi|413950797|gb|AFW83446.1| hypothetical protein ZEAMMB73_198866 [Zea mays]
Length = 870
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 7/99 (7%)
Query: 98 CPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 156
C + +C SCG+ P + + + + C C R+ YC +CLK + V
Sbjct: 372 CRRVLQCESCGNCFPNKDTNKMVYVMEQLACRLCARILALKKYCGICLKNLQHKYGGRRV 431
Query: 157 CCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 194
CC C+ WVH +CD S+ K LQ + +Y CP CR
Sbjct: 432 CCHGCESWVHAECDENCSNLKDLQ-----DKKYHCPYCR 465
>gi|294948371|ref|XP_002785717.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
gi|239899765|gb|EER17513.1| mixed-lineage leukemia protein, mll, putative [Perkinsus marinus
ATCC 50983]
Length = 1340
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%)
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
YCPVCL+ + MV CD C+ WVH +CD + ++ D +++Y CP CR
Sbjct: 164 YCPVCLRAWSTVWCDDMVQCDGCEFWVHAKCDNFTCKEQFTELTDKDVKYFCPICR 219
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 381 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 480
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471
>gi|3540206|gb|AAC34356.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1250
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 1107 KYYHIRCLTS---RQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1162
Query: 87 PHKNVSSGPYLC 98
P ++V +G + C
Sbjct: 1163 PCESVPNGEWFC 1174
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481
>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
Length = 1055
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC VC K++ S+ VCCD C WVH +CD IS + F+ +
Sbjct: 415 CKHCHKLRQSKQYCGVCKKIWHHSDGGNWVCCDGCNVWVHAECDKISSK---LFKDLAHS 471
Query: 187 QYRCPTCR 194
+Y CP C+
Sbjct: 472 EYYCPDCK 479
>gi|6648956|gb|AAF21306.1|AF108134_1 ubi-d4/requiem [Mus musculus]
Length = 380
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 281 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 340
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 341 YHMYCLTPSMSEPPEGSWSC 360
>gi|169146772|emb|CAQ13473.1| novel protein similar to human D4, zinc and double PHD fingers
family 1 (DPF1) [Danio rerio]
Length = 127
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 44/98 (44%), Gaps = 3/98 (3%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C C G + GC ++SC CG+ H +CL+ W+C C+ C +C
Sbjct: 14 CDFCLGGSKKTGCPE--DLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 71
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ + ++ +FC CD YH YC PP G + C
Sbjct: 72 GTSENDDQLLFCDDCDRGYHMYCLSPPMSEPPEGSWSC 109
>gi|194328734|ref|NP_060758.2| PHD finger protein 10 isoform a [Homo sapiens]
gi|296439276|sp|Q8WUB8.3|PHF10_HUMAN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a; AltName: Full=XAP135
gi|119567827|gb|EAW47442.1| PHD finger protein 10, isoform CRA_a [Homo sapiens]
Length = 498
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 379 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 439 CGQPHHEEEMMFCDMCDRGYHTFC 462
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 471
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 47/117 (40%), Gaps = 17/117 (14%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC----PSCRIC 57
C+ C GENE ++L C C K YH C K N W ++C + R C
Sbjct: 193 CQFCTSGENE-----DKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKC 247
Query: 58 EIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
+C R K ++C C AYH C PP V G + CH C S P
Sbjct: 248 IVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWY------CHGCISRAP 298
Score = 46.6 bits (109), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC-------PKHTKCHSCGS 109
C+ C + +K + C CD YH YC P N+ G + C KC CG
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGG 252
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYR 148
+ P + + CD C R + Y P LKV R
Sbjct: 253 HRPSPVGKMIY------CDLCPRAYHADCYIPPLLKVPR 285
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR-----------ICEI 59
E C++ ++ C +C + YH C + WS CP C E
Sbjct: 329 EVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWS---CPHCEGEGIKEQEEDDHMEF 385
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
CR D + + C C +AYH +C +PP K + G + CP+
Sbjct: 386 CRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPR 426
>gi|431904613|gb|ELK09995.1| PHD finger protein 10 [Pteropus alecto]
Length = 451
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 LCGICLKGKESNKKGKAESLIHCSQCDSSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|387542916|gb|AFJ72085.1| PHD finger protein 10 isoform a [Macaca mulatta]
Length = 498
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 379 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 439 CGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|190337311|gb|AAI63678.1| Myst3 protein [Danio rerio]
Length = 2247
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 230 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKTCSS 289
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 290 CQDQGKNADNMLFCDSCDRGFHMECCDPPLMRMPKGMWIC 329
>gi|300508320|pdb|2KWJ|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
Transcriptional Protein Dpf3 Bound To A Histone Peptide
Containing Acetylation At Lysine 14
gi|300508322|pdb|2KWK|A Chain A, Solution Structures Of The Double Phd Fingers Of Human
Transcriptional Protein Dpf3b Bound To A H3 Peptide Wild
Type
gi|300508324|pdb|2KWN|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
Fingers Of Human Transcriptional Protein Dpf3b Bound To
A Histone H4 Peptide Containing Acetylation At Lysine 16
gi|300508326|pdb|2KWO|A Chain A, Solution Structure Of The Double Phd (Plant Homeodomain)
Fingers Of Human Transcriptional Protein Dpf3b Bound To
A Histone H4 Peptide Containing N-Terminal Acetylation
At Serine 1
Length = 114
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 45/99 (45%), Gaps = 2/99 (2%)
Query: 2 CRLCFVGENEGCERAR--RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
C C G N + R ++SC CG+ H CL+ + W+C C+ C +
Sbjct: 4 CDFCLGGSNMNKKSGRPEELVSCADCGRSGHPTCLQFTLNMTEAVKTYKWQCIECKSCIL 63
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C + + ++ +FC CD YH YC +PP G + C
Sbjct: 64 CGTSENDDQLLFCDDCDRGYHMYCLNPPVAEPPEGSWSC 102
>gi|255522851|ref|NP_077212.3| PHD finger protein 10 [Mus musculus]
Length = 497
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 378 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 437
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 438 CGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|67078518|ref|NP_001019918.1| PHD finger protein 10 [Rattus norvegicus]
gi|66910931|gb|AAH98049.1| PHD finger protein 10 [Rattus norvegicus]
Length = 410
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|209882276|ref|XP_002142575.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
gi|209558181|gb|EEA08226.1| PHD-finger domain-containing protein [Cryptosporidium muris RN66]
Length = 305
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 44/96 (45%), Gaps = 1/96 (1%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+ C C K YH C + + + S W C C C +CR++G + + C C+ A
Sbjct: 132 FIECSICKKSYHLTCCDPIIEKVSI-NNSKWICSDCNGCIVCRKSGREDYQVLCDVCNRA 190
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 114
+H YC +P +V G ++C C C N+ N
Sbjct: 191 FHIYCLYPTLDSVPQGIWICDDCYVCAFCQGNIKYN 226
>gi|194328736|ref|NP_579866.2| PHD finger protein 10 isoform b [Homo sapiens]
Length = 496
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 377 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 436
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 437 CGQPHHEEEMMFCDMCDRGYHTFC 460
>gi|74183205|dbj|BAE22542.1| unnamed protein product [Mus musculus]
Length = 331
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 42/96 (43%), Gaps = 2/96 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E+ ++ +LSC CG H +CLK + W+C C+ C
Sbjct: 216 ICSFCLGTKESNREKKPEELLSCADCGSSGHPSCLKFCPELTANVKALRWQCIECKTCSA 275
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSG 94
CR G + + +FC CD +H C PP + G
Sbjct: 276 CRVQGKNADNMLFCDSCDRGFHMECCDPPLSRMPKG 311
>gi|328871667|gb|EGG20037.1| hypothetical protein DFA_07153 [Dictyostelium fasciculatum]
Length = 1433
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/113 (22%), Positives = 53/113 (46%), Gaps = 18/113 (15%)
Query: 2 CRLCFVGENEGCERA----------RRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKC 51
C +CF+ + G ++ +++C +C + +H++C+ + + + + S W C
Sbjct: 716 CVICFLSTSGGNKKFYHQKTKKNQNTTLVTCFACERSFHQDCITDQPNSNN--NNSEWYC 773
Query: 52 P-----SCRI-CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+C++ C +C++ + F+ C +C YH YC P V P+ C
Sbjct: 774 SIDCSMTCQVRCNVCQKGDHEDSFVLCDKCSDGYHIYCLSPQLSEVPYDPWEC 826
>gi|1083466|pir||A55302 probable transcription factor requiem - mouse
gi|606661|gb|AAA64637.1| Requiem [Mus musculus]
Length = 371
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ W+C C+ C +C + + ++ +FC CD
Sbjct: 272 LVSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQCIECKCCNLCGTSENDDQLLFCDDCDRG 331
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC P G + C
Sbjct: 332 YHMYCLTPSMSEPPEGSWSC 351
>gi|297842501|ref|XP_002889132.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
lyrata]
gi|297334973|gb|EFH65391.1| hypothetical protein ARALYDRAFT_339887 [Arabidopsis lyrata subsp.
lyrata]
Length = 1160
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 1017 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 1072
Query: 87 PHKNVSSGPYLC 98
P ++V +G + C
Sbjct: 1073 PCESVPNGEWFC 1084
>gi|237839305|ref|XP_002368950.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211966614|gb|EEB01810.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 551
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 72/186 (38%), Gaps = 30/186 (16%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C C + +H +C + N +L W C C+ CE C+ + + + C CD A
Sbjct: 347 LLVCFRCRQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRA 405
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--------SVRWF--------- 121
YH C PP + V G + C +C C + + S+R
Sbjct: 406 YHMDCMEPPVEEVPDGTWFCADCGRCACCDRRLSDEKILDPHSCVGSMRRLCFDCKERHR 465
Query: 122 ---------LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP---MVCCDVCQRWVHCQC 169
LG + DA + + C VC+K E P V CD+C++ VH C
Sbjct: 466 RGKRSRLSRLGSSQGDAGTHSAKRTSLCDVCVKSLCACEGKPPKMRVACDLCKQVVHADC 525
Query: 170 DGISDE 175
+ E
Sbjct: 526 ARLPQE 531
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 376 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 475
>gi|341942257|sp|Q9D8M7.4|PHF10_MOUSE RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
Length = 497
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 378 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 437
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 438 CGQPHHEEEMMFCDVCDRGYHTFC 461
>gi|383850174|ref|XP_003700672.1| PREDICTED: uncharacterized protein LOC100875893 [Megachile
rotundata]
Length = 659
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + SSW+C C+ C IC
Sbjct: 429 CSLC------AKEKQETLVACRDCTVRAHPSCIYS-PEEMIQKAGSSWQCERCKSCTICC 481
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C CD AYH YC P
Sbjct: 482 ETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|380792751|gb|AFE68251.1| PHD finger protein 10 isoform a, partial [Macaca mulatta]
Length = 480
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 379 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 438
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 439 CGQPHHEEEMMFCDVCDRGYHTFC 462
>gi|221507890|gb|EEE33477.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 546
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 72/186 (38%), Gaps = 30/186 (16%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C C + +H +C + N +L W C C+ CE C+ + + + C CD A
Sbjct: 342 LLVCFRCRQSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRA 400
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--------SVRWF--------- 121
YH C PP + V G + C +C C + + S+R
Sbjct: 401 YHMDCMEPPVEEVPDGTWFCADCGRCACCDRRLSDEKILDPHSCVGSMRRLCFDCKERHR 460
Query: 122 ---------LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP---MVCCDVCQRWVHCQC 169
LG + DA + + C VC+K E P V CD+C++ VH C
Sbjct: 461 RGKRSRLSRLGSSQGDAGTHSAKRTSLCDVCVKSLCACEGKPPKMRVACDLCKQVVHADC 520
Query: 170 DGISDE 175
+ E
Sbjct: 521 ARLPQE 526
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 366 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 422
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 423 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 465
>gi|12805463|gb|AAH02206.1| PHD finger protein 10 [Mus musculus]
gi|148688526|gb|EDL20473.1| PHD finger protein 10, isoform CRA_b [Mus musculus]
Length = 408
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 289 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 348
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 349 CGQPHHEEEMMFCDVCDRGYHTFC 372
>gi|149047117|gb|EDL99837.1| PHD finger protein 10 [Rattus norvegicus]
Length = 449
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 330 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 389
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 390 CGQPHHEEEMMFCDVCDRGYHTFC 413
>gi|302843023|ref|XP_002953054.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
nagariensis]
gi|300261765|gb|EFJ45976.1| hypothetical protein VOLCADRAFT_105771 [Volvox carteri f.
nagariensis]
Length = 2579
Score = 53.5 bits (127), Expect = 4e-04, Method: Composition-based stats.
Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 15/110 (13%)
Query: 3 RLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRR 62
R+C + + +S +SCG++ N + N R LF CP +C
Sbjct: 1249 RVCVGAQQHPPWDSAEEMSIRSCGQRSRCNIIHNLL--RLLF-----SCP-----RVCWL 1296
Query: 63 TGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
D N+ + C CD YHCYC PP V +G + CP C + G +P
Sbjct: 1297 DEDKNRILLCDGCDGEYHCYCVEPPLLEVPAGAWFCP---SCTARGLGLP 1343
>gi|380792753|gb|AFE68252.1| PHD finger protein 10 isoform b, partial [Macaca mulatta]
Length = 478
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 377 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 436
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 437 CGQPHHEEEMMFCDVCDRGYHTFC 460
>gi|410960397|ref|XP_004001392.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 10 [Felis catus]
Length = 440
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C +
Sbjct: 321 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCIV 380
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 381 CGQPHHEEEMMFCDVCDRGYHTFC 404
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 387 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 443
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 444 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 486
>gi|148688525|gb|EDL20472.1| PHD finger protein 10, isoform CRA_a [Mus musculus]
Length = 469
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 350 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 409
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 410 CGQPHHEEEMMFCDVCDRGYHTFC 433
>gi|355569151|gb|EHH25368.1| hypothetical protein EGK_21333 [Macaca mulatta]
Length = 410
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 368 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 424
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 425 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 467
>gi|313233623|emb|CBY09794.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 43/98 (43%), Gaps = 1/98 (1%)
Query: 2 CRLCFVGENEGCERARRMLSCKS-CGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
C LC G + E ++ C C + H C+ A +W+C C+ C C
Sbjct: 225 CDLCSNGPDTSSEDMSMLVKCSGPCKRLTHPYCVNLPANIVKNVSTYAWECQDCKHCSKC 284
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+ +K +FC CD H YC +PP KN SG + C
Sbjct: 285 GLDENDDKLLFCDDCDRGVHLYCLNPPLKNAPSGRWTC 322
>gi|7023354|dbj|BAA91934.1| unnamed protein product [Homo sapiens]
gi|48146663|emb|CAG33554.1| PHF10 [Homo sapiens]
gi|82571445|gb|AAI10324.1| PHD finger protein 10 [Homo sapiens]
gi|261858284|dbj|BAI45664.1| PHD finger protein 10 [synthetic construct]
Length = 410
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|332825485|ref|XP_518861.3| PREDICTED: PHD finger protein 10 [Pan troglodytes]
Length = 451
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|332263993|ref|XP_003281033.1| PREDICTED: PHD finger protein 10 [Nomascus leucogenys]
Length = 451
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|18088065|gb|AAH20954.1| PHD finger protein 10 [Homo sapiens]
gi|123981058|gb|ABM82358.1| PHD finger protein 10 [synthetic construct]
gi|123995863|gb|ABM85533.1| PHD finger protein 10 [synthetic construct]
Length = 408
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 289 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 348
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 349 CGQPHHEEEMMFCDMCDRGYHTFC 372
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 72/184 (39%), Gaps = 54/184 (29%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWK--CPSCRICEI 59
C LC E EG +R R ++ +CGK YH CL W Q+ HW + CP +C
Sbjct: 552 CFLC--NEREG-DRIRCIVP--ACGKHYHSKCLIPWPQS----HWQGGRLTCPY-HVCHT 601
Query: 60 C-------RRTGDPN-KFMFCRRCDAAYHC--YCQHPPHKNVSSGPYLCPKHTKCHSCGS 109
C R+ PN K C RC ++YH C + +++ +CPKH K
Sbjct: 602 CSSDNPQNNRSRAPNEKVAKCVRCPSSYHASALCLPAGSEILTASQIICPKHYK------ 655
Query: 110 NVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
P L+ W C C R ++CCD C H +C
Sbjct: 656 -APHPPLNAAW------CFLCTR-------------------GGSLICCDTCPTSFHLEC 689
Query: 170 DGIS 173
GI+
Sbjct: 690 LGIN 693
>gi|195447676|ref|XP_002071320.1| GK25190 [Drosophila willistoni]
gi|194167405|gb|EDW82306.1| GK25190 [Drosophila willistoni]
Length = 2262
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 7/86 (8%)
Query: 2 CRLCFVGENEGCERARRM----LSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC 57
C +C ++ AR M + C SC + H +C++ + +W+C C+ C
Sbjct: 1869 CGVCLRNQHRN---ARNMPEAFIRCYSCRRNVHPSCIEMPQRMLGRVRNYNWQCAECKCC 1925
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYC 83
CRR K ++C +CD YH YC
Sbjct: 1926 IKCRRRQKEGKMLYCEQCDRGYHIYC 1951
>gi|119567828|gb|EAW47443.1| PHD finger protein 10, isoform CRA_b [Homo sapiens]
Length = 449
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 330 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 389
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 390 CGQPHHEEEMMFCDMCDRGYHTFC 413
>gi|45382851|ref|NP_989971.1| zinc finger protein neuro-d4 [Gallus gallus]
gi|18202298|sp|P58267.1|DPF1_CHICK RecName: Full=Zinc finger protein neuro-d4; AltName: Full=D4, zinc
and double PHD fingers family 1
gi|14010358|gb|AAK51966.1|AF362752_1 neuro-d4 [Gallus gallus]
Length = 380
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 36/80 (45%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
M++C CG+ H +CL+ W+C C+ C +C + + +FC CD
Sbjct: 283 MIACADCGRAGHPSCLQFTLAMAAAARSYRWQCIECKNCSLCGSAENDEQLLFCDDCDRG 342
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP G + C
Sbjct: 343 YHMYCISPPVAEPPEGTWSC 362
>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
[Brachypodium distachyon]
Length = 1037
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 7/94 (7%)
Query: 103 KCHSCGSNVPG-NGLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVC 161
+C SCG+ P + + + + C C R+ YC +CLK ++ VCC C
Sbjct: 381 QCESCGNCFPNKDSNKMVYVMEQLACKHCARILRSKEYCGICLKSWQHKCGRRWVCCHGC 440
Query: 162 QRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 194
+ W+H +CD SD K LQ + Y CP CR
Sbjct: 441 ESWIHAECDKKCSDLKDLQ-----DKSYFCPYCR 469
>gi|255558536|ref|XP_002520293.1| DNA binding protein, putative [Ricinus communis]
gi|223540512|gb|EEF42079.1| DNA binding protein, putative [Ricinus communis]
Length = 510
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL N N + W CPSC +C C D ++ + C CD AYH YC P
Sbjct: 378 KYYHVRCLTN---NLLKSYGPRWYCPSC-LCRTCFVDRDDDQIVLCDGCDHAYHMYCMSP 433
Query: 87 PHKNVSSGPYLC 98
P ++ G + C
Sbjct: 434 PRTSIPRGKWFC 445
>gi|426355212|ref|XP_004045024.1| PREDICTED: PHD finger protein 10 [Gorilla gorilla gorilla]
Length = 451
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELISMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDMCDRGYHTFC 415
>gi|281351742|gb|EFB27326.1| hypothetical protein PANDA_014721 [Ailuropoda melanoleuca]
Length = 397
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C +
Sbjct: 278 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCIV 337
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 338 CGQPHHEEEMMFCDVCDRGYHTFC 361
>gi|303286287|ref|XP_003062433.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455950|gb|EEH53252.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 450
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDV--CQRWVHCQCDGISDEKYLQFQVDG 184
C C +L +G +CP C KV++ + MV CD C+ WVH CD + E + +
Sbjct: 85 CALCAKLHKEGQFCPACDKVWQWANCPAMVGCDAPGCEFWVHASCDARAKE-VMDAPENE 143
Query: 185 NLQYRCPTCRGECYQVRDL 203
+++Y CP C + + +++
Sbjct: 144 DIEYHCPRCVDKAERAKEI 162
>gi|13487236|gb|AAK27451.1|AF338735_1 hypothetical PHD zinc finger protein XAP135 [Homo sapiens]
Length = 410
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|195174305|ref|XP_002027919.1| GL27102 [Drosophila persimilis]
gi|194115608|gb|EDW37651.1| GL27102 [Drosophila persimilis]
Length = 2142
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+ C SC ++ H +C+ + +W+C C+ C C+ + P K ++C +CD
Sbjct: 1837 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1896
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
YH YC K V + C + C CG+ P
Sbjct: 1897 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1928
>gi|156392562|ref|XP_001636117.1| predicted protein [Nematostella vectensis]
gi|156223217|gb|EDO44054.1| predicted protein [Nematostella vectensis]
Length = 229
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C++CRR GD K + C CD +H YC PP K++ G + CP
Sbjct: 1 CKLCRRKGDAEKMLLCDACDRGHHMYCLKPPIKHIPEGNWFCP 43
>gi|403305903|ref|XP_003943488.1| PREDICTED: PHD finger protein 10 [Saimiri boliviensis boliviensis]
Length = 410
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|355749042|gb|EHH53525.1| hypothetical protein EGM_14185 [Macaca fascicularis]
Length = 410
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTVELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|390462274|ref|XP_002747237.2| PREDICTED: PHD finger protein 10 [Callithrix jacchus]
Length = 451
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 332 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 391
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 392 CGQPHHEEEMMFCDVCDRGYHTFC 415
>gi|51969394|dbj|BAD43389.1| unnamed protein product [Arabidopsis thaliana]
gi|51969560|dbj|BAD43472.1| unnamed protein product [Arabidopsis thaliana]
gi|51969870|dbj|BAD43627.1| unnamed protein product [Arabidopsis thaliana]
Length = 522
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 87 PHKNVSSGPYLC 98
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
Short=CHD-3
gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
Length = 1787
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 25/109 (22%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------- 55
E C + ++ C +C + YH C+ +N + W CP C
Sbjct: 269 EVCNQDGELMLCDTCTRAYHVACID---ENMEQPPEGDWSCPHCEEHGPDVLIVEEEPAK 325
Query: 56 ----ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C IC+ T + + C C ++YH YC PP + G + CP+
Sbjct: 326 ANMDYCRICKETSN---ILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPR 371
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 65/158 (41%), Gaps = 12/158 (7%)
Query: 4 LCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCP--SCRICEICR 61
LCF E G + S ++CGKKYHR C+ N R +S+KCP C C +
Sbjct: 726 LCFACEQPGGLEGLQTCSVRNCGKKYHRACISN--NPRAALKDNSFKCPLHKCANCTYPQ 783
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKC--HSCGSNVPGNGLSVR 119
+ P + C RC AYH C + ++ LCPKH H+ + G R
Sbjct: 784 ASTYP--LVRCIRCPIAYHTCCVPAGCLHENAIYLLCPKHQPVEKHAKSNICLACGDGGR 841
Query: 120 WFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVC 157
F CCD C + + V SE +P C
Sbjct: 842 LF----CCDTCPAAYHQECLKDVLALTGTPSEDSPWYC 875
>gi|290992402|ref|XP_002678823.1| predicted protein [Naegleria gruberi]
gi|284092437|gb|EFC46079.1| predicted protein [Naegleria gruberi]
Length = 457
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
Query: 1 MCRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEIC 60
+C++C E+ + + C SC +H C ++ + H +W C C+IC C
Sbjct: 313 LCKIC-----NSFEQETKFIQCLSCNSYFHTFCYTPNLEHLEQVHKDNWLCSDCKICLKC 367
Query: 61 RRTGDPNKFMFCRRCDAAYH 80
R+ + +FC CD +H
Sbjct: 368 RKGPNEGTLVFCDYCDCGFH 387
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 54
C+ C G+NE ++L C C K YH C K +N W C C
Sbjct: 1937 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1988
Query: 55 RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
R C +C + + + + C C AYH C HP V G + C K
Sbjct: 1989 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 2036
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C+ C + +K + C CD YH YC P +N+ G + CH C + G
Sbjct: 1937 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1988
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
C CG+ + T ++ C++C R H C
Sbjct: 1989 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 2019
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 17/108 (15%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 54
C+ C G+NE ++L C C K YH C K +N W C C
Sbjct: 1791 CQFCHSGDNED-----KLLLCDGCDKGYHTYCFKPKMEN---IPEGDWYCHECMNKATGE 1842
Query: 55 RICEIC--RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
R C +C + + + + C C AYH C HP V G + C K
Sbjct: 1843 RNCIVCGKKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSK 1890
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 40/113 (35%), Gaps = 30/113 (26%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C+ C + +K + C CD YH YC P +N+ G + CH C + G
Sbjct: 1791 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWY------CHECMNKATGE-- 1842
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQC 169
C CG+ + T ++ C++C R H C
Sbjct: 1843 --------RNCIVCGK--------------KSSTSGTRLILCELCPRAYHTDC 1873
>gi|42563280|ref|NP_177849.2| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|95147302|gb|ABF57286.1| At1g77250 [Arabidopsis thaliana]
gi|332197833|gb|AEE35954.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 522
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 87 PHKNVSSGPYLC 98
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|47118082|gb|AAT11171.1| monocytic leukemia zinc finger protein [Danio rerio]
Length = 2246
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 1 MCRLCF-VGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C E ++ ++SC CG H +CLK + W+C C+ C
Sbjct: 229 ICSFCLGTKEQNRDKKPEELISCADCGNSGHPSCLKFSPELTVRVKALWWQCIECKSCSS 288
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ G + + +FC CD +H C PP + G ++C
Sbjct: 289 CQDQGKNADNMLFCDSCDRGFHMECCDPPLTRMPKGMWIC 328
>gi|348561411|ref|XP_003466506.1| PREDICTED: PHD finger protein 10-like [Cavia porcellus]
Length = 614
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 495 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSVELVSMIKTYPWQCMECKTCII 554
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 555 CGQPHHEEEMMFCDVCDRGYHTFC 578
>gi|51969444|dbj|BAD43414.1| unnamed protein product [Arabidopsis thaliana]
Length = 522
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 27 KKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHP 86
K YH CL + H W C SC +C C D +K + C CD AYH YC P
Sbjct: 379 KYYHIRCL---TSRQIKLHGVRWYCSSC-LCRNCLTDKDDDKIVLCDGCDDAYHIYCMRP 434
Query: 87 PHKNVSSGPYLC 98
P ++V +G + C
Sbjct: 435 PCESVPNGEWFC 446
>gi|198471111|ref|XP_002133666.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
gi|198145773|gb|EDY72293.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
Length = 2132
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+ C SC ++ H +C+ + +W+C C+ C C+ + P K ++C +CD
Sbjct: 1827 FIRCYSCRQRVHPSCIDMPQRMVGRVRNYNWQCAGCKCCIKCKSSQRPGKMLYCEQCDRG 1886
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVP 112
YH YC K V + C + C CG+ P
Sbjct: 1887 YHIYCLG--LKTVPDERWSCERCCICMRCGAVKP 1918
>gi|198437529|ref|XP_002126456.1| PREDICTED: similar to Bromodomain adjacent to zinc finger domain
protein 1A (ATP-utilizing chromatin assembly and
remodeling factor 1) (hACF1) (ATP-dependent
chromatin-remodeling protein) (Williams syndrome
transcription factor-related chromatin-remodeling fa...
[Ciona intestinalis]
Length = 1458
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 24/43 (55%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C ICRR GD K + C CD +H YC P K V SG + CP
Sbjct: 1178 CRICRRKGDGEKMLLCDNCDRGHHMYCLRPALKIVPSGDWFCP 1220
>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
Length = 1032
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 7/115 (6%)
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYC 140
+ + P N + P + +C SCG+ P S+ + + C C ++ YC
Sbjct: 355 FFEVPMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYC 414
Query: 141 PVCLKVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 194
VCLK ++ VCC C+ WVH +CD S+ K L+ + Y CP CR
Sbjct: 415 GVCLKSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 464
>gi|84000081|ref|NP_001033141.1| PHD finger protein 10 [Bos taurus]
gi|122136994|sp|Q2T9V9.1|PHF10_BOVIN RecName: Full=PHD finger protein 10; AltName: Full=BRG1-associated
factor 45a; Short=BAF45a
gi|83405479|gb|AAI11244.1| PHD finger protein 10 [Bos taurus]
gi|296483819|tpg|DAA25934.1| TPA: PHD finger protein 10 [Bos taurus]
Length = 410
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 291 LCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDVCDRGYHTFC 374
>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
Length = 1012
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 7/115 (6%)
Query: 82 YCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVR-WFLGYTCCDACGRLFVKGNYC 140
+ + P N + P + +C SCG+ P S+ + + C C ++ YC
Sbjct: 365 FFEVPMDGNTTGQPARYKRALQCESCGNCFPNKDPSMMVYVMEQLACRQCAKILRSKEYC 424
Query: 141 PVCLKVYRDSESTPMVCCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 194
VCLK ++ VCC C+ WVH +CD S+ K L+ + Y CP CR
Sbjct: 425 GVCLKSWQHKCGGRWVCCHGCESWVHAECDKKCSNLKDLR-----DNSYFCPYCR 474
>gi|440893739|gb|ELR46406.1| PHD finger protein 10, partial [Bos grunniens mutus]
Length = 469
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 350 LCGICLKGKESSRRGKAEPLVHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCII 409
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 410 CGQPHHEEEMMFCDVCDRGYHTFC 433
>gi|196014713|ref|XP_002117215.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
gi|190580180|gb|EDV20265.1| hypothetical protein TRIADDRAFT_61269 [Trichoplax adhaerens]
Length = 1478
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C ICRR GD + C CD +H YC PP ++ +G + CP
Sbjct: 1141 CRICRRKGDAELMLLCDECDRGHHTYCLRPPLNSIPAGNWYCP 1183
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRIC------------- 57
E C++ ++ C +C + YH CL+ D W CP C
Sbjct: 382 EVCQQGGEIILCDTCPRAYHLVCLE---PELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPR 481
>gi|326915596|ref|XP_003204100.1| PREDICTED: PHD finger protein 10-like, partial [Meleagris
gallopavo]
Length = 361
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 242 ICGICLKGKESNKKGKAEALIHCSQCDNSGHPSCLDMTPELVAMIKTYPWQCMECKTCII 301
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 302 CGQPHHEEEMMFCDVCDRGYHTFC 325
>gi|221483410|gb|EEE21729.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
Length = 556
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 71/186 (38%), Gaps = 30/186 (16%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
+L C C +H +C + N +L W C C+ CE C+ + + + C CD A
Sbjct: 352 LLVCFRCRHSHHASCC-DPPLNFELVTRYPWHCADCKRCECCQLNTNEEQMLICDACDRA 410
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL--------SVRWF--------- 121
YH C PP + V G + C +C C + + S+R
Sbjct: 411 YHMDCMEPPVEEVPDGTWFCADCGRCACCDRRLSDEKILDPHSCVGSMRRLCFDCKERHR 470
Query: 122 ---------LGYTCCDACGRLFVKGNYCPVCLKVYRDSESTP---MVCCDVCQRWVHCQC 169
LG + DA + + C VC+K E P V CD+C++ VH C
Sbjct: 471 RGKRSRLSRLGSSQGDAGTHSAKRTSLCDVCVKSLCACEGKPPKMRVACDLCKQVVHADC 530
Query: 170 DGISDE 175
+ E
Sbjct: 531 ARLPQE 536
>gi|348508478|ref|XP_003441781.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oreochromis niloticus]
Length = 1521
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C++CRR GD K + C C+ A+H +C P V +G +LCP +C V G
Sbjct: 1202 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRVPNGEWLCP------ACQPTVARRGS 1255
Query: 117 SVR 119
VR
Sbjct: 1256 RVR 1258
>gi|297596335|ref|NP_001042415.2| Os01g0218900 [Oryza sativa Japonica Group]
gi|56784089|dbj|BAD81418.1| unknown protein [Oryza sativa Japonica Group]
gi|222617993|gb|EEE54125.1| hypothetical protein OsJ_00898 [Oryza sativa Japonica Group]
gi|255673003|dbj|BAF04329.2| Os01g0218900 [Oryza sativa Japonica Group]
Length = 405
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
Query: 100 KHTKCHSCGSNVPGNGLSVRWFLGY-TCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 158
K C CG+ +P S + G C C +L YC +C K++ ++ VCC
Sbjct: 306 KSPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365
Query: 159 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
D CQ WVH +C D+ ++ + N Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397
>gi|344257401|gb|EGW13505.1| PHD finger protein 10 [Cricetulus griseus]
Length = 329
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C I
Sbjct: 210 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSIIKTYPWQCMECKTCII 269
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 270 CGQPHHEEEMMFCDVCDRGYHTFC 293
>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
Length = 1690
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C ICRR D + + C C+ A+H YC PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295
>gi|145534470|ref|XP_001452979.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420679|emb|CAK85582.1| unnamed protein product [Paramecium tetraurelia]
Length = 286
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 4/69 (5%)
Query: 127 CDACGRLFVKGNYCPVCLKVY--RDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDG 184
CD C +L+ KGN+C C +VY D E+ V CD CQ+W H C+ + + +Q + +
Sbjct: 87 CDKCSKLYNKGNFCDFCEQVYGSYDDEAV-WVQCDSCQKWNHIVCEQKNRNQNIQIEFET 145
Query: 185 NLQYRCPTC 193
+ QY C TC
Sbjct: 146 S-QYHCLTC 153
>gi|345493038|ref|XP_003426985.1| PREDICTED: hypothetical protein LOC100678755 isoform 1 [Nasonia
vitripennis]
gi|345493040|ref|XP_003426986.1| PREDICTED: hypothetical protein LOC100678755 isoform 2 [Nasonia
vitripennis]
Length = 728
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 14/107 (13%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC + E + +C+ C + H +C+ + + H +SW+C C+ C +C
Sbjct: 499 CSLCSKDKQEA------LTACRDCTVRAHPSCIYTPEEIMNKTH-TSWQCERCKTCVVCY 551
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCG 108
T + + C CD A+H C H P VS + CH C
Sbjct: 552 ETSEAGPLVACYSCDDAFHYTC-HTPRIPVSKAKW------NCHECS 591
>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
Length = 1441
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 80/218 (36%), Gaps = 75/218 (34%)
Query: 1 MCRLCFVGENEGC-----------------ERAR-------RMLSCKS--CGKKYHRNCL 34
+C C VGE EGC E A ++L+C CGK++H +C
Sbjct: 859 VCHECNVGEPEGCVICHQVESPAVPSTPMKEEAPSHIPIEDKLLTCSQPLCGKRFHTSCC 918
Query: 35 KNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYH--CYC 83
K W Q H S +CP +C C + DP +K C RC A YH +C
Sbjct: 919 KYWPQANSSKH--SARCPR-HVCHTC-VSDDPSGKFQQLGSSKLAKCVRCPATYHQDSHC 974
Query: 84 QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVC 143
+ +++ +CP+H N+ V Y C VKG
Sbjct: 975 IPAGTQMLNATHIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE---- 1014
Query: 144 LKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 178
+VCC+ C VH C I ++E Y+
Sbjct: 1015 -----------LVCCETCPIAVHAHCRNIPIKTNENYI 1041
>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
Length = 1690
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C ICRR D + + C C+ A+H YC PP K V +G + CP
Sbjct: 1253 CRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCP 1295
>gi|291414252|ref|XP_002723376.1| PREDICTED: PHD finger protein 10 [Oryctolagus cuniculus]
Length = 626
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C +
Sbjct: 507 ICGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMSTELVSMIKTYPWQCMECKTCIV 566
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 567 CGQPHHEEEMMFCDVCDRGYHTFC 590
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
C ICRR GD + + C +CD +H YC P K+V G + CP
Sbjct: 1132 CRICRRKGDAEQMLLCDKCDRGHHMYCLKPRLKHVPKGDWFCP 1174
>gi|380029159|ref|XP_003698249.1| PREDICTED: uncharacterized protein LOC100865213 [Apis florea]
Length = 659
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + S+W+C C+ C IC
Sbjct: 429 CSLC------AKEKQENLVACRDCTVRAHPSCIYS-PEEMIQKAGSNWQCERCKSCTICC 481
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C CD AYH YC P
Sbjct: 482 ETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|328785548|ref|XP_003250614.1| PREDICTED: hypothetical protein LOC100576266 [Apis mellifera]
Length = 659
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + S+W+C C+ C IC
Sbjct: 429 CSLC------AKEKQENLVACRDCTVRAHPSCIYS-PEEMIQKAGSNWQCERCKSCTICC 481
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C CD AYH YC P
Sbjct: 482 ETSDAGPLATCFTCDEAYHYYCHTP 506
>gi|357616639|gb|EHJ70297.1| hypothetical protein KGM_09919 [Danaus plexippus]
Length = 1569
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 47/119 (39%), Gaps = 19/119 (15%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSC------- 54
C+ C G+NE ++L C C K YH C K + W W+C +
Sbjct: 1256 CQFCLSGDNED-----QLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWECVNKARGGSRE 1310
Query: 55 RICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
R+C +C + + C C AYH C +PP + G + C + C S P
Sbjct: 1311 RVCIVCGGAAR-GRALPCALCVRAYHLDCHYPPLTKMPRGKWYCSQ------CASRAPA 1362
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Nasonia vitripennis]
Length = 1382
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 67/178 (37%), Gaps = 47/178 (26%)
Query: 5 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 64
CFV ER + S +CGK YH +CLK+W Q + + CP IC C
Sbjct: 663 CFVCHERDGERTK--CSILACGKHYHPDCLKSWPQCQ--WQGGRLTCPH-HICHTCASDN 717
Query: 65 DPN--------KFMFCRRCDAAYHCYCQHPPHKN--VSSGPYLCPKHTKCHSCGSNVPGN 114
N KF C +C + YH P + ++ +CPKH K S+ P
Sbjct: 718 PQNSHPRSAGEKFAKCVKCPSTYHASISCLPAGSTILTGSQIVCPKHYK----SSHPP-- 771
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
V +C +C +E ++CCD C H +C GI
Sbjct: 772 --------------------VNATWCFLC------TEGGSLICCDTCPTSFHLECLGI 803
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 13/100 (13%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRI----------CEIC 60
E C++ ++ C +C + YH CL+ + WS CP C E C
Sbjct: 373 EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS---CPHCENDGALEDDDEHMEFC 429
Query: 61 RRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
R D + + C C +AYH +C +PP + G + CP+
Sbjct: 430 RVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPR 469
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 56
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 377 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 433
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C IC+ G+ + C C +AYH +C +PP + G + CP+
Sbjct: 434 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 474
>gi|392575621|gb|EIW68754.1| hypothetical protein TREMEDRAFT_63213 [Tremella mesenterica DSM
1558]
Length = 2086
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 6/61 (9%)
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNG 115
+CE+CR G P+K + C +CD YH YC PP K + + + C SC PGNG
Sbjct: 470 VCEVCRSGGAPDKMLLCDKCDCGYHIYCLDPPLKGLPAY-----EEWYCTSCLLG-PGNG 523
Query: 116 L 116
Sbjct: 524 F 524
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 20/104 (19%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR--------------I 56
E C++ ++ C +C + YH CL ++ WS CP C
Sbjct: 380 EVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWS---CPHCENEGPAEQDDDEHQEF 436
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C IC+ G+ + C C +AYH +C +PP + G + CP+
Sbjct: 437 CRICKDGGE---LLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPR 477
>gi|218187759|gb|EEC70186.1| hypothetical protein OsI_00918 [Oryza sativa Indica Group]
Length = 405
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 5/96 (5%)
Query: 100 KHTKCHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCC 158
K C CG+ +P S + G C C +L YC +C K++ ++ VCC
Sbjct: 306 KLPGCDICGNRLPCKIASKKKQAGERLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCC 365
Query: 159 DVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
D CQ WVH +C D+ ++ + N Y CP C+
Sbjct: 366 DECQIWVHVEC----DQTCIKMEDLENADYFCPDCK 397
>gi|218188422|gb|EEC70849.1| hypothetical protein OsI_02356 [Oryza sativa Indica Group]
Length = 1226
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 49/103 (47%), Gaps = 11/103 (10%)
Query: 1 MCRLCFVGENEGCERARRMLSC---KSCGKKYHRNCLK--NWAQNRDLFHWSSWKCPSCR 55
+C++C E E+ +R L C K YH +CLK A ++ L W CPSC
Sbjct: 1074 LCKMCGNPE----EKDKRFLVCGHTHCLYKYYHISCLKATQIASDKQLDK-PCWYCPSC- 1127
Query: 56 ICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
+C +C D + + C CD AYH YC P ++ G + C
Sbjct: 1128 LCRVCHSDRDDDLTILCDGCDEAYHLYCITPRRTSIPKGKWYC 1170
>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
Length = 991
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 104 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
C SCG+ VP + G C C +L YC +C K++ ++ VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401
Query: 163 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
WVH +CD + + N Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429
>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
Length = 991
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 104 CHSCGSNVPGNGLSVRWFLG-YTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQ 162
C SCG+ VP + G C C +L YC +C K++ ++ VCCD CQ
Sbjct: 342 CDSCGNRVPPKIAKKKKQAGEQLLCRHCDKLLQSKQYCGICKKIWHHTDGGNWVCCDECQ 401
Query: 163 RWVHCQCDGISDEKYLQFQVDGNLQYRCPTCR 194
WVH +CD + + N Y CP C+
Sbjct: 402 IWVHVECDLTC----INMEDLENADYFCPDCK 429
>gi|348534080|ref|XP_003454531.1| PREDICTED: PHD finger protein 10-like [Oreochromis niloticus]
Length = 491
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/84 (25%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVGENEGCE-RARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G+ + + ++ C C H +CL + + W+C C+ C +
Sbjct: 368 ICGICQKGKEANKKGKPEALIHCSECENSGHPSCLDMSEELVSMIQTYRWQCMECKTCTV 427
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C++ ++ MFC CD YH +C
Sbjct: 428 CQQPHHEDEMMFCDMCDRGYHTFC 451
>gi|195432472|ref|XP_002064247.1| GK19801 [Drosophila willistoni]
gi|194160332|gb|EDW75233.1| GK19801 [Drosophila willistoni]
Length = 478
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 40/93 (43%), Gaps = 5/93 (5%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++CKSC +K H CL +N + +KC CR C C G + C C
Sbjct: 160 FITCKSCMQKCHFACLPLNFENLTMAR-KKYKCEKCRYCSYCNSKGKEILIILCSSCVDG 218
Query: 79 YHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNV 111
YH C +PP + L + KCH C +N
Sbjct: 219 YHFECHNPPL----NASILDDREWKCHKCDTNA 247
>gi|224067978|ref|XP_002302628.1| SET domain protein [Populus trichocarpa]
gi|222844354|gb|EEE81901.1| SET domain protein [Populus trichocarpa]
Length = 667
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C +L YC +C K + S+ VCCD C WVH +CD IS + + + ++
Sbjct: 30 CKHCAKLRKSKQYCGICKKTWHHSDGGNWVCCDGCNVWVHAECDNISSKLFKDME---DI 86
Query: 187 QYRCPTCR 194
Y CP C+
Sbjct: 87 DYYCPDCK 94
>gi|449444070|ref|XP_004139798.1| PREDICTED: uncharacterized protein LOC101205573 [Cucumis sativus]
Length = 574
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 27 KKYHRNCL-KNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQH 85
K YH CL K ++ D + W CPSC +C C D +K + C CD +H YC
Sbjct: 434 KCYHTRCLTKKQLKSYD----ACWYCPSC-LCRACLINQDDDKIVLCDGCDHGFHIYCMR 488
Query: 86 PPHKNVSSGPYLCPK 100
PP + G + C K
Sbjct: 489 PPLAAIPKGKWFCSK 503
>gi|297803296|ref|XP_002869532.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
gi|297315368|gb|EFH45791.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
Length = 1024
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C RL + C +C K+ +S V CD C+ W+H +CD ISD+ G
Sbjct: 384 CKPCSRLTKSKHICGICKKIRNHLDSQSWVRCDGCKIWIHAECDQISDKHLKDL---GET 440
Query: 187 QYRCPTCRGE 196
Y CPTCR +
Sbjct: 441 DYYCPTCRAK 450
>gi|432892838|ref|XP_004075862.1| PREDICTED: tyrosine-protein kinase BAZ1B-like [Oryzias latipes]
Length = 1572
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C++CRR GD K + C C+ A+H +C P + +G +LCP +C V G
Sbjct: 1183 CKVCRRKGDDEKLILCDECNKAFHLFCLRPALYRIPTGEWLCP------ACQPTVARRGS 1236
Query: 117 SVR 119
+R
Sbjct: 1237 RLR 1239
>gi|195356293|ref|XP_002044613.1| GM11100 [Drosophila sechellia]
gi|194132317|gb|EDW53891.1| GM11100 [Drosophila sechellia]
Length = 95
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 39/80 (48%)
Query: 19 MLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNKFMFCRRCDAA 78
++SC CG+ H +CL+ A W+C C+ C IC + + ++ +FC CD
Sbjct: 5 LVSCSDCGRSGHPSCLQFTANMIISVKRYRWQCIECKYCSICGTSDNDDQLLFCDDCDRG 64
Query: 79 YHCYCQHPPHKNVSSGPYLC 98
YH YC PP G + C
Sbjct: 65 YHMYCLSPPLVTPPEGSWSC 84
>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 954
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 139 YCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRCPTCRGE 196
YC +C +++ S+ VCCD C WVH +CD IS + + + N Y CP C+G+
Sbjct: 335 YCGICKRIWHHSDGGNWVCCDGCNVWVHAECDKISSKLFKDLE---NTDYYCPDCKGK 389
>gi|350396306|ref|XP_003484507.1| PREDICTED: hypothetical protein LOC100744391 [Bombus impatiens]
Length = 658
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + S+W+C C+ C IC
Sbjct: 428 CSLC------AKEKQETLVACRDCTVRAHPSCIYS-PEEMIQKAGSNWQCERCKSCTICC 480
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C CD AYH YC P
Sbjct: 481 ETSDAGPLATCFTCDDAYHYYCHTP 505
>gi|340722214|ref|XP_003399503.1| PREDICTED: hypothetical protein LOC100648836 [Bombus terrestris]
Length = 659
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 7/85 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + S+W+C C+ C IC
Sbjct: 429 CSLC------AKEKQETLVACRDCTVRAHPSCIYS-PEEMVQKAGSNWQCERCKSCTICC 481
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHP 86
T D C CD AYH YC P
Sbjct: 482 ETSDAGPLATCFTCDDAYHYYCHTP 506
>gi|301779690|ref|XP_002925264.1| PREDICTED: PHD finger protein 10-like [Ailuropoda melanoleuca]
Length = 636
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + + W+C C+ C +
Sbjct: 517 LCGICLKGKESNKKGKAESLIHCSQCDNSGHPSCLDMTMELVSMIKTYPWQCMECKTCIV 576
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 577 CGQPHHEEEMMFCDVCDRGYHTFC 600
>gi|62896783|dbj|BAD96332.1| PHD finger protein 10 isoform a variant [Homo sapiens]
Length = 410
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 MCRLCFVG-ENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C +C G E+ +A ++ C C H +CL + W+C C+ C I
Sbjct: 291 ICGICLKGKESNKKGKAESLIHCSQCENSGHPSCLDMTMELVSTIKTYPWQCMECKTCII 350
Query: 60 CRRTGDPNKFMFCRRCDAAYHCYC 83
C + + MFC CD YH +C
Sbjct: 351 CGQPHHEEEMMFCDMCDRGYHTFC 374
>gi|340501484|gb|EGR28266.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 956
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 74/184 (40%), Gaps = 25/184 (13%)
Query: 18 RMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTGDPNK-FMFCRRCD 76
+ C+ C YH C+++ ++ C CR C IC G NK + C C
Sbjct: 159 KFTVCQYCQLYYHAQCVQDQ---------ENFVCEQCRPCTICY--GKINKDNITCCECK 207
Query: 77 AAYHCYCQHPPHKNV-----SSGPYL-CPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDAC 130
+ +H C ++ G L C KC CG + ++ C+ C
Sbjct: 208 SRFHKKCGFSIAHDLRIVDKQYGVKLYCESCVKCCMCGVKLLNAVNGYQFQDDQIYCNEC 267
Query: 131 GRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNLQYRC 190
+ K YCPVC + ++ + MV C CQ W+H +CD + Y + + Y C
Sbjct: 268 IQQLEKKEYCPVCRQFWQKECTKDMVQCS-CQMWIHKECDP-HLKNYKEQAI-----YHC 320
Query: 191 PTCR 194
P CR
Sbjct: 321 PNCR 324
>gi|449672238|ref|XP_002165676.2| PREDICTED: histone acetyltransferase KAT6B-like [Hydra
magnipapillata]
Length = 522
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 40/98 (40%), Gaps = 6/98 (6%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C+ C NE + C +C H +CL+ + W+C C+ C C
Sbjct: 3 CKFCEADSNE------EQIQCATCKGMCHPSCLELPKHIIPVVRTYDWQCNDCKYCYGCH 56
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
+ + +FC RCD YH YC P K G + CP
Sbjct: 57 DIENEKQILFCDRCDRGYHMYCIKPKMKKKPKGDWFCP 94
>gi|168057192|ref|XP_001780600.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667966|gb|EDQ54583.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2546
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPG 113
C +C D M C +CDA YH YC +PP + V G + CP +C + PG
Sbjct: 1164 CRVCGVDEDYESIMLCDKCDAEYHTYCLNPPLEKVPEGTWFCP---ECVALDKGFPG 1217
>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
Length = 1418
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 80/218 (36%), Gaps = 75/218 (34%)
Query: 1 MCRLCFVGENEGC-----------------ERAR-------RMLSCKS--CGKKYHRNCL 34
+C C VGE EGC E A ++L+C CGK++H +C
Sbjct: 846 VCHECNVGEPEGCVICHQVESPAVPSTPRKEDAPSHTPIEDKLLTCSQPVCGKRFHTSCC 905
Query: 35 KNWAQNRDLFHWSSWKCPSCRICEICRRTGDP---------NKFMFCRRCDAAYH--CYC 83
K W Q H S +CP +C C + DP +K C RC A YH +C
Sbjct: 906 KYWPQASSSKH--SARCPR-HVCHTC-VSNDPSGRFQQLGSSKLAKCVRCPATYHQDSHC 961
Query: 84 QHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGLSVRWFLGYTCCDACGRLFVKGNYCPVC 143
+ +++ +CP+H N+ V Y C VKG
Sbjct: 962 IPAGTQMLNATNIICPRH--------NIAKADAHVNVLWCYIC--------VKGGE---- 1001
Query: 144 LKVYRDSESTPMVCCDVCQRWVHCQCDGI---SDEKYL 178
+VCC+ C VH C I ++E Y+
Sbjct: 1002 -----------LVCCETCPIAVHAHCRNIPIKTNENYI 1028
>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Nasonia vitripennis]
Length = 1317
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 67/178 (37%), Gaps = 47/178 (26%)
Query: 5 CFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICRRTG 64
CFV ER + S +CGK YH +CLK+W Q + + CP IC C
Sbjct: 598 CFVCHERDGERTK--CSILACGKHYHPDCLKSWPQCQ--WQGGRLTCPH-HICHTCASDN 652
Query: 65 DPN--------KFMFCRRCDAAYHCYCQHPPHKN--VSSGPYLCPKHTKCHSCGSNVPGN 114
N KF C +C + YH P + ++ +CPKH K S+ P
Sbjct: 653 PQNSHPRSAGEKFAKCVKCPSTYHASISCLPAGSTILTGSQIVCPKHYK----SSHPP-- 706
Query: 115 GLSVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGI 172
V +C +C +E ++CCD C H +C GI
Sbjct: 707 --------------------VNATWCFLC------TEGGSLICCDTCPTSFHLECLGI 738
>gi|145518149|ref|XP_001444952.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412385|emb|CAK77555.1| unnamed protein product [Paramecium tetraurelia]
Length = 678
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 66/165 (40%), Gaps = 25/165 (15%)
Query: 12 GCERARRMLSCKSCGKKYHRNCLKN---WAQNRDLFHWSSWKCPSCRICEICRRTGDPNK 68
G E +L C++C K YH C N + Q R + +W C +C C+ C + G N
Sbjct: 460 GYEFLDNLLMCENCNKTYHFYCQINNSQYHQQRVMKSLQNWTCNNCVRCKECDKYGQKND 519
Query: 69 FMFCRRCDAAYHCYCQHPPHKNVSSGP--YLCPKHTKCHSCGS--------------NVP 112
+FC C+ YH C + G + C KC +C +
Sbjct: 520 -LFCCNCNEFYHFQCVFNNFIAPTDGLDYWKCKNCFKCANCQTTKLFGPELLSRIKPTTT 578
Query: 113 GNG--LSVRWFLGYTCCDACGRLFVKGNYCPVC---LKVYRDSES 152
N +S+ +F + C +CG +C C +++Y D +S
Sbjct: 579 SNTVHISIEYFQNFQYCLSCGLDVAGFRFCQFCEEYIQIYNDQQS 623
>gi|66356556|ref|XP_625456.1| 2x PHD domain containing protein [Cryptosporidium parvum Iowa II]
gi|46226407|gb|EAK87407.1| 2x PHD domain containing protein [Cryptosporidium parvum Iowa II]
Length = 933
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 23/43 (53%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
CE+CR + C RCD YH YC PP +V SG + CP
Sbjct: 272 CEVCRLNDHEEVLLLCDRCDRGYHTYCLDPPLDSVPSGEWFCP 314
>gi|297792715|ref|XP_002864242.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
gi|297310077|gb|EFH40501.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
Length = 1049
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 127 CDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEKYLQFQVDGNL 186
C C RL C +C K++ +S V CD C+ W+H CD IS + F+ G
Sbjct: 409 CKLCSRLTKPKQVCGICKKIWNHLDSQSWVRCDGCKVWIHSACDQISHK---HFKDLGET 465
Query: 187 QYRCPTCR 194
Y CPTCR
Sbjct: 466 DYYCPTCR 473
>gi|242053849|ref|XP_002456070.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
gi|241928045|gb|EES01190.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
Length = 1051
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 45/99 (45%), Gaps = 7/99 (7%)
Query: 98 CPKHTKCHSCGSNVPGNGLS-VRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMV 156
C + +C SCG+ P + + + C C + YC +CLK + V
Sbjct: 390 CRRALQCESCGNCFPNKDTNKMVHVMEQLACRLCAGILALKKYCGICLKSLQHKYGGRWV 449
Query: 157 CCDVCQRWVHCQCD-GISDEKYLQFQVDGNLQYRCPTCR 194
CC C+ WVH +CD S+ K LQ + YRCP CR
Sbjct: 450 CCHGCESWVHAECDENCSNLKDLQ-----DNSYRCPYCR 483
>gi|390478964|ref|XP_002762152.2| PREDICTED: zinc finger protein neuro-d4 [Callithrix jacchus]
Length = 364
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 48/116 (41%), Gaps = 16/116 (13%)
Query: 2 CRLCFVGENE-GCERARRMLSCKSCGKKYHRNCLK---NWAQNRDLFHWSSWKCPSCRIC 57
C C G + GC ++SC CG+ H +CL+ N + W +C SC +C
Sbjct: 241 CDFCLGGSKKTGC--PEDLISCADCGRSGHPSCLQFTVNMTAAVRTYRWQCIECKSCSLC 298
Query: 58 EICRRTGDP-------NKFMFCRRCDAAYHCYCQHPPHKNVSSGPY---LCPKHTK 103
G ++ +FC CD YH YC PP G + LC +H K
Sbjct: 299 GTSENDGASWAGLTPQDQLLFCDDCDRGYHMYCLSPPMAEPPEGSWSCHLCLRHLK 354
>gi|67590829|ref|XP_665508.1| KIAA1453 protein [Cryptosporidium hominis TU502]
gi|54656232|gb|EAL35279.1| KIAA1453 protein [Cryptosporidium hominis]
Length = 933
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 23/43 (53%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
CE+CR + C RCD YH YC PP +V SG + CP
Sbjct: 272 CEVCRLNDHEEVLLLCDRCDRGYHTYCLDPPLDSVPSGEWFCP 314
>gi|332025406|gb|EGI65573.1| Atherin [Acromyrmex echinatior]
Length = 719
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 8/92 (8%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEICR 61
C LC E+ +++C+ C + H +C+ + + + S+W+C C+ C +C
Sbjct: 485 CSLC------AKEKQEALVACRDCTVRAHPSCIYSPEEMLQKAN-SNWQCERCKTCTVCC 537
Query: 62 RTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSS 93
T D C CD AYH +C HP + S
Sbjct: 538 ETSDAGPLATCYNCDDAYHYFC-HPSRVTIKS 568
>gi|410956350|ref|XP_003984805.1| PREDICTED: histone acetyltransferase KAT6A-like, partial [Felis
catus]
Length = 302
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 2/89 (2%)
Query: 1 MCRLCFVGENEGCER-ARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCRICEI 59
+C C + + E+ ++SC CG H +CLK + W+C C+ C
Sbjct: 208 ICSFCLGTKEQNREKKPEELISCADCGNSGHPSCLKFSPELTVRVKALRWQCIECKTCSS 267
Query: 60 CRRTG-DPNKFMFCRRCDAAYHCYCQHPP 87
CR G + + +FC CD +H C PP
Sbjct: 268 CRDQGKNADNMLFCDSCDRGFHMECCDPP 296
>gi|167526880|ref|XP_001747773.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773877|gb|EDQ87513.1| predicted protein [Monosiga brevicollis MX1]
Length = 807
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%)
Query: 48 SWKCPSCRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCP 99
+WKC C+ C C + G+ ++ +FC CDAA H YC P V + CP
Sbjct: 281 AWKCIRCKTCMRCHKKGNADQLLFCDGCDAAIHTYCCRPKLNGVPDSDFYCP 332
>gi|324504913|gb|ADY42117.1| Histone acetyltransferase MYST2 [Ascaris suum]
Length = 788
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 51/116 (43%), Gaps = 12/116 (10%)
Query: 2 CRLCFVGENEGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWS---SWKCPSCRICE 58
C +C G+++G +R C SC YH CL+ A++ L + W CP C C
Sbjct: 238 CHVC-KGDSDGLKR------CCSCRVLYHLQCLEYSAEHASLIAETRKDDWLCPKCTFCT 290
Query: 59 ICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGN 114
+C + + C CD AYH C+ PH +S + K C C S P N
Sbjct: 291 VCAEYISDRENVQCLMCDRAYHGACR--PHAEGNSESFDPTKPFYCPECASKKPCN 344
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C ICRR GD + + C CD +H YC PP K++ SG + C
Sbjct: 778 CRICRRKGDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYC 819
Score = 42.7 bits (99), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C+ C + G + + C C +AYH C +PP K + +G ++C
Sbjct: 911 CDECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWIC 952
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 43/113 (38%), Gaps = 27/113 (23%)
Query: 8 GENEGCERARR---MLSCKSCGKKYHRNCLKNWAQNRDLFHWSS--WKCPSCRI------ 56
G ++ C R RR ++ C SC +H +C+ + L W C C +
Sbjct: 999 GHSDRCARCRRGGELILCDSCPLSFHLDCV-----DPPLLGVPPDIWLCQLCVLEAESSP 1053
Query: 57 -----------CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLC 98
C++C R + + C C A+H C PP V SG + C
Sbjct: 1054 LEGCSDGTDSHCDVCARCYKHGQLILCDVCPLAFHLRCTDPPLLKVPSGKWTC 1106
Score = 39.3 bits (90), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 31/144 (21%)
Query: 57 CEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPKHTKCHSCGSNVPGNGL 116
C CRR G+ + C C ++H C PP V +LC C + P G
Sbjct: 1004 CARCRRGGE---LILCDSCPLSFHLDCVDPPLLGVPPDIWLC---QLCVLEAESSPLEG- 1056
Query: 117 SVRWFLGYTCCDACGRLFVKGNYCPVCLKVYRDSESTPMVCCDVCQRWVHCQCDGISDEK 176
C D ++C VC + Y+ + ++ CDVC H +C +D
Sbjct: 1057 ---------CSDGTD------SHCDVCARCYKHGQ---LILCDVCPLAFHLRC---TDPP 1095
Query: 177 YLQFQVDGNLQYRCPTCRGECYQV 200
L+ + ++ C C +C V
Sbjct: 1096 LLKVP---SGKWTCQICVKDCQPV 1116
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 16/103 (15%)
Query: 11 EGCERARRMLSCKSCGKKYHRNCLKNWAQNRDLFHWSSWKCPSCR-------------IC 57
E C++ ++ C +C + YH CL+ + WS CP C
Sbjct: 372 EVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWS---CPHCEGEGIAGAAEDDDEHM 428
Query: 58 EICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
E CR D + + C C +AYH +C +PP + G + CP+
Sbjct: 429 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPR 471
>gi|219880763|gb|ACL51656.1| jumonji AT-rich interactive domain 1D [Callithrix jacchus]
Length = 1508
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 54 CRICEICRRTGDPNKFMFCRRCDAAYHCYCQHPPHKNVSSGPYLCPK 100
C +C+IC R + +K +FC CD YH +C PP + G + CPK
Sbjct: 308 CYVCQICSRGDEDDKLLFCDGCDDCYHIFCLLPPLPEIPRGIWRCPK 354
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,817,498,168
Number of Sequences: 23463169
Number of extensions: 524098497
Number of successful extensions: 1482237
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2209
Number of HSP's successfully gapped in prelim test: 2751
Number of HSP's that attempted gapping in prelim test: 1462956
Number of HSP's gapped (non-prelim): 16664
length of query: 719
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 569
effective length of database: 8,839,720,017
effective search space: 5029800689673
effective search space used: 5029800689673
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)