BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002995
(860 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255573673|ref|XP_002527758.1| phd finger protein, putative [Ricinus communis]
gi|223532845|gb|EEF34619.1| phd finger protein, putative [Ricinus communis]
Length = 1103
Score = 1437 bits (3721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 697/866 (80%), Positives = 772/866 (89%), Gaps = 9/866 (1%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWYSG VVGY SE+ RHHV+Y DGD+EDL++SNE+IKFYIS+EEM+ L L+FSI + D D
Sbjct: 239 DWYSGCVVGYTSETKRHHVEYQDGDKEDLVISNEKIKFYISREEMEQLNLTFSIKSADGD 298
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
YDYDEMV LAA LDDCQ+LEPGDIIWAKLTGHAMWPAIVVD+SLIG+ KGLNKISG RS
Sbjct: 299 CYDYDEMVALAAVLDDCQDLEPGDIIWAKLTGHAMWPAIVVDQSLIGERKGLNKISGERS 358
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI KQVISFLKGLLSSFHLKC+KP FT+SLEEAK+YLSEQKLPRRML
Sbjct: 359 VFVQFFGTHDFARIKPKQVISFLKGLLSSFHLKCRKPHFTRSLEEAKMYLSEQKLPRRML 418
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQN++ AD +++ S+DEGS S E+C +ER+Q L + SPYV GDLQI+SLGKIV
Sbjct: 419 QLQNSMNADSCKSASSEDEGSSDSSEDCIDNERIQRILRGLETSPYVIGDLQIISLGKIV 478
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDSEYFQ+DRFIWPEGYTA+RKFTS+ DP C YKMEVLRD ESKIRPLFRVTLDNGEQ
Sbjct: 479 KDSEYFQNDRFIWPEGYTALRKFTSVTDPSACTIYKMEVLRDAESKIRPLFRVTLDNGEQ 538
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP CW KI +IR+ Q + SD FSAEG E+ +SGSDMFGFSNPEVMKLI GL+
Sbjct: 539 IRGSTPCACWDKIYRRIRKLQYSASDGFSAEGVVERFYKSGSDMFGFSNPEVMKLIKGLS 598
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
KSR SK S+CKLTS +Y+DLP GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 599 KSRLYSKMSICKLTSERYQDLPVGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRM 658
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GVLW CNLCRPGAP+ PPCCLCPV+GGAMKPTTDGRWAHLACAIW
Sbjct: 659 MVHARCYGELEPVDGVLWYCNLCRPGAPD-SPPCCLCPVIGGAMKPTTDGRWAHLACAIW 717
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+D+KRMEPIDGLNR++KDRWKLLCSICGV+YGACIQCSN TCRVAYHPLCARAA
Sbjct: 718 IPETCLSDIKRMEPIDGLNRINKDRWKLLCSICGVAYGACIQCSNNTCRVAYHPLCARAA 777
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVELEDE+RL+LLS+D+D EDQCIRLLSFCK+HKQP N+R +ER+ ++T R DYI
Sbjct: 778 GLCVELEDEERLHLLSVDDDVEDQCIRLLSFCKRHKQPSNERPVTEERIGRITHRYSDYI 837
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ+ SG TLPS
Sbjct: 838 PPCNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQHESSGITLPS 897
Query: 662 IRVIGSKFSFSL------HRDAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
V GS+FS +L DAP N +SMA+KY++M++TFRKRLAFGKSGIHGFGIFAKH
Sbjct: 898 NGVEGSRFSSNLQWLKTSQLDAPNNIISMAEKYEYMRQTFRKRLAFGKSGIHGFGIFAKH 957
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
PHRAGDMVIEYTGELVRP IADRREHFIYNSLVGAGTYMFRI+DERVIDATRAGSIAHLI
Sbjct: 958 PHRAGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRINDERVIDATRAGSIAHLI 1017
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGV 834
NHSCEPNCYSRVISVNGDEHIIIFAKRDIK+WEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 1018 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1077
Query: 835 VNDTEAEEQVAKLYAPRSELIDWRGD 860
VND EAEEQVAKLYAPR+ELID++G+
Sbjct: 1078 VNDIEAEEQVAKLYAPRNELIDFKGE 1103
>gi|224065044|ref|XP_002301643.1| SET domain protein [Populus trichocarpa]
gi|222843369|gb|EEE80916.1| SET domain protein [Populus trichocarpa]
Length = 1014
Score = 1384 bits (3583), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 672/866 (77%), Positives = 748/866 (86%), Gaps = 24/866 (2%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWYSG VVG+ S++NR++++Y DGD+EDL+LSNE++KF+IS EEM+ L LS + + D D
Sbjct: 165 DWYSGRVVGHISDTNRYNIEYEDGDKEDLMLSNEKVKFFISGEEMERLNLSVCVKSTDGD 224
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
Y+EMVVLAASLDDCQ+LEPGDIIWAKLTGHAMWPAIVVD +LIGD+KG++K GG S
Sbjct: 225 RNYYNEMVVLAASLDDCQDLEPGDIIWAKLTGHAMWPAIVVDGALIGDHKGISKNIGGGS 284
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFARI KQ ISFLKGLLSSFHLKCK+PRFT+SLEEAK+YLSEQKL RRML
Sbjct: 285 ISVQFFGTHDFARIKPKQAISFLKGLLSSFHLKCKQPRFTRSLEEAKMYLSEQKLSRRML 344
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQN ++AD E++ S DEGS SGE+C +D +Q L LGKIV
Sbjct: 345 QLQNGMKADSCESA-SSDEGSTDSGEDCMQDGGIQRILAR---------------LGKIV 388
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDSE+FQD+RFIWPEGYTA+RKFTS+ DP V YKMEVLRD ESKIRPLFRVTLDNGE+
Sbjct: 389 KDSEHFQDNRFIWPEGYTALRKFTSIKDPNVRMIYKMEVLRDAESKIRPLFRVTLDNGEE 448
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP CW KI KIR+ Q+ S+ FSAE E+ +SGSDMFGFSNPEV+KL+ GL+
Sbjct: 449 INGSTPDACWDKIYRKIRKMQDGNSNGFSAESGGERKLKSGSDMFGFSNPEVIKLLKGLS 508
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
KS +SK S CKLTS +Y+ +P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 509 KSIHSSKLSTCKLTSERYQGIPVGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRM 568
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GVLWLCNLCRPGAP PPPCCLCPV+GGAMKPTTDGRWAHLACAIW
Sbjct: 569 MVHARCYGELEPVDGVLWLCNLCRPGAPNSPPPCCLCPVIGGAMKPTTDGRWAHLACAIW 628
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+DVKRMEPIDG +R++KDRWKLLCSICGV+YGACIQCSN TCRVAYHPLCARAA
Sbjct: 629 IPETCLSDVKRMEPIDGQSRINKDRWKLLCSICGVAYGACIQCSNNTCRVAYHPLCARAA 688
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVELEDEDRL LLSLDEDD DQCIRLLSFCKKH+QP NDR+ DER+ ++ RRC DYI
Sbjct: 689 GLCVELEDEDRLYLLSLDEDDADQCIRLLSFCKKHRQPSNDRMVTDERVGRIPRRCSDYI 748
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NPSGCAR+EPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGY Q+ SG T+ S
Sbjct: 749 PPCNPSGCARTEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYSQHESSGCTIAS 808
Query: 662 IRVIGSKFSFSLHR------DAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
+I S FS SL R AP N LSMA+KY+HM++TFRKRLAFGKSGIHGFGIFAKH
Sbjct: 809 NGLIKSVFSSSLQRLKASRLSAPSNILSMAEKYQHMRQTFRKRLAFGKSGIHGFGIFAKH 868
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
PHRAGDMVIEYTGELVRP IADRRE FIYNSLVGAGTYMFRIDD+RVIDATRAGSIAHLI
Sbjct: 869 PHRAGDMVIEYTGELVRPPIADRRERFIYNSLVGAGTYMFRIDDKRVIDATRAGSIAHLI 928
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGV 834
NHSCEPNCYSRVISVNGDEHIIIFAKRDIK+WEELTYDYRFFSI+E+LACYCGFPRCRGV
Sbjct: 929 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIEEKLACYCGFPRCRGV 988
Query: 835 VNDTEAEEQVAKLYAPRSELIDWRGD 860
VNDTEAEEQVAKLYAPRSEL DW+G+
Sbjct: 989 VNDTEAEEQVAKLYAPRSELTDWKGE 1014
>gi|225433774|ref|XP_002268621.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Vitis
vinifera]
Length = 1084
Score = 1353 bits (3502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 644/866 (74%), Positives = 742/866 (85%), Gaps = 8/866 (0%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
+WY G ++GYD E+NRH VKY DGD+E+LILS+E+IKFY+S+E+M L LS S+ ++D+D
Sbjct: 219 EWYRGCIIGYDLEANRHQVKYNDGDKEELILSSEKIKFYVSREDMQHLNLSLSVRSLDSD 278
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
DYDEMVVLAAS +DCQ+ EPGDIIWAKLTGHAMWPAIVVDES+I + KGLNKIS +S
Sbjct: 279 DIDYDEMVVLAASWNDCQDHEPGDIIWAKLTGHAMWPAIVVDESIIHNRKGLNKISKEKS 338
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+PVQFFG+HDFAR+ KQV FLKGLLSSFHLKC KP F QSL E+K YLSEQKL +RML
Sbjct: 339 LPVQFFGSHDFARVKTKQVTPFLKGLLSSFHLKCTKPHFHQSLVESKAYLSEQKLSKRML 398
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
++Q DD E+ +DE SG++C DER++ L + P+ GDLQ++ LGKIV
Sbjct: 399 RMQKLTEDDDCESMSGEDEKRTDSGDDCIGDERVKRKLDDLIKFPFEIGDLQVIRLGKIV 458
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDS+ FQ + FI PEGYTA+RKFTS+ DP +C YKMEVLRD ESKI+PLFRVTLDNGEQ
Sbjct: 459 KDSDRFQVEGFICPEGYTAMRKFTSITDPSLCALYKMEVLRDAESKIQPLFRVTLDNGEQ 518
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F GSTPS+CW+KI +IR+ QN+ SD SAEG AEK++ESG DMFGFSNPE+ +L+ L+
Sbjct: 519 FQGSTPSSCWNKIFRRIRKMQNSASDGSSAEGGAEKLNESGFDMFGFSNPEIFRLVQELS 578
Query: 363 KSRPTSKSSLCK-LTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
S+ +SK S+ K ++ +Y+DL GYRPVRVDWKDLDKCSVCHMDEEY+NNLFLQCDKCRM
Sbjct: 579 TSKISSKFSMSKSISRRYQDLSSGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRM 638
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GVLWLC LC PGAP+ PPPCCLCPV GGAMKPTTDGRWAHLACAIW
Sbjct: 639 MVHARCYGELEPVDGVLWLCKLCGPGAPDSPPPCCLCPVTGGAMKPTTDGRWAHLACAIW 698
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+D+K MEPIDGL+R++KDRWKLLCSICGVSYGACIQCSN+TCRVAYHPLCARAA
Sbjct: 699 IPETCLSDIKTMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNSTCRVAYHPLCARAA 758
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVELEDEDRL+L+S+++D++DQCIRLLSFCKKH+QP N+R A DER+ QV R C +Y
Sbjct: 759 GLCVELEDEDRLHLISVEDDEDDQCIRLLSFCKKHRQPSNERTAFDERIGQVARECSNYN 818
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PPSNPSGCAR+EPYN+FGRRGRKEPEALAAASLKRLFV+N+PYLVGGYCQ+ GN L S
Sbjct: 819 PPSNPSGCARTEPYNHFGRRGRKEPEALAAASLKRLFVDNRPYLVGGYCQHESLGNPLSS 878
Query: 662 IRVIGSKFSFSLHR------DAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
+ GSKFSF + DAP + LSM +KY +M+ETFRKRLAFGKSGIHGFGIFAK
Sbjct: 879 SALSGSKFSFRNQKIKASQLDAPKSILSMVEKYNYMRETFRKRLAFGKSGIHGFGIFAKQ 938
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
PHRAGDMVIEYTGELVRPSIADRRE IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 939 PHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 998
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGV 834
NHSCEPNCYSRVIS NGD+HIIIFAKRDIK+WEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 999 NHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1058
Query: 835 VNDTEAEEQVAKLYAPRSELIDWRGD 860
VND +AEE++AK YAPRSELI W G+
Sbjct: 1059 VNDIDAEERMAKRYAPRSELIGWIGE 1084
>gi|224128834|ref|XP_002320433.1| SET domain protein [Populus trichocarpa]
gi|222861206|gb|EEE98748.1| SET domain protein [Populus trichocarpa]
Length = 1050
Score = 1350 bits (3494), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 653/874 (74%), Positives = 736/874 (84%), Gaps = 54/874 (6%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
+WYSG VVG+ +++NR++++Y DGD+EDLI+SNE++KF+IS EEM+ L L+ S+ + D D
Sbjct: 212 EWYSGRVVGHIADTNRYNIEYEDGDKEDLIISNEKVKFFISHEEMERLNLTVSVKSTDGD 271
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
YDY+EMVVLAASLDDCQ+L+PGDIIWAK+TGHAMWPAIVVDE+LIG++KGL+K GGRS
Sbjct: 272 RYDYNEMVVLAASLDDCQDLDPGDIIWAKVTGHAMWPAIVVDEALIGNHKGLSKNIGGRS 331
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFAR YLSEQKLPRRML
Sbjct: 332 VSVQFFGTHDFAR-----------------------------------YLSEQKLPRRML 356
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQN ++AD +++ S+DEGS SGE+C +D ++ L +G SPYV GDLQI+SLGKIV
Sbjct: 357 QLQNGMKADSCDSASSEDEGSTDSGEDCIQDGGIRRILARLGTSPYVIGDLQIISLGKIV 416
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDSE+FQDDRFIWPEGYTA+RKFTS+ DP V YKMEVLRD ESKIRPLFRVTLDNGE+
Sbjct: 417 KDSEHFQDDRFIWPEGYTALRKFTSIKDPNVHMMYKMEVLRDAESKIRPLFRVTLDNGEE 476
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP+ CW KI KIR+ Q++TS+ FS EG +I +SGS+MFGFSNPEV+KLI GL+
Sbjct: 477 IKGSTPAACWDKIYRKIRKMQDSTSNGFSTEGGVGRILKSGSEMFGFSNPEVIKLIKGLS 536
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
KSR +SK S+CKL+S +Y+ +P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 537 KSRHSSKLSMCKLSSERYQGIPVGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRM 596
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GVLWLCNLCRPGAP+ PPCCLCPV+GGAMKPTTDGRWAHLACAIW
Sbjct: 597 MVHARCYGELEPVDGVLWLCNLCRPGAPDSTPPCCLCPVIGGAMKPTTDGRWAHLACAIW 656
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+DVKRMEPIDGLNR++KDRWKLLCSICGV+YGACIQCSN CRVAYHPLCARAA
Sbjct: 657 IPETCLSDVKRMEPIDGLNRINKDRWKLLCSICGVAYGACIQCSNNACRVAYHPLCARAA 716
Query: 542 GLCVE-----------LEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERL 590
GLCVE LEDEDRL LLSLDEDD DQCIRLLSFCKKH+QP N+R+ DER+
Sbjct: 717 GLCVEVFFIKHSEICTLEDEDRLYLLSLDEDDADQCIRLLSFCKKHRQPSNERVVTDERV 776
Query: 591 VQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYC 650
Q+ RRC DYIPP N SGCAR+EPYNYFGRRGRKEPE LAAASLKRLFVENQPYLVGGY
Sbjct: 777 GQIPRRCSDYIPPCNLSGCARTEPYNYFGRRGRKEPEVLAAASLKRLFVENQPYLVGGYS 836
Query: 651 QNGLSGNTLPSIRVIGSKFSFSLHR------DAP-NFLSMADKYKHMKETFRKRLAFGKS 703
Q+ SG TL S +I S FS SL R DAP N LSMA+KY+HM+ TFRKRLAFGKS
Sbjct: 837 QHESSGCTLASNGLINSGFSSSLQRLRASQLDAPSNILSMAEKYQHMRHTFRKRLAFGKS 896
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
GIHGFGIFAKHPHRAGDMVIEYTGELVRP IADRREHFIYNSLVGAGTYMFRIDD+RVID
Sbjct: 897 GIHGFGIFAKHPHRAGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDKRVID 956
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLA 823
ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK+WEELTYDYRFFSI+E+LA
Sbjct: 957 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIEEKLA 1016
Query: 824 CYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDW 857
CYCGF RCRGVVNDTEAEEQVAKLYAPRSEL DW
Sbjct: 1017 CYCGFSRCRGVVNDTEAEEQVAKLYAPRSELTDW 1050
>gi|297745169|emb|CBI39161.3| unnamed protein product [Vitis vinifera]
Length = 1068
Score = 1335 bits (3454), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/866 (73%), Positives = 734/866 (84%), Gaps = 24/866 (2%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
+WY G ++GYD E+NRH VKY DGD+E+LILS+E+IKFY+S+E+M L LS S+ ++D+D
Sbjct: 219 EWYRGCIIGYDLEANRHQVKYNDGDKEELILSSEKIKFYVSREDMQHLNLSLSVRSLDSD 278
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
DYDEMVVLAAS +DCQ+ EPGDIIWAKLTGHAMWPAIVVDES+I + KGLNKIS +S
Sbjct: 279 DIDYDEMVVLAASWNDCQDHEPGDIIWAKLTGHAMWPAIVVDESIIHNRKGLNKISKEKS 338
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+PVQFFG+HDFAR+ KQV FLKGLLSSFHLKC KP F QSL E+K YLSEQKL +RML
Sbjct: 339 LPVQFFGSHDFARVKTKQVTPFLKGLLSSFHLKCTKPHFHQSLVESKAYLSEQKLSKRML 398
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
++Q DD E+ +DE SG++C DER++ L + P+ GDLQ++ LGKIV
Sbjct: 399 RMQKLTEDDDCESMSGEDEKRTDSGDDCIGDERVKRKLDDLIKFPFEIGDLQVIRLGKIV 458
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDS+ FQ + FI PEGYTA+RKFTS+ DP +C YKMEVLRD ESKI+PLFRVTLDNGEQ
Sbjct: 459 KDSDRFQVEGFICPEGYTAMRKFTSITDPSLCALYKMEVLRDAESKIQPLFRVTLDNGEQ 518
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F GSTPS+CW+KI +IR+ QN+ SD SAEG AEK++ESG DMFGFSNPE+ +L+ L+
Sbjct: 519 FQGSTPSSCWNKIFRRIRKMQNSASDGSSAEGGAEKLNESGFDMFGFSNPEIFRLVQELS 578
Query: 363 KSRPTSKSSLCK-LTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
S+ +SK S+ K ++ +Y+DL GYRPVRVDWKDLDKCSVCHMDEEY+NNLFLQCDKCRM
Sbjct: 579 TSKISSKFSMSKSISRRYQDLSSGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRM 638
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GVLWLC LC PGAP+ PPPCCLCPV GGAMKPTTDGRWAHLACAIW
Sbjct: 639 MVHARCYGELEPVDGVLWLCKLCGPGAPDSPPPCCLCPVTGGAMKPTTDGRWAHLACAIW 698
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+D+K MEPIDGL+R++KDRWKLLCSICGVSYGACIQCSN+TCRVAYHPLCARAA
Sbjct: 699 IPETCLSDIKTMEPIDGLSRINKDRWKLLCSICGVSYGACIQCSNSTCRVAYHPLCARAA 758
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVELEDEDRL+L+S+++D++DQCIRLLSFCKKH+QP N+R A DER+ QV R C +Y
Sbjct: 759 GLCVELEDEDRLHLISVEDDEDDQCIRLLSFCKKHRQPSNERTAFDERIGQVARECSNYN 818
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PPSNPSGCAR+EPYN+FGRRGRKEPEALAAASLKRLFV+N+PYLVGGYC
Sbjct: 819 PPSNPSGCARTEPYNHFGRRGRKEPEALAAASLKRLFVDNRPYLVGGYC----------- 867
Query: 662 IRVIGSKFSFSLHR------DAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
SKFSF + DAP + LSM +KY +M+ETFRKRLAFGKSGIHGFGIFAK
Sbjct: 868 -----SKFSFRNQKIKASQLDAPKSILSMVEKYNYMRETFRKRLAFGKSGIHGFGIFAKQ 922
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
PHRAGDMVIEYTGELVRPSIADRRE IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI
Sbjct: 923 PHRAGDMVIEYTGELVRPSIADRRERLIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 982
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGV 834
NHSCEPNCYSRVIS NGD+HIIIFAKRDIK+WEELTYDYRFFSIDEQLACYCGFPRCRGV
Sbjct: 983 NHSCEPNCYSRVISFNGDDHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGFPRCRGV 1042
Query: 835 VNDTEAEEQVAKLYAPRSELIDWRGD 860
VND +AEE++AK YAPRSELI W G+
Sbjct: 1043 VNDIDAEERMAKRYAPRSELIGWIGE 1068
>gi|449441169|ref|XP_004138356.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Cucumis
sativus]
Length = 1036
Score = 1283 bits (3320), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 638/862 (74%), Positives = 737/862 (85%), Gaps = 7/862 (0%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G VVGY+SE++ HH++Y DGD EDL+LSNE++KF+IS EEM L L+F +++VD+D
Sbjct: 177 WYCGRVVGYNSETSCHHIEYEDGDREDLVLSNEKVKFHISGEEMQTLNLNFGVDSVDSDA 236
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSI 123
YDY+EM+VLAA+LDDC E EPGDI+WAKLTGHAMWPAI+VDESLIGD KGL ISGGR++
Sbjct: 237 YDYNEMLVLAATLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNISGGRTV 296
Query: 124 PVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRMLQ 183
PVQFFGTHDFARI VKQ ISFLKGLLS FH KCKKP F +SLEEAK+YLSEQKLP MLQ
Sbjct: 297 PVQFFGTHDFARIKVKQAISFLKGLLSFFHQKCKKPHFMRSLEEAKMYLSEQKLPPSMLQ 356
Query: 184 LQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIVK 243
LQN I DD ++ ++EG+ SGE C + + SP+ GDL+I+SLGKIVK
Sbjct: 357 LQNGIEVDDFASASGEEEGTTDSGEECLNEGGGVRCALNGYRSPFKVGDLEIISLGKIVK 416
Query: 244 DSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQF 303
DS+YFQ+D +WPEGYTAVRKF+SL DP VC Y+MEVLRD ESK RPLFRVTLDNGEQF
Sbjct: 417 DSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYRMEVLRDFESKFRPLFRVTLDNGEQF 476
Query: 304 TGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTK 363
GS+PS CW+KI ++++ Q+ + D S E E + +SGSDMFGFSNP+V KLI G++K
Sbjct: 477 KGSSPSACWNKIYKRMKKIQHTS--DASTETKGEFVYKSGSDMFGFSNPDVKKLIQGISK 534
Query: 364 SRPTSKSSLCKLTSK-YRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
S +S SL K+ SK Y+D P GYRPVRVDWKDLDKCSVCHMDEEY+NNLFLQCDKCRMM
Sbjct: 535 SGLSSSRSLSKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRMM 594
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWI 482
VHARCYGELEPV+GV+WLCNLCRPG+P+ PPPCCLCPV+GGAMKPTTDGRWAHLACAIWI
Sbjct: 595 VHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIWI 654
Query: 483 PETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
PETCL+D+K+MEPIDGLNR++KDRWKLLCSICGVSYGACIQCSN TC VAYHPLCARAAG
Sbjct: 655 PETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAAG 714
Query: 543 LCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIP 602
LCVELE++DRL+LL+ DED+EDQCIRLLSFCKKH+ P N+RL ++R+ Q ++C +Y P
Sbjct: 715 LCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYTP 774
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
P NPSGCAR+EPYNYF RRGRK PEA+AAA+LKRLFVENQPY+ GY Q+ LSGN LPS
Sbjct: 775 PCNPSGCARTEPYNYFERRGRKAPEAVAAAALKRLFVENQPYIASGYSQHLLSGNLLPSS 834
Query: 663 RVIGSKFSF----SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRA 718
V+G KFS + D N LS+A+KYK M+ETFRKRLAFGKSGIHGFGIFAKHPHRA
Sbjct: 835 GVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHRA 894
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 778
GDMVIEYTGE+VRP IADRRE FIYN LVGAGTYMFRIDDERVIDATRAGSIAHLINHSC
Sbjct: 895 GDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 954
Query: 779 EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDT 838
EPNCYSRVISVNGDEHIIIFAKRDIK+WEELTYDYRFFSIDEQLACYCG+PRCRGVVNDT
Sbjct: 955 EPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVNDT 1014
Query: 839 EAEEQVAKLYAPRSELIDWRGD 860
+ EE+V+KL+ R++L+DWRG+
Sbjct: 1015 DEEERVSKLHVSRTDLVDWRGE 1036
>gi|449496792|ref|XP_004160228.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ATX2-like [Cucumis sativus]
Length = 1095
Score = 1277 bits (3305), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 636/863 (73%), Positives = 736/863 (85%), Gaps = 8/863 (0%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G VVGY+SE++ HH++Y DGD EDL+LSNE++KF+IS EEM L L+F +++VD+D
Sbjct: 235 WYCGRVVGYNSETSCHHIEYEDGDREDLVLSNEKVKFHISGEEMQTLNLNFGVDSVDSDA 294
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSI 123
YDY+EM+VLAA+LDDC E EPGDI+WAKLTGHAMWPAI+VDESLIGD KGL ISGGR++
Sbjct: 295 YDYNEMLVLAATLDDCLEPEPGDIVWAKLTGHAMWPAIIVDESLIGDRKGLRNISGGRTV 354
Query: 124 PVQFFGTHDFARINV-KQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
PVQFFGTHDFAR + KQ ISFLKGLLS FH KCKKP F +SLEEAK+YLSEQKLP ML
Sbjct: 355 PVQFFGTHDFARFDXSKQAISFLKGLLSXFHQKCKKPHFMRSLEEAKMYLSEQKLPPSML 414
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQN I DD ++ ++EG+ SGE C + + SP+ GDL+I+SLGKIV
Sbjct: 415 QLQNGIEVDDFASASGEEEGTTDSGEECLNEGGGVRCALNGYRSPFKVGDLEIISLGKIV 474
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
KDS+YFQ+D +WPEGYTAVRKF+SL DP VC Y+MEVLRD ESK RPLFRVTLDNGEQ
Sbjct: 475 KDSKYFQNDGSVWPEGYTAVRKFSSLTDPNVCTLYRMEVLRDFESKFRPLFRVTLDNGEQ 534
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F GS+PS CW+KI ++++ Q+ + D S E E + +SGSDMFGFSNP+V KLI G++
Sbjct: 535 FKGSSPSACWNKIYKRMKKIQHTS--DASTETKGEFVYKSGSDMFGFSNPDVKKLIQGIS 592
Query: 363 KSRPTSKSSLCKLTSK-YRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
KS +S SL K+ SK Y+D P GYRPVRVDWKDLDKCSVCHMDEEY+NNLFLQCDKCRM
Sbjct: 593 KSGLSSSRSLSKVASKKYKDFPIGYRPVRVDWKDLDKCSVCHMDEEYENNLFLQCDKCRM 652
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEPV+GV+WLCNLCRPG+P+ PPPCCLCPV+GGAMKPTTDGRWAHLACAIW
Sbjct: 653 MVHARCYGELEPVDGVIWLCNLCRPGSPDCPPPCCLCPVIGGAMKPTTDGRWAHLACAIW 712
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL+D+K+MEPIDGLNR++KDRWKLLCSICGVSYGACIQCSN TC VAYHPLCARAA
Sbjct: 713 IPETCLSDIKKMEPIDGLNRINKDRWKLLCSICGVSYGACIQCSNNTCYVAYHPLCARAA 772
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVELE++DRL+LL+ DED+EDQCIRLLSFCKKH+ P N+RL ++R+ Q ++C +Y
Sbjct: 773 GLCVELEEDDRLHLLAADEDEEDQCIRLLSFCKKHRPPSNERLMAEDRIGQAGQQCSNYT 832
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NPSGCAR+EPYNYF RRGRK PEA+AAA+LKRLFVENQPY+ GY Q+ LSGN LPS
Sbjct: 833 PPCNPSGCARTEPYNYFERRGRKAPEAVAAAALKRLFVENQPYIASGYSQHLLSGNLLPS 892
Query: 662 IRVIGSKFSF----SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHR 717
V+G KFS + D N LS+A+KYK M+ETFRKRLAFGKSGIHGFGIFAKHPHR
Sbjct: 893 SGVLGMKFSLQHLKTCQLDPRNILSVAEKYKFMRETFRKRLAFGKSGIHGFGIFAKHPHR 952
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
AGDMVIEYTGE+VRP IADRRE FIYN LVGAGTYMFRIDDERVIDATRAGSIAHLINHS
Sbjct: 953 AGDMVIEYTGEIVRPPIADRRERFIYNLLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 1012
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVND 837
CEPNCYSRVISVNGDEHIIIFAKRDIK+WEELTYDYRFFSIDEQLACYCG+PRCRGVVND
Sbjct: 1013 CEPNCYSRVISVNGDEHIIIFAKRDIKRWEELTYDYRFFSIDEQLACYCGYPRCRGVVND 1072
Query: 838 TEAEEQVAKLYAPRSELIDWRGD 860
T+ EE+V+KL+ R++L+DWRG+
Sbjct: 1073 TDEEERVSKLHVSRTDLVDWRGE 1095
>gi|342209882|gb|AEL16988.1| trithorax-like protein [Phaseolus vulgaris]
Length = 1089
Score = 1264 bits (3270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 631/867 (72%), Positives = 720/867 (83%), Gaps = 17/867 (1%)
Query: 5 YSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSI---NNVDN 61
Y+G+V YD E+ HHVKY DGDEE+LILSNE I+F++S++E+ LKL+F+ NNV +
Sbjct: 229 YTGYVRSYDKETKIHHVKYDDGDEENLILSNENIRFHVSRDEVKHLKLNFAKVRDNNVSD 288
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
YD +EM+ +AASLDDCQ+ EPGDIIWAKLTGHAMWPA+V+DESL + KGL I GGR
Sbjct: 289 --YDVEEMLAMAASLDDCQDFEPGDIIWAKLTGHAMWPAVVLDESLARNCKGLKMILGGR 346
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
S+PVQFFGTHDFAR+ V+QV SFL GLL+ H KCKK F + LEEAK YLSEQKLP M
Sbjct: 347 SVPVQFFGTHDFARVRVQQVKSFLSGLLTDLHSKCKKHSFIEGLEEAKRYLSEQKLPLEM 406
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
++LQ AD+ N +D G SG+ C ER L ++ PYV GDLQILSLGKI
Sbjct: 407 IELQKRCTADNCNNVSGEDGGCTDSGDECLNGERTLMALENVETFPYVVGDLQILSLGKI 466
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
VKDS F+D +FIWPEGYTAVRKFTS+ DP V YKMEVLRD ESK+RPLFRVT+D GE
Sbjct: 467 VKDSA-FRDGKFIWPEGYTAVRKFTSVTDPEVFAPYKMEVLRDPESKVRPLFRVTVDGGE 525
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGL 361
QF G+TPS CWS++ KI++ + N S+ AEG EK ESGSDMFGFSNP+V+KLI GL
Sbjct: 526 QFNGNTPSACWSEVFKKIKKMEKNASEGTLAEGGVEKGYESGSDMFGFSNPKVLKLIKGL 585
Query: 362 TKSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420
+KS+ +SK+S CKL S ++ +LP GYR V ++W DLDKC+VCHMDEEY+NNLFLQCDKCR
Sbjct: 586 SKSKISSKNSFCKLGSQRHNNLPLGYRQVHINWFDLDKCNVCHMDEEYENNLFLQCDKCR 645
Query: 421 MMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAI 480
MMVHARCYGELEPVNGVLWLCNLCR GAP PP LCP++GGAMKPTTDGRWAHLACA+
Sbjct: 646 MMVHARCYGELEPVNGVLWLCNLCRSGAPPPPCC--LCPLIGGAMKPTTDGRWAHLACAM 703
Query: 481 WIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
WIPETCL DVKRMEPIDGL+R+SKDRWKLLCSICGVSYGACIQCSN++CRVAYHPLCARA
Sbjct: 704 WIPETCLADVKRMEPIDGLSRISKDRWKLLCSICGVSYGACIQCSNSSCRVAYHPLCARA 763
Query: 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDY 600
AGLCVELE+EDRL LLS+D DDEDQCIRLLSFCKKH+QP N+ D+R+V+V C DY
Sbjct: 764 AGLCVELENEDRLYLLSVD-DDEDQCIRLLSFCKKHRQPSNEHSVADDRIVRVAGLCSDY 822
Query: 601 IPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLP 660
PP NPSGCARSEPY+YFGRRGRKEPEALAAASLKRLFVENQPY+VGGYCQ+G + P
Sbjct: 823 EPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYIVGGYCQHGALNDPEP 882
Query: 661 SIRVIGSKFSFSLHR-------DAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
S R + SKF S R + N LS+++KYK+M+ETFRKRLAFGKS IHGFGIFAK
Sbjct: 883 SGRGVCSKFFCSQQRLRTSLIDTSNNILSISEKYKYMRETFRKRLAFGKSRIHGFGIFAK 942
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
HPH+ GDMVIEYTGELVRP IADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL
Sbjct: 943 HPHKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 1002
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRG 833
INHSC PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE+L+CYCGFP+CRG
Sbjct: 1003 INHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLSCYCGFPKCRG 1062
Query: 834 VVNDTEAEEQVAKLYAPRSELIDWRGD 860
+VNDTEAEE+ A LYAPR ELIDWRG+
Sbjct: 1063 IVNDTEAEERAATLYAPRRELIDWRGE 1089
>gi|356568903|ref|XP_003552647.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
Length = 1088
Score = 1240 bits (3209), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 628/868 (72%), Positives = 710/868 (81%), Gaps = 19/868 (2%)
Query: 5 YSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSI---NNVDN 61
Y G+V YD E+ HHVKY DGDEE LIL+NE I+F++S++EM LKL+F+ NNV +
Sbjct: 228 YIGYVRSYDKETKIHHVKYDDGDEESLILANENIRFHVSRDEMKHLKLNFAKVRDNNVSD 287
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
YD +EM+ LAASLDDCQ+ EPGDIIWAKLTGHAMWPA+V+DESL + KGL GGR
Sbjct: 288 --YDVEEMLALAASLDDCQDFEPGDIIWAKLTGHAMWPAVVLDESLASNCKGLKMFLGGR 345
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
S+PVQFFGTHDFAR+ ++QV SFL GLL+ H KCKK F + LEEAK YLSEQKLP M
Sbjct: 346 SVPVQFFGTHDFARVRLQQVKSFLSGLLTDLHSKCKKHSFIEGLEEAKRYLSEQKLPSEM 405
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
L+L+ ADD N +D G SG++C D L +I PY GDLQILSLGKI
Sbjct: 406 LELRKRCTADDCNNISGEDGGCTDSGDDCLNDNGTLTALETIETFPYEVGDLQILSLGKI 465
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V+DS F+D RFIWPEGYTAVRKFTS+ DP+V YKMEVLRD ESK+RPLFRVT++ GE
Sbjct: 466 VEDSA-FRDGRFIWPEGYTAVRKFTSVTDPKVSAPYKMEVLRDPESKVRPLFRVTVEGGE 524
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGL 361
QF G TPS CW+++ KI++ + + S+ A G EK ESGSDMFGFSNP V+KLI GL
Sbjct: 525 QFNGYTPSACWNEVYEKIKKMEKDASEGTVA-GGEEKSYESGSDMFGFSNPIVLKLIRGL 583
Query: 362 TKSRPTSKSSLCKLTSKY--RDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKC 419
+KS+ +SK+S CKL S+ +LP GYR V ++W DLDKC+VCHMDEEY+NNLFLQCDKC
Sbjct: 584 SKSKISSKNSTCKLGSRRYNNNLPLGYRQVHINWFDLDKCNVCHMDEEYENNLFLQCDKC 643
Query: 420 RMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACA 479
RMMVHARCYGELEPVNGVLWLCNLCR GAP PP LCP++GGAMKPTTDGRWAHLACA
Sbjct: 644 RMMVHARCYGELEPVNGVLWLCNLCRSGAPPPPCC--LCPLIGGAMKPTTDGRWAHLACA 701
Query: 480 IWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR 539
+WIPETCL DVKRMEPIDGL+R+SKDRWKLLCSICGVSYGACIQCSN +CRVAYHPLCAR
Sbjct: 702 MWIPETCLADVKRMEPIDGLSRISKDRWKLLCSICGVSYGACIQCSNNSCRVAYHPLCAR 761
Query: 540 AAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCD 599
AAGLCVELE+EDRL LLS+D DDEDQCIRLLSFCKKH+QP N+ DER+V+V C D
Sbjct: 762 AAGLCVELENEDRLYLLSVD-DDEDQCIRLLSFCKKHRQPSNEPSVADERMVRVAGLCSD 820
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
Y PP N SGCARSEPY+YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ+G N
Sbjct: 821 YEPPPNLSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQHGSLNNLE 880
Query: 660 PSIRVIGSKFSFSLHR------DAPN-FLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
PS R + SKF S R D N LS+++KY +M+ETFRKRLAFGKS IHGFGIFA
Sbjct: 881 PSGRGVCSKFFCSQQRLRTSLVDTSNSILSISEKYTYMRETFRKRLAFGKSRIHGFGIFA 940
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
KH ++ GDMVIEYTGELVRP IADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH
Sbjct: 941 KHAYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 1000
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
LINHSC PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE+LACYCGFP+CR
Sbjct: 1001 LINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLACYCGFPKCR 1060
Query: 833 GVVNDTEAEEQVAKLYAPRSELIDWRGD 860
G+VNDTEAEE+ A LYAPRSELIDWRG+
Sbjct: 1061 GIVNDTEAEERAATLYAPRSELIDWRGE 1088
>gi|356526623|ref|XP_003531916.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Glycine max]
Length = 1088
Score = 1236 bits (3198), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 621/868 (71%), Positives = 713/868 (82%), Gaps = 18/868 (2%)
Query: 5 YSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSI---NNVDN 61
Y G+V YD E+ HHVKY DGDEE+LI++NE I+F++S++E+ LKL+F+ NNV +
Sbjct: 227 YIGYVRSYDKETKIHHVKYDDGDEENLIIANENIRFHVSRDELKHLKLNFAKVRDNNVSD 286
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
Y+ +EM+ LAASLDDCQ+ EPGDIIWAKLTGHAMWPA+V+DESL + KGL GG
Sbjct: 287 --YNVEEMLALAASLDDCQDFEPGDIIWAKLTGHAMWPAVVLDESLASNCKGLKMFLGGS 344
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
S+PVQFFGTHDFAR+ ++QV SFL GLL+ H KCKK F + LEEAK YLSEQKLP M
Sbjct: 345 SVPVQFFGTHDFARVRLQQVKSFLSGLLTDLHSKCKKHSFIEGLEEAKRYLSEQKLPLEM 404
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
L+LQ AD N +D G SG++C + L ++ PY GDLQILSLGKI
Sbjct: 405 LELQKRCTADGCNNVSGEDGGCSDSGDDCLNGKGTLMALETVENFPYEVGDLQILSLGKI 464
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
VKDS F+D RFIWPEGYTAVRKFTS+ DP+V YKMEVLRD ESK+RPLFRVT++ GE
Sbjct: 465 VKDSA-FRDGRFIWPEGYTAVRKFTSVTDPKVSAPYKMEVLRDPESKLRPLFRVTVEGGE 523
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGL 361
QF G TPS W+++ KI++ + + S+ AEG EK ESGSDMFGFSNP+V+KLI GL
Sbjct: 524 QFNGYTPSASWNEVYEKIKKMEKDASEGTVAEGGEEKGYESGSDMFGFSNPKVLKLIQGL 583
Query: 362 TKSRPTSKSSLCKLTSKY--RDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKC 419
+KS+ +SK+S+CKL S+ +LP GYR V ++W DLDKC+VCHMDEEY+NNLFLQCDKC
Sbjct: 584 SKSKISSKNSICKLGSRRCNNNLPLGYRQVHINWFDLDKCNVCHMDEEYENNLFLQCDKC 643
Query: 420 RMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACA 479
RMMVHARCYGELEPVNGVLWLCNLCR GAP PP LCP++GGAMKPTTDGRWAHLACA
Sbjct: 644 RMMVHARCYGELEPVNGVLWLCNLCRSGAPPPPCC--LCPLIGGAMKPTTDGRWAHLACA 701
Query: 480 IWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR 539
+WIPETCL DVKRMEPIDG++R+SKDRW+LLCSICGVSYGACIQCSN +CRVAYHPLCAR
Sbjct: 702 MWIPETCLADVKRMEPIDGMSRISKDRWRLLCSICGVSYGACIQCSNNSCRVAYHPLCAR 761
Query: 540 AAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCD 599
AAGLCVELE+EDRL LLS+D DDEDQCIRLLSFCKKH+QP N+ D+R+V+V+ C D
Sbjct: 762 AAGLCVELENEDRLYLLSVD-DDEDQCIRLLSFCKKHRQPSNEHSVADDRIVRVSGLCSD 820
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
Y PP NPSGCARSEPY+YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ+G N
Sbjct: 821 YEPPPNPSGCARSEPYDYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQHGSLNNLE 880
Query: 660 PSIRVIGSKFSFSLHR------DAPN-FLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
PS R + SKF S R D N LS+++KY +M+ETFRKRLAFGKS IHGFGIFA
Sbjct: 881 PSGRGVCSKFFCSQQRLRTSRIDTSNSILSISEKYTYMRETFRKRLAFGKSRIHGFGIFA 940
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
KHP++ GDMVIEYTGELVRP IADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH
Sbjct: 941 KHPYKGGDMVIEYTGELVRPPIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 1000
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
LINHSC PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE+L CYCGFP+CR
Sbjct: 1001 LINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDERLPCYCGFPKCR 1060
Query: 833 GVVNDTEAEEQVAKLYAPRSELIDWRGD 860
G+VNDTEAEE+ A LYAPRSELIDWRG+
Sbjct: 1061 GIVNDTEAEERAATLYAPRSELIDWRGE 1088
>gi|240254018|ref|NP_172074.6| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|240254020|ref|NP_001077464.4| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|257096236|sp|P0CB22.1|ATX2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX2; AltName:
Full=Protein SET DOMAIN GROUP 30; AltName:
Full=Trithorax-homolog protein 2; Short=TRX-homolog
protein 2
gi|332189779|gb|AEE27900.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
gi|332189780|gb|AEE27901.1| histone-lysine N-methyltransferase ATX2 [Arabidopsis thaliana]
Length = 1083
Score = 1214 bits (3140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 597/856 (69%), Positives = 683/856 (79%), Gaps = 12/856 (1%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY+ E+ H VKY DGD E+L L E+IKF IS+++M+ L + F N+V DG
Sbjct: 235 WYPGSIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDMELLNMKFGTNDVVVDG 294
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDE+V+LAAS ++CQ+ EP DIIWAKLTGHAMWPAI+VDES+I KGLN KISGGRS
Sbjct: 295 QDYDELVILAASFEECQDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKISGGRS 354
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI VKQ +SFLKGLLS LKCK+PRF +++EEAK+YL E KLP RM
Sbjct: 355 VLVQFFGTHDFARIQVKQAVSFLKGLLSRSPLKCKQPRFEEAMEEAKMYLKEYKLPGRMD 414
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQ D E S +E S SG++ KD + +G + GDLQI++LG+IV
Sbjct: 415 QLQKVADTDCSERINSGEEDSSNSGDDYTKDGEVWLRPTELGDCLHRIGDLQIINLGRIV 474
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DSE+F+D + WPEGYTA RKF SL DP YKMEVLRD ESK RP+FRVT ++GEQ
Sbjct: 475 TDSEFFKDSKHTWPEGYTATRKFISLKDPNASAMYKMEVLRDAESKTRPVFRVTTNSGEQ 534
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F G TPS CW+KI +I++ Q SD+ G E + ESG+DMFGFSNPEV KLI GL
Sbjct: 535 FKGDTPSACWNKIYNRIKKIQI-ASDNPDVLG--EGLHESGTDMFGFSNPEVDKLIQGLL 591
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+SRP SK S K +S KY+D P GYRPVRV+WKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 592 QSRPPSKVSQRKYSSGKYQDHPTGYRPVRVEWKDLDKCNVCHMDEEYENNLFLQCDKCRM 651
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVH RCYG+LEP NG+LWLCNLCRP A + PP CCLCPVVGGAMKPTTDGRWAHLACAIW
Sbjct: 652 MVHTRCYGQLEPHNGILWLCNLCRPVALDIPPRCCLCPVVGGAMKPTTDGRWAHLACAIW 711
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL DVK+MEPIDG+ +VSKDRWKLLCSICGVSYGACIQCSN TCRVAYHPLCARAA
Sbjct: 712 IPETCLLDVKKMEPIDGVKKVSKDRWKLLCSICGVSYGACIQCSNNTCRVAYHPLCARAA 771
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
GLCVEL DEDRL LLS+D+D+ DQCIRLLSFCK+H+Q N L E +++ +Y+
Sbjct: 772 GLCVELADEDRLFLLSMDDDEADQCIRLLSFCKRHRQTSNYHLET-EYMIKPAHNIAEYL 830
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NPSGCAR+EPYNY GRRGRKEPEALA AS KRLFVENQPY+VGGY ++ S
Sbjct: 831 PPPNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIVGGYSRHEFSTYE--- 887
Query: 662 IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
R+ GSK S N LSMA+KY MKET+RKRLAFGKSGIHGFGIFAK PHRAGDM
Sbjct: 888 -RIYGSKMSQIT--TPSNILSMAEKYTFMKETYRKRLAFGKSGIHGFGIFAKLPHRAGDM 944
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEYTGELVRP IAD+REH IYNS+VGAGTYMFRID+ERVIDATR GSIAHLINHSCEPN
Sbjct: 945 VIEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATRTGSIAHLINHSCEPN 1004
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAE 841
CYSRVISVNGDEHIIIFAKRD+ +WEELTYDYRFFSIDE+LACYCGFPRCRGVVNDTEAE
Sbjct: 1005 CYSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYCGFPRCRGVVNDTEAE 1064
Query: 842 EQVAKLYAPRSELIDW 857
E+ A ++A R EL +W
Sbjct: 1065 ERQANIHASRCELKEW 1080
>gi|30685011|ref|NP_850170.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
gi|95147534|sp|Q9C5X4.2|ATX1_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX1; AltName:
Full=Protein SET DOMAIN GROUP 27; AltName:
Full=Trithorax-homolog protein 1; Short=TRX-homolog
protein 1
gi|330253475|gb|AEC08569.1| histone-lysine N-methyltransferase ATX1 [Arabidopsis thaliana]
Length = 1062
Score = 1189 bits (3077), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 575/867 (66%), Positives = 691/867 (79%), Gaps = 35/867 (4%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY +E R+ VKY DG +ED++ E IKF +S+EEM+ L L F +NV DG
Sbjct: 221 WYEGSIVGYSAERKRYTVKYRDGCDEDIVFDREMIKFLVSREEMELLHLKFCTSNVTVDG 280
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDEMVVLAA+LD+CQ+ EPGDI+WAKL GHAMWPA++VDES+IG+ KGLN K+SGG S
Sbjct: 281 RDYDEMVVLAATLDECQDFEPGDIVWAKLAGHAMWPAVIVDESIIGERKGLNNKVSGGGS 340
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI VKQ ISF+KGLLS HLKCK+PRF + ++EAK+YL +LP RM
Sbjct: 341 LLVQFFGTHDFARIKVKQAISFIKGLLSPSHLKCKQPRFEEGMQEAKMYLKAHRLPERMS 400
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISP-------YVFGDLQI 235
QLQ + D + + S +EG+ SG + D G + + P ++ GDL I
Sbjct: 401 QLQKGADSVDSDMANSTEEGN--SGGDLLND-------GEVWLRPTEHVDFRHIIGDLLI 451
Query: 236 LSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRV 295
++LGK+V DS++F+D+ IWPEGYTA+RKFTSL D YKMEVLRD E+K PLF V
Sbjct: 452 INLGKVVTDSQFFKDENHIWPEGYTAMRKFTSLTDHSASALYKMEVLRDAETKTHPLFIV 511
Query: 296 TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVM 355
T D+GEQF G TPS CW+KI +I++ QN+ S + E+++ SG+DMFG SNPEV+
Sbjct: 512 TADSGEQFKGPTPSACWNKIYNRIKKVQNSDSPNI----LGEELNGSGTDMFGLSNPEVI 567
Query: 356 KLILGLTKSRPTSKSSLCKLT-SKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFL 414
KL+ L+KSRP+S S+CK + ++++ P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFL
Sbjct: 568 KLVQDLSKSRPSSHVSMCKNSLGRHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFL 627
Query: 415 QCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWA 474
QCDKCRMMVHA+CYGELEP +G LWLCNLCRPGAP+ PP CCLCPVVGGAMKPTTDGRWA
Sbjct: 628 QCDKCRMMVHAKCYGELEPCDGALWLCNLCRPGAPDMPPRCCLCPVVGGAMKPTTDGRWA 687
Query: 475 HLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYH 534
HLACAIWIPETCL+DVK+MEPIDG+N+VSKDRWKL+C+ICGVSYGACIQCSN +CRVAYH
Sbjct: 688 HLACAIWIPETCLSDVKKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYH 747
Query: 535 PLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVT 594
PLCARAAGLCVELE++ +S++ ++ DQCIR+LSFCK+H+Q L ++R+ T
Sbjct: 748 PLCARAAGLCVELEND-----MSVEGEEADQCIRMLSFCKRHRQTSTACLGSEDRIKSAT 802
Query: 595 RRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
+ +Y+PP NPSGCAR+EPYN FGRRGRKEPEALAAAS KRLFVENQPY++GGY + L
Sbjct: 803 HKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSR--L 860
Query: 655 SGNTLPSIRVIGSKFSFSLHRDAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+T SI GSK S + P N LSMA+KY++M+ET+RKRLAFGKSGIHGFGIFAK
Sbjct: 861 EFSTYKSIH--GSKVS---QMNTPSNILSMAEKYRYMRETYRKRLAFGKSGIHGFGIFAK 915
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
PHRAGDM+IEYTGELVRPSIAD+RE IYNS+VGAGTYMFRIDDERVIDATR GSIAHL
Sbjct: 916 LPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIAHL 975
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRG 833
INHSC PNCYSRVI+VNGDEHIIIFAKR I +WEELTYDYRFFSI E+L+C CGFP CRG
Sbjct: 976 INHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRG 1035
Query: 834 VVNDTEAEEQVAKLYAPRSELIDWRGD 860
VVNDTEAEEQ AK+ PR +LIDW +
Sbjct: 1036 VVNDTEAEEQHAKICVPRCDLIDWTAE 1062
>gi|12659210|gb|AAK01237.1|AF329273_1 trithorax-like protein 1 [Arabidopsis thaliana]
Length = 1062
Score = 1189 bits (3075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 574/867 (66%), Positives = 691/867 (79%), Gaps = 35/867 (4%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY +E R+ VKY DG +ED++ E IKF +S+EEM+ L L F +NV DG
Sbjct: 221 WYEGSIVGYSAERKRYTVKYRDGCDEDIVFDREMIKFLVSREEMELLHLKFCTSNVTVDG 280
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDEMVVLAA+LD+CQ+ EPGDI+WAKL GH+MWPA++VDES+IG+ KGLN K+SGG S
Sbjct: 281 RDYDEMVVLAATLDECQDFEPGDIVWAKLAGHSMWPAVIVDESIIGERKGLNNKVSGGGS 340
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI VKQ ISF+KGLLS HLKCK+PRF + ++EAK+YL +LP RM
Sbjct: 341 LLVQFFGTHDFARIKVKQAISFIKGLLSPSHLKCKQPRFEEGMQEAKMYLKAHRLPERMS 400
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISP-------YVFGDLQI 235
QLQ + D + + S +EG+ SG + D G + + P ++ GDL I
Sbjct: 401 QLQKGADSVDSDMANSTEEGN--SGGDLLND-------GEVWLRPTEHVDFRHIIGDLLI 451
Query: 236 LSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRV 295
++LGK+V DS++F+D+ IWPEGYTA+RKFTSL D YKMEVLRD E+K PLF V
Sbjct: 452 INLGKVVTDSQFFKDENHIWPEGYTAMRKFTSLTDHSASALYKMEVLRDAETKTHPLFIV 511
Query: 296 TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVM 355
T D+GEQF G TPS CW+KI +I++ QN+ S + E+++ SG+DMFG SNPEV+
Sbjct: 512 TADSGEQFKGPTPSACWNKIYNRIKKVQNSDSPNI----LGEELNGSGTDMFGLSNPEVI 567
Query: 356 KLILGLTKSRPTSKSSLCKLT-SKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFL 414
KL+ L+KSRP+S S+CK + ++++ P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFL
Sbjct: 568 KLVQDLSKSRPSSHVSMCKNSLGRHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFL 627
Query: 415 QCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWA 474
QCDKCRMMVHA+CYGELEP +G LWLCNLCRPGAP+ PP CCLCPVVGGAMKPTTDGRWA
Sbjct: 628 QCDKCRMMVHAKCYGELEPCDGALWLCNLCRPGAPDMPPQCCLCPVVGGAMKPTTDGRWA 687
Query: 475 HLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYH 534
HLACAIWIPETCL+DVK+MEPIDG+N+VSKDRWKL+C+ICGVSYGACIQCSN +CRVAYH
Sbjct: 688 HLACAIWIPETCLSDVKKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYH 747
Query: 535 PLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVT 594
PLCARAAGLCVELE++ +S++ ++ DQCIR+LSFCK+H+Q L ++R+ T
Sbjct: 748 PLCARAAGLCVELEND-----MSVEGEEADQCIRMLSFCKRHRQTSTACLGSEDRIKSAT 802
Query: 595 RRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
+ +Y+PP NPSGCAR+EPYN FGRRGRKEPEALAAAS KRLFVENQPY++GGY + L
Sbjct: 803 HKTSEYLPPPNPSGCARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSR--L 860
Query: 655 SGNTLPSIRVIGSKFSFSLHRDAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+T SI GSK S + P N LSMA+KY++M+ET+RKRLAFGKSGIHGFGIFAK
Sbjct: 861 EFSTYKSIH--GSKVS---QMNTPSNILSMAEKYRYMRETYRKRLAFGKSGIHGFGIFAK 915
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
PHRAGDM+IEYTGELVRPSIAD+RE IYNS+VGAGTYMFRIDDERVIDATR GSIAHL
Sbjct: 916 LPHRAGDMMIEYTGELVRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIAHL 975
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRG 833
INHSC PNCYSRVI+VNGDEHIIIFAKR I +WEELTYDYRFFSI E+L+C CGFP CRG
Sbjct: 976 INHSCVPNCYSRVITVNGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRG 1035
Query: 834 VVNDTEAEEQVAKLYAPRSELIDWRGD 860
VVNDTEAEEQ AK+ PR +LIDW +
Sbjct: 1036 VVNDTEAEEQHAKICVPRCDLIDWTAE 1062
>gi|297826607|ref|XP_002881186.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
lyrata]
gi|297327025|gb|EFH57445.1| hypothetical protein ARALYDRAFT_902198 [Arabidopsis lyrata subsp.
lyrata]
Length = 1066
Score = 1185 bits (3066), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 581/864 (67%), Positives = 690/864 (79%), Gaps = 30/864 (3%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY +E RH KY DG +ED+IL E IKF +S+EEM+ L L ++V G
Sbjct: 221 WYEGSIVGYSAERKRHTAKYRDGYDEDIILDREMIKFLVSREEMELLHLKLCTSDVTVGG 280
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDEMVVLAA+L+ CQ+ E GDI+WAKL GHAMWPA++VDES+IG+ KGLN K+SGG S
Sbjct: 281 RDYDEMVVLAATLEGCQDFETGDIVWAKLAGHAMWPAVIVDESVIGERKGLNNKVSGGGS 340
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI KQ ISF+KGLLS HLKCK+PRF + ++EAK+YL E +LP RM
Sbjct: 341 LLVQFFGTHDFARIKEKQAISFIKGLLSPSHLKCKQPRFEEGMQEAKMYLKEHRLPERMS 400
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISP-------YVFGDLQI 235
QLQ D E + S +EG+LG + F D G + + P + GDLQI
Sbjct: 401 QLQKGADVD-SEIANSTEEGNLGG--DLFND-------GEVWLRPTEHVDFLHTIGDLQI 450
Query: 236 LSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRV 295
++LGK+V DS++F+D+ IWPEGYTA+RKFTSL D V YKMEVLRD ESK RPLFRV
Sbjct: 451 INLGKVVTDSQFFKDENHIWPEGYTAMRKFTSLKDHSVSALYKMEVLRDAESKTRPLFRV 510
Query: 296 TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVM 355
T D+GEQF G TPS CW+KI +IR+ QN+T DD + G E+++ SG+DMFG SNPEV+
Sbjct: 511 TADSGEQFKGFTPSACWNKIYNRIRKVQNST-DDPNVLG--EELNGSGTDMFGLSNPEVI 567
Query: 356 KLILGLTKSRPTSKSSLCKLT-SKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFL 414
KL+ L+KSRP+S S+CK + +++ GYRPVRVDWKDLDKC+VC+MDEEY+NNLFL
Sbjct: 568 KLVQDLSKSRPSSDVSMCKYSLGRHQSQATGYRPVRVDWKDLDKCNVCYMDEEYENNLFL 627
Query: 415 QCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWA 474
QCDKCRMMVHA+CYGELEP +G LWLCNLCRPGAP+ PP CCLCP+VGGAMK TTDGRWA
Sbjct: 628 QCDKCRMMVHAKCYGELEPCDGALWLCNLCRPGAPDIPPRCCLCPLVGGAMKQTTDGRWA 687
Query: 475 HLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYH 534
HLACAIWIPETCL+DVK+MEPIDG+N+VSKDRWKL+C+ICGVSYGACIQCSN +CRVAYH
Sbjct: 688 HLACAIWIPETCLSDVKKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYH 747
Query: 535 PLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVT 594
PLCARAAGLCVELE+EDRL L S+++++ DQCIR+LSFCK+H+Q L + R+ VT
Sbjct: 748 PLCARAAGLCVELENEDRLFLQSVEDEEADQCIRMLSFCKRHRQTSTACLGSENRIKSVT 807
Query: 595 RRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
++ +Y+PP NPSG AR+EPYN FGRRGRKEPEALAAAS KRLFVENQPY++GGY +
Sbjct: 808 QKTSEYLPPPNPSGSARTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSKIEF 867
Query: 655 SGNTLPSIRVIGSKFSFSLHRDAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
S T I GSK S + P N LSMA+KY++MKET+RKRLAFGKSGIHGFGIFAK
Sbjct: 868 S--TYECIH--GSKVS---QMNTPSNILSMAEKYRYMKETYRKRLAFGKSGIHGFGIFAK 920
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
PH+AGDM+IEYTGELVRPSIAD+RE IYNS+VGAGTYMFRIDD+RVIDATRAGSIAHL
Sbjct: 921 LPHKAGDMMIEYTGELVRPSIADKRERLIYNSMVGAGTYMFRIDDKRVIDATRAGSIAHL 980
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRG 833
INHSC PNCYSRVI+VNG+EHIIIFAKRDI WEELTYDYRFFSI E+L+C CGF CRG
Sbjct: 981 INHSCVPNCYSRVITVNGEEHIIIFAKRDIPIWEELTYDYRFFSIGERLSCSCGFQGCRG 1040
Query: 834 VVNDTEAEEQVAKLYAPRSELIDW 857
VVNDTEAE Q +K+ PR ELIDW
Sbjct: 1041 VVNDTEAEAQQSKICVPRCELIDW 1064
>gi|297843378|ref|XP_002889570.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
lyrata]
gi|297335412|gb|EFH65829.1| hypothetical protein ARALYDRAFT_311665 [Arabidopsis lyrata subsp.
lyrata]
Length = 1206
Score = 1132 bits (2929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 569/877 (64%), Positives = 654/877 (74%), Gaps = 69/877 (7%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY+ E+ H VKY DGD E+L L E+IKF IS++EM+ L + F N+V G
Sbjct: 238 WYPGSIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDEMELLNMKFGTNDVAVHG 297
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDE+V+LAAS +DC + EP DIIWAKLTGHAMWPAI+VDES+I KGLN K SGGRS
Sbjct: 298 QDYDELVILAASFEDCHDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKASGGRS 357
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFAR YL E KLP RM
Sbjct: 358 VLVQFFGTHDFAR-----------------------------------YLKEYKLPGRMD 382
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQ + E + S +E S SG++ KD + IG + GDLQI++LG+IV
Sbjct: 383 QLQKVADTECSERTNSGEEDSSNSGDDYTKDGEVWLQPTGIGDCLHRIGDLQIMNLGRIV 442
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DSE+F+D + WPEGYTA+RKF SL DP YKMEVLRD ESK RP+FRVT ++GEQ
Sbjct: 443 TDSEFFKDSKHTWPEGYTAIRKFISLKDPDASAMYKMEVLRDAESKNRPVFRVTTNSGEQ 502
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F G TPS CW+KI +I++ Q+ + +G E + ESG+DMFGFSNPEV KLI GL
Sbjct: 503 FKGDTPSACWNKIYNRIKKIQSASD---GPDGLGEGLHESGTDMFGFSNPEVDKLIQGLL 559
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+SRP SK S K +S KY+D P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 560 QSRPPSKVSQRKYSSGKYQDHPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRM 619
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPV-----VGGAMKPTTDGRWAHL 476
MVH RCYG+LEP +G+LWLCNLCRPGA + PP CCLCPV GGAMKPTTDGRWAHL
Sbjct: 620 MVHTRCYGQLEPRDGILWLCNLCRPGALDIPPRCCLCPVPTKYLSGGAMKPTTDGRWAHL 679
Query: 477 ACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPL 536
ACAIWIPETCL DVK+MEPIDG+N+V+KDRWKLLCSICGVSYGACIQCSN++CRVAYHPL
Sbjct: 680 ACAIWIPETCLLDVKKMEPIDGVNKVNKDRWKLLCSICGVSYGACIQCSNSSCRVAYHPL 739
Query: 537 CARAAGLCVE-----------------LEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
CARAAGLCVE L DEDRL LLS+D+D+ DQCIRLLSFCK+H+Q
Sbjct: 740 CARAAGLCVEVLSYPTGEYKILRSFIYLADEDRLFLLSMDDDEADQCIRLLSFCKRHRQT 799
Query: 580 LNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFV 639
ND L E +++ +Y+PP NPSGCAR+EPYNY GRRGRKEPEALA S KRLFV
Sbjct: 800 SNDHLET-EYMIKPAHNIAEYLPPPNPSGCARTEPYNYLGRRGRKEPEALAGPSSKRLFV 858
Query: 640 ENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLA 699
ENQPY+VGGY ++ + R+ GSK S + N LSMA+KY MKET+RKRLA
Sbjct: 859 ENQPYIVGGYSRHEFATYE----RIYGSKMSQII--TPSNILSMAEKYTFMKETYRKRLA 912
Query: 700 FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDE 759
FGKSGIHGFGIFAK PHRAGDMVIEYTGELVRP IAD+REH IYNS+VGAGTYMFRID+E
Sbjct: 913 FGKSGIHGFGIFAKLPHRAGDMVIEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNE 972
Query: 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID 819
RVIDATR GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD+ +WEELTYDYRFFSID
Sbjct: 973 RVIDATRTGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSID 1032
Query: 820 EQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELID 856
E+LACYCGFPRCRGVVNDTEAEEQ A ++A RS D
Sbjct: 1033 ERLACYCGFPRCRGVVNDTEAEEQQANIHASRSSEYD 1069
>gi|6850313|gb|AAF29390.1|AC009999_10 Contains similarity to MLL protein from Fugu rubripes gb|AF036382,
and contains a PWWP PF|00855 and a SET PF|00856 domain
[Arabidopsis thaliana]
Length = 1193
Score = 1126 bits (2912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/866 (65%), Positives = 650/866 (75%), Gaps = 62/866 (7%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY G +VGY+ E+ H VKY DGD E+L L E+IKF IS+++M+ L + F N+V DG
Sbjct: 232 WYPGSIVGYNVETKHHIVKYGDGDGEELALRREKIKFLISRDDMELLNMKFGTNDVVVDG 291
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN-KISGGRS 122
DYDE+V+LAAS ++CQ+ EP DIIWAKLTGHAMWPAI+VDES+I KGLN KISGGRS
Sbjct: 292 QDYDELVILAASFEECQDFEPRDIIWAKLTGHAMWPAIIVDESVIVKRKGLNNKISGGRS 351
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFAR YL E KLP RM
Sbjct: 352 VLVQFFGTHDFAR-----------------------------------YLKEYKLPGRMD 376
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQ D E S +E S SG++ KD + +G + GDLQI++LG+IV
Sbjct: 377 QLQKVADTDCSERINSGEEDSSNSGDDYTKDGEVWLRPTELGDCLHRIGDLQIINLGRIV 436
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DSE+F+D + WPEGYTA RKF SL DP YKMEVLRD ESK RP+FRVT ++GEQ
Sbjct: 437 TDSEFFKDSKHTWPEGYTATRKFISLKDPNASAMYKMEVLRDAESKTRPVFRVTTNSGEQ 496
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
F G TPS CW+KI +I++ Q SD+ G E + ESG+DMFGFSNPEV KLI GL
Sbjct: 497 FKGDTPSACWNKIYNRIKKIQI-ASDNPDVLG--EGLHESGTDMFGFSNPEVDKLIQGLL 553
Query: 363 KSRPTSKSSLCKLTS-KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+SRP SK S K +S KY+D P GYRPVRV+WKDLDKC+VCHMDEEY+NNLFLQCDKCRM
Sbjct: 554 QSRPPSKVSQRKYSSGKYQDHPTGYRPVRVEWKDLDKCNVCHMDEEYENNLFLQCDKCRM 613
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVV-----GGAMKPTTDGRWAHL 476
MVH RCYG+LEP NG+LWLCNLCRP A + PP CCLCPV GGAMKPTTDGRWAHL
Sbjct: 614 MVHTRCYGQLEPHNGILWLCNLCRPVALDIPPRCCLCPVPTKYLPGGAMKPTTDGRWAHL 673
Query: 477 ACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPL 536
ACAIWIPETCL DVK+MEPIDG+ +VSKDRWKLLCSICGVSYGACIQCSN TCRVAYHPL
Sbjct: 674 ACAIWIPETCLLDVKKMEPIDGVKKVSKDRWKLLCSICGVSYGACIQCSNNTCRVAYHPL 733
Query: 537 CARAAGLCVE----------LEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV 586
CARAAGLCVE L DEDRL LLS+D+D+ DQCIRLLSFCK+H+Q N L
Sbjct: 734 CARAAGLCVEVLSYPTGDHKLADEDRLFLLSMDDDEADQCIRLLSFCKRHRQTSNYHLET 793
Query: 587 DERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLV 646
E +++ +Y+PP NPSGCAR+EPYNY GRRGRKEPEALA AS KRLFVENQPY+V
Sbjct: 794 -EYMIKPAHNIAEYLPPPNPSGCARTEPYNYLGRRGRKEPEALAGASSKRLFVENQPYIV 852
Query: 647 GGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIH 706
GGY ++ S R+ GSK S N LSMA+KY MKET+RKRLAFGKSGIH
Sbjct: 853 GGYSRHEFSTYE----RIYGSKMSQIT--TPSNILSMAEKYTFMKETYRKRLAFGKSGIH 906
Query: 707 GFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATR 766
GFGIFAK PHRAGDMVIEYTGELVRP IAD+REH IYNS+VGAGTYMFRID+ERVIDATR
Sbjct: 907 GFGIFAKLPHRAGDMVIEYTGELVRPPIADKREHLIYNSMVGAGTYMFRIDNERVIDATR 966
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYC 826
GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD+ +WEELTYDYRFFSIDE+LACYC
Sbjct: 967 TGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDVAKWEELTYDYRFFSIDERLACYC 1026
Query: 827 GFPRCRGVVNDTEAEEQVAKLYAPRS 852
GFPRCRGVVNDTEAEE+ A ++A RS
Sbjct: 1027 GFPRCRGVVNDTEAEERQANIHASRS 1052
>gi|357502739|ref|XP_003621658.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
truncatula]
gi|355496673|gb|AES77876.1| Histone-lysine N-methyltransferase ATX2, partial [Medicago
truncatula]
Length = 791
Score = 1088 bits (2814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 555/773 (71%), Positives = 622/773 (80%), Gaps = 17/773 (2%)
Query: 95 HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHL 154
+AMWPA+V+DESL KGL GGRS+PVQFFGTHDFAR+ V+QV SFL GLL+ H
Sbjct: 29 YAMWPAVVLDESLASSCKGLKTFIGGRSVPVQFFGTHDFARVRVQQVKSFLSGLLTDLHS 88
Query: 155 KCKKPRFTQSLEEAKVYLSEQKLPRRMLQLQNAIRADDGENSWSQDEGSLGSGENCFKDE 214
KCKKP F + LEEAK YLS QKLP M++L+ D N +D G SGE+ D+
Sbjct: 89 KCKKPSFIEGLEEAKRYLSAQKLPLGMIELRKRY---DCNNVSGEDGGCSDSGEDYVSDK 145
Query: 215 RLQGTLGSIGISPYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVC 274
L I PY GDLQIL+LGKIV DS F+D R IWPEGYTAVRKFTS+ DP V
Sbjct: 146 GAWAALQKIDTFPYEVGDLQILNLGKIVGDSTAFRDGRSIWPEGYTAVRKFTSVTDPEVS 205
Query: 275 NSYKMEVLRDTESKIRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEG 334
YKMEVLRD ES+ RPLFRVT+D GEQF G PSTCW+++ +I++ + S+ +
Sbjct: 206 VPYKMEVLRDPESRFRPLFRVTVDGGEQFNGYKPSTCWNQVYERIKKLKKAVSEGSVVDS 265
Query: 335 AAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWK 394
E ESGSDMFGFSNPEV KLI GL+KS+ + K S CK S+ LP GYR V ++W
Sbjct: 266 VVESGYESGSDMFGFSNPEVAKLIKGLSKSKVSLKKSSCKSGSR---LPVGYRQVHINWF 322
Query: 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPP 454
DLDKC+VCHMDEEY+NNLFLQCDKCRMMVHARCYGE EPVNGVLWLCNLCR GAP PP
Sbjct: 323 DLDKCNVCHMDEEYENNLFLQCDKCRMMVHARCYGEHEPVNGVLWLCNLCRSGAPPPPCC 382
Query: 455 CCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSIC 514
LCP++GGAMKPTTDGRWAHLACA+WIPETCL DVKRMEPIDGL R+SKDRWKLLCSIC
Sbjct: 383 --LCPLIGGAMKPTTDGRWAHLACAMWIPETCLADVKRMEPIDGLRRISKDRWKLLCSIC 440
Query: 515 GVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCK 574
GVSYGACIQCSN +CRVAYHPLCARAAGLCVELE+EDRL LLS+D DDEDQCIRLLSFCK
Sbjct: 441 GVSYGACIQCSNNSCRVAYHPLCARAAGLCVELENEDRLYLLSVD-DDEDQCIRLLSFCK 499
Query: 575 KHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASL 634
KH+QP +D DER VQV +C DY PP NPSGCARSEPY+YFGRRGRKEPEALAA+SL
Sbjct: 500 KHRQPSHDHSVADER-VQVIGQCSDYEPPPNPSGCARSEPYDYFGRRGRKEPEALAASSL 558
Query: 635 KRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHR-------DAPNFLSMADKY 687
KRLFVENQPYLVGGYCQ+GL ++ PS R + SKF S R A + L++A+KY
Sbjct: 559 KRLFVENQPYLVGGYCQHGLLNDSEPSGRGVCSKFFCSEQRLRTSMVDAADSILTVAEKY 618
Query: 688 KHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLV 747
K+M ETFRK+LAFGKS IHGFGIFAKHP++ GDMVIEYTGELVRP IADRRE FIYNSLV
Sbjct: 619 KYMSETFRKQLAFGKSRIHGFGIFAKHPYKGGDMVIEYTGELVRPPIADRRERFIYNSLV 678
Query: 748 GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWE 807
GAGTYMFRIDDERVIDATRAGSIAHLINHSC PNCYSRVISVNGDEHIIIFAKRDIKQWE
Sbjct: 679 GAGTYMFRIDDERVIDATRAGSIAHLINHSCAPNCYSRVISVNGDEHIIIFAKRDIKQWE 738
Query: 808 ELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDWRGD 860
ELTYDYRFFSIDE+L+CYCGFP+CRGVVNDTEAEE+ YAPRSEL+DW+G+
Sbjct: 739 ELTYDYRFFSIDERLSCYCGFPKCRGVVNDTEAEERAGTRYAPRSELVDWKGE 791
>gi|115478080|ref|NP_001062635.1| Os09g0134500 [Oryza sativa Japonica Group]
gi|47848462|dbj|BAD22318.1| trithorax-like [Oryza sativa Japonica Group]
gi|113630868|dbj|BAF24549.1| Os09g0134500 [Oryza sativa Japonica Group]
gi|215704333|dbj|BAG93767.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 1022
Score = 1056 bits (2730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/858 (61%), Positives = 640/858 (74%), Gaps = 25/858 (2%)
Query: 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDN 61
+DWY G + GY+ + +H VKY DG+ EDL L++ERIKF IS EEM C L F I+N++
Sbjct: 184 EDWYKGSITGYNEATKKHSVKYDDGESEDLNLADERIKFSISSEEMKCRNLKFGISNLNK 243
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
GYD E++ LA SL D Q L+PGD++WAKLTGHAMWPA+VVDES + + L +
Sbjct: 244 RGYD--ELLALAVSLHDYQGLDPGDLVWAKLTGHAMWPAVVVDESNVPANRALKPGRLDQ 301
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
SI VQFFGTHDFARI +KQ + FL GLLSS HLKCK+ RF +SLEEAK +L Q LP M
Sbjct: 302 SILVQFFGTHDFARIKLKQAVPFLNGLLSSLHLKCKQARFYRSLEEAKEFLCTQLLPENM 361
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
LQLQ ++ + + ++D + S +N +D+ + ++P G+L++ LG+I
Sbjct: 362 LQLQKSMEKGSSDANSNKD---VHSCDNLSEDKTAESGGDYDEMTPIELGNLRVSKLGRI 418
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS+YF + + IWPEGYTA RKF S+ DP V YKMEVLR+++ K RPLFRVT ++G
Sbjct: 419 VTDSDYFHNKKHIWPEGYTAFRKFRSVKDPHVVILYKMEVLRNSDIKARPLFRVTSEDGT 478
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMFGFSNPEVMKLILG 360
Q GSTP+TCW +I +++E Q N A G + + SGS MFGFSNP++ +LI
Sbjct: 479 QIDGSTPNTCWKEIYCRLKEKQRNV-----ASGLDRDVCQGSGSYMFGFSNPQIRQLIQE 533
Query: 361 LTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420
L +R S K D GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCR
Sbjct: 534 LPNAR-----SCLKYFENAGDTFRGYRAVHVNWKDLDYCSVCDMDEEYEDNLFLQCDKCR 588
Query: 421 MMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAI 480
MMVHARCYGELEP+NGVLWLCNLCRP AP P CCLCPV GGAMKPTTDGRWAHLACAI
Sbjct: 589 MMVHARCYGELEPLNGVLWLCNLCRPEAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAI 648
Query: 481 WIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
WIPETCL DVKRMEPIDGL+R++KDRWKLLCSICGV+YGACIQCS+ TCRVAYHPLCARA
Sbjct: 649 WIPETCLKDVKRMEPIDGLSRINKDRWKLLCSICGVAYGACIQCSHPTCRVAYHPLCARA 708
Query: 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQ-VTRRCCD 599
A LCVELED+D+++L+ LDE DED CIRLLS+CKKH+QP +R +++ L + D
Sbjct: 709 ADLCVELEDDDKIHLMLLDE-DEDPCIRLLSYCKKHRQPSTERPSLESNLAKPAVVVQTD 767
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
+P PSGCAR+EPYN GRRG+K+P+ +A AS+KRL+VEN PY+V G+CQN + + +
Sbjct: 768 AVP---PSGCARTEPYNIHGRRGQKQPQVMATASVKRLYVENMPYIVSGFCQNRVGHDAI 824
Query: 660 PS-IRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHR 717
I+ +G F H++A N SM +KYK MK TFR+RLAFGKS IHGFG+FAK H+
Sbjct: 825 SEPIQSVG--FLDVAHQEAVGNVSSMIEKYKSMKATFRRRLAFGKSRIHGFGVFAKVSHK 882
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
AGDM+IEY GELVRP I+D RE IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS
Sbjct: 883 AGDMMIEYIGELVRPPISDIRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 942
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVND 837
CEPNCYSRVISV GDEHIIIFAKRDI WEELTYDYRF S D++L CYCGFP+CRGVVND
Sbjct: 943 CEPNCYSRVISVLGDEHIIIFAKRDINPWEELTYDYRFVSSDQRLPCYCGFPKCRGVVND 1002
Query: 838 TEAEEQVAKLYAPRSELI 855
EAE Q AK+ RSEL
Sbjct: 1003 VEAEGQSAKIRVNRSELF 1020
>gi|222641096|gb|EEE69228.1| hypothetical protein OsJ_28463 [Oryza sativa Japonica Group]
Length = 1057
Score = 1039 bits (2687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/891 (59%), Positives = 640/891 (71%), Gaps = 58/891 (6%)
Query: 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDN 61
+DWY G + GY+ + +H VKY DG+ EDL L++ERIKF IS EEM C L F I+N++
Sbjct: 186 EDWYKGSITGYNEATKKHSVKYDDGESEDLNLADERIKFSISSEEMKCRNLKFGISNLNK 245
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
GYD E++ LA SL D Q L+PGD++WAKLTGHAMWPA+VVDES + + L +
Sbjct: 246 RGYD--ELLALAVSLHDYQGLDPGDLVWAKLTGHAMWPAVVVDESNVPANRALKPGRLDQ 303
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
SI VQFFGTHDFARI +KQ + FL GLLSS HLKCK+ RF +SLEEAK +L Q LP M
Sbjct: 304 SILVQFFGTHDFARIKLKQAVPFLNGLLSSLHLKCKQARFYRSLEEAKEFLCTQLLPENM 363
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
LQLQ ++ + + ++D + S +N +D+ + ++P G+L++ LG+I
Sbjct: 364 LQLQKSMEKGSSDANSNKD---VHSCDNLSEDKTAESGGDYDEMTPIELGNLRVSKLGRI 420
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS+YF + + IWPEGYTA RKF S+ DP V YKMEVLR+++ K RPLFRVT ++G
Sbjct: 421 VTDSDYFHNKKHIWPEGYTAFRKFRSVKDPHVVILYKMEVLRNSDIKARPLFRVTSEDGT 480
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMFGFSNPEVMKLILG 360
Q GSTP+TCW +I +++E Q N A G + + SGS MFGFSNP++ +LI
Sbjct: 481 QIDGSTPNTCWKEIYCRLKEKQRNV-----ASGLDRDVCQGSGSYMFGFSNPQIRQLIQE 535
Query: 361 LTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420
L +R S K D GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCR
Sbjct: 536 LPNAR-----SCLKYFENAGDTFRGYRAVHVNWKDLDYCSVCDMDEEYEDNLFLQCDKCR 590
Query: 421 MMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAI 480
MMVHARCYGELEP+NGVLWLCNLCRP AP P CCLCPV GGAMKPTTDGRWAHLACAI
Sbjct: 591 MMVHARCYGELEPLNGVLWLCNLCRPEAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAI 650
Query: 481 WIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
WIPETCL DVKRMEPIDGL+R++KDRWKLLCSICGV+YGACIQCS+ TCRVAYHPLCARA
Sbjct: 651 WIPETCLKDVKRMEPIDGLSRINKDRWKLLCSICGVAYGACIQCSHPTCRVAYHPLCARA 710
Query: 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQ-VTRRCCD 599
A LCVELED+D+++L+ LDE DED CIRLLS+CKKH+QP +R +++ L + D
Sbjct: 711 ADLCVELEDDDKIHLMLLDE-DEDPCIRLLSYCKKHRQPSTERPSLESNLAKPAVVVQTD 769
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
+P PSGCAR+EPYN GRRG+K+P+ +A AS+KRL+VEN PY+V G+CQN + + +
Sbjct: 770 AVP---PSGCARTEPYNIHGRRGQKQPQVMATASVKRLYVENMPYIVSGFCQNRVGHDAI 826
Query: 660 PS-IRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHR 717
I+ +G F H++A N SM +KYK MK TFR+RLAFGKS IHGFG+FAK H+
Sbjct: 827 SEPIQSVG--FLDVAHQEAVGNVSSMIEKYKSMKATFRRRLAFGKSRIHGFGVFAKVSHK 884
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLV------------------------------ 747
AGDM+IEY GELVRP I+D RE IYNSLV
Sbjct: 885 AGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWKTLVKLIGLFMEFNLKALMGVIIT 944
Query: 748 ---GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV GDEHIIIFAKRDI
Sbjct: 945 TFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDIN 1004
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELI 855
WEELTYDYRF S D++L CYCGFP+CRGVVND EAE Q AK+ RSEL
Sbjct: 1005 PWEELTYDYRFVSSDQRLPCYCGFPKCRGVVNDVEAEGQSAKIRVNRSELF 1055
>gi|218201707|gb|EEC84134.1| hypothetical protein OsI_30479 [Oryza sativa Indica Group]
Length = 1057
Score = 1036 bits (2678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 529/891 (59%), Positives = 639/891 (71%), Gaps = 58/891 (6%)
Query: 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDN 61
+DWY G + GY+ + +H VKY DG+ EDL L++ERIKF IS EEM C L F I+N++
Sbjct: 186 EDWYKGSITGYNEATKKHSVKYDDGESEDLNLADERIKFSISSEEMKCRNLKFGISNLNK 245
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
GYD E++ LA SL D Q L+PGD++WAKLTGHAMWPA+VVDES + + L +
Sbjct: 246 RGYD--ELLALAVSLHDYQGLDPGDLVWAKLTGHAMWPAVVVDESNVPANRALKPGRLDQ 303
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
SI VQFFGTHDFARI +KQ + FL GLLSS HLKCK+ RF +SLEEAK +L Q LP M
Sbjct: 304 SILVQFFGTHDFARIKLKQAVPFLNGLLSSLHLKCKQARFYRSLEEAKEFLCTQLLPENM 363
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
LQLQ ++ + + ++D + S +N +D+ + ++ G+L++ LG+I
Sbjct: 364 LQLQKSMEKGSSDANSNKD---VHSCDNLSEDKTAESGGDYDEMTQIELGNLRVSKLGRI 420
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS+YF + + IWPEGYTA RKF S+ DP V YKMEVLR+++ K RPLFRVT ++G
Sbjct: 421 VTDSDYFHNKKHIWPEGYTAFRKFRSVKDPHVVILYKMEVLRNSDIKARPLFRVTSEDGT 480
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMFGFSNPEVMKLILG 360
Q GSTP+TCW +I +++E Q N A G + + SGS MFGFSNP++ +LI
Sbjct: 481 QIDGSTPNTCWKEIYCRLKEKQRNV-----ASGLDRDVCQGSGSYMFGFSNPQIRQLIQE 535
Query: 361 LTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420
L +R S K D GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCR
Sbjct: 536 LPNAR-----SCLKYFENAGDTFRGYRAVHVNWKDLDYCSVCDMDEEYEDNLFLQCDKCR 590
Query: 421 MMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAI 480
MMVHARCYGELEP+NGVLWLCNLCRP AP P CCLCPV GGAMKPTTDGRWAHLACAI
Sbjct: 591 MMVHARCYGELEPLNGVLWLCNLCRPEAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAI 650
Query: 481 WIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
WIPETCL DVKRMEPIDGL+R++KDRWKLLCSICGV+YGACIQCS+ TCRVAYHPLCARA
Sbjct: 651 WIPETCLKDVKRMEPIDGLSRINKDRWKLLCSICGVAYGACIQCSHPTCRVAYHPLCARA 710
Query: 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQ-VTRRCCD 599
A LCVELED+D+++L+ LDE DED CIRLLS+CKKH+QP +R +++ L + D
Sbjct: 711 ADLCVELEDDDKIHLMLLDE-DEDPCIRLLSYCKKHRQPSTERPSLESNLAKPAVVVQTD 769
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
+P PSGCAR+EPYN GRRG+K+P+ +A AS+KRL+VEN PY+V G+CQN + + +
Sbjct: 770 AVP---PSGCARTEPYNIHGRRGQKQPQVMATASVKRLYVENMPYIVSGFCQNRVGHDAI 826
Query: 660 PS-IRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHR 717
I+ +G F H++A N SM +KYK MK TFR+RLAFGKS IHGFG+FAK H+
Sbjct: 827 SEPIQSVG--FLDVAHQEAVGNVSSMIEKYKSMKATFRRRLAFGKSRIHGFGVFAKVSHK 884
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLV------------------------------ 747
AGDM+IEY GELVRP I+D RE IYNSLV
Sbjct: 885 AGDMMIEYIGELVRPPISDIRERRIYNSLVPEGYMWKTLVKLIGLFMEFNLKALMGVIIT 944
Query: 748 ---GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV GDEHIIIFAKRDI
Sbjct: 945 TFLGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVLGDEHIIIFAKRDIN 1004
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELI 855
WEELTYDYRF S D++L CYCGFP+CRGVVND EAE Q AK+ RSEL
Sbjct: 1005 PWEELTYDYRFVSSDQRLPCYCGFPKCRGVVNDVEAEGQSAKIRVNRSELF 1055
>gi|357150782|ref|XP_003575574.1| PREDICTED: histone-lysine N-methyltransferase ATX2-like [Brachypodium
distachyon]
Length = 1055
Score = 1032 bits (2669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 515/856 (60%), Positives = 631/856 (73%), Gaps = 25/856 (2%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + GY + +H VKY DG+ EDL L++ERI+F IS EEM CL L F ++N+D
Sbjct: 219 DWYKGSITGYKEVTKKHSVKYDDGEAEDLTLADERIRFTISSEEMKCLNLKFGMSNMDKK 278
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
G+D E++ LA S D Q L+PGD++WAK+TGHAMWPA+VVDES + + L ++ +S
Sbjct: 279 GHD--ELLALALSFHDYQGLDPGDLVWAKITGHAMWPAVVVDESNVPANRALKEVRLDQS 336
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
+ VQFFGTHDFARI +KQ + FL GLLSS HLKCK+ RF + LEEAK +L Q+LP ML
Sbjct: 337 VLVQFFGTHDFARIKLKQAVPFLNGLLSSLHLKCKQARFCRGLEEAKEFLLTQQLPESML 396
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QLQ +++DG + QD+ ++G+ +N ++R ++ G+L + LG+IV
Sbjct: 397 QLQ---KSNDGSDVNCQDD-TIGAYDN-LPEDRATENGDDEEMTQIELGNLCVSKLGRIV 451
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DS++F + IWPEGYTA RKFTS+ DP + SYKMEVLR++++K RPLFRV ++G Q
Sbjct: 452 SDSDHFHNKMHIWPEGYTAFRKFTSVKDPHLVTSYKMEVLRNSDTKARPLFRVIAEDGLQ 511
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP+ CW +I +I++ Q D ++E SGSDMFGFSNP++ +LI L
Sbjct: 512 IDGSTPNACWKEIYRRIKKKQC----DVASESEGNVCQRSGSDMFGFSNPQIRQLIQELP 567
Query: 363 KSRPTSKSSLC-KLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+R LC K D GYR V V+WKDLD C+VC MDEEY++NLFLQCDKCRM
Sbjct: 568 NAR------LCLKYFENGGDTLRGYRAVNVNWKDLDYCNVCDMDEEYEDNLFLQCDKCRM 621
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGELEP++G LWLCNLCRPGAP P CCLCPV GGAMKPTTDGRWAHLACAIW
Sbjct: 622 MVHARCYGELEPLDGKLWLCNLCRPGAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAIW 681
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL DVKRMEPIDGL+R++KDRWKL+CSIC V+YGACIQCS+ TCRVAYHPLCARAA
Sbjct: 682 IPETCLKDVKRMEPIDGLSRINKDRWKLICSICTVAYGACIQCSHPTCRVAYHPLCARAA 741
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
LCVELED+D+++L+ LDE DED CIRLLS+CKKH+QP + R ++ L +
Sbjct: 742 DLCVELEDDDKIHLMLLDE-DEDLCIRLLSYCKKHRQPSSKRPSLKSDLGNPAMVVQTDV 800
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN--GLSGNTL 659
S SGCAR+EPYN+ RRG+++P+ A AS+KRL+VEN+PY+V GYCQN G +
Sbjct: 801 ASS--SGCARTEPYNFHRRRGQQQPQVTATASVKRLYVENRPYIVSGYCQNRGGCDTSCE 858
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
P V S +L N SM +KYK MK TFR+RLAFGKS IHGFG+FAK H+AG
Sbjct: 859 PIQSVCPS--DAALQEVVVNVSSMVEKYKRMKATFRRRLAFGKSRIHGFGVFAKVAHKAG 916
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
DM+IEY GELVRP ++D RE IYNSLVGAGTYMFRIDDERVIDATR+GSIAHLINHSCE
Sbjct: 917 DMMIEYIGELVRPPVSDIREGRIYNSLVGAGTYMFRIDDERVIDATRSGSIAHLINHSCE 976
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTE 839
PNCYSRVISV GDEHIIIFAKRDI WEELTYDYRF S D++L CYCGFP+CRGVVND E
Sbjct: 977 PNCYSRVISVLGDEHIIIFAKRDIAPWEELTYDYRFVSNDQRLPCYCGFPKCRGVVNDVE 1036
Query: 840 AEEQVAKLYAPRSELI 855
AE Q A + RS+L
Sbjct: 1037 AEVQSANIKVTRSDLF 1052
>gi|332077887|gb|AED99887.1| Trx1 [Hordeum vulgare]
Length = 1029
Score = 1020 bits (2638), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/856 (60%), Positives = 631/856 (73%), Gaps = 17/856 (1%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y + +H VKY DG+ EDL L+NERI+F IS EEM CL L F +N+D
Sbjct: 190 DWYKGSITAYTELTKKHSVKYDDGEAEDLTLANERIQFSISSEEMKCLNLKFGTSNLDKQ 249
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
GYD E++ LA S D Q L+P D++WAK+TGHAMWPA++VDES + + L I +S
Sbjct: 250 GYD--ELLALAVSFHDYQGLDPDDLVWAKITGHAMWPAVIVDESNVPSSRALKPIRLDQS 307
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFARI KQ + FL GLLSS HLKCK+ RF++ LEEAK +L Q+LP ML
Sbjct: 308 ILVQFFGTHDFARIKSKQAVPFLSGLLSSLHLKCKQTRFSRGLEEAKEFLLTQQLPENML 367
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QL+ +I +DG + QD+ +GS +N +ER + ++ G+L++ +LG+IV
Sbjct: 368 QLRKSIE-NDGSDVNGQDD-VIGSCDN-LSEERAEDNGEDAEMTQIELGNLRVSNLGRIV 424
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DS++F + IWPEGYTA RKFTS+ DP + SYKMEVLR+++ K RPLFRV ++G Q
Sbjct: 425 SDSDHFHNKMHIWPEGYTAFRKFTSVNDPHLVTSYKMEVLRNSDIKARPLFRVISEDGLQ 484
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP+ CW +I +I+ Q N + + EG SGSDMFGFSNP++ +LI L
Sbjct: 485 IDGSTPNACWKEIYCRIKAKQCNVASEL--EGNV--CQRSGSDMFGFSNPQIRQLIQEL- 539
Query: 363 KSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
P ++S L + L GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCRMM
Sbjct: 540 ---PNARSCLKYFENGGETL-RGYRAVHVNWKDLDFCSVCDMDEEYEDNLFLQCDKCRMM 595
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWI 482
VHARCYGELEP+NGVLWLCNLCRPGAP P CCLCPV GGAMKPTTDGRWAHLACAIWI
Sbjct: 596 VHARCYGELEPLNGVLWLCNLCRPGAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAIWI 655
Query: 483 PETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
PETCL DVKRMEPIDGL++++KDRWKLLCSIC V+YG CIQCS+ TCRVAYHPLCARAA
Sbjct: 656 PETCLKDVKRMEPIDGLSKINKDRWKLLCSICTVAYGVCIQCSHPTCRVAYHPLCARAAD 715
Query: 543 LCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIP 602
LCVELED+D+++L+ L E+DED CIRLLS+CKKH+QP +R +++ L +
Sbjct: 716 LCVELEDDDKIHLILL-EEDEDPCIRLLSYCKKHRQPSTERPSLESDLGNPAQLV--QTD 772
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
++ SGCAR+EPYN+ RRG+++P+ A AS+KRL+VEN PY+V GYCQN + +T
Sbjct: 773 AASSSGCARTEPYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEP 832
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ + + N SMA+KYK MK TFRKRLAFGKS IHGFG+FAK H+AGDM+
Sbjct: 833 IQSVALLDAASQEASVNVSSMAEKYKSMKATFRKRLAFGKSRIHGFGVFAKVGHKAGDMM 892
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GELVRP I+D RE IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC
Sbjct: 893 IEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 952
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEE 842
YSRVISV GDEHIIIFAKRDI WEELTYDYRF S +++L CYCGFP+CRGVVND EAE
Sbjct: 953 YSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVNDVEAEV 1012
Query: 843 QVAKLYAPRSELIDWR 858
Q K+ RSEL +
Sbjct: 1013 QSTKIRVTRSELFQQK 1028
>gi|326489957|dbj|BAJ94052.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1081
Score = 1020 bits (2637), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/856 (60%), Positives = 631/856 (73%), Gaps = 17/856 (1%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y + +H VKY DG+ EDL L+NERI+F IS EEM CL L F +N+D
Sbjct: 242 DWYKGSITAYTELTKKHSVKYDDGEAEDLTLANERIQFSISSEEMKCLNLKFGTSNLDKQ 301
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
GYD E++ LA S D Q L+P D++WAK+TGHAMWPA++VDES + + L I +S
Sbjct: 302 GYD--ELLALAVSFHDYQGLDPDDLVWAKITGHAMWPAVIVDESNVPSSRALKPIRLDQS 359
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFARI KQ + FL GLLSS HLKCK+ RF++ LEEAK +L Q+LP ML
Sbjct: 360 ILVQFFGTHDFARIKSKQAVPFLSGLLSSLHLKCKQTRFSRGLEEAKEFLLTQQLPENML 419
Query: 183 QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKIV 242
QL+ +I +DG + QD+ +GS +N +ER + ++ G+L++ +LG+IV
Sbjct: 420 QLRKSIE-NDGSDVNGQDD-VIGSCDN-LSEERAEDNGEDAEMTQIELGNLRVSNLGRIV 476
Query: 243 KDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQ 302
DS++F + IWPEGYTA RKFTS+ DP + SYKMEVLR+++ K RPLFRV ++G Q
Sbjct: 477 SDSDHFHNKMHIWPEGYTAFRKFTSVNDPHLVTSYKMEVLRNSDIKARPLFRVISEDGLQ 536
Query: 303 FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLT 362
GSTP+ CW +I +I+ Q N + + EG SGSDMFGFSNP++ +LI L
Sbjct: 537 IDGSTPNACWKEIYCRIKAKQCNVASEL--EGNV--CQRSGSDMFGFSNPQIRQLIQEL- 591
Query: 363 KSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
P ++S L + L GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCRMM
Sbjct: 592 ---PNARSCLKYFENGGETL-RGYRAVHVNWKDLDFCSVCDMDEEYEDNLFLQCDKCRMM 647
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWI 482
VHARCYGELEP+NGVLWLCNLCRPGAP P CCLCPV GGAMKPTTDGRWAHLACAIWI
Sbjct: 648 VHARCYGELEPLNGVLWLCNLCRPGAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAIWI 707
Query: 483 PETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
PETCL DVKRMEPIDGL++++KDRWKLLCSIC V+YG CIQCS+ TCRVAYHPLCARAA
Sbjct: 708 PETCLKDVKRMEPIDGLSKINKDRWKLLCSICTVAYGVCIQCSHPTCRVAYHPLCARAAD 767
Query: 543 LCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIP 602
LCVELED+D+++L+ L E+DED CIRLLS+CKKH+QP +R +++ L +
Sbjct: 768 LCVELEDDDKIHLILL-EEDEDPCIRLLSYCKKHRQPSTERPSLESDLGNPAQLV--QTD 824
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
++ SGCAR+EPYN+ RRG+++P+ A AS+KRL+VEN PY+V GYCQN + +T
Sbjct: 825 AASSSGCARTEPYNFHRRRGQQQPQVTATASVKRLYVENMPYIVSGYCQNKVGCDTSCEP 884
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ + + N SMA+KYK MK TFRKRLAFGKS IHGFG+FAK H+AGDM+
Sbjct: 885 IQSVALLDAASQEASVNVSSMAEKYKSMKATFRKRLAFGKSRIHGFGVFAKVGHKAGDMM 944
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GELVRP I+D RE IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC
Sbjct: 945 IEYIGELVRPPISDLRERRIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 1004
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEE 842
YSRVISV GDEHIIIFAKRDI WEELTYDYRF S +++L CYCGFP+CRGVVND EAE
Sbjct: 1005 YSRVISVLGDEHIIIFAKRDIDPWEELTYDYRFVSNEQRLPCYCGFPKCRGVVNDVEAEV 1064
Query: 843 QVAKLYAPRSELIDWR 858
Q K+ RSEL +
Sbjct: 1065 QSTKIRVTRSELFQQK 1080
>gi|414884804|tpg|DAA60818.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
gi|414884805|tpg|DAA60819.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
Length = 976
Score = 949 bits (2453), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/856 (56%), Positives = 593/856 (69%), Gaps = 73/856 (8%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y+ + +H VKY DG+ EDL L++ERIKF+IS EEM L L I+N D
Sbjct: 188 DWYKGSIAWYNEATKKHSVKYDDGEAEDLSLADERIKFFISSEEMKSLNLKLGISNQDKK 247
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
G+D E++ LA S D Q L+PGD++WAK+TGHAMWPA+VVDES + + L I +S
Sbjct: 248 GHD--ELLALAVSFHDYQGLDPGDLVWAKITGHAMWPAVVVDESNVPATRALKSIRLDQS 305
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFARI +KQ + FL GL+SS HLKCK+ F++SLEEAK +L Q+LP ML
Sbjct: 306 ILVQFFGTHDFARIKLKQAVPFLNGLVSSLHLKCKQASFSRSLEEAKEFLHTQQLPEIML 365
Query: 183 QLQNAIRADDGE-NSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
QL+ +++ D + NS+ EG + S N + LQ + ++ G+L++ +LG+I
Sbjct: 366 QLRKSVQHDVSDVNSY---EGKVDSCSNLSEALALQNGEDA-EMTQIELGNLRVSNLGRI 421
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS++F + + IWPEGYTA RK+ S+ DP YKMEVLR++++K RPLFRV ++G
Sbjct: 422 VSDSDHFHNKKDIWPEGYTAFRKYMSIEDPHAVTLYKMEVLRNSDTKARPLFRVISEDGV 481
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGL 361
Q GSTP+ CW +I +I+E Q E SGS MFGFSNP++ +LI L
Sbjct: 482 QIDGSTPNACWKEIYRRIKEKQ-------CTELERNVCQSSGSYMFGFSNPQIRQLIQEL 534
Query: 362 TKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+R S K D GYR V+V+WKDLD C+VC MDEEY++NLFLQCDKCRM
Sbjct: 535 PNAR-----SCLKYFENGGDTILGYRAVQVNWKDLDYCNVCDMDEEYEDNLFLQCDKCRM 589
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGEL+ ++G LWLCNLCRPGAP P CCLCPV GGAMKPTTDGRWAHLACAIW
Sbjct: 590 MVHARCYGELKQLDGGLWLCNLCRPGAPRVSPKCCLCPVTGGAMKPTTDGRWAHLACAIW 649
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL DVKRMEPIDGL+R++KDRWKL+CSICGVSYG CIQCS+ TCRVAYHPLCARAA
Sbjct: 650 IPETCLKDVKRMEPIDGLSRINKDRWKLVCSICGVSYGVCIQCSHPTCRVAYHPLCARAA 709
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYI 601
LC+E+ D
Sbjct: 710 DLCIEVVQTDM------------------------------------------------- 720
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
++ SGCAR+EPYN RRG+K+P+ +A ASLKRL+VEN+P++V GYCQN + S
Sbjct: 721 --ASSSGCARTEPYNLHRRRGQKQPQVVATASLKRLYVENRPHIVSGYCQNRVGNTCGES 778
Query: 662 IRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGD 720
I+ G S ++ ++A N SM +KY +K TFRKRL FGKS IHG+G+FAK H+AGD
Sbjct: 779 IQPFG--LSDAVQQEAIGNVSSMVEKYTIVKATFRKRLTFGKSKIHGYGVFAKVAHKAGD 836
Query: 721 MVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEP 780
M++EY GE+VRP I+D RE IYNSLVGAGTYMFRIDDERVIDATR GSIA LINHSCEP
Sbjct: 837 MMVEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSCEP 896
Query: 781 NCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEA 840
NCYSR I++ GDEHIIIFAKRDI WEELTYDYRFFS D++L C+CGFP+CRGVVND EA
Sbjct: 897 NCYSRAITILGDEHIIIFAKRDIDPWEELTYDYRFFSSDQRLPCFCGFPKCRGVVNDVEA 956
Query: 841 EEQVAKLYAPRSELID 856
EEQ AK+ RSEL +
Sbjct: 957 EEQSAKIRVKRSELFE 972
>gi|414884806|tpg|DAA60820.1| TPA: putative trithorax-like family protein isoform 1 [Zea mays]
gi|414884807|tpg|DAA60821.1| TPA: putative trithorax-like family protein isoform 2 [Zea mays]
Length = 982
Score = 942 bits (2436), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/818 (58%), Positives = 595/818 (72%), Gaps = 32/818 (3%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y+ + +H VKY DG+ EDL L++ERIKF+IS EEM L L I+N D
Sbjct: 188 DWYKGSIAWYNEATKKHSVKYDDGEAEDLSLADERIKFFISSEEMKSLNLKLGISNQDKK 247
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
G+D E++ LA S D Q L+PGD++WAK+TGHAMWPA+VVDES + + L I +S
Sbjct: 248 GHD--ELLALAVSFHDYQGLDPGDLVWAKITGHAMWPAVVVDESNVPATRALKSIRLDQS 305
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFARI +KQ + FL GL+SS HLKCK+ F++SLEEAK +L Q+LP ML
Sbjct: 306 ILVQFFGTHDFARIKLKQAVPFLNGLVSSLHLKCKQASFSRSLEEAKEFLHTQQLPEIML 365
Query: 183 QLQNAIRADDGE-NSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
QL+ +++ D + NS+ EG + S N + LQ + ++ G+L++ +LG+I
Sbjct: 366 QLRKSVQHDVSDVNSY---EGKVDSCSNLSEALALQNGEDA-EMTQIELGNLRVSNLGRI 421
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS++F + + IWPEGYTA RK+ S+ DP YKMEVLR++++K RPLFRV ++G
Sbjct: 422 VSDSDHFHNKKDIWPEGYTAFRKYMSIEDPHAVTLYKMEVLRNSDTKARPLFRVISEDGV 481
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGL 361
Q GSTP+ CW +I +I+E Q E SGS MFGFSNP++ +LI L
Sbjct: 482 QIDGSTPNACWKEIYRRIKEKQ-------CTELERNVCQSSGSYMFGFSNPQIRQLIQEL 534
Query: 362 TKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
+R S K D GYR V+V+WKDLD C+VC MDEEY++NLFLQCDKCRM
Sbjct: 535 PNAR-----SCLKYFENGGDTILGYRAVQVNWKDLDYCNVCDMDEEYEDNLFLQCDKCRM 589
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIW 481
MVHARCYGEL+ ++G LWLCNLCRPGAP P CCLCPV GGAMKPTTDGRWAHLACAIW
Sbjct: 590 MVHARCYGELKQLDGGLWLCNLCRPGAPRVSPKCCLCPVTGGAMKPTTDGRWAHLACAIW 649
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
IPETCL DVKRMEPIDGL+R++KDRWKL+CSICGVSYG CIQCS+ TCRVAYHPLCARAA
Sbjct: 650 IPETCLKDVKRMEPIDGLSRINKDRWKLVCSICGVSYGVCIQCSHPTCRVAYHPLCARAA 709
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP-LNDRLAVDE---RLVQVTRRC 597
LC+ELE++D+++ + LDE DED CIRLLS+CKKH+QP +R +++ +QV +
Sbjct: 710 DLCIELENDDKIHHMYLDE-DEDPCIRLLSYCKKHRQPSAAERPSLESDPPEPIQVVQ-- 766
Query: 598 CDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGN 657
++ SGCAR+EPYN RRG+K+P+ +A ASLKRL+VEN+P++V GYCQN +
Sbjct: 767 ---TDMASSSGCARTEPYNLHRRRGQKQPQVVATASLKRLYVENRPHIVSGYCQNRVGNT 823
Query: 658 TLPSIRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPH 716
SI+ G S ++ ++A N SM +KY +K TFRKRL FGKS IHG+G+FAK H
Sbjct: 824 CGESIQPFG--LSDAVQQEAIGNVSSMVEKYTIVKATFRKRLTFGKSKIHGYGVFAKVAH 881
Query: 717 RAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 776
+AGDM++EY GE+VRP I+D RE IYNSLVGAGTYMFRIDDERVIDATR GSIA LINH
Sbjct: 882 KAGDMMVEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINH 941
Query: 777 SCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
SCEPNCYSR I++ GDEHIIIFAKRDI WEELTYDYR
Sbjct: 942 SCEPNCYSRAITILGDEHIIIFAKRDIDPWEELTYDYR 979
>gi|47848463|dbj|BAD22319.1| trithorax 1 (ATX-1) (TRX1)-like [Oryza sativa Japonica Group]
Length = 873
Score = 803 bits (2074), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/706 (57%), Positives = 509/706 (72%), Gaps = 25/706 (3%)
Query: 2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDN 61
+DWY G + GY+ + +H VKY DG+ EDL L++ERIKF IS EEM C L F I+N++
Sbjct: 184 EDWYKGSITGYNEATKKHSVKYDDGESEDLNLADERIKFSISSEEMKCRNLKFGISNLNK 243
Query: 62 DGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGR 121
GYD E++ LA SL D Q L+PGD++WAKLTGHAMWPA+VVDES + + L +
Sbjct: 244 RGYD--ELLALAVSLHDYQGLDPGDLVWAKLTGHAMWPAVVVDESNVPANRALKPGRLDQ 301
Query: 122 SIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRM 181
SI VQFFGTHDFARI +KQ + FL GLLSS HLKCK+ RF +SLEEAK +L Q LP M
Sbjct: 302 SILVQFFGTHDFARIKLKQAVPFLNGLLSSLHLKCKQARFYRSLEEAKEFLCTQLLPENM 361
Query: 182 LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFGDLQILSLGKI 241
LQLQ ++ + + ++D + S +N +D+ + ++P G+L++ LG+I
Sbjct: 362 LQLQKSMEKGSSDANSNKD---VHSCDNLSEDKTAESGGDYDEMTPIELGNLRVSKLGRI 418
Query: 242 VKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGE 301
V DS+YF + + IWPEGYTA RKF S+ DP V YKMEVLR+++ K RPLFRVT ++G
Sbjct: 419 VTDSDYFHNKKHIWPEGYTAFRKFRSVKDPHVVILYKMEVLRNSDIKARPLFRVTSEDGT 478
Query: 302 QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMFGFSNPEVMKLILG 360
Q GSTP+TCW +I +++E Q N A G + + SGS MFGFSNP++ +LI
Sbjct: 479 QIDGSTPNTCWKEIYCRLKEKQRNV-----ASGLDRDVCQGSGSYMFGFSNPQIRQLIQE 533
Query: 361 LTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR 420
L +R S K D GYR V V+WKDLD CSVC MDEEY++NLFLQCDKCR
Sbjct: 534 LPNAR-----SCLKYFENAGDTFRGYRAVHVNWKDLDYCSVCDMDEEYEDNLFLQCDKCR 588
Query: 421 MMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAI 480
MMVHARCYGELEP+NGVLWLCNLCRP AP P CCLCPV GGAMKPTTDGRWAHLACAI
Sbjct: 589 MMVHARCYGELEPLNGVLWLCNLCRPEAPRVSPRCCLCPVTGGAMKPTTDGRWAHLACAI 648
Query: 481 WIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
WIPETCL DVKRMEPIDGL+R++KDRWKLLCSICGV+YGACIQCS+ TCRVAYHPLCARA
Sbjct: 649 WIPETCLKDVKRMEPIDGLSRINKDRWKLLCSICGVAYGACIQCSHPTCRVAYHPLCARA 708
Query: 541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQ-VTRRCCD 599
A LCVELED+D+++L+ LDE DED CIRLLS+CKKH+QP +R +++ L + D
Sbjct: 709 ADLCVELEDDDKIHLMLLDE-DEDPCIRLLSYCKKHRQPSTERPSLESNLAKPAVVVQTD 767
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
+P PSGCAR+EPYN GRRG+K+P+ +A AS+KRL+VEN PY+V G+CQN + + +
Sbjct: 768 AVP---PSGCARTEPYNIHGRRGQKQPQVMATASVKRLYVENMPYIVSGFCQNRVGHDAI 824
Query: 660 PS-IRVIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKS 703
I+ +G F H++A N SM +KYK MK TFR+RLAFGKS
Sbjct: 825 SEPIQSVG--FLDVAHQEAVGNVSSMIEKYKSMKATFRRRLAFGKS 868
>gi|168057166|ref|XP_001780587.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162667953|gb|EDQ54570.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 902
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/873 (48%), Positives = 546/873 (62%), Gaps = 42/873 (4%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
+WY G + Y+ ++ +H + Y D + E + +S ER K +S EE L+ + +
Sbjct: 41 EWYPGVIHDYNPQTKKHRIDYRDNEMEMVSVSKERFKLKLSPEEWVDLETATGEQPLIRS 100
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
D E+V LA +++D ++L GD++WAK+ G MWPA V+DE D+ + G+
Sbjct: 101 QPDPVELVALATAVEDQEKLVHGDLVWAKVKGWPMWPAFVMDE----DHAAACGMDPGKK 156
Query: 123 --IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRR 180
+P+QFFG++D R + K+++ F KGL+ FH KCK+ F Q LEE + YL E KLP
Sbjct: 157 GMVPLQFFGSYDHCRFSYKKLVIFSKGLMMKFHTKCKRVVFVQGLEEVERYLKECKLPDS 216
Query: 181 MLQL-------QNAI--RADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVFG 231
M L AI + + E +W + E + K + + + P G
Sbjct: 217 MSHLLDDGDGVHGAIERQRQEIEENWPEPEPDVQETGGEVK-RMTRRRVKRVVTFPLQLG 275
Query: 232 DLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRP 291
L ILSLG +V+DSE+F D ++IWPEGYT+VR F S DP YKM+V RD ++ P
Sbjct: 276 ALTILSLGNVVRDSEHFHDQQYIWPEGYTSVRSFPSAKDPDAFVDYKMQVFRDPLTQSLP 335
Query: 292 LFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSN 351
FR T + G +P CW K ++R+ + + S + MFGFSN
Sbjct: 336 TFRCTPADDTPVEGPSPYACWKKAFHRLRKAHKKVGKIPELDKRMQ--FRSAAHMFGFSN 393
Query: 352 PEVMKLILGLTKSRPTSKSSLCKLTSKYRD----LPGGYRPVRVDWKDLDKCSVCHMDEE 407
P V KLI L +R +K + + D LP GY+PV + WK LD+C+VC++DEE
Sbjct: 394 PRVSKLIQALPGARACTKFANWSVNPPNEDVEAVLPAGYKPVEISWKHLDRCTVCYLDEE 453
Query: 408 YQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKP 467
Y +NL LQCDKCR+MVH CYGELE +G LWLCNLCRP AP+ PPCCLCPV GA+K
Sbjct: 454 YVDNLLLQCDKCRIMVHMNCYGELELPDGDLWLCNLCRPDAPKTRPPCCLCPVTSGALKK 513
Query: 468 TTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNT 527
TTDGRWAHL CA+WIPETCL DVKRMEP+DG+N +SK+RW+L CSIC V YGACI+CS
Sbjct: 514 TTDGRWAHLMCAMWIPETCLVDVKRMEPVDGINAISKERWRLTCSICNVPYGACIRCSVN 573
Query: 528 TCRVAYHPLCARAAGLCVE-LEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP--LNDRL 584
+C+ A+HPLCAR+AGL +E LE++ ++N E D +RLLS+C+KHKQ LN +
Sbjct: 574 SCKTAFHPLCARSAGLYMEVLEEKLQVN----GETD----LRLLSYCRKHKQSTRLNCEV 625
Query: 585 AVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPY 644
A+ + C Y PP SGCARSEPYN RRGR+EPEALAAA KRLFVEN PY
Sbjct: 626 ALPTPCTKTD--CLTYQPPVTSSGCARSEPYNAAARRGRREPEALAAALAKRLFVENLPY 683
Query: 645 LVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPN--FLSMADKYKHMKETFRKRLAFGK 702
V G N LP I + S + PN LSM+DK++ MK + +RLAFGK
Sbjct: 684 RV-----TGCRKNPLPKIASVSSNGNMWSMHWKPNVEILSMSDKFRRMKSSLSQRLAFGK 738
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IHG G+F K H A DM+IEY GE+VRP IAD RE Y+SLVGAGTYMFRIDDERV+
Sbjct: 739 SAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERRCYDSLVGAGTYMFRIDDERVV 798
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT AG+IAHLINHSCEPNCYSR ++ +G++ IIIFAKR+I+ EELTYDYRF S DE L
Sbjct: 799 DATHAGTIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEVGEELTYDYRFMSKDEVL 858
Query: 823 ACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELI 855
CYCG CRG VN + + KL P SELI
Sbjct: 859 TCYCGCAGCRGSVNVVDGDGDSTKLSVPLSELI 891
>gi|168029907|ref|XP_001767466.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162681362|gb|EDQ67790.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 924
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/894 (47%), Positives = 544/894 (60%), Gaps = 62/894 (6%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y+ ++ H + Y D + E +++S ER K +S EE L+ S ++ +
Sbjct: 41 DWYPGVIHDYNPQTKMHRIDYNDNEIEMVLISKERCKLKLSPEEWVELETSTRDQALNRN 100
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
E V LA +++D ++L GD++WAK+ G MWPA V+DE D+ + G+
Sbjct: 101 HLGPVEKVTLANTVEDQEKLVHGDLVWAKVKGWPMWPAFVMDE----DHAAACGMEPGKK 156
Query: 123 --IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRR 180
+P+QFFG++D R + K+++ F KGL+ FH KCK+ F Q LEE + YL E KLP
Sbjct: 157 GMVPLQFFGSYDHCRFSYKKLVIFSKGLMMKFHTKCKRVVFVQGLEEVERYLKECKLPDS 216
Query: 181 MLQLQNAIRADDGEN------SWSQDEGSLGSGENCFKDERLQGTLGSIGIS-------- 226
M L DDG+ Q+ D + G G
Sbjct: 217 MSHL-----LDDGDGVHGVIERQRQEIEENWPEPEPEPDVQEGGGEGKRMRRRRVKRVVA 271
Query: 227 -PYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDT 285
P G L ILSLG +V+DSE+F D ++IWPEGYTAVR F S DP YKM+V RD
Sbjct: 272 FPLQLGALTILSLGNVVRDSEHFHDQQYIWPEGYTAVRSFPSAKDPDAFVEYKMQVFRDP 331
Query: 286 ESKIRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSD 345
+ P FR T + G +P CW K ++R+ + + + S +
Sbjct: 332 IIQSLPTFRCTPGDDTPVEGPSPYLCWKKAFQRLRKAHMKMGKNPELDKRMQ--YRSAAH 389
Query: 346 MFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRD----LPGGYRPVRVDWKDLDKCSV 401
MFGFSNP V KLI L +R +K + + + D LP GY+PV + WK LD+C+V
Sbjct: 390 MFGFSNPRVSKLIQALPGARACTKFTNWTVIAPDEDVEAVLPAGYKPVHISWKHLDRCTV 449
Query: 402 CHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVV 461
C++DEEY +NL LQCDKCR+MVH CYGELE +G LWLCNLCRP AP+ PPCCLCPV
Sbjct: 450 CYLDEEYVDNLLLQCDKCRIMVHMNCYGELELPDGDLWLCNLCRPDAPKTRPPCCLCPVT 509
Query: 462 GGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGAC 521
GGA+K T DGRWAHL CA+WIPETCL DVKRMEP+DG+N +SK+RW+L CS+C V YGAC
Sbjct: 510 GGALKKTIDGRWAHLMCAMWIPETCLVDVKRMEPVDGINAISKERWRLTCSVCKVPYGAC 569
Query: 522 IQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLN 581
I+CS +C+ A+HPLCAR+AGL +E+ +E L ++ + E +RLLS+C++HKQ +
Sbjct: 570 IKCSVNSCKTAFHPLCARSAGLYMEVLEEK----LQVNGETE---LRLLSYCRRHKQSTS 622
Query: 582 DRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVEN 641
V + + C Y PP SGCARSEPYN RRGR+EPEALAAA KRLFVEN
Sbjct: 623 PTCDVAQPIPCAKTDCLSYQPPLTSSGCARSEPYNAAARRGRREPEALAAALAKRLFVEN 682
Query: 642 QPYLVGGYCQN-----------GLSGNTL--PSIRVIGSKFSFSLHRDAPN-------FL 681
PY V G QN G G L PS IG + L P L
Sbjct: 683 LPYRVTGCRQNPPLKIGDCSSKGNMGYMLWEPSKGTIGGE---DLPVSTPTKSGEDVQVL 739
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
SM+DK++ MK + +RL FGKS IHG G+F K H A DM+IEY GE+VRP IAD RE
Sbjct: 740 SMSDKFRRMKSSLSQRLVFGKSAIHGMGVFTKQVHYANDMIIEYAGEVVRPVIADIRERR 799
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y+SLVGAGTYMFRIDDERV+DATRAGSIAHLINHSCEPNCYSR ++ +G++ IIIFAKR
Sbjct: 800 FYDSLVGAGTYMFRIDDERVVDATRAGSIAHLINHSCEPNCYSRTVTASGEDRIIIFAKR 859
Query: 802 DIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELI 855
+I+ EELTYDYRF S DE L CYCG CRG VN +++ KL P SELI
Sbjct: 860 NIEIGEELTYDYRFMSKDEVLTCYCGCAGCRGSVNVVDSDGDPTKLSVPLSELI 913
>gi|27754617|gb|AAO22754.1| putative trithorax protein 1 [Arabidopsis thaliana]
Length = 479
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/491 (74%), Positives = 423/491 (86%), Gaps = 14/491 (2%)
Query: 372 LCKLT-SKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE 430
+CK + ++++ P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRMMVHA+CYGE
Sbjct: 1 MCKNSLGRHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRMMVHAKCYGE 60
Query: 431 LEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDV 490
LEP +G LWLCNLCRPGAP+ PP CCLCPVVGGAMKPTTDGRWAHLACAIWIPETCL+DV
Sbjct: 61 LEPCDGALWLCNLCRPGAPDMPPRCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLSDV 120
Query: 491 KRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDE 550
K+MEPIDG+N+VSKDRWKL+C+ICGVSYGACIQCSN +CRVAYHPLCARAAGLCVELE++
Sbjct: 121 KKMEPIDGVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYHPLCARAAGLCVELEND 180
Query: 551 DRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCA 610
+S++ ++ DQCIR+LSFCK+H+Q L ++R+ T + +Y+PP NPSGCA
Sbjct: 181 -----MSVEGEEADQCIRMLSFCKRHRQTSTACLGSEDRIKSATHKTSEYLPPPNPSGCA 235
Query: 611 RSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFS 670
R+EPYN FGRRGRKEPEALAAAS KRLFVENQPY++GGY + L +T SI GSK S
Sbjct: 236 RTEPYNCFGRRGRKEPEALAAASSKRLFVENQPYVIGGYSR--LEFSTYKSIH--GSKVS 291
Query: 671 FSLHRDAP-NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
+ P N LSMA+KY++M+ET+RKRLAFGKSGIHGFGIFAK PHRAGDM+IEYTGEL
Sbjct: 292 ---QMNTPSNILSMAEKYRYMRETYRKRLAFGKSGIHGFGIFAKLPHRAGDMMIEYTGEL 348
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
VRPSIAD+RE IYNS+VGAGTYMFRIDDERVIDATR GSIAHLINHSC PNCYSRVI+V
Sbjct: 349 VRPSIADKREQLIYNSMVGAGTYMFRIDDERVIDATRTGSIAHLINHSCVPNCYSRVITV 408
Query: 790 NGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYA 849
NGDEHIIIFAKR I +WEELTYDYRFFSI E+L+C CGFP CRGVVNDTEAEEQ AK+
Sbjct: 409 NGDEHIIIFAKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVNDTEAEEQHAKICV 468
Query: 850 PRSELIDWRGD 860
PR +LIDW +
Sbjct: 469 PRCDLIDWTAE 479
>gi|302786940|ref|XP_002975241.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
gi|300157400|gb|EFJ24026.1| hypothetical protein SELMODRAFT_415365 [Selaginella moellendorffii]
Length = 1184
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/897 (45%), Positives = 551/897 (61%), Gaps = 53/897 (5%)
Query: 2 QDWYSGFVVGYDSESNRHHVKYVDG-------------------DEEDLILSNERIKFYI 42
+ W + VV YD S +H V ++ G E LS ERI FYI
Sbjct: 293 RKWLAASVVQYDQLSKKHQVLHLSGLGLITLTNASGPQVHYDIQGEVWQFLSRERIAFYI 352
Query: 43 SQEEMDCLKLSF-SINNVDNDGYDYDEMVVLAASLDDCQEL-EPGDIIWAKLTGHAMWPA 100
S +E L S+ ++++ +E++V AAS + L + GDI+WA+ G+ MWPA
Sbjct: 353 SLDEKMRLITSYDERSSLEERKAKAEELLVFAASTESYDRLFQHGDIVWAETPGYPMWPA 412
Query: 101 IVVDESLIGDYKGLN-KISGGRSIPVQFFGTHDFAR-----INVKQVISFLKGLLSSFHL 154
V+D+ + N +IS G +PV +FGT++ AR I + +++ F KG+L +HL
Sbjct: 413 FVMDDQ---HARLCNLEISTG-ELPVHYFGTYESARSTCFRIIIGRIVKFSKGILDDYHL 468
Query: 155 KCKKPRFTQSLEEAKVYLSEQKLPRRMLQLQNAIRADDGENSWSQDEGSLGSGENCFKDE 214
KC + F + LEE Y EQKLP M LQ + + S E E+ DE
Sbjct: 469 KCSRISFDRGLEEVDRYYREQKLPPGMTYLQEEVVTTLTQAIHSDSEEKSEKDEDFVGDE 528
Query: 215 R---LQGTLGSIGISPYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADP 271
R ++ +L SI P G LQ+L+LGKIVKDSEYF D +IW EGYT++RKF S+ DP
Sbjct: 529 RKHKVRKSLESISKCPIKLGSLQVLNLGKIVKDSEYFHDQHYIWTEGYTSIRKFFSIKDP 588
Query: 272 RVCNSYKMEVLRDTESKIRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFS 331
YKME+LR+ ++ PLFRVT D+GEQ G + + CW KI ++ + + +
Sbjct: 589 TKTVIYKMEILRNPNARTMPLFRVTPDDGEQVEGPSAAACWKKILNQLHRARKKLNLN-G 647
Query: 332 AEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDL----PGGYR 387
+ SG+ MFG SNP++ +LI L +R SK + + + + DL P G++
Sbjct: 648 QHDKKRQFYRSGAAMFGLSNPQISRLIQALPYARVCSKFKVWREATTFEDLDAILPAGFK 707
Query: 388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG 447
V V+WK LD CSVC +EEY+ N+ LQCDKCRM+VH CYG LEP G WLCNLC
Sbjct: 708 HVEVEWKHLDHCSVCDTNEEYEGNILLQCDKCRMLVHLNCYGVLEP-PGDSWLCNLCDSN 766
Query: 448 APEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRW 507
AP+ PPCCLCP+ GGAMK TTDGRW HLACA+WIPET D+ RMEPI+G++ V+K+RW
Sbjct: 767 APKRSPPCCLCPIKGGAMKRTTDGRWVHLACALWIPETSCVDMDRMEPIEGISSVNKERW 826
Query: 508 KLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVE-LEDEDRLNLLSLDEDDEDQC 566
KL C+IC V YGACIQC++ CRV+YH LCARAAG C + LE R + + ++
Sbjct: 827 KLTCTICSVPYGACIQCADHHCRVSYHALCARAAGFCTKVLEGLRRKRNRTTGVQEVERS 886
Query: 567 IRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS-NPSGCARSEPYNYFGRRGRKE 625
++L+S+CKKH ++ +++ DY S NPSG ARSEPYN RRGR+E
Sbjct: 887 VQLVSYCKKH---MHSKMSTKATFDTFATHEDDYAYASHNPSGSARSEPYNVAIRRGRRE 943
Query: 626 PEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG----SKFSFSLHRDAPN-- 679
P+ +AA KR F+ N+P++V G +N P+ IG F R AP+
Sbjct: 944 PDHYSAALAKRAFLVNRPHIVTGCLRNPYKRVKRPAPVPIGVSKVQPFLTIRPRSAPDGR 1003
Query: 680 --FLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADR 737
L++++++ M+ + +RL FGKS IHG+G+FAK PH AGDMVIEY GE++RP++AD
Sbjct: 1004 QVALTVSERFYQMRASLHRRLTFGKSAIHGWGLFAKEPHGAGDMVIEYAGEIIRPTVADV 1063
Query: 738 REHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIII 797
RE YNSLVGAGTYMF ID+ERV+DATRAGSIAHLINHSCEPNCYSRV++ NG E I+I
Sbjct: 1064 REKRCYNSLVGAGTYMFCIDNERVVDATRAGSIAHLINHSCEPNCYSRVVTTNGKERIVI 1123
Query: 798 FAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSEL 854
FAK+DI +E+TYDYRF SI +QL C+CG CRG+VN + E++ ++ +EL
Sbjct: 1124 FAKQDIAGGDEVTYDYRFTSIGDQLPCHCGTAGCRGIVNVMDEEDESGRILVRTAEL 1180
>gi|308044237|ref|NP_001182990.1| uncharacterized protein LOC100501309 [Zea mays]
gi|238008654|gb|ACR35362.1| unknown [Zea mays]
Length = 531
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/591 (59%), Positives = 416/591 (70%), Gaps = 66/591 (11%)
Query: 267 SLADPRVCNSYKMEVLRDTESKIRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNT 326
S+ DP YKMEVLR++++K RPLFRV ++G Q GSTP+ CW +I +I+E Q
Sbjct: 2 SIEDPHAVTLYKMEVLRNSDTKARPLFRVISEDGVQIDGSTPNACWKEIYRRIKEKQ--- 58
Query: 327 SDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGY 386
E SGS MFGFSNP++ +LI L P ++S L K D GY
Sbjct: 59 ----CTELERNVCQSSGSYMFGFSNPQIRQLIQEL----PNARSCL-KYFENGGDTILGY 109
Query: 387 RPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446
R V+V+WKDLD C+VC MDEEY++NLFLQCDKCRMMVHARCYGEL+ ++G LWLCNLCRP
Sbjct: 110 RAVQVNWKDLDYCNVCDMDEEYEDNLFLQCDKCRMMVHARCYGELKQLDGGLWLCNLCRP 169
Query: 447 GAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
GAP P CCLCPV GGAMKPTTDGRWAHLACAIWIPETCL DVKRMEPIDGL+R++KDR
Sbjct: 170 GAPRVSPKCCLCPVTGGAMKPTTDGRWAHLACAIWIPETCLKDVKRMEPIDGLSRINKDR 229
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
WKL+CSICGVSYG CIQCS+ TCRVAYHPLCARAA LC+E+ D
Sbjct: 230 WKLVCSICGVSYGVCIQCSHPTCRVAYHPLCARAADLCIEVVQTDM-------------- 275
Query: 567 IRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEP 626
++ SGCAR+EPYN RRG+K+P
Sbjct: 276 -------------------------------------ASSSGCARTEPYNLHRRRGQKQP 298
Query: 627 EALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDA-PNFLSMAD 685
+ +A ASLKRL+VEN+P++V GYCQN + SI+ G S ++ ++A N SM +
Sbjct: 299 QVVATASLKRLYVENRPHIVSGYCQNRVGNTCGESIQPFG--LSDAVQQEAIGNVSSMVE 356
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
KY +K TFRKRL FGKS IHG+G+FAK H+AGDM++EY GE+VRP I+D RE IYNS
Sbjct: 357 KYTIVKATFRKRLTFGKSKIHGYGVFAKVAHKAGDMMVEYIGEIVRPPISDTRERRIYNS 416
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
LVGAGTYMFRIDDERVIDATR GSIA LINHSCEPNCYSR I++ GDEHIIIFAKRDI
Sbjct: 417 LVGAGTYMFRIDDERVIDATRVGSIARLINHSCEPNCYSRAITILGDEHIIIFAKRDIDP 476
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELID 856
WEELTYDYRFFS D++L C+CGFP+CRGVVND EAEEQ AK+ RSEL +
Sbjct: 477 WEELTYDYRFFSSDQRLPCFCGFPKCRGVVNDVEAEEQSAKIRVKRSELFE 527
>gi|302821061|ref|XP_002992195.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
gi|300139962|gb|EFJ06692.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
Length = 1052
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 343/884 (38%), Positives = 476/884 (53%), Gaps = 134/884 (15%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCL--------KLSFS 55
WY+G + Y+ S +H + YVD E L L NE++ F + +E L K+
Sbjct: 270 WYTGAIHEYNPASKKHRIDYVDNQMEWLCLRNEKVHFQMGIKEEKVLCAAGCDEYKMRLG 329
Query: 56 INNVDNDGYDYDEMVVLAASLDDCQE-LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGL 114
+ DE+ +LA ++ +E + G ++WAK+ G MWPA+V+DE + GL
Sbjct: 330 V----------DELALLANEMESFEEQISHGTVVWAKVKGWPMWPALVLDEEH-AEKCGL 378
Query: 115 NKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSE 174
+ + VQFFG+ DFAR+N ++++F KG+ +H KCK+P F Q L E + YL +
Sbjct: 379 ERPLKKSTFAVQFFGSCDFARLNNDKIVTFSKGVQLKYHSKCKRPAFDQGLREVESYLKQ 438
Query: 175 QKLPRRM---LQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGI---SPY 228
+LP +M L++Q+ R D G S + D E DER T S+ P
Sbjct: 439 GRLPEKMAQMLEIQSKSRNDVGRASKTSD------AEEYIDDEREHPTKTSMEDLMGYPL 492
Query: 229 VFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G L+ILSLG++ DS +F + +IWP G+++VR D ++MEV R
Sbjct: 493 DLGSLKILSLGEVAFDSAHFHNKHYIWPVGFSSVRVLPFFKDLEKLVEHRMEVTRCKSDP 552
Query: 289 IRPLFRVTLDNG---EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGS 344
P+F VT+ + QF GSTP CW K+ ++ D + A SG+
Sbjct: 553 AIPVFHVTVADTISRLQFEGSTPGDCWKKVYKEL--------DRLKEKPAKTHFKRLSGA 604
Query: 345 DMFGFSNPEVMKLILGLTKSRPTSK----SSLCKLTSKYRDLPGGYRPVRVDWKDLDKCS 400
MFGF+NP V KLI L +R S + + P GY+P+ V WK L++C+
Sbjct: 605 SMFGFNNPRVFKLIKELPNARECSNFMDWDKVHSSKKSKQAGPSGYKPINVMWKHLERCN 664
Query: 401 VCHMDE---EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
VC++DE Q+++ ++ E++
Sbjct: 665 VCYLDEAGIRRQSSVAVR--------------EMQ------------------------- 685
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
+ GGAMK TTDGRWAHL CA+WIPETCL DVK+MEPIDG++ + K+RWKL CS+C VS
Sbjct: 686 --IDGGAMKKTTDGRWAHLTCAMWIPETCLVDVKKMEPIDGVSSIHKERWKLTCSVCRVS 743
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
+GACIQ ++D+D D ++ ++LLSFC KH
Sbjct: 744 HGACIQ-----------------------IQDDD--------GDGLEKPLKLLSFCSKH- 771
Query: 578 QPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRL 637
N+R A V ++ + + + N SGC+R EPYN RR R+EPE A ++ KR
Sbjct: 772 ---NNR-ASSHVDVHLSSQESEVV--ENSSGCSRCEPYNPVSRRRRQEPEGKALSTSKRS 825
Query: 638 FVENQPYLVGGYCQNGL---SGNTLPSIRVIGSKFSFSLHRDAPN-FLSMADKYKHMKET 693
FVE PYLV + + G S+ + SF R LSM+ + + M+ T
Sbjct: 826 FVEKTPYLVANFERKAKLVKIGKAYKSVFGTVGQGSFLSDRSTEEPVLSMSARLESMRST 885
Query: 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYM 753
RL FGKSGIHG+G+FAK H+AGDM+ EY GE+VR +IAD RE Y+SLVGAGTYM
Sbjct: 886 MHDRLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIRERRHYDSLVGAGTYM 945
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FRIDDERV+DAT GS+AHLINHSCEPNCYSR+I+V+ + IIIFAKRDI WEELTYDY
Sbjct: 946 FRIDDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDY 1005
Query: 814 RFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDW 857
RF S +L C CG +CRG V ++ ++ A RS L W
Sbjct: 1006 RFASKGAELVCNCGALKCRGSVRHSDDSNNEGRILALRSSLRSW 1049
>gi|168027193|ref|XP_001766115.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162682758|gb|EDQ69174.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 514
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 285/479 (59%), Positives = 337/479 (70%), Gaps = 19/479 (3%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
LP GY+PV V WK LD+C+VC++DEEY NNL LQCDKC MMVH CYGE E +G LWLC
Sbjct: 39 LPAGYKPVDVLWKHLDRCTVCYLDEEYDNNLLLQCDKCHMMVHMDCYGEQELPDGDLWLC 98
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNR 501
NLC AP+P PPCCLCP+ GGAMK TTDGRWAHL CA+WIPETC DVKRMEPI G+
Sbjct: 99 NLCELDAPKPRPPCCLCPITGGAMKKTTDGRWAHLMCAMWIPETCFVDVKRMEPIHGIKA 158
Query: 502 VSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDED 561
VSK+RW+L C +C V YGACIQC +C A+HPLCAR+AGLC+EL++E D
Sbjct: 159 VSKERWRLTCVVCKVLYGACIQCPVRSCTTAFHPLCARSAGLCMELQEEKHKKYGKSD-- 216
Query: 562 DEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRR 621
+RLL++C+KHKQP N V +R+ V C Y PP N SGCARSEPYN RR
Sbjct: 217 -----MRLLAYCRKHKQPTNSTCEVAQRIPHVMTDCMSYCPPLNSSGCARSEPYNAAARR 271
Query: 622 GRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFS---FSLHRDAP 678
GR+EP+ALAAA KRLFVEN PY V G QN P +V G +SLH +A
Sbjct: 272 GRREPDALAAALAKRLFVENLPYSVTGCRQN-------PPPKVAGYTNGGSLWSLHWEAS 324
Query: 679 --NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIAD 736
+ LSM+DK++ MK + +RL FGKS IHG G+F K H A DM+IEY GE+VRP +AD
Sbjct: 325 KGSILSMSDKFRRMKGSLSQRLVFGKSAIHGMGVFTKRVHYANDMIIEYAGEVVRPVVAD 384
Query: 737 RREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHII 796
RE Y+SLVGAGTYMFRIDDERV+DAT GSIAHLINHSCEPNCYSR ++ +G++ II
Sbjct: 385 SRERRHYDSLVGAGTYMFRIDDERVVDATTTGSIAHLINHSCEPNCYSRTVTASGEDRII 444
Query: 797 IFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELI 855
IFAKRD++ EELTYDYRF S E L CYCG CRG VN + + KL P S+L
Sbjct: 445 IFAKRDLEIGEELTYDYRFMSKTEVLTCYCGSAGCRGSVNVQDEDGDPTKLCIPLSDLF 503
>gi|302800676|ref|XP_002982095.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
gi|300150111|gb|EFJ16763.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
Length = 1045
Score = 562 bits (1449), Expect = e-157, Method: Compositional matrix adjust.
Identities = 339/881 (38%), Positives = 472/881 (53%), Gaps = 135/881 (15%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCL--------KLSFS 55
WY+G + Y+ S +H + YVD E L L NE++ F + +E L K+
Sbjct: 270 WYTGAIHEYNPASKKHRIDYVDNQMEWLCLRNEKVHFQMGIKEEKVLCAAGCDDYKMRLG 329
Query: 56 INNVDNDGYDYDEMVVLAASLDDCQE-LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGL 114
+ DE+ +LA ++ +E + G ++WAK+ G MWPA+V+DE + GL
Sbjct: 330 V----------DELALLANEMESFEEQISHGTVVWAKVKGWPMWPALVLDEEH-AEKCGL 378
Query: 115 NKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSE 174
+ + VQFFG+ DFAR+N ++++F KG+ +H KCK+P F Q L E ++
Sbjct: 379 ERPLKKSTFAVQFFGSCDFARLNNDKIVTFSKGVQLKYHSKCKRPAFDQGLREGRL---P 435
Query: 175 QKLPRRMLQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGT---LGSIGISPYVFG 231
+K+ + ML++Q+ R D G+ S + D E DER T + + P G
Sbjct: 436 EKMAQ-MLEIQSKSRNDVGKASKTSD------AEEYIDDEREHPTKTRMEDLMGYPLDLG 488
Query: 232 DLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRP 291
L+ILSLG++ DS +F + +IWP G+++VR D ++MEV R P
Sbjct: 489 SLKILSLGEVAFDSAHFHNKHYIWPVGFSSVRVLPFFKDLEKLVEHRMEVTRCKSDPAIP 548
Query: 292 LFRVTLDNG---EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMF 347
+F VT+ + QF GSTP CW K+ ++ D + A SG+ MF
Sbjct: 549 VFHVTVADTISRLQFEGSTPGDCWKKVYKEL--------DRLKEKPAKTHFKRLSGASMF 600
Query: 348 GFSNPEVMKLILGLTKSRPTSK----SSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCH 403
GF+NP V KLI L +R S + + P GY+P+ V WK L++C+VC+
Sbjct: 601 GFNNPRVFKLIKELPNARECSNFMDWDKVHSSKKSKQTGPSGYKPINVMWKHLERCNVCY 660
Query: 404 MDE---EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPV 460
+DE Q+++ ++ E++ +
Sbjct: 661 LDEAGIRGQSSVAVR--------------EMQ---------------------------I 679
Query: 461 VGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGA 520
GGAMK TTDGRWAHL CA+WIPETCL DVK+MEPIDG++ + K+RWKL CS+C VS+GA
Sbjct: 680 DGGAMKKTTDGRWAHLTCAMWIPETCLVDVKKMEPIDGVSTIHKERWKLTCSVCRVSHGA 739
Query: 521 CIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPL 580
CIQ ++D+D D ++ ++LLSFC KH
Sbjct: 740 CIQ-----------------------IQDDD--------GDGLEKPLKLLSFCSKH---- 764
Query: 581 NDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVE 640
N+R + V V + N SGC+R EPYN RR R+EPE A ++ KR FVE
Sbjct: 765 NNRASSH---VDVHLSSQESEVAENSSGCSRCEPYNPVSRRRRQEPEGKALSTSKRSFVE 821
Query: 641 NQPYLVGGYCQNGL---SGNTLPSIRVIGSKFSFSLHRDAPN-FLSMADKYKHMKETFRK 696
PYLV + + G S+ + SF R LSM+ + + M+ T
Sbjct: 822 KTPYLVANFERKAKLVKIGKAYKSVFGTVGQGSFLSDRSTEEPVLSMSARLESMRSTMHD 881
Query: 697 RLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI 756
RL FGKSGIHG+G+FAK H+AGDM+ EY GE+VR +IAD RE Y+SLVGAGTYMFRI
Sbjct: 882 RLTFGKSGIHGWGVFAKQIHKAGDMMAEYAGEIVRSNIADIREQRHYDSLVGAGTYMFRI 941
Query: 757 DDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFF 816
DDERV+DAT GS+AHLINHSCEPNCYSR+I+V+ + IIIFAKRDI WEELTYDYRF
Sbjct: 942 DDERVVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFA 1001
Query: 817 SIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDW 857
S +L C CG +CRG V ++ ++ A RS L W
Sbjct: 1002 SKGAELVCNCGALKCRGSVRHSDDSNNEGRILALRSSLRSW 1042
>gi|311788758|gb|ADQ12920.1| trithorax [Solanum lycopersicum]
Length = 873
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 273/357 (76%), Positives = 302/357 (84%), Gaps = 7/357 (1%)
Query: 510 LCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRL 569
LCSIC V YGACIQCSN CRVAYHPLCARAAG CVELEDEDRL+L+ +D+D+ DQCIRL
Sbjct: 518 LCSICSVPYGACIQCSNPVCRVAYHPLCARAAGFCVELEDEDRLHLIPMDDDELDQCIRL 577
Query: 570 LSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEAL 629
LSFCKKH+ N+R AVDE + Q DY+PP NPSGCARSEPYNYFGRRGRKEPE L
Sbjct: 578 LSFCKKHRAVSNERPAVDECVGQKACEYSDYVPPPNPSGCARSEPYNYFGRRGRKEPEVL 637
Query: 630 AAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHR------DAPNFLSM 683
AASLKRL+VEN+PYLVGG+ Q+ S NTL S GSK +F L + + + +SM
Sbjct: 638 TAASLKRLYVENRPYLVGGHSQHDQSSNTLSS-SCAGSKHTFDLQKLRCSQLTSRSIVSM 696
Query: 684 ADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIY 743
+KY +MKET +RLAFGKSGIHGFGIFAK P +AGDMVIEYTGELVRP IADRREH IY
Sbjct: 697 VEKYNYMKETLGQRLAFGKSGIHGFGIFAKLPQKAGDMVIEYTGELVRPPIADRREHLIY 756
Query: 744 NSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDI 803
NSLVGAGTYMFRIDD+RVIDATRAGSIAHLINHSCEPNCYSRVISVN +HIIIF+KRDI
Sbjct: 757 NSLVGAGTYMFRIDDQRVIDATRAGSIAHLINHSCEPNCYSRVISVNSIDHIIIFSKRDI 816
Query: 804 KQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDWRGD 860
+QWEELTYDYRF SIDEQLACYCGFPRCRGVVNDTEAEE++AKLYAPRSELIDW G+
Sbjct: 817 EQWEELTYDYRFLSIDEQLACYCGFPRCRGVVNDTEAEERMAKLYAPRSELIDWEGE 873
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 120/206 (58%), Positives = 149/206 (72%), Gaps = 1/206 (0%)
Query: 240 KIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRPLFRVTLDN 299
KIV+DSE F+D+ FIWPEGYTAVRK S+ DP V SYKMEVLRD + + RPLFRVT D+
Sbjct: 249 KIVEDSELFRDEEFIWPEGYTAVRKLPSVTDPSVRVSYKMEVLRDPDFRTRPLFRVTSDS 308
Query: 300 GEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLIL 359
EQF GS+PS CW+K+ ++R+ Q + D+ + +E+ SGS MFGFS+PE+ KLI
Sbjct: 309 REQFKGSSPSACWNKVYKQMRKTQVDNFDESISSRKSERTFGSGSHMFGFSHPEISKLIK 368
Query: 360 GLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKC 419
L+KS+ +K SL +SK +D KC+VCHMDEEY+NNLFLQCDKC
Sbjct: 369 ELSKSKILAK-SLKLASSKNQDXXXXXXXXXXXXXXXXKCNVCHMDEEYENNLFLQCDKC 427
Query: 420 RMMVHARCYGELEPVNGVLWLCNLCR 445
RMMVHARCYGE EP++GVLWLCNLCR
Sbjct: 428 RMMVHARCYGEREPMDGVLWLCNLCR 453
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 80/142 (56%), Positives = 93/142 (65%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDG 63
WY+G + GY+SE+ RHHVKYVDGDEEDL+LSNERIKF ++
Sbjct: 19 WYTGRITGYNSETGRHHVKYVDGDEEDLLLSNERIKFSVTLXXXXXXXXXXXXXXXXXXX 78
Query: 64 YDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSI 123
LEPGDIIWAKLTGHAMWPAIV+DES G KGLNK+SG +S+
Sbjct: 79 XXXXXXXXXXXXXXXXXXLEPGDIIWAKLTGHAMWPAIVLDESCAGGCKGLNKVSGEKSV 138
Query: 124 PVQFFGTHDFARINVKQVISFL 145
VQFFGTHDFAR+ +KQVISFL
Sbjct: 139 LVQFFGTHDFARVKLKQVISFL 160
>gi|302785193|ref|XP_002974368.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
gi|300157966|gb|EFJ24590.1| hypothetical protein SELMODRAFT_101022 [Selaginella moellendorffii]
Length = 508
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 264/508 (51%), Positives = 342/508 (67%), Gaps = 19/508 (3%)
Query: 360 GLTKSRPTSKSSLCKLTSKYRDL----PGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQ 415
L +R SK + + + + DL P G++ V V+WK LD CSVC +EEY+ N+ LQ
Sbjct: 3 ALPYARVCSKFKVWREATTFEDLDAILPAGFKHVEVEWKHLDHCSVCDTNEEYEGNILLQ 62
Query: 416 CDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAH 475
CDKCRM+VH CYG LEP G WLCNLC AP+ PPCCLCP+ GGAMK TTDGRW H
Sbjct: 63 CDKCRMLVHLNCYGVLEP-PGDSWLCNLCDSNAPKRSPPCCLCPIKGGAMKRTTDGRWVH 121
Query: 476 LACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHP 535
LACA+WIPET D+ RMEPI+G++ V+K+RWKL C+IC V YGACIQC++ CRV+YH
Sbjct: 122 LACALWIPETSCVDMDRMEPIEGISSVNKERWKLTCTICSVPYGACIQCADHHCRVSYHA 181
Query: 536 LCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTR 595
LCARAAG C + R + E + ++L+S+CKKH ++ +++
Sbjct: 182 LCARAAGFCTKGLRRKRNRTTGVQEVERS--VQLVSYCKKH---MHSKMSTKATFDTFAT 236
Query: 596 RCCDYIPPS-NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
DY S NPSG ARSEPYN RRGR+EP+ +AA KR F+ N+P++V G +N
Sbjct: 237 HEDDYAYASHNPSGSARSEPYNVAIRRGRREPDHYSAALAKRAFLVNRPHIVTGCLRNPY 296
Query: 655 SGNTLPSIRVIG----SKFSFSLHRDAPN----FLSMADKYKHMKETFRKRLAFGKSGIH 706
P+ IG F R AP+ L++++++ M+ + +RL FGKS IH
Sbjct: 297 KRVKRPAPVPIGVSKVQPFLTIRPRSAPDGRQVALTVSERFYQMRASLHRRLTFGKSAIH 356
Query: 707 GFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATR 766
G+G+FAK PH AGDMVIEY GE++RP++AD RE YNSLVGAGTYMF ID+ERV+DATR
Sbjct: 357 GWGLFAKEPHGAGDMVIEYAGEIIRPTVADVREKRCYNSLVGAGTYMFCIDNERVVDATR 416
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYC 826
AGSIAHLINHSCEPNCYSRV++ NG E I+IFAK+DI +E+TYDYRF SI +QL C+C
Sbjct: 417 AGSIAHLINHSCEPNCYSRVVTTNGKERIVIFAKQDIAGGDEVTYDYRFTSIGDQLPCHC 476
Query: 827 GFPRCRGVVNDTEAEEQVAKLYAPRSEL 854
G CRG+VN + E++ ++ SEL
Sbjct: 477 GTAGCRGIVNVMDEEDESGRILVRASEL 504
>gi|384252564|gb|EIE26040.1| hypothetical protein COCSUDRAFT_13072, partial [Coccomyxa
subellipsoidea C-169]
Length = 859
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 300/874 (34%), Positives = 431/874 (49%), Gaps = 93/874 (10%)
Query: 4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQ------EEMDCLKLSFSIN 57
W+ G V Y+ + +H V+Y DGD E ++++ R++ +S EE+ L +
Sbjct: 16 WFLGSVSSYNPDDGKHEVRYEDGDMESILMATSRVRLEVSAGETLQAEELRTLAEQQAAA 75
Query: 58 NVDNDGYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKI 117
+ D D + + Q G+++WA+ G WP++V+ D L
Sbjct: 76 AAGSRPDDQDPASNAPEAGE--QTFGLGEVVWAREKGWPAWPSVVITLESARDLSPLRPC 133
Query: 118 SGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPR--FTQSLEEAKVYLSEQ 175
GG + V FFGT++ I ++ GL H+KCK+ F ++L EA YL
Sbjct: 134 KGG--VTVWFFGTYEMGCIKPAEIAGLRTGLDQELHVKCKRAVKVFRRALHEAYTYLQAS 191
Query: 176 KLP-------RRMLQLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPY 228
LP R + + + D + SW D G+ D L+ LG
Sbjct: 192 LLPYAELIFSRVSMIRTDQMILDSNDESWFADADEAPDGDFTGADFPLK--LGK------ 243
Query: 229 VFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
+L + SLG+I + F + +++WP GY+A+R T R+ ++ DT
Sbjct: 244 ---NLTVNSLGRIEFLRQAFHNKKYLWPLGYSAMRTETLPNGKRMACHCQILSSPDT--- 297
Query: 289 IRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE-SGSDMF 347
+ P+FRVT + G P W ++ +A GA + + SG MF
Sbjct: 298 LAPIFRVTPEGGVPVEADHPGRAWKEV--------------LAATGAERRAAGLSGPRMF 343
Query: 348 GFSNPEVMKLILGLTKS-RPTSKSSLC--KLTSKYRDLPGGYRPVRVDWKDLDKCSVCHM 404
G NP + ++I L + R S + + + LP G R V D CSVC
Sbjct: 344 GLDNPAIARMIQALPHAGRCASFDAWLGERPPCEPLRLPAGIRAVPADRAVQGVCSVCGE 403
Query: 405 DEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGA 464
+EE N LQCD CR VH CYG P G LWLC++C+ G P P C LCPV GG
Sbjct: 404 EEETDANHLLQCDGCREFVHMDCYGVGAPPEGRLWLCDVCKLG-PSRAPACALCPVEGGL 462
Query: 465 MKPTTDGRWAHLACAIWIPETCL-TDVKRMEPIDGLNRVSK--DRWKLLCSICGVSYGAC 521
+K TT GRW H AC +W+PET + DV +DGL + K R L C++C +YGAC
Sbjct: 463 LKRTTCGRWVHSACTLWVPETAIDCDVGL---VDGLQYIPKACHRLPLSCAVCSQAYGAC 519
Query: 522 IQCS-NTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH---- 576
IQC+ + +C ++HPLCARAAGLC+ + E LS +RL+ +C +H
Sbjct: 520 IQCAGHRSCCASFHPLCARAAGLCMRVWREG--TALSAG-------LRLMCYCPRHTALL 570
Query: 577 -KQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLK 635
L R+++ + +R I + C+R PYN+ RRG + P+A+ AA K
Sbjct: 571 ESSTLKARMSIPPTPLPPSRALRAQI---QTAPCSRCIPYNHELRRGHRAPDAIVAALAK 627
Query: 636 RLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFR 695
R FV PYLV ++ P++ + + S+A+++K M+ T
Sbjct: 628 RAFVRATPYLVT------MARTQPPALTGAAAARGPAPGGR--AVQSLAERFKEMRRTVT 679
Query: 696 KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFR 755
RL GKS IHG+G F K P DM++EY GEL+R +AD RE Y+ LVGAGTY+F
Sbjct: 680 ARLTCGKSAIHGWGAFTKVPAAESDMLVEYMGELLRRPVADARERRTYDRLVGAGTYVFG 739
Query: 756 IDDERVIDATRAGSIAHLINHSCEPNCYSRVISV------NGDEHIIIFAKRDIKQWEEL 809
+ DE V+DATR G++AHL+NHSCEPN YSR +SV +H++IFAKR I EEL
Sbjct: 740 LSDELVVDATRKGNMAHLLNHSCEPNSYSRTVSVRCPDTGTLSDHVVIFAKRAIAAGEEL 799
Query: 810 TYDYR---FFSIDEQLACYCGFPRCRGVVNDTEA 840
TYDYR S +L C CG CRG VN +A
Sbjct: 800 TYDYRRAPSHSTPHKLPCNCGAATCRGFVNLPKA 833
>gi|4582457|gb|AAD24841.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 178
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 146/170 (85%), Positives = 163/170 (95%)
Query: 378 KYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGV 437
++++ P GYRPVRVDWKDLDKC+VCHMDEEY+NNLFLQCDKCRMMVHA+CYGELEP +G
Sbjct: 8 RHQNQPTGYRPVRVDWKDLDKCNVCHMDEEYENNLFLQCDKCRMMVHAKCYGELEPCDGA 67
Query: 438 LWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPID 497
LWLCNLCRPGAP+ PP CCLCPVVGGAMKPTTDGRWAHLACAIWIPETCL+DVK+MEPID
Sbjct: 68 LWLCNLCRPGAPDMPPRCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLSDVKKMEPID 127
Query: 498 GLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVEL 547
G+N+VSKDRWKL+C+ICGVSYGACIQCSN +CRVAYHPLCARAAGLCVE+
Sbjct: 128 GVNKVSKDRWKLMCTICGVSYGACIQCSNNSCRVAYHPLCARAAGLCVEV 177
>gi|4582456|gb|AAD24840.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
Length = 186
Score = 313 bits (802), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 149/182 (81%), Positives = 164/182 (90%), Gaps = 1/182 (0%)
Query: 679 NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
N LSMA+KY++M+ET+RKRLAFGKSGIHGFGIFAK PHRAGDM+IEYTGELVRPSIAD+R
Sbjct: 6 NILSMAEKYRYMRETYRKRLAFGKSGIHGFGIFAKLPHRAGDMMIEYTGELVRPSIADKR 65
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E IYNS+ GAGTYMFRIDDERVIDATR GSIAHLINHSC PNCYSRVI+VNGDEHIIIF
Sbjct: 66 EQLIYNSM-GAGTYMFRIDDERVIDATRTGSIAHLINHSCVPNCYSRVITVNGDEHIIIF 124
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDWR 858
AKR I +WEELTYDYRFFSI E+L+C CGFP CRGVVNDTEAEEQ AK+ PR +LIDW
Sbjct: 125 AKRHIPKWEELTYDYRFFSIGERLSCSCGFPGCRGVVNDTEAEEQHAKICVPRCDLIDWT 184
Query: 859 GD 860
+
Sbjct: 185 AE 186
>gi|4582458|gb|AAD24842.1| hypothetical protein [Arabidopsis thaliana]
Length = 514
Score = 299 bits (765), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 169/370 (45%), Positives = 222/370 (60%), Gaps = 75/370 (20%)
Query: 16 SNRHHVKYV-DGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDGYDYDEMVVLAA 74
++R K+V DG +ED++ E IKF +S+EEM+ L L F +NV DG DYDEMVVLAA
Sbjct: 186 ASRSAKKWVRDGCDEDIVFDREMIKFLVSREEMELLHLKFCTSNVTVDGRDYDEMVVLAA 245
Query: 75 SLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGL-NKISGGRSIPVQFFGTHDF 133
+LD+CQ+ EPGDI+WAKL GHAMWPA++VDES+IG+ KGL NK+SGG S+ VQFFGTHDF
Sbjct: 246 TLDECQDFEPGDIVWAKLAGHAMWPAVIVDESIIGERKGLNNKVSGGGSLLVQFFGTHDF 305
Query: 134 ARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRMLQLQNAIRADDG 193
AR YL +LP RM QLQ + D
Sbjct: 306 AR-----------------------------------YLKAHRLPERMSQLQKGADSVDS 330
Query: 194 ENSWSQDEGSLGSGENCFKDERLQGTLGSIGISP-------YVFGDLQILSL-------- 238
+ + S +EG+ SG + D G + + P ++ GDL I++L
Sbjct: 331 DMANSTEEGN--SGGDLLND-------GEVWLRPTEHVDFRHIIGDLLIINLVGYRKTPM 381
Query: 239 ----------GKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
GK+V DS++F+D+ IWPEGYTA+RKFTSL D YKMEVLRD E+K
Sbjct: 382 IIYRRMTCLTGKVVTDSQFFKDENHIWPEGYTAMRKFTSLTDHSASALYKMEVLRDAETK 441
Query: 289 IRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFG 348
PLF VT D+GEQF G TPS CW+KI +I++ QN+ S + E+++ SG+DMFG
Sbjct: 442 THPLFIVTADSGEQFKGPTPSACWNKIYNRIKKVQNSDSPNI----LGEELNGSGTDMFG 497
Query: 349 FSNPEVMKLI 358
SNPEV+KL+
Sbjct: 498 LSNPEVIKLV 507
>gi|303286289|ref|XP_003062434.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226455951|gb|EEH53253.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 491
Score = 295 bits (755), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 175/455 (38%), Positives = 257/455 (56%), Gaps = 24/455 (5%)
Query: 385 GYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNL 443
GY +RV W +D+C+VC D ++ + + C+ C + VH CYG E P + V WLC
Sbjct: 56 GYEIIRVQWS-VDRCAVCDDDRDFDFDQLVTCEGCAISVHQSCYGIPEIPDDAVGWLCAA 114
Query: 444 CRP--GAPEPPPPCCLCPVVGGAMKPTTD-GRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C G P CCLCPV GGA+KPTT GRW H AC WIPET + DV M+PID ++
Sbjct: 115 CEHTGGVVSETPLCCLCPVEGGALKPTTKPGRWCHSACCQWIPETTVLDVDTMQPIDQID 174
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ ++RW+LLC++C +GA IQC + C +AYHPLCARA+GL +E L D
Sbjct: 175 TIQRERWELLCTVCKQRHGAKIQCDHPGCYLAYHPLCARASGLFME------ARLGEDDG 228
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGR 620
+DED + ++S+C +H +R A + R P +P+G R R
Sbjct: 229 EDEDSPLMMVSYCHRHCLVDTERAAFARGRSRQKR------PWRSPAGRRRRRRKRRTRR 282
Query: 621 RGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNF 680
R R + S KR FVE PY+ + L+ T ++ V + + ++ +
Sbjct: 283 RKRPKTTRPPPRSGKRAFVELIPYVAENAAR--LAAKTSGAVAVAAMEIAAAIVDEGKG- 339
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
+ ++ + + +R FGKS IHG+G+ AK P +AG MVIE+ GE+V+P +AD RE
Sbjct: 340 --VRERMEEAHDFLHERFTFGKSNIHGWGLIAKKPVKAGSMVIEFRGEIVKPHVADLREK 397
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
++ + Y+ + D++ VID T G+IA NHSC PN Y++++SV+G HII FA+
Sbjct: 398 AYDDANIDC--YLLKADEKTVIDTTMRGNIARFTNHSCNPNMYTKIVSVDGSNHIIFFAR 455
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
D++ EE+TYDYRF + ++ CYCG CRG +
Sbjct: 456 VDVQPGEEMTYDYRFDAESGKVPCYCGAHNCRGFL 490
>gi|302761298|ref|XP_002964071.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
gi|300167800|gb|EFJ34404.1| hypothetical protein SELMODRAFT_142359 [Selaginella moellendorffii]
Length = 564
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 188/529 (35%), Positives = 276/529 (52%), Gaps = 72/529 (13%)
Query: 337 EKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDL 396
+++ ESG+ G+++ V KL + + C L Y PV V+W
Sbjct: 79 QRMIESGAPGIGYNSHNVRKLAM--------KDFTAC--------LQEPYTPVIVNWTS- 121
Query: 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCC 456
++C+VC E+Y+ N + C++C++ VH CYG W+C +C+ P+ CC
Sbjct: 122 ERCAVCRWVEDYEYNKIIICNRCQLAVHEECYGVKASEISSSWVCRVCQ--TPDIERECC 179
Query: 457 LCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515
LCPV GGA+KP+ G W H+ CA + PE D+ +MEP GL V R+ +CS+C
Sbjct: 180 LCPVKGGALKPSDIPGFWVHVTCAWFTPEVAFKDINKMEPAVGLQGVDTTRFSQVCSLCE 239
Query: 516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKK 575
+GACIQC T CR YHP+CA AG +EL+ + N+ S R+LS+C
Sbjct: 240 QVHGACIQC--TKCRTTYHPMCASRAGYHMELQISKKKNVPS---------TRMLSYCAA 288
Query: 576 HKQPLND---RLAVDERLVQVTRRCCDYIP-PSNPSGCARSE------PYNYFGR----- 620
HK P D + + ++ T+ +P P+ C +E N R
Sbjct: 289 HKTPSPDAFLQYNITQKKPPATKF--PLVPVPTEEVPCVETERLGCLLAANELSRSLEIS 346
Query: 621 -----RGRKEPEALAAA------SLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKF 669
R R P A A S K+ Y V GY +N L + + S+R
Sbjct: 347 FEKFKRWRMLPAARCQAYNPDDTSHKKPTGAGIAYRVMGYTKNSL--DWISSLRDY---- 400
Query: 670 SFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
L D +SM ++ +++T + R+ FGKSGIHG+G+FA+ + G+MV+EY GE
Sbjct: 401 ---LEPDHGEVMSMKERLSFLQKTEKTRVCFGKSGIHGWGLFARRYIKEGEMVVEYRGER 457
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
+R S+AD RE L G Y+F+I +E VIDAT G+I LINHSCEPNCY+R++SV
Sbjct: 458 IRRSVADLREKRY--CLEGKHCYLFKISEEVVIDATENGNIGRLINHSCEPNCYARIVSV 515
Query: 790 --NGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
G+ HI++ A++D+ EELTYDY+F D+++ C CG RCR +N
Sbjct: 516 EGEGESHIVLIARKDVSVGEELTYDYQFDKEDKKVLCLCGSSRCRRFMN 564
>gi|223974115|gb|ACN31245.1| unknown [Zea mays]
Length = 214
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 139/212 (65%), Positives = 167/212 (78%), Gaps = 3/212 (1%)
Query: 604 SNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIR 663
++ SGCAR+EPYN RRG+K+P+ +A ASLKRL+VEN+P++V GYCQN + SI+
Sbjct: 2 ASSSGCARTEPYNLHRRRGQKQPQVVATASLKRLYVENRPHIVSGYCQNRVGNTCGESIQ 61
Query: 664 VIGSKFSFSLHRDA-PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
G S ++ ++A N SM +KY +K TFRKRL FGKS IHG+G+FAK H+AGDM+
Sbjct: 62 PFG--LSDAVQQEAIGNVSSMVEKYTIVKATFRKRLTFGKSKIHGYGVFAKVAHKAGDMM 119
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
+EY GE+VRP I+D RE IYNSLVGAGTYMFRIDDERVIDATR GSIA LINHS EPNC
Sbjct: 120 VEYIGEIVRPPISDTRERRIYNSLVGAGTYMFRIDDERVIDATRVGSIARLINHSREPNC 179
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
YSR I++ GDEHIIIFAKRDI WEELTYDYR
Sbjct: 180 YSRAITILGDEHIIIFAKRDIDPWEELTYDYR 211
>gi|302787360|ref|XP_002975450.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
gi|300157024|gb|EFJ23651.1| hypothetical protein SELMODRAFT_174869 [Selaginella moellendorffii]
Length = 582
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 185/537 (34%), Positives = 274/537 (51%), Gaps = 70/537 (13%)
Query: 337 EKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDL 396
+++ ESG+ G+++ V KL + + C L Y PV V+W
Sbjct: 79 QRMIESGAPGIGYNSHNVRKLAV--------KDFTAC--------LQEPYTPVIVNWTS- 121
Query: 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCC 456
++C+VC E+Y+ N + C++C++ VH CYG W+C +C+ P+ CC
Sbjct: 122 ERCAVCRWVEDYEYNKIIICNRCQLAVHEECYGVKASEISSSWVCRVCQ--TPDIERECC 179
Query: 457 LCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515
LCPV GGA+KP+ G W H+ CA + PE D+ +MEP GL V R+ C +C
Sbjct: 180 LCPVKGGALKPSDIPGFWVHVTCAWFTPEVAFKDINKMEPAVGLQGVDTTRFSQACVVCE 239
Query: 516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLN-----LLSLDEDDEDQCIRL- 569
+GACIQC T CR YHP+CA AG +EL+ + N +LS + C+R+
Sbjct: 240 QVHGACIQC--TKCRTTYHPMCASRAGYHMELQISKKKNVPSTRMLSYCAAHKLSCLRIN 297
Query: 570 ---LSFCKKHKQPLND---RLAVDERLVQVTRRCCDYIPPSNPSGCARSE------PYNY 617
LS + P D + + ++ T+ +P + C +E N
Sbjct: 298 FTSLSTFNSCRTPSPDAFLQYNITQKKPPATKFPLAPVP-TEEVPCVETERLGCLLAANE 356
Query: 618 FGR----------RGRKEPEALAAA------SLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
R R R P A A S K+ Y V GY +N L + + S
Sbjct: 357 LSRSLEISFEKFKRWRMLPAARCQAYNPDDTSHKKPTGAGIAYRVMGYTKNSL--DWISS 414
Query: 662 IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
+R L D +SM ++ +++T + R+ FGKSGIHG+G+FA+ + G+M
Sbjct: 415 LRDY-------LEPDHGEVMSMKERLSFLQKTEKTRVCFGKSGIHGWGLFARRYIKEGEM 467
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
V+EY GE +R S+AD RE L G Y+F+I +E VIDAT G+I LINHSCEPN
Sbjct: 468 VVEYRGERIRRSVADLREK--RYCLEGKHCYLFKISEEVVIDATENGNIGRLINHSCEPN 525
Query: 782 CYSRVISV--NGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY+R++SV G+ HI++ A++D+ EELTYDY+F D+++ C CG RCR +N
Sbjct: 526 CYARIVSVEGEGESHIVLIARKDVSVGEELTYDYQFDKEDKKVLCLCGSSRCRRFMN 582
>gi|145349623|ref|XP_001419228.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579459|gb|ABO97521.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 175/474 (36%), Positives = 238/474 (50%), Gaps = 63/474 (13%)
Query: 396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPC 455
++KC VC E+ ++ +QCD+C ++VH CYG G WLC C G PP C
Sbjct: 49 VEKCDVCDSVREFDQDVLVQCDECMILVHMGCYGVTTAPTGGRWLCRACELGL-RTPPRC 107
Query: 456 CLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515
LCP VGGAMKPT GRW H+ CA+W T EPI+G+N V + K C++C
Sbjct: 108 ALCPNVGGAMKPTLCGRWCHVVCALWAECTFAHPDGVAEPIEGVNMVPAESLKATCAVCE 167
Query: 516 VSYGACIQCSNTT-CRVAYHPLCARAA------------------------GLCVELEDE 550
SYGAC QC T C+ A+H CAR A G L +
Sbjct: 168 QSYGACAQCMGTKKCQKAFHVYCARDAECGYIAHSRTVAQLKQAGIRKFIVGYEQPLRNT 227
Query: 551 DRLNLLSLDEDDE---DQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPS 607
D L S + R S KK Q R VD R V+ D + P
Sbjct: 228 DTLLFPSCPACANWRGRKRKRRASTPKKRTQTPKTRPTVDSREVE------DKDEDAKPL 281
Query: 608 GCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGS 667
CA+ +P + R P+ ++ PYLV G + L +L ++ +
Sbjct: 282 QCAKFDPLGAYARALTVSPK------------DSIPYLVTGARTSRLESFSLRAVALA-- 327
Query: 668 KFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTG 727
D P L+ ++++ MK T RL GKS IHG+G+FAK H G+M+I+Y G
Sbjct: 328 --------DPPRNLN--ERFERMKATISDRLTLGKSYIHGYGLFAKRAHARGEMIIDYVG 377
Query: 728 ELVRPSIADRREHFIYNSLVGAGTYMFRI--DDERV-IDATRAGSIAHLINHSCEPNCYS 784
E+VRP +AD RE +Y++ G GTY+F + DD+ V +DAT AG++A+L NHSC PN +S
Sbjct: 378 EIVRPVVADIRERDVYDTCFGNGTYIFALGGDDQPVRLDATCAGNLANLANHSCAPNAHS 437
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDT 838
R + D HI +FA R+I+ EE+ Y+YR D+ L C CG CRGVVN T
Sbjct: 438 RQVYAANDNHICLFASRNIQPGEEILYEYR-LGADQTLRCNCGAANCRGVVNFT 490
>gi|302798240|ref|XP_002980880.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
gi|300151419|gb|EFJ18065.1| hypothetical protein SELMODRAFT_178454 [Selaginella moellendorffii]
Length = 749
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 160/468 (34%), Positives = 238/468 (50%), Gaps = 32/468 (6%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y+PV W ++C+VC E+Y N + C++C++ VH CYG W+C
Sbjct: 301 LEAAYKPVVPIWTP-ERCAVCRWIEDYDVNKIIICNRCQLAVHEECYGVKASEISGSWVC 359
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPTT-DGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPTT G W H+ CA +I E D MEP G+
Sbjct: 360 RGCE--TPDAVRDCCLCPVKGGALKPTTIKGLWVHITCAWFIHEVTFKDEVAMEPAAGIT 417
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
R+ R++ C++C +G CIQC+ C+ YHP+CA AG +E++ +S +
Sbjct: 418 RIDLMRFREACAVCKQIHGVCIQCNK--CKTLYHPMCALRAGYHMEVQ-------ISYKK 468
Query: 561 DDEDQCIRLLSFCKKHKQPLND-----------RLAVDERLVQVTRRCCDYIPPSNPSGC 609
+ + R++++C HK P D L ER S P
Sbjct: 469 NGSFE-TRMITYCATHKAPKPDSYIQYTTDKGVSLGKKERFSNGQSSKSSSTFNSFPKKL 527
Query: 610 ARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKF 669
+ R K P A + ++ + Y G S ++ SI + F
Sbjct: 528 CDDLEKEFQNFRRSKNPAARCQVYGQNMYYKRSNRSPIAYRVMGYSHHSTESIDAL-RDF 586
Query: 670 SFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
H D + M + +++T + R+ FGKSGIHG+G+FA+ G++V+EY GE
Sbjct: 587 PELEHED---LVDMKTRLSFLEKTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQ 643
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
VR S+AD RE + G Y+F+I +E VIDAT G+I LINHSC P+CY+R++ V
Sbjct: 644 VRRSVADLREKRYRDQ--GKDCYLFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCV 701
Query: 790 NGDE-HIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+G+E I++ AKR++ EELTYDY+F D+++ C CG CR +N
Sbjct: 702 DGEESRIVLIAKRNVAAGEELTYDYQFEEEDKKVPCLCGSDSCRQYMN 749
>gi|359489550|ref|XP_002278335.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1073
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 161/473 (34%), Positives = 235/473 (49%), Gaps = 42/473 (8%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C++C E++ N + C++C++ VH CYG + W+C
Sbjct: 625 LKEKYEPVHAKW-TTERCAICRWVEDWDYNKMIICNRCQIAVHQECYGARNVKDFTSWVC 683
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT +G W H+ CA + PE + ++MEP G+
Sbjct: 684 RACE--TPDAKRECCLCPVKGGALKPTDVEGLWVHVTCAWFRPEVAFLNDEKMEPAVGIL 741
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
R+ + +C IC ++G+C QC C +H +CA AG +EL E
Sbjct: 742 RIPSTSFLKVCVICKQTHGSCTQCCK--CATYFHAMCASRAGYSMELH---------CGE 790
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGR 620
+ Q + LS+C H+ P D V V R N R + +
Sbjct: 791 KNGRQITKKLSYCAVHRAPNAD-------TVLVVRTPSGVFSARNRQNQKRDQSFRGSRL 843
Query: 621 RGRKEPEALAAASLK------------RLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSK 668
+ PE + +L+ R+F + + G + L G S+ I
Sbjct: 844 VSSRRPELPVSLALETNELEPLSAGRCRVFKRSINNVGAGAIFHRLMGPRHHSLDAIDGL 903
Query: 669 FSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 728
+ D F S ++ H++ T R+ FGKSGIHG+G+FA+ + G+MVIEY GE
Sbjct: 904 SLYKELEDPQTFSSFKERLYHLQRTENHRVCFGKSGIHGWGLFARRSIQEGEMVIEYRGE 963
Query: 729 LVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVIS 788
VR S+AD RE L G Y+F+I +E VIDAT G+IA LINHSC PNCY+R++S
Sbjct: 964 QVRRSVADLRE--AKYRLEGKDCYLFKISEEVVIDATNKGNIARLINHSCFPNCYARIMS 1021
Query: 789 VNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
V +E I++ AK ++ +ELTYDY F DE ++ C CG P CR +N
Sbjct: 1022 VGDEESRIVLIAKINVSAGDELTYDY-LFDPDERDESKVPCLCGAPNCRKFMN 1073
>gi|449445828|ref|XP_004140674.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
gi|449487413|ref|XP_004157614.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Cucumis
sativus]
Length = 1055
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 164/476 (34%), Positives = 246/476 (51%), Gaps = 54/476 (11%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y P+ W ++C+VC E+++ N + C++C++ VH CYG + + W+C C
Sbjct: 609 YEPIYAKW-TTERCAVCRWVEDWEENKIIICNRCQVAVHQECYGAKDIHDFTSWVCRACE 667
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT +G W H+ CA + PE + ++MEP G+ R+
Sbjct: 668 --TPDTSRECCLCPVKGGALKPTDAEGLWVHVTCAWFRPEVVFLNHEKMEPAVGIYRIPS 725
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + C IC S+G+C QC C +H +CA AG +EL+ E+
Sbjct: 726 NSFLKKCVICKQSHGSCTQCCK--CATYFHTMCASRAGYFMELQ---------CSEEKGR 774
Query: 565 QCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTRRC-------CDYIPPSNPSG 608
Q R L +C H+ P D + V L+Q + C I + S
Sbjct: 775 QITRKLIYCAVHRAPNPDAVVVVRSPSGVFSGRNLLQKQKGCYRGSRLVTSKIEEQSKSS 834
Query: 609 CARSEPYN-YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGS 667
+ + + Y R R A ++ KR+ + QP + N P +I
Sbjct: 835 ASETNDFEPYSAARCR----AYVRSNDKRVEGQRQPIF-----HRLMGPNHHPLDEII-- 883
Query: 668 KFSFSLHR---DAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
S S R D +F S ++ ++++ T + R+ FGKSGIHG+G+FA+ + G+MV+E
Sbjct: 884 --SLSTRREGADPKSFSSFKERLQYLQRTEKDRVCFGKSGIHGWGLFARRNVQEGEMVVE 941
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y GE VR S+AD RE L G Y+F+I +E VIDAT G+IA LINHSC PNCY+
Sbjct: 942 YRGEQVRRSVADLRE--ARYQLEGKDCYLFKISEEVVIDATEKGNIARLINHSCMPNCYA 999
Query: 785 RVISVNGDE-HIIIFAKRDIKQWEELTYDYRF--FSIDE-QLACYCGFPRCRGVVN 836
R++SV +E I++ AK ++ EELTYDY F +DE ++ C+C P CR +N
Sbjct: 1000 RIMSVGDNESRIVLIAKTNVAAGEELTYDYLFDPDELDELKVPCHCNAPNCRKFMN 1055
>gi|302755933|ref|XP_002961390.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
gi|300170049|gb|EFJ36650.1| hypothetical protein SELMODRAFT_164635 [Selaginella moellendorffii]
Length = 749
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 160/468 (34%), Positives = 238/468 (50%), Gaps = 32/468 (6%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y+PV W ++C+VC E+Y N + C++C++ VH CYG W+C
Sbjct: 301 LEAAYKPVVPIWTP-ERCAVCRWIEDYDVNKIIICNRCQLAVHEECYGVKASEISGSWVC 359
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPTT-DGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPTT G W H+ CA +I E D MEP G+
Sbjct: 360 RGCE--TPDAVRDCCLCPVKGGALKPTTIKGLWVHITCAWFIHEVTFKDEVAMEPAAGIT 417
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
R+ R++ C++C +G CIQC+ C+ YHP+CA AG +E++ +S +
Sbjct: 418 RIDLMRFREACAVCKQIHGVCIQCNK--CKTLYHPMCALRAGYHMEVQ-------ISYKK 468
Query: 561 DDEDQCIRLLSFCKKHKQPLND-----------RLAVDERLVQVTRRCCDYIPPSNPSGC 609
+ + R++++C HK P D L ER S P
Sbjct: 469 NGSFE-TRMITYCATHKAPKPDSYIQYTTDKGVSLGKKERFSNGQSSKSSSTFNSFPKKL 527
Query: 610 ARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKF 669
+ R K P A + ++ + Y G S ++ SI + F
Sbjct: 528 CDDLEKEFQNFRRSKNPAARCQVYGQNMYYKRSNRSPIAYRVMGYSHHSTESIDAL-RDF 586
Query: 670 SFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
H D + M + +++T + R+ FGKSGIHG+G+FA+ G++V+EY GE
Sbjct: 587 PELEHED---LVDMKTRLSFLEKTEKSRVCFGKSGIHGWGLFARRTIEEGEIVVEYRGEQ 643
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
VR S+AD RE + G Y+F+I +E VIDAT G+I LINHSC P+CY+R++ V
Sbjct: 644 VRRSVADLREKRYRDQ--GKDCYLFKISEEIVIDATEKGNIGRLINHSCSPSCYARILCV 701
Query: 790 NGDE-HIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+G+E I++ AKR++ EELTYDY+F D+++ C CG CR +N
Sbjct: 702 DGEESRIVLIAKRNVAAGEELTYDYQFEEEDKKVPCLCGSDSCRQYMN 749
>gi|449455758|ref|XP_004145618.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Cucumis
sativus]
Length = 1073
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 167/494 (33%), Positives = 255/494 (51%), Gaps = 53/494 (10%)
Query: 365 RPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVH 424
RP+ K KL + ++ Y PV W ++C+VC E++ N + C++C++ VH
Sbjct: 611 RPSMKERRQKLLTFLQE---KYEPVYAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVH 666
Query: 425 ARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIP 483
CYG + W+C +C P+ CCLCPV GGA+KPT D W H+ CA + P
Sbjct: 667 QECYGARNVRDITSWVCKVCE--TPDVKRECCLCPVKGGALKPTDVDTLWVHVTCAWFRP 724
Query: 484 ETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
E ++MEP G+ + + + +C IC +G+C+QC C YH +CA AG
Sbjct: 725 EVSFASDEKMEPALGILSIPSNSFVKICVICKQIHGSCMQCCK--CSTYYHAMCASRAGY 782
Query: 544 CVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVT 594
C+EL E + Q +++S+C H+ P D + + + L+Q
Sbjct: 783 CMELH---------CLEKNGRQITKMVSYCAYHRAPNPDTVLIIQTPLGVFSTKSLLQNK 833
Query: 595 RRCCDYIPPSN------PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQP-YLVG 647
+R + SN S + EP++ + K + S+K+ VE + V
Sbjct: 834 KRAGSRLISSNRKEIEEVSEASELEPFSAARCQVYKR-----STSVKKRTVEGAVIHKVM 888
Query: 648 GYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHG 707
G C + L +R + + F+ + + F S D+ H++ T R+ FG+SGIHG
Sbjct: 889 GPCHHPLK-----ELRNLNT-FNLPMVEEPKIFSSFRDRLYHLQRTENDRVCFGRSGIHG 942
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+FA+ + G+MV+EY GE VR ++AD RE L G Y+F+I +E V+DAT
Sbjct: 943 WGLFARRNIQEGEMVLEYRGEQVRRTVADLRE--ARYRLAGKDCYLFKISEEVVVDATDK 1000
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QL 822
G+IA LINHSC PNCY+R++SV DE I++ AK ++ EELTYDY F DE ++
Sbjct: 1001 GNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDY-LFDPDEPDEFKV 1059
Query: 823 ACYCGFPRCRGVVN 836
C C P CR +N
Sbjct: 1060 PCLCKAPNCRKFMN 1073
>gi|297745446|emb|CBI40526.3| unnamed protein product [Vitis vinifera]
Length = 1003
Score = 258 bits (660), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 160/469 (34%), Positives = 234/469 (49%), Gaps = 42/469 (8%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C++C E++ N + C++C++ VH CYG + W+C C
Sbjct: 559 YEPVHAKW-TTERCAICRWVEDWDYNKMIICNRCQIAVHQECYGARNVKDFTSWVCRACE 617
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT +G W H+ CA + PE + ++MEP G+ R+
Sbjct: 618 --TPDAKRECCLCPVKGGALKPTDVEGLWVHVTCAWFRPEVAFLNDEKMEPAVGILRIPS 675
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +C IC ++G+C QC C +H +CA AG +EL E +
Sbjct: 676 TSFLKVCVICKQTHGSCTQCCK--CATYFHAMCASRAGYSMELH---------CGEKNGR 724
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRK 624
Q + LS+C H+ P D V V R N R + + +
Sbjct: 725 QITKKLSYCAVHRAPNAD-------TVLVVRTPSGVFSARNRQNQKRDQSFRGSRLVSSR 777
Query: 625 EPEALAAASLK------------RLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFS 672
PE + +L+ R+F + + G + L G S+ I +
Sbjct: 778 RPELPVSLALETNELEPLSAGRCRVFKRSINNVGAGAIFHRLMGPRHHSLDAIDGLSLYK 837
Query: 673 LHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRP 732
D F S ++ H++ T R+ FGKSGIHG+G+FA+ + G+MVIEY GE VR
Sbjct: 838 ELEDPQTFSSFKERLYHLQRTENHRVCFGKSGIHGWGLFARRSIQEGEMVIEYRGEQVRR 897
Query: 733 SIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGD 792
S+AD RE L G Y+F+I +E VIDAT G+IA LINHSC PNCY+R++SV +
Sbjct: 898 SVADLRE--AKYRLEGKDCYLFKISEEVVIDATNKGNIARLINHSCFPNCYARIMSVGDE 955
Query: 793 E-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
E I++ AK ++ +ELTYDY F DE ++ C CG P CR +N
Sbjct: 956 ESRIVLIAKINVSAGDELTYDY-LFDPDERDESKVPCLCGAPNCRKFMN 1003
>gi|449528994|ref|XP_004171486.1| PREDICTED: histone-lysine N-methyltransferase ATX5-like, partial
[Cucumis sativus]
Length = 588
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 167/494 (33%), Positives = 255/494 (51%), Gaps = 53/494 (10%)
Query: 365 RPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVH 424
RP+ K KL + ++ Y PV W ++C+VC E++ N + C++C++ VH
Sbjct: 126 RPSMKERRQKLLTFLQE---KYEPVYAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVH 181
Query: 425 ARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIP 483
CYG + W+C +C P+ CCLCPV GGA+KPT D W H+ CA + P
Sbjct: 182 QECYGARNVRDITSWVCKVCE--TPDVKRECCLCPVKGGALKPTDVDTLWVHVTCAWFRP 239
Query: 484 ETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
E ++MEP G+ + + + +C IC +G+C+QC C YH +CA AG
Sbjct: 240 EVSFASDEKMEPALGILSIPSNSFVKICVICKQIHGSCMQCCK--CSTYYHAMCASRAGY 297
Query: 544 CVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVT 594
C+EL E + Q +++S+C H+ P D + + + L+Q
Sbjct: 298 CMELH---------CLEKNGRQITKMVSYCAYHRAPNPDTVLIIQTPLGVFSTKSLLQNK 348
Query: 595 RRCCDYIPPSN------PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQP-YLVG 647
+R + SN S + EP++ + K + S+K+ VE + V
Sbjct: 349 KRAGSRLISSNRKEIEEVSEASELEPFSAARCQVYKR-----STSVKKRTVEGAVIHKVM 403
Query: 648 GYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHG 707
G C + L +R + + F+ + + F S D+ H++ T R+ FG+SGIHG
Sbjct: 404 GPCHHPLK-----ELRNLNT-FNLPMVEEPKIFSSFRDRLYHLQRTENDRVCFGRSGIHG 457
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+FA+ + G+MV+EY GE VR ++AD RE L G Y+F+I +E V+DAT
Sbjct: 458 WGLFARRNIQEGEMVLEYRGEQVRRTVADLRE--ARYRLAGKDCYLFKISEEVVVDATDK 515
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QL 822
G+IA LINHSC PNCY+R++SV DE I++ AK ++ EELTYDY F DE ++
Sbjct: 516 GNIARLINHSCMPNCYARIMSVGDDESRIVLIAKANVPAGEELTYDY-LFDPDEPDEFKV 574
Query: 823 ACYCGFPRCRGVVN 836
C C P CR +N
Sbjct: 575 PCLCKAPNCRKFMN 588
>gi|356573885|ref|XP_003555086.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1003
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 161/494 (32%), Positives = 247/494 (50%), Gaps = 39/494 (7%)
Query: 358 ILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCD 417
++ + +P+ K KL + ++ Y PV W ++C+VC E++ N + C+
Sbjct: 534 VVSVKPKKPSFKERKQKLLTFLQE---KYEPVCAKW-TTERCAVCRWVEDWDYNKIIICN 589
Query: 418 KCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHL 476
+C++ VH CYG + W+C C P+ CCLCPV GGA+KPT D W H+
Sbjct: 590 RCQIAVHQECYGARNVQDFTSWVCKACE--TPDIKQECCLCPVKGGALKPTDVDTLWVHV 647
Query: 477 ACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPL 536
CA + PE ++MEP G+ + + + +C IC +G+C QC C +H +
Sbjct: 648 TCAWFRPEVSFASDEKMEPALGILSIPLNSFVKICVICKEIHGSCTQCCK--CSTYFHAM 705
Query: 537 CARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------D 587
CA AG +EL E + Q R++S+C H+ P D +++
Sbjct: 706 CASRAGYRMELH---------CMEKNGKQTTRMVSYCAYHRAPNPDTVSIMQTPLGVIST 756
Query: 588 ERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG 647
+ L+Q R+ + S + P R EP + A + R + G
Sbjct: 757 KSLLQTKRKAGSRLISSKRIKVEDTSP----AENTRHEPFSAARCRIYRRTNHTKKRAAG 812
Query: 648 GYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHG 707
+ + G+ + I S + + P F S ++ H++ T +R+ FG+SGIHG
Sbjct: 813 EAIAHHVRGHYHHPLDAIQSLNADRMVDKPPAFSSFRERLHHLQRTENERVCFGRSGIHG 872
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+FA+ + G+MV+EY GE VR SIAD RE L G Y+F+I +E V+DAT
Sbjct: 873 WGLFARQNIQEGEMVLEYRGEQVRRSIADLRE--ARYRLEGKDCYLFKISEEVVVDATDK 930
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QL 822
G+IA LINHSC PNCY+R++SV DE I++ AK D+ +ELTYDY F DE ++
Sbjct: 931 GNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTDVSTGDELTYDY-LFDPDEPDEFKV 989
Query: 823 ACYCGFPRCRGVVN 836
C C CR +N
Sbjct: 990 PCLCKASNCRKFMN 1003
>gi|30696333|ref|NP_200155.2| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
gi|75244456|sp|Q8GZ42.1|ATX5_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX5; AltName:
Full=Protein SET DOMAIN GROUP 29; AltName:
Full=Trithorax-homolog protein 5; Short=TRX-homolog
protein 5
gi|26449542|dbj|BAC41897.1| putative trithorax 5 TX5 [Arabidopsis thaliana]
gi|29029074|gb|AAO64916.1| At5g53430 [Arabidopsis thaliana]
gi|332008970|gb|AED96353.1| histone-lysine N-methyltransferase ATX5 [Arabidopsis thaliana]
Length = 1043
Score = 255 bits (652), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 167/493 (33%), Positives = 247/493 (50%), Gaps = 50/493 (10%)
Query: 365 RPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVH 424
RP+ K +L S R+ Y PV V W ++C+VC E++ N + C++C++ VH
Sbjct: 580 RPSIKQRKQRLLSFLRE---KYEPVNVKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVH 635
Query: 425 ARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIP 483
CYG + W+C C PE CCLCPV GGA+KPT + W H+ CA + P
Sbjct: 636 QECYGTRNVRDFTSWVCKACE--TPEIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQP 693
Query: 484 ETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
E C ++MEP G+ + + +C IC +G+C QC C YH +CA AG
Sbjct: 694 EVCFASEEKMEPALGILSIPSSNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY 751
Query: 544 CVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVT 594
R+ L L+++ Q +++S+C H+ P D + + + LVQ
Sbjct: 752 --------RMELHCLEKNGR-QITKMVSYCSYHRAPNPDTVLIIQTPSGVFSAKSLVQNK 802
Query: 595 RRCCDYIPPSN-----PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGY 649
++ + +N S + P + F R S KR E P+ GG
Sbjct: 803 KKSGTRLILANREEIEESAAEDTIPIDPFSS-ARCRLYKRTVNSKKRTKEEGIPHYTGGL 861
Query: 650 CQNGLSG-NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGF 708
+ + TL + R + + +F S ++ H++ T +R+ FG+SGIHG+
Sbjct: 862 RHHPSAAIQTLNAFRHVAE--------EPKSFSSFRERLHHLQRTEMERVCFGRSGIHGW 913
Query: 709 GIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAG 768
G+FA+ + G+MV+EY GE VR IAD RE G Y+F+I +E V+DAT G
Sbjct: 914 GLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRRE--GKDCYLFKISEEVVVDATEKG 971
Query: 769 SIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLA 823
+IA LINHSC PNCY+R++SV DE I++ AK + EELTYDY F DE ++
Sbjct: 972 NIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDY-LFDPDEPDEFKVP 1030
Query: 824 CYCGFPRCRGVVN 836
C C P CR +N
Sbjct: 1031 CLCKSPNCRKFMN 1043
>gi|8843772|dbj|BAA97320.1| unnamed protein product [Arabidopsis thaliana]
Length = 1040
Score = 255 bits (651), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 167/493 (33%), Positives = 247/493 (50%), Gaps = 50/493 (10%)
Query: 365 RPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVH 424
RP+ K +L S R+ Y PV V W ++C+VC E++ N + C++C++ VH
Sbjct: 577 RPSIKQRKQRLLSFLRE---KYEPVNVKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVH 632
Query: 425 ARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIP 483
CYG + W+C C PE CCLCPV GGA+KPT + W H+ CA + P
Sbjct: 633 QECYGTRNVRDFTSWVCKACE--TPEIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQP 690
Query: 484 ETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
E C ++MEP G+ + + +C IC +G+C QC C YH +CA AG
Sbjct: 691 EVCFASEEKMEPALGILSIPSSNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY 748
Query: 544 CVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVT 594
R+ L L+++ Q +++S+C H+ P D + + + LVQ
Sbjct: 749 --------RMELHCLEKNGR-QITKMVSYCSYHRAPNPDTVLIIQTPSGVFSAKSLVQNK 799
Query: 595 RRCCDYIPPSN-----PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGY 649
++ + +N S + P + F R S KR E P+ GG
Sbjct: 800 KKSGTRLILANREEIEESAAEDTIPIDPFSS-ARCRLYKRTVNSKKRTKEEGIPHYTGGL 858
Query: 650 CQNGLSG-NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGF 708
+ + TL + R + + +F S ++ H++ T +R+ FG+SGIHG+
Sbjct: 859 RHHPSAAIQTLNAFRHVAE--------EPKSFSSFRERLHHLQRTEMERVCFGRSGIHGW 910
Query: 709 GIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAG 768
G+FA+ + G+MV+EY GE VR IAD RE G Y+F+I +E V+DAT G
Sbjct: 911 GLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRRE--GKDCYLFKISEEVVVDATEKG 968
Query: 769 SIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLA 823
+IA LINHSC PNCY+R++SV DE I++ AK + EELTYDY F DE ++
Sbjct: 969 NIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASCEELTYDY-LFDPDEPDEFKVP 1027
Query: 824 CYCGFPRCRGVVN 836
C C P CR +N
Sbjct: 1028 CLCKSPNCRKFMN 1040
>gi|302144034|emb|CBI23139.3| unnamed protein product [Vitis vinifera]
Length = 1018
Score = 255 bits (651), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 159/470 (33%), Positives = 238/470 (50%), Gaps = 36/470 (7%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 570 LQEKYEPVHARW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARNVRDFTSWVC 628
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT + W H+ CA + PE + ++MEP G+
Sbjct: 629 RACE--TPDVERECCLCPVKGGALKPTDIETLWVHVTCAWFQPEVSFSSDEKMEPAVGIL 686
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + + +C IC +G+C QC C YH +CA AG R+ L SL +
Sbjct: 687 SIPSNSFIKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY--------RMELHSLVK 736
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTRRCCDYIPPSNPSGCAR 611
+ Q +++S+C H+ P D + + + L+Q ++ + SN +
Sbjct: 737 NGR-QITKMVSYCAYHRAPNPDTVLIIQTPLGVFSTKSLIQNKKKSGSRLISSNRIELQQ 795
Query: 612 SEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSF 671
EP + A + R N V + + G S+ I S F
Sbjct: 796 IPTVET----DEFEPFSAARCRIFRRSKSNTKRTVEEAIAHQVKGPFHHSLSAIESLNIF 851
Query: 672 SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
+ NF + ++ H++ T R+ FG+SGIHG+G+FA+ + GDMV+EY GE VR
Sbjct: 852 REVEEPKNFSTFRERLYHLQRTENDRVCFGRSGIHGWGLFARQAIQEGDMVLEYRGEQVR 911
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
SIAD RE + L G Y+F+I +E V+DAT G+IA LINHSC PNCY+R++SV
Sbjct: 912 RSIADMRE--VRYRLEGKDCYLFKISEEVVVDATDKGNIARLINHSCAPNCYARIMSVGD 969
Query: 792 DE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
DE I++ AK ++ +ELTYDY F DE ++ C C P CR +N
Sbjct: 970 DESRIVLIAKTNVAAGDELTYDY-LFDPDEPDECKVPCLCKAPNCRKFMN 1018
>gi|297792715|ref|XP_002864242.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
gi|297310077|gb|EFH40501.1| hypothetical protein ARALYDRAFT_918421 [Arabidopsis lyrata subsp.
lyrata]
Length = 1049
Score = 253 bits (646), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 167/493 (33%), Positives = 246/493 (49%), Gaps = 50/493 (10%)
Query: 365 RPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVH 424
RP+ K +L S R+ Y PV V W ++C+VC E++ N + C++C++ VH
Sbjct: 586 RPSIKQRKQRLLSFLRE---KYEPVNVKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVH 641
Query: 425 ARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIP 483
CYG + W+C C PE CCLCPV GGA+KPT + W H+ CA + P
Sbjct: 642 QECYGTRNVRDFTSWVCKACE--TPEIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQP 699
Query: 484 ETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
E C ++MEP G+ + + +C IC +G+C QC C YH +CA AG
Sbjct: 700 EVCFASEEKMEPALGILSIPSSNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY 757
Query: 544 CVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVT 594
R+ L L+++ Q +++S+C H+ P D + + + LVQ
Sbjct: 758 --------RMELHCLEKNGR-QITKMVSYCSYHRAPNPDTVLIIQTPSGVFSAKSLVQNK 808
Query: 595 RRCCDYIPPSN-----PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGY 649
++ + +N S + P + F R S KR E P+ GG
Sbjct: 809 KKTGSRLILANREEVEESAAEDTIPIDPFSS-ARCRLYKRTVNSKKRTKEEGIPHHKGGP 867
Query: 650 CQNGLSG-NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGF 708
+ + TL + R + + +F S ++ H++ T R+ FG+SGIHG+
Sbjct: 868 RHHPSAAIQTLNAFRHVAE--------EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGW 919
Query: 709 GIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAG 768
G+FA+ + G+MV+EY GE VR IAD RE G Y+F+I +E V+DAT G
Sbjct: 920 GLFARRNIQEGEMVLEYRGEQVRGIIADLREARYRRE--GKDCYLFKISEEVVVDATEKG 977
Query: 769 SIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLA 823
+IA LINHSC PNCY+R++SV DE I++ AK + EELTYDY F DE ++
Sbjct: 978 NIARLINHSCMPNCYARIMSVGDDESRIVLIAKTTVASGEELTYDY-LFDPDEPDEFKVP 1036
Query: 824 CYCGFPRCRGVVN 836
C C P CR +N
Sbjct: 1037 CLCKSPNCRKFMN 1049
>gi|357127667|ref|XP_003565500.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like
[Brachypodium distachyon]
Length = 972
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 156/469 (33%), Positives = 234/469 (49%), Gaps = 50/469 (10%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C C
Sbjct: 536 YSPVYAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARVVQDFTNWVCRACE 594
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT D W H+ CA + P+ + MEP G+ +
Sbjct: 595 --VPQQKRQCCLCPVKGGALKPTEIDQLWVHITCAWYQPKVSFPFEETMEPATGILNIPS 652
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +K C IC +GAC QC C YH +CA AG C+EL+ +R +
Sbjct: 653 EYFKRTCVICKQMHGACTQCYK--CSTFYHTMCASRAGYCMELQYTER---------NGS 701
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYF------ 618
Q +++S+C H P D + + + P G ++ N
Sbjct: 702 QVTKMVSYCAFHSTPNPDNVLIVK----------------TPEGVFSTKLLNQNNEKQSP 745
Query: 619 GRRGRKEPEALAAASLKRLFVENQPYLV-------GGYCQNGLSGNTLPSIRVIGSKFSF 671
R ++ +A A + V PY + G + ++G S +I S+
Sbjct: 746 ARLDKENHQAFPAKNSDCPAVRCLPYEMLKNKKEQGEAIPHRITGPQHHSQDLIEGLNSY 805
Query: 672 SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
F + ++ +++T KR++ G+SG+HG+G+FA + G MVIEY G+ VR
Sbjct: 806 MDQNGDQTFSTFKERLHFLQKTENKRVSCGRSGVHGWGLFAARKIQEGQMVIEYRGDQVR 865
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
S+AD RE + Y+F+I ++ VIDAT G+IA LINHSC PNCY+R++SV
Sbjct: 866 QSVADLREARYHRE--NKDCYLFKISEDVVIDATERGNIARLINHSCMPNCYARIVSVGD 923
Query: 792 DE-HIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
D+ II+ AKRD+ EELTYDY+F S D ++ C C P CRG +N
Sbjct: 924 DKSQIILIAKRDVSAGEELTYDYKFDRDESEDRKVLCLCKAPNCRGYMN 972
>gi|359490859|ref|XP_002268525.2| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Vitis
vinifera]
Length = 1094
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 161/474 (33%), Positives = 244/474 (51%), Gaps = 46/474 (9%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C C
Sbjct: 644 YEPVHARWT-TERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARNVRDFTSWVCRACE 702
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + PE + ++MEP G+ +
Sbjct: 703 --TPDVERECCLCPVKGGALKPTDIETLWVHVTCAWFQPEVSFSSDEKMEPAVGILSIPS 760
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + +C IC +G+C QC C YH +CA AG R+ L SL ++
Sbjct: 761 NSFIKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY--------RMELHSLVKNGR- 809
Query: 565 QCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTRRCCDYIPPSN-------PSG 608
Q +++S+C H+ P D + + + L+Q ++ + SN P+
Sbjct: 810 QITKMVSYCAYHRAPNPDTVLIIQTPLGVFSTKSLIQNKKKSGSRLISSNRIELQQIPT- 868
Query: 609 CARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSG-NTLPSIRVIGS 667
++ + F R + ++ KR E + V G + LS +L R +
Sbjct: 869 -VETDEFEPFSA-ARCRIFRRSKSNTKRTVEEAIAHQVKGPFHHSLSAIESLNIFREVEE 926
Query: 668 KFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTG 727
+FS R+ L + H++ T R+ FG+SGIHG+G+FA+ + GDMV+EY G
Sbjct: 927 PKNFSTFRERLYHLQVNF---HLQRTENDRVCFGRSGIHGWGLFARQAIQEGDMVLEYRG 983
Query: 728 ELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVI 787
E VR SIAD RE + L G Y+F+I +E V+DAT G+IA LINHSC PNCY+R++
Sbjct: 984 EQVRRSIADMRE--VRYRLEGKDCYLFKISEEVVVDATDKGNIARLINHSCAPNCYARIM 1041
Query: 788 SVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
SV DE I++ AK ++ +ELTYDY F DE ++ C C P CR +N
Sbjct: 1042 SVGDDESRIVLIAKTNVAAGDELTYDY-LFDPDEPDECKVPCLCKAPNCRKFMN 1094
>gi|356518627|ref|XP_003527980.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1067
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 162/482 (33%), Positives = 239/482 (49%), Gaps = 69/482 (14%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C C
Sbjct: 624 YEPVHAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARNVRDFTSWVCKACE 682
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT D W H+ CA + PE ++MEP G+ +
Sbjct: 683 --TPDIKRECCLCPVKGGALKPTDVDTLWVHVTCAWFRPEVSFASDEKMEPALGILSIPS 740
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + +C IC +G+C QC C +H +CA AG R+ L L+++ +
Sbjct: 741 NSFVKICVICKQIHGSCTQCCK--CSTYFHAMCASRAGY--------RMELHCLEKNGK- 789
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDE--------------------RLVQVTRRCCDYIPPS 604
Q +++S+C H+ P D + + + RL+ RR D P
Sbjct: 790 QTTKMVSYCAYHRAPNPDTVLIMQTPLGVISTKSLLQTKKKSGSRLISSNRRKQDDTPVD 849
Query: 605 NPSGCARSEPYN-----YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
N EP++ F R + A A R + G Y + +L
Sbjct: 850 N----TEHEPFSAARCRIFQRTNHTKKRAADEAVSHR--------VRGPYHHPLDAIESL 897
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
+ RV+ +FS S ++ H++ T R+ FG+SGIHG+G+FA+ + G
Sbjct: 898 NTHRVVHEPQAFS---------SFRERLYHLQRTENDRVCFGRSGIHGWGLFARRNIQEG 948
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
DMV+EY GE VR SIAD RE L G Y+F+I +E V+DAT G+IA LINHSC
Sbjct: 949 DMVLEYRGEQVRRSIADLRE--ARYRLEGKDCYLFKISEEVVVDATDKGNIARLINHSCM 1006
Query: 780 PNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGV 834
PNCY+R++SV DE I++ AK ++ +ELTYDY F DE ++ C C P CR
Sbjct: 1007 PNCYARIMSVGDDESRIVLIAKTNVVAGDELTYDY-LFDPDEPEENKVPCLCKAPNCRKY 1065
Query: 835 VN 836
+N
Sbjct: 1066 MN 1067
>gi|145332921|ref|NP_001078326.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|332646730|gb|AEE80251.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 982
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 161/487 (33%), Positives = 245/487 (50%), Gaps = 69/487 (14%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PVR W ++C+VC E+++ N + C++C++ VH CYG + + W+C
Sbjct: 533 LEEKYEPVRAKWT-TERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDLTSWVC 591
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ +G W H+ CA + PE + + MEP GL
Sbjct: 592 RACE--TPDIERDCCLCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLF 649
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
++ + + +C+IC ++G+C+ C C +H +CA AG +EL E
Sbjct: 650 KIPANSFLKVCTICKQTHGSCVHCCK--CATHFHAMCASRAGYNMELH---------CLE 698
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGC--ARSEPYNYF 618
+ Q R +C H++P D + V PSG +R+ N +
Sbjct: 699 KNGVQRTRKSVYCSFHRKPDPDSVVV----------------VHTPSGVFGSRNLLQNQY 742
Query: 619 GR---------RGRKEP--------------EALAAASLKRLFVENQPYLVGGYCQNGLS 655
GR + K P ++L+AA + N + + L
Sbjct: 743 GRAKGSRLVLTKKMKLPGFQTQTQAEQSRVFDSLSAARCRIYSRSNTKKIDLEAISHRLK 802
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
G + S+ I + SF +F S ++ KH++ T R+ FGKSGIHG+G+FA+
Sbjct: 803 GPSHHSLSAIENLNSFK----EADFTSFRERLKHLQRTENFRVCFGKSGIHGWGLFARKS 858
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
+ G+M+IEY G VR S+AD RE + G Y+F+I +E VIDAT +G+IA LIN
Sbjct: 859 IQEGEMIIEYRGVKVRRSVADLREANYRSQ--GKDCYLFKISEEIVIDATDSGNIARLIN 916
Query: 776 HSCEPNCYSRVISVNG--DEHIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFP 829
HSC PNCY+R++S+ D I++ AK ++ EELTYDY F +DE ++ C C P
Sbjct: 917 HSCMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDY-LFEVDESEEIKVPCLCKAP 975
Query: 830 RCRGVVN 836
CR +N
Sbjct: 976 NCRKFMN 982
>gi|255544976|ref|XP_002513549.1| trithorax, putative [Ricinus communis]
gi|223547457|gb|EEF48952.1| trithorax, putative [Ricinus communis]
Length = 1018
Score = 249 bits (635), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 162/502 (32%), Positives = 254/502 (50%), Gaps = 39/502 (7%)
Query: 353 EVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGG---YRPVRVDWKDLDKCSVCHMDEEYQ 409
++ +L ++P ++S+ + K G Y PV W ++C+VC E++
Sbjct: 538 QLAELHARAVSTKPPKRASIKERKQKLLAFLQGIKKYEPVYAKW-TTERCAVCRWVEDWD 596
Query: 410 NNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT- 468
N + C++C++ VH CYG + W+C C P+ CCLCPV GGA+KPT
Sbjct: 597 YNKIIICNRCQIAVHQECYGARNVQDFTSWVCKACE--TPDVKRECCLCPVKGGALKPTD 654
Query: 469 TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTT 528
+ W H+ CA + PE ++MEP G+ + + + +C IC +G+C QCS
Sbjct: 655 VETLWVHVTCAWFQPEVSFASDEKMEPALGILSIPSNSFVKICVICKQIHGSCTQCSK-- 712
Query: 529 CRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV-- 586
C YH +CA AG R+ L L+++ Q +++S+C H+ P D + +
Sbjct: 713 CSTYYHAMCASRAGY--------RMELHCLEKNGR-QTTKMVSYCAYHRAPNPDTVLIIQ 763
Query: 587 -------DERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFV 639
+ LVQ +R + S+ + E + + A KR+
Sbjct: 764 TPVGVFSAKSLVQNKKRAGTRLISSSR---VKLEELSTEETTEAEPLSAARCRVFKRV-S 819
Query: 640 ENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLA 699
N+ + L+G + +I S +F + + +F S ++ H++ T R+
Sbjct: 820 NNKKRTEEEAISHRLTGPCNHPLGIIQSLNAFRVVEEPKSFSSFRERLYHLQRTENDRVC 879
Query: 700 FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDE 759
FG+SGIHG+G+FA+ + G+MV+EY GE VR SIAD RE L G Y+F+I +E
Sbjct: 880 FGRSGIHGWGLFARRNIQEGEMVLEYRGEQVRRSIADLRE--ARYRLEGKDCYLFKISEE 937
Query: 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSI 818
V+DAT G+IA LINHSC PNCY+R++SV DE I++ AK ++ +ELTYDY F
Sbjct: 938 VVVDATDKGNIARLINHSCMPNCYARIMSVGDDESRIVLIAKTNVSAGDELTYDY-LFDP 996
Query: 819 DE----QLACYCGFPRCRGVVN 836
DE ++ C C P CR +N
Sbjct: 997 DEPDEFKVPCLCKAPNCRQFMN 1018
>gi|42567196|ref|NP_194520.3| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
gi|229488104|sp|Q9SUE7.3|ATX4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX4; AltName:
Full=Protein SET DOMAIN GROUP 16; AltName:
Full=Trithorax-homolog protein 4; Short=TRX-homolog
protein 4; Short=Trithorax 4
gi|332660008|gb|AEE85408.1| histone-lysine N-methyltransferase ATX4 [Arabidopsis thaliana]
Length = 1027
Score = 249 bits (635), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 162/482 (33%), Positives = 242/482 (50%), Gaps = 60/482 (12%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 579 LSETYEPVNAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARHVRDFTSWVC 637
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT + W H+ CA + PE C ++MEP G+
Sbjct: 638 KACE--RPDIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPAVGIL 695
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + +C IC +G+C QC C YH +CA AG R+ L L++
Sbjct: 696 SIPSTNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY--------RMELHCLEK 745
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDE--------------------RLVQVTRRCCDY 600
+ + Q +++S+C H+ P D + + + RL+ + R D
Sbjct: 746 NGQ-QITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNKKKGGSRLISLIRED-DE 803
Query: 601 IPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSG-NTL 659
P N C +P++ R K S KR+ E P+ G + + TL
Sbjct: 804 APAENTITC---DPFSAARCRVFKR----KINSKKRIEEEAIPHHTRGPRHHASAAIQTL 856
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
+ R + + +F S ++ H++ T R+ FG+SGIHG+G+FA+ + G
Sbjct: 857 NTFRHVPE--------EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFARRNIQEG 908
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MV+EY GE VR SIAD RE VG Y+F+I +E V+DAT G+IA LINHSC
Sbjct: 909 EMVLEYRGEQVRGSIADLREARYRR--VGKDCYLFKISEEVVVDATDKGNIARLINHSCT 966
Query: 780 PNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGV 834
PNCY+R++SV +E I++ AK ++ EELTYDY F DE ++ C C P CR
Sbjct: 967 PNCYARIMSVGDEESRIVLIAKANVAVGEELTYDY-LFDPDEAEELKVPCLCKAPNCRKF 1025
Query: 835 VN 836
+N
Sbjct: 1026 MN 1027
>gi|224132822|ref|XP_002321418.1| SET domain protein [Populus trichocarpa]
gi|222868414|gb|EEF05545.1| SET domain protein [Populus trichocarpa]
Length = 1070
Score = 248 bits (633), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 161/489 (32%), Positives = 245/489 (50%), Gaps = 39/489 (7%)
Query: 363 KSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
+ RP+ K KL + L G Y PV W ++C+VC E++ N + C++C++
Sbjct: 606 QKRPSIKERKQKLLA---FLQGIYDPVFTKW-TTERCAVCRWVEDWDYNKIIICNRCQIA 661
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIW 481
VH CYG + W+C C P+ CCLCPV GGA+KPT + W H+ CA +
Sbjct: 662 VHQECYGARNVQDFTSWVCKACE--TPDVRRECCLCPVKGGALKPTDVESLWVHVTCAWF 719
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
PE ++MEP G+ + + + +C IC +G+C QC C YH +CA A
Sbjct: 720 QPEVSFASDEKMEPALGILSIPSNSFVKICVICQQIHGSCTQCCK--CSTYYHAMCASRA 777
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQ 592
G R+ L L+++ Q R++S+C H+ P D + + + LVQ
Sbjct: 778 GY--------RMELHCLEKNGR-QTTRMISYCACHRAPNPDTVLIIQTPAGVFSAKSLVQ 828
Query: 593 VTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
+ + SN + EP + A + + N+ +
Sbjct: 829 NKKSAGTRLISSNRIKLEEES----MEEATKSEPHSAARCRVFKRVNSNKKRTEEEAIYH 884
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
L+ I S +F + + +F S ++ H++ T R+ FG+SGIHG+G+FA
Sbjct: 885 RLTRPCHHPFLEIQSLNAFRVVEEPKSFSSFRERLYHLQRTENDRVCFGRSGIHGWGLFA 944
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
+ + G+MV+EY GE VR SIAD RE + L G Y+F+I +E V+DAT G+IA
Sbjct: 945 RRNIQEGEMVLEYRGEQVRGSIADLRE--VRYRLEGKDCYLFKISEEVVVDATDKGNIAR 1002
Query: 773 LINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCG 827
LINHSC PNCY+R++SV +E I++ AK ++ +ELTYDY F DE ++ C C
Sbjct: 1003 LINHSCMPNCYARIMSVGDNESRIVLIAKTNVPAGDELTYDY-LFDPDEPDEFKVPCLCK 1061
Query: 828 FPRCRGVVN 836
P CR +N
Sbjct: 1062 APNCRKFMN 1070
>gi|224120768|ref|XP_002318412.1| SET domain protein [Populus trichocarpa]
gi|222859085|gb|EEE96632.1| SET domain protein [Populus trichocarpa]
Length = 1078
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 162/476 (34%), Positives = 243/476 (51%), Gaps = 50/476 (10%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C C
Sbjct: 628 YEPVYAKWT-TERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARNVQDFTSWVCKACE 686
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + PE ++MEP G+ +
Sbjct: 687 --TPDIKRECCLCPVKGGALKPTDVETLWVHVTCAWFRPEVSFASDEKMEPALGILSIPS 744
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + +C IC +G+C QC C YH +CA AG R+ L L+++
Sbjct: 745 NSFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY--------RMELHCLEKNGR- 793
Query: 565 QCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTRRCCDYIPPSNPSGCARSEPY 615
Q +++S+C H+ P D + + + LVQ +R + SN +
Sbjct: 794 QTTKMISYCAYHRAPNLDTVLIIQTPVGVFSAKNLVQNKKRAGTRLISSNRT------KL 847
Query: 616 NYFGRRGRKEPEALAAA----------SLKRLFVENQPYLVGGYCQNGLSG-NTLPSIRV 664
E E+L+AA + KR E + + C + L +L + RV
Sbjct: 848 EEVSTEEATESESLSAARCRVFKRVNNNKKRTEEEAISHRLTRPCHHPLGEIQSLNAFRV 907
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
+ SFS R+ +L + H+ +T R+ FG+SGIHG+G+FA+ + G+MV+E
Sbjct: 908 VEEPKSFSSFRERLYYLQAS---LHLSKTENDRVCFGRSGIHGWGLFARRNIQEGEMVLE 964
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y GE VR SIAD RE L G Y+F+I +E V+DAT G+IA LINHSC PNCY+
Sbjct: 965 YRGEQVRGSIADLRE--ARYRLEGKDCYLFKISEEVVVDATDKGNIARLINHSCMPNCYA 1022
Query: 785 RVISVNGDE-HIIIFAKRDIKQWEELTYDYRF--FSIDE-QLACYCGFPRCRGVVN 836
R++SV +E I++ AK ++ +ELTYDY F DE ++ C C P CR +N
Sbjct: 1023 RIMSVGDNESRIVLIAKTNVSAGDELTYDYLFDPNEPDEFKVPCLCKAPNCRKYMN 1078
>gi|297821052|ref|XP_002878409.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324247|gb|EFH54668.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1002
Score = 246 bits (627), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 164/500 (32%), Positives = 250/500 (50%), Gaps = 78/500 (15%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PVR W ++C+VC E+++ N + C++C++ VH CYG + + W+C
Sbjct: 536 LEEKYEPVRAKWT-TERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDLTSWVC 594
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ +G W H+ CA + PE + + MEP GL
Sbjct: 595 RACE--TPDIERECCLCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLF 652
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
++ + + +C+IC ++G+C+ C C +H +CA AG +EL E
Sbjct: 653 KIPVNSFLKVCTICKQTHGSCVHCCK--CATHFHAMCASRAGYNMELH---------CLE 701
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGC--ARSEPYNYF 618
+ Q R +C H++P D + V PSG +R+ N +
Sbjct: 702 KNGVQRTRKSVYCSFHRKPDPDSVVV----------------VHTPSGVFGSRNLLQNQY 745
Query: 619 GR---------RGRKEP--------------EALAAASLKRLFVENQPYLVGGYCQNGLS 655
GR + K P ++L+AA + N + + L
Sbjct: 746 GRTKGSRLVLTKKMKLPGFETQTQAEQSRVFDSLSAARCRIYSRSNTKKIDLEAISHRLK 805
Query: 656 GNTLPSIRVIGSKFSF--SLHRDAP-----------NFLSMADKYKHMKETFRKRLAFGK 702
G + S+ I ++ SF S AP +F S ++ KH++ T R+ FGK
Sbjct: 806 GPSHHSLGEIENRNSFKASFSFRAPFMSMFCFLGEADFTSFRERLKHLQRTENFRVCFGK 865
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
SGIHG+G+FA++ + G+M+IEY G VR S+AD RE + G Y+F+I +E VI
Sbjct: 866 SGIHGWGLFARNSIQEGEMIIEYRGVKVRRSVADLREANYRSQ--GKDCYLFKISEEIVI 923
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNG--DEHIIIFAKRDIKQWEELTYDYRFFSIDE 820
DAT +G+IA LINHSC PNCY+R++S+ D I++ AK ++ EELTYDY F +DE
Sbjct: 924 DATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAKTNVSAGEELTYDY-LFEVDE 982
Query: 821 ----QLACYCGFPRCRGVVN 836
++ C C P CR +N
Sbjct: 983 SEEIKVPCLCKAPNCRKFMN 1002
>gi|356522510|ref|XP_003529889.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 989
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 159/479 (33%), Positives = 241/479 (50%), Gaps = 63/479 (13%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV V W ++C+VC E++++N + C++C++ VH CYG + W+C +C
Sbjct: 546 YEPVNVKW-TTERCAVCRWVEDWEDNKIIICNRCQIAVHQECYGAKNVQDFTSWVCRVCE 604
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + P+ + + MEP G+ ++
Sbjct: 605 --TPDVERECCLCPVKGGALKPTDVEMLWVHVTCAWFRPQVVFQNHEAMEPAMGILKIPP 662
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + C IC S+G+CI C C +H +CA AG +EL E +
Sbjct: 663 NSFVKTCVICKQSHGSCISCCK--CSTYFHVMCASRAGYTMELHSM---------EKNGT 711
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS---NPSGCARSEPYNY---- 617
Q R L +C H+ P D + V + + + P + N GC R
Sbjct: 712 QVTRKLIYCAIHRVPNPDSVLVVHTPLGI------FSPRTSLQNQKGCFRGSRLILSKNI 765
Query: 618 -FGRRGRKEPEALAAASLKRLFVENQP----------YLVGGYCQNGLSGNTLPSIRVIG 666
E + + S R V + +L+GG PS+ +G
Sbjct: 766 ELNESSTTENDLVEPLSAARCRVYRRSPNKRADVPIIHLLGG-----------PSLHSLG 814
Query: 667 SKFSFSLHRDAPN---FLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
+ ++ +DA F S ++ H+ ET + R+ FGKSGIHG+G+FA+ + G+MV+
Sbjct: 815 AITQLNIFKDADESKVFTSFKERLHHLWETEKFRVCFGKSGIHGWGLFARRDIQEGEMVV 874
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EY G VR S+AD RE + G Y+F+I +E V+DAT G+IA LINHSC PNCY
Sbjct: 875 EYRGVHVRRSVADLREEKYRSE--GKDCYLFKISEEVVVDATNRGNIARLINHSCMPNCY 932
Query: 784 SRVISVNGDE--HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
+R++S+ GD+ I++ AK ++ EELTYDY F DE ++ C C P CR +N
Sbjct: 933 ARIMSL-GDQGSRIVLIAKTNVSAGEELTYDY-LFDPDERDELKVPCLCKAPNCRRFMN 989
>gi|297803296|ref|XP_002869532.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
gi|297315368|gb|EFH45791.1| hypothetical protein ARALYDRAFT_913734 [Arabidopsis lyrata subsp.
lyrata]
Length = 1024
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 162/479 (33%), Positives = 241/479 (50%), Gaps = 53/479 (11%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 575 LSETYEPVNAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARHVRDFTSWVC 633
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT + W H+ CA + PE C ++MEP G+
Sbjct: 634 KACE--RPDIKRECCLCPVKGGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPAVGIL 691
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + +C IC +G+C QC C YH +CA AG R+ L L++
Sbjct: 692 SIPSTNFVKICVICKQIHGSCTQCCK--CSTYYHAMCASRAGY--------RMELHCLEK 741
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTR----RCCDYIPPSNPS 607
+ + Q +++S+C H+ P D + + + LVQ + R I N
Sbjct: 742 NGQ-QITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNKKKGGSRLISSIREDNEE 800
Query: 608 GCARS----EPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSG-NTLPSI 662
A +P++ R K S KR+ E P+ G + + TL +
Sbjct: 801 SPAEDTITRDPFSAARCRVFKR----KINSKKRIEEEAIPHHTRGPRHHPSAAIQTLNTF 856
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
R + + +F S ++ H++ T R+ FG+SGIHG+G+FA+ + G+MV
Sbjct: 857 RHVPE--------EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFARRNIQEGEMV 908
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
+EY GE VR SIAD RE VG Y+F+I +E V+DAT G+IA LINHSC PNC
Sbjct: 909 LEYRGEQVRGSIADLREARYRR--VGKDCYLFKISEEVVVDATDKGNIARLINHSCTPNC 966
Query: 783 YSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
Y+R++SV +E I++ AK ++ EELTYDY F DE ++ C C P CR +N
Sbjct: 967 YARIMSVGDEESRIVLIAKANVAVGEELTYDY-LFDPDEAEELKVPCLCKAPNCRKFMN 1024
>gi|255564717|ref|XP_002523353.1| trithorax, putative [Ricinus communis]
gi|223537441|gb|EEF39069.1| trithorax, putative [Ricinus communis]
Length = 1057
Score = 244 bits (624), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 166/473 (35%), Positives = 240/473 (50%), Gaps = 41/473 (8%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 608 LQEKYDPVYAKW-TTERCAVCRWVEDWDVNKIIICNRCQIAVHQECYGVKNIQDLTSWVC 666
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ + W H+ CA + PE + ++MEP G+
Sbjct: 667 RACE--TPDVMRECCLCPVKGGALKPSDIEMLWVHVTCAWFRPEVAFLNHEKMEPATGIF 724
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
R+ + C IC ++G+CIQC C +H +CA AG +EL E
Sbjct: 725 RIPSTTFLKSCVICSQTHGSCIQCCK--CATYFHAMCASRAGYFMELH---------CIE 773
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS---NPSGC-ARSEPYN 616
+ Q + L++C H+ P D + V V + S +GC S +
Sbjct: 774 KNGIQVTKKLAYCAVHRTPSVDSVVVVRSPTGV------FAARSLLQKQNGCFGGSRLIS 827
Query: 617 YFGRRGRKEP--------EALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSK 668
Y G G EP E L++A + N+ G + L G S+ I S
Sbjct: 828 YQGMEGLPEPSTSETNEFEPLSSARCRAYKRTNKKRAEGEPIFHRLMGLRHHSLDAISSL 887
Query: 669 FSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 728
+ D+ F S ++ H+++T R+ FGKSGIHG+G+FA+ + G+MVIEY GE
Sbjct: 888 STHKEMDDSLVFSSFKERLCHLQKTECHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGE 947
Query: 729 LVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVIS 788
VR SIAD RE L G Y+F+I +E VIDAT G+IA LINHSC PNCY+R++S
Sbjct: 948 QVRRSIADLRESRY--RLEGKDCYLFKISEEVVIDATNKGNIARLINHSCMPNCYARIMS 1005
Query: 789 VNGDEH-IIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
V E+ I++ AK ++ +ELTYDY F DE ++ C C P CR +N
Sbjct: 1006 VGDVENRIVLIAKTNVSAADELTYDY-LFDPDEHDDLKVPCLCRAPNCRKFMN 1057
>gi|168050289|ref|XP_001777592.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671077|gb|EDQ57635.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 907
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 165/520 (31%), Positives = 255/520 (49%), Gaps = 42/520 (8%)
Query: 338 KISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLD 397
++ +F + + +GL P + K + L Y PV W +
Sbjct: 409 RVKNQEQPLFAWLKDMLSNGAIGLAFEGPDLVPTRTKEQALLATLDQPYEPVDTTWTS-E 467
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C+VC M E++ +N + C++C++ VH CYG + W+C C P+ CCL
Sbjct: 468 RCAVCRMVEDWHHNKMIICNRCQIAVHEECYGVRASDSVGSWVCRACE--TPDVERECCL 525
Query: 458 CPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV 516
CPV GGA+KP+ T W H+ CA ++PE ++ +MEP +GL V ++ C IC
Sbjct: 526 CPVRGGALKPSNTANLWVHVTCAWFVPEVKFKNIVKMEPAEGLTNVHLSTFQQKCGICKQ 585
Query: 517 SYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+G C+ C++ CR ++H +CA + +E++ + L + ++ FC H
Sbjct: 586 VHGVCVSCAHKNCRRSFHIMCAFKSCYHMEMKAVTKNGL---------EMTQMNLFCSIH 636
Query: 577 KQPLNDR---LAVDERLVQVTRRCCDYIPPSNPS------GCARSEPYNYFGRRGRKEPE 627
+ P D L + V R D PS G S N R
Sbjct: 637 RTPNPDMHLMLKSPSGKIPVKR---DAAHPSGSESVVSLFGGYASGTGNTSLEHTRSVTN 693
Query: 628 ALAAASLK-RLFVENQPYLVGG----YCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLS 682
++AA L + Y G Y +G+S NT I I S F++ D +
Sbjct: 694 SVAARCLPYSTYDAKAKYKKRGKAIAYRVSGVSWNT---IENINSLRDFAV-LDQRKVFT 749
Query: 683 MADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
++ K +++T +K + FGKS IHG+G+F++ + G+MVIEY GE VR S+AD RE I
Sbjct: 750 NEERLKFLQKTEKKTVCFGKSAIHGWGLFSRRAIQEGEMVIEYRGERVRGSVADLRE--I 807
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN-----GDEHIII 797
G Y+F+I++E VIDAT G+IA LINHSC+P+CY++++ GD I++
Sbjct: 808 RYHKEGKDCYLFKINEEIVIDATDKGNIARLINHSCDPSCYAKILDFQRDDGEGDSRIVL 867
Query: 798 FAKRDIKQWEELTYDYRFFSIDEQ-LACYCGFPRCRGVVN 836
A++ I EELTY+YRF D Q + C CG CR +N
Sbjct: 868 IARKYIAAGEELTYNYRFDEEDTQKVPCLCGSSNCRKYMN 907
>gi|356560272|ref|XP_003548417.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 954
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 156/469 (33%), Positives = 239/469 (50%), Gaps = 43/469 (9%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV V W ++C+VC E++++N + C +C++ VH CYG + + W+C +C
Sbjct: 511 YEPVNVKW-TTERCAVCRWVEDWEDNKIIICSRCQIAVHQECYGAKKVQDFTSWVCRVCE 569
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + P+ + + MEP G+ ++
Sbjct: 570 --TPDVERECCLCPVKGGALKPTDVEMLWVHVTCAWFRPQVVFQNHEAMEPATGILKIPP 627
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + C IC S+G+CI C C +H +CA AG +EL E +
Sbjct: 628 NSFVKTCVICEQSHGSCIACCK--CSTYFHVMCASRAGYTMELHSM---------EKNGT 676
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS---NPSGCARSE-------- 613
Q + L +C H+ P D + V + + + P + N GC R
Sbjct: 677 QITKKLIYCAIHRVPNPDSVLVVHTPLGI------FSPRTSLQNQKGCFRGSRLISSKNI 730
Query: 614 PYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSL 673
N + E L+AA R++ + + L G +L S+ I F
Sbjct: 731 ELNESSTTEKDIVEPLSAARC-RVYQRSPNKRADVPIIHLLRGPSLHSLGAITQLNHFKD 789
Query: 674 HRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPS 733
++ F S ++ H+ E + R+ FGKSGIHG+G+FA+ + G+MV+EY G VR S
Sbjct: 790 ADESKVFTSFKERLHHLWEMEKFRVCFGKSGIHGWGLFARRDIQEGEMVVEYRGVHVRRS 849
Query: 734 IADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE 793
+ D RE + G Y+F+I +E V+DAT +G+IA LINHSC PNCY+R++S+ GD+
Sbjct: 850 VTDLREEKYRSE--GKDCYLFKISEEVVVDATNSGNIARLINHSCMPNCYARIMSM-GDQ 906
Query: 794 --HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
I++ AK ++ EELTYDY F DE ++ C C P CR +N
Sbjct: 907 GSRIVLIAKTNVSAGEELTYDY-LFDPDERDELKVPCLCKAPNCRRFMN 954
>gi|356507582|ref|XP_003522543.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Glycine max]
Length = 1035
Score = 242 bits (617), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 160/490 (32%), Positives = 238/490 (48%), Gaps = 76/490 (15%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 587 LQEKYEPVHAKW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARNVRDFTSWVC 645
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT D W H+ CA + PE ++MEP G+
Sbjct: 646 KACE--RPDIKRECCLCPVKGGALKPTDVDTLWVHVTCAWFRPEVSFASDEKMEPALGIL 703
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + + +C IC +G+C QC C +H +CA AG R+ L L++
Sbjct: 704 SIPSNSFVKICVICKQIHGSCTQCCK--CSTYFHAMCASRAGY--------RMELHCLEK 753
Query: 561 DDEDQCIRLLSFCKKHKQP--------------------LNDRLAVDERLVQVTRRCCDY 600
+ + Q +++S+C H+ P L + RL+ +R+ D
Sbjct: 754 NGK-QTTKMVSYCAYHRAPNPDTVLIMQTPLGVISTKSLLQTKKKTGSRLISSSRKKQDD 812
Query: 601 IPPSNPSGCARSEPYN-----YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
P N EP++ F R + A A R+ G
Sbjct: 813 SPVDN----TEHEPFSAARCRIFQRTNHTKKRAADEAVSHRV--------------RGPY 854
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPN----FLSMADKYKHMKETFRKRLAFGKSGIHGFGIF 711
+ L +I S + HR + F S ++ H++ T +R+ FG+SGIH +G+F
Sbjct: 855 HHPLDAIE------SLNTHRQVVHEPQAFSSFRERLYHLQRTENERVCFGRSGIHEWGLF 908
Query: 712 AKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIA 771
A+ + GDMV+EY GE VR SIAD RE L G Y+F+I +E V+DAT G+IA
Sbjct: 909 ARRNIQEGDMVLEYRGEQVRRSIADLRE--ARYRLEGKDCYLFKISEEVVVDATDKGNIA 966
Query: 772 HLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYC 826
LINHSC PNCY+R++SV +E I++ AK ++ +ELTYDY F DE ++ C C
Sbjct: 967 RLINHSCMPNCYARIMSVGDEESRIVLIAKTNVAAGDELTYDY-LFDPDEPEENKVPCLC 1025
Query: 827 GFPRCRGVVN 836
P CR +N
Sbjct: 1026 KAPNCRKFMN 1035
>gi|218187758|gb|EEC70185.1| hypothetical protein OsI_00917 [Oryza sativa Indica Group]
Length = 991
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 163/485 (33%), Positives = 242/485 (49%), Gaps = 71/485 (14%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L +RPV W ++C+VC E++ N + C++C++ VH CYG + + W+C
Sbjct: 548 LANSFRPVNARW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARDVQDFTNWVC 606
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT D W H+ CA + P+ + MEP G+
Sbjct: 607 RACE--LPKQKRECCLCPVKGGALKPTDIDQLWVHVTCAWFQPKVSFPVDETMEPAMGIL 664
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + +K C IC +GAC QC C YH +CA AG +EL+ E
Sbjct: 665 SIPSEYFKKACVICKQMHGACTQCYK--CSTYYHAMCASRAGYRMELQ---------YSE 713
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------------------DERLVQVTRRCCD 599
+ R++S+C H P D + + RLV+ +
Sbjct: 714 KNGRNITRMVSYCAFHSTPDPDNVLIVKTPEGVFSTKFLPQNNEKQSGTRLVR-KENLQE 772
Query: 600 YIPPSNPSGC--ARSEPYNYFGRRGRKEP-EALAAASLKRLFVENQPYLVGGYCQNGLSG 656
+ P+ S C AR PY + +KEP EA+A + G
Sbjct: 773 KVLPAKISDCPAARCLPYEML--KNKKEPGEAIAHRIM------------------GPRH 812
Query: 657 NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPH 716
++ SI + + +D +F + ++ +++++ KR++ G+SGIHG+G+FA
Sbjct: 813 HSQESIEGLNACMD---QKDEKSFATFRERLRYLQKIENKRVSCGRSGIHGWGLFAAKKI 869
Query: 717 RAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 776
+ G MVIEY G+ VR S+AD RE + Y+F+I ++ V+DAT G+IA LINH
Sbjct: 870 QEGQMVIEYRGDQVRRSVADLREARYHRE--KKDCYLFKISEDVVVDATEKGNIARLINH 927
Query: 777 SCEPNCYSRVISVNGDE--HIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRC 831
SC PNCY+R++SV GDE II+ AKRD+ EELTYDY F S D ++ C C P C
Sbjct: 928 SCMPNCYARIMSV-GDEKSQIILIAKRDVSAGEELTYDYLFDPDESEDCRVPCLCKAPNC 986
Query: 832 RGVVN 836
RG +N
Sbjct: 987 RGYMN 991
>gi|242051571|ref|XP_002454931.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
gi|241926906|gb|EES00051.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
Length = 993
Score = 241 bits (616), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 223/838 (26%), Positives = 361/838 (43%), Gaps = 147/838 (17%)
Query: 85 GDIIWAKLTGHA-MWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG------THDFARIN 137
GD++WA+ + WPA+V+D LN G ++ V FFG D+ +
Sbjct: 217 GDVVWARSGKKSPTWPALVIDPMQHAPEVVLNSCVPG-ALCVMFFGYSANGHGRDYGWVK 275
Query: 138 VKQVISFLKGLLSSFH----LKCKKPRFTQSLEEAKVYLSEQKL----------PRRMLQ 183
+ F+ L F K + +F ++EEA +L+E+ PR+ +
Sbjct: 276 QGMIFPFVD-YLDRFQGQPLYKLRPSKFRAAIEEA--FLAERGFFDLEMDGVCSPRKSVN 332
Query: 184 LQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLG-----------SIGISPYVFGD 232
Q+ + + Q+E + + + C D ++ G L S G
Sbjct: 333 KQS-------DPNGFQEEAASNNEQECQSDNQVVGKLALSCDSCGNRLPSKVSKKRKQGG 385
Query: 233 LQIL--SLGKIVKDSEYFQDDRFIW-----------------PEGYTAVRKFTSLADPRV 273
Q+L K+++ +Y + IW P+ + ++ T A ++
Sbjct: 386 EQVLCRHCEKLLQSKQYCGICKKIWHHTDGGNWDLEKADYYCPDCKS--KRKTVPATEKM 443
Query: 274 CNSYKMEVLRDTESKIRPLFRVTLDNGEQFTGSTPST--CWSKIC----MKIREGQNNTS 327
S E T+ K+ + V + E C K C M + E + +T
Sbjct: 444 NTSNSSECASTTKEKLTAMIAVCCNGEEALYVPEKHMILCQCKSCKERMMSLNEWEKHTG 503
Query: 328 DDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSR--PTSKSSLCKLTSKYRDLPGG 385
+ + +G + + + G +KS K L +L +
Sbjct: 504 SRKKNWKMSIRQKSTGEPLINLLD----DIPCGASKSSNPGIKKEELLQLQAN------A 553
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + WLC C
Sbjct: 554 YSPVCAKWT-TERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARAVQDLTTWLCRACE 612
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT +G W H+ CA + P+ + MEP G+ +
Sbjct: 613 --FPQQKRECCLCPVKGGALKPTDIEGLWVHVTCAWFQPKVSFPVEETMEPAMGIMSIPV 670
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +K C IC +GAC QC C YH +CA A +EL+ +R +
Sbjct: 671 EYFKKTCVICKQIHGACTQCHK--CSTYYHAMCASRAAYRMELQCSERSGRHT------- 721
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDE---------------------RLVQVTRRCCDYIPP 603
I+++S+C H P D + + + RLV+ + + P
Sbjct: 722 --IKMVSYCDFHSTPDPDNVLIVKTPEGVFSTKFFLQNGEKQTAARLVRKENHQ-EKVLP 778
Query: 604 SNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG-GYCQNGLSGNTLPSI 662
S S C ++ Y + +KEP A + +Q ++ G C +
Sbjct: 779 SIVSDCLVAKCLTYEMSKYKKEPGEAIAHRIMGPRHHSQDFIDGLNACMD---------- 828
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+D +F + ++ ++++T R++ G+SGIHG+G+FA + G MV
Sbjct: 829 -----------RKDDQSFSTFKERLGYLQKTENLRVSCGRSGIHGWGLFAARNIQEGQMV 877
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE VR +AD RE + Y+F+I ++ VIDAT G+IA LINHSC PNC
Sbjct: 878 IEYRGEQVRRCVADLREAQYHRE--KKDCYLFKISEDVVIDATDKGNIARLINHSCMPNC 935
Query: 783 YSRVISVNGDEH-IIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
Y+R+++V+GD + II+ AKRD+ EELTYDY F S D ++ C C P CRG +N
Sbjct: 936 YARIMTVSGDRNQIILIAKRDVSAGEELTYDYLFDPDESEDCKVPCLCKAPNCRGYMN 993
>gi|414875701|tpg|DAA52832.1| TPA: hypothetical protein ZEAMMB73_838843 [Zea mays]
Length = 971
Score = 241 bits (615), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 157/478 (32%), Positives = 236/478 (49%), Gaps = 65/478 (13%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV W ++C+VC E++ N + C++C++ VH CYG + WLC C
Sbjct: 532 YSPVCAKWT-TERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGACAVQDLTTWLCRACE 590
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
+P+ CCLCPV GGA+KPT DG W H+ CA + P+ + MEP G+ +
Sbjct: 591 --SPQQKRECCLCPVKGGALKPTDIDGLWVHVTCAWFQPKVSFPVDETMEPAMGIMSIPV 648
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +K C IC +GAC QC C YH +CA AG C+EL+ +R
Sbjct: 649 EYFKKTCVICKQMHGACTQCYK--CSTYYHAICASRAGYCMELQCSERSG---------R 697
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDE---------------------RLVQVTRRCCDYIPP 603
+++S+C H P D + + + RLV+ +PP
Sbjct: 698 HITKMVSYCDFHSTPDPDNVLIVKTPEGVFSTKFFLQNGEKQTAARLVRKENHQEKVLPP 757
Query: 604 SNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG-GYCQNGLSGNTLPSI 662
S C ++ Y + +K+P A + +Q ++ G C +
Sbjct: 758 II-SDCLVAKCLTYEMSKYKKQPGEAIAHRIMGPRHHSQDFIDGLNACMD---------- 806
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+D +F + ++ +++++T R++ G+SGIHG+G+FA + G MV
Sbjct: 807 -----------RKDDQSFSTFKERLRYLQKTENLRVSCGRSGIHGWGLFAARNIQEGQMV 855
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE VR +AD RE + Y+F+I ++ VIDAT G+IA LINHSC PNC
Sbjct: 856 IEYRGEQVRRCVADLREAQYHRE--KKDCYLFKISEDVVIDATDKGNIARLINHSCMPNC 913
Query: 783 YSRVISVNGD-EHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
Y+R+++V+ D II+ AKRD+ EELTYDY F S D ++ C C P CRG +N
Sbjct: 914 YARIMTVSDDRSEIILIAKRDVSAGEELTYDYLFDPDESEDCRVPCLCKAPNCRGYMN 971
>gi|145339751|ref|NP_191733.3| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
gi|259016183|sp|Q9M364.2|ATX3_ARATH RecName: Full=Histone-lysine N-methyltransferase ATX3; AltName:
Full=Protein SET DOMAIN GROUP 14; AltName:
Full=Trithorax-homolog protein 3; Short=TRX-homolog
protein 3
gi|225898735|dbj|BAH30498.1| hypothetical protein [Arabidopsis thaliana]
gi|332646729|gb|AEE80250.1| histone-lysine N-methyltransferase ATX3 [Arabidopsis thaliana]
Length = 1018
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 166/520 (31%), Positives = 256/520 (49%), Gaps = 99/520 (19%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PVR W ++C+VC E+++ N + C++C++ VH CYG + + W+C
Sbjct: 533 LEEKYEPVRAKWT-TERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDLTSWVC 591
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ +G W H+ CA + PE + + MEP GL
Sbjct: 592 RACE--TPDIERDCCLCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLF 649
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
++ + + +C+IC ++G+C+ C C +H +CA AG +EL E
Sbjct: 650 KIPANSFLKVCTICKQTHGSCVHCCK--CATHFHAMCASRAGYNMELH---------CLE 698
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGC--ARSEPYNYF 618
+ Q R +C H++P D + + PSG +R+ N +
Sbjct: 699 KNGVQRTRKSVYCSFHRKPDPDSV----------------VVVHTPSGVFGSRNLLQNQY 742
Query: 619 GR---------RGRKEP--------------EALAAA-------SLKRLFVENQPYLVGG 648
GR + K P ++L+AA S ++ +E + + G
Sbjct: 743 GRAKGSRLVLTKKMKLPGFQTQTQAEQSRVFDSLSAARCRIYSRSNTKIDLEAISHRLKG 802
Query: 649 YCQNGLSG-NTLPSIRV--------------IGSKFSFSL-----------HRDAPNFLS 682
+ LS L S + +G+ FS L H++A +F S
Sbjct: 803 PSHHSLSAIENLNSFKASFSFRAPFMSVFCFLGATFSEYLRKILISIYLVTHQEA-DFTS 861
Query: 683 MADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
++ KH++ T R+ FGKSGIHG+G+FA+ + G+M+IEY G VR S+AD RE
Sbjct: 862 FRERLKHLQRTENFRVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANY 921
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG--DEHIIIFAK 800
+ G Y+F+I +E VIDAT +G+IA LINHSC PNCY+R++S+ D I++ AK
Sbjct: 922 RSQ--GKDCYLFKISEEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAK 979
Query: 801 RDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
++ EELTYDY F +DE ++ C C P CR +N
Sbjct: 980 TNVAAGEELTYDY-LFEVDESEEIKVPCLCKAPNCRKFMN 1018
>gi|110742931|dbj|BAE99361.1| trithorax 3 [Arabidopsis thaliana]
Length = 1018
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 166/520 (31%), Positives = 256/520 (49%), Gaps = 99/520 (19%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PVR W ++C+VC E+++ N + C++C++ VH CYG + + W+C
Sbjct: 533 LEEKYEPVRAKWT-TERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDLTSWVC 591
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ +G W H+ CA + PE + + MEP GL
Sbjct: 592 RACE--TPDIERDCCLCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLF 649
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
++ + + +C+IC ++G+C+ C C +H +CA AG +EL E
Sbjct: 650 KIPANSFLKVCTICKQTHGSCVHCCK--CATHFHAMCASRAGYNMELH---------CLE 698
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGC--ARSEPYNYF 618
+ Q R +C H++P D + + PSG +R+ N +
Sbjct: 699 KNGVQRTRKSVYCSFHRKPDPDSV----------------VVVHTPSGVFGSRNLLQNQY 742
Query: 619 GR---------RGRKEP--------------EALAAA-------SLKRLFVENQPYLVGG 648
GR + K P ++L+AA S ++ +E + + G
Sbjct: 743 GRAKGSRLVLTKKMKLPGFQTQTQAEQSRVFDSLSAARCRIYSRSNTKIDLEAISHRLKG 802
Query: 649 YCQNGLSG-NTLPSIRV--------------IGSKFSFSL-----------HRDAPNFLS 682
+ LS L S + +G+ FS L H++A +F S
Sbjct: 803 PSHHSLSAIENLNSFKASFSFRAPFMSVFCFLGATFSEYLRKILISIYLVTHQEA-DFTS 861
Query: 683 MADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
++ KH++ T R+ FGKSGIHG+G+FA+ + G+M+IEY G VR S+AD RE
Sbjct: 862 FRERLKHLQRTENFRVCFGKSGIHGWGLFARKSIQEGEMIIEYRGVKVRRSVADLREANY 921
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG--DEHIIIFAK 800
+ G Y+F+I +E VIDAT +G+IA LINHSC PNCY+R++S+ D I++ AK
Sbjct: 922 RSQ--GKDCYLFKISEEIVIDATDSGNIARLINHSCMPNCYARIVSMGDGEDNRIVLIAK 979
Query: 801 RDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
++ EELTYDY F +DE ++ C C P CR +N
Sbjct: 980 TNVAAGEELTYDY-LFEVDESGEIKVPCLCKAPNCRKFMN 1018
>gi|357463899|ref|XP_003602231.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355491279|gb|AES72482.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 1053
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 154/476 (32%), Positives = 239/476 (50%), Gaps = 49/476 (10%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C+VC E++ N + C++C++ VH CYG + W+C
Sbjct: 606 LKEKYEPVYAKWT-TERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGAKNVRDFTSWVC 664
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP D W H+ CA + PE ++MEP G+
Sbjct: 665 KACE--TPDIKRECCLCPVKGGALKPADIDTLWVHVTCAWFRPEVSFASDEKMEPALGIL 722
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + + +C IC +G+C QC C +H +CA AG R+ L L +
Sbjct: 723 SIPSNSFVKICVICKQIHGSCTQCCK--CSTYFHAMCASRAGY--------RMELHCLKK 772
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQVTRRCCDYIPPS------N 605
+ + Q +++S+C H+ P D + + + L+Q + I + N
Sbjct: 773 NGK-QTTKMVSYCAYHRAPNPDNVLILQTPLGVISTKSLLQKRKVGSRLISSARIEKEDN 831
Query: 606 PSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVI 665
P +P++ R + + KR E +L G+ + L +T+ S+
Sbjct: 832 PIDITELDPFSA----ARCQIFKRTNHTRKRAADEAIFHLARGHSHHPL--DTIQSLNTY 885
Query: 666 GSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
+ + + F S ++ H++ T R+ FG+SGIHG+G+FA+ + G+MV+EY
Sbjct: 886 RA-----VVEEPQAFASFRERLYHLQRTENGRVCFGRSGIHGWGLFARRNIQEGEMVLEY 940
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
GE VR S+AD RE G Y+F+I +E V+DAT G+IA LINHSC PNCY+R
Sbjct: 941 RGEQVRRSVADLREARYRAE--GKDCYLFKISEEVVVDATDKGNIARLINHSCMPNCYAR 998
Query: 786 VISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
++SV DE I++ AK ++ +ELTYDY F DE ++ C C P CR +N
Sbjct: 999 IMSVGDDESRIVLIAKTNVSAGDELTYDY-LFDPDEPDEFKVPCMCKAPNCRKFMN 1053
>gi|326519903|dbj|BAK03876.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1025
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 160/472 (33%), Positives = 239/472 (50%), Gaps = 52/472 (11%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y V+V W ++C+VC E++ N + C++C+M VH CYG + W+C C
Sbjct: 585 YDHVQVKWT-TERCAVCRWVEDWDYNKIVICNRCQMAVHQECYGVSGKQDFTSWICRACE 643
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
PE CCLCPV GGA+KPT D W H+ CA + P+ + MEP G+ ++
Sbjct: 644 --TPEQKRECCLCPVKGGALKPTNVDNLWVHVTCAWFQPQVAFDSDELMEPAIGILSMAP 701
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +C IC +G+C QC C YH CA AG R+ L SL+++ +
Sbjct: 702 LLFMKMCVICRQIHGSCTQCYR--CSTYYHATCASRAGY--------RMELHSLEKNGK- 750
Query: 565 QCIRLLSFCKKHKQPLNDRLAV---------DERLVQ------VTRRCCDYIPPSNPSGC 609
Q +++S+C H++P D + + +RLVQ +R +P + S
Sbjct: 751 QSTKMISYCAHHRRPNPDNVLIIQTPAGTFSSKRLVQSSGTIAASRLIRKDLPNDSASEV 810
Query: 610 ARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKF 669
E N R R LKR + V GYCQ+ I S
Sbjct: 811 ETIE--NLCSARCR----VYVRKDLKRSREAAMAHRVRGYCQHRWDE--------IDSLN 856
Query: 670 SFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
RD +F + ++ +++++T R+ FG+SGIH +G+FA+ + G+MV+EY GE
Sbjct: 857 PRREERDPESFSTFTERLRYLQKTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQ 916
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
VR S+AD RE + G Y+F+I +E V+DAT G++A LINHSC PN Y+R++SV
Sbjct: 917 VRRSVADLRE--AQYRVQGKDCYLFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSV 974
Query: 790 NGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
D+ I++ AKR++ EELTYDY F DE ++ C C CR +N
Sbjct: 975 GHDQSRIVLIAKRNVSAGEELTYDY-LFDPDEADDCKVPCLCQTADCRKFMN 1025
>gi|356503907|ref|XP_003520741.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 146/467 (31%), Positives = 237/467 (50%), Gaps = 43/467 (9%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV +W ++C++C E++++N + C++C++ VH CYG + W+C C
Sbjct: 546 YEPVYANWT-TERCAICRWVEDWEDNKIIICNRCQIAVHQECYGAKNVQDFTSWVCRACE 604
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + PE + K MEP G+ ++
Sbjct: 605 --TPDVERECCLCPVKGGALKPTDVEMLWVHVTCAWFQPEVLFQNDKAMEPASGIFKIPP 662
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + C IC S+G+C C C +H +CA G +EL ++ L ++
Sbjct: 663 NSFSKTCVICKQSHGSCTSCCK--CATYFHVMCASRKGYSMELHSTEKNGTLITEK---- 716
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRK 624
L +C +H+ P + V +V +P+ R+ + G R
Sbjct: 717 -----LIYCARHRVPNPESGLVVHTPQEVF----------SPTTSLRNHLGSIRGSRLVS 761
Query: 625 EPEALAAASLKRLFVENQPYLVGGYCQNGL----SGNTLPSIRVIGSKFSFSL------- 673
++ E +P L C+ + + +P I ++G SL
Sbjct: 762 SEIMELPKTVNSEINEIEP-LSAARCRVFIRPSRKKDEVPIIHLLGRTNLHSLSAITQLN 820
Query: 674 -HRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRP 732
++DA F S+ ++ H+++T +++ GKSGIHG+G+FA+ G+MV+EY GE +R
Sbjct: 821 SNKDAQVFSSLKERLHHLQKTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRR 880
Query: 733 SIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGD 792
SI D RE + G Y F+I +E VIDAT G+IA LINHSC PNC++R++ ++
Sbjct: 881 SITDLREAQYRSE--GKDCYFFKISEEVVIDATDKGNIARLINHSCMPNCFARIVPLSDQ 938
Query: 793 EH-IIIFAKRDIKQWEELTYDYRF--FSIDEQLACYCGFPRCRGVVN 836
E+ I++ AK ++ EELTY+Y F +E++ C C P C G +N
Sbjct: 939 ENRIVLIAKTNVSAGEELTYNYSFDDERDEEKVVCRCKAPNCSGFMN 985
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 49/105 (46%), Gaps = 16/105 (15%)
Query: 85 GDIIWAKL-TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG------THDFARIN 137
GDI+WAK H WPAIV+D L L+ G ++ V FFG D+A +
Sbjct: 201 GDIVWAKCGKKHPAWPAIVIDPLLQAPKSVLSCCVPG-ALCVMFFGYSKNGKQRDYAWVK 259
Query: 138 VKQVISFLKGL-----LSSFHLKCKKPRFTQSLEEAKVYLSEQKL 177
+ FL+ + S H KCK F +L+EA YL E +
Sbjct: 260 QGMIFPFLEFMDRFPGQSQLH-KCKASDFNGALDEA--YLVENGI 301
>gi|115435312|ref|NP_001042414.1| Os01g0218800 [Oryza sativa Japonica Group]
gi|56784088|dbj|BAD81417.1| putative trithorax 3 [Oryza sativa Japonica Group]
gi|113531945|dbj|BAF04328.1| Os01g0218800 [Oryza sativa Japonica Group]
Length = 991
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 162/485 (33%), Positives = 241/485 (49%), Gaps = 71/485 (14%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L +RPV W ++C+VC E++ N + C++C++ VH CYG + + W+C
Sbjct: 548 LANSFRPVNARW-TTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARDVQDFTNWVC 606
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT D W H+ CA + P+ + MEP G+
Sbjct: 607 RACE--LPKQKRECCLCPVKGGALKPTDIDQLWVHVTCAWFQPKVSFPVDETMEPAMGIL 664
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + +K C IC +GAC QC C YH +CA AG +EL+ E
Sbjct: 665 SIPSEYFKKACVICKQMHGACTQCYK--CSTYYHAMCASRAGYRMELQ---------YSE 713
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------------------DERLVQVTRRCCD 599
+ R++S+C H P D + + RLV+ +
Sbjct: 714 KNGRNITRMVSYCAFHSTPDPDNVLIVKTPEGVFSTKFLPQNNEKQSGTRLVR-KENLQE 772
Query: 600 YIPPSNPSGC--ARSEPYNYFGRRGRKEP-EALAAASLKRLFVENQPYLVGGYCQNGLSG 656
+ P+ S C AR PY + +KEP EA+A + G
Sbjct: 773 KVLPAKISDCPAARCLPYEML--KNKKEPGEAIAHRIM------------------GPRH 812
Query: 657 NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPH 716
++ SI + + +D +F + ++ +++++ KR++ G+SGIHG+G+FA
Sbjct: 813 HSQESIEGLNACMD---QKDEKSFATFRERLRYLQKIENKRVSCGRSGIHGWGLFAAKKI 869
Query: 717 RAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 776
+ G MVIEY G+ VR S+AD RE + Y+F+I ++ V+DAT G+IA LINH
Sbjct: 870 QEGQMVIEYRGDQVRRSVADLREARYHRE--KKDCYLFKISEDVVVDATEKGNIARLINH 927
Query: 777 SCEPNCYSRVISVNGDE--HIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRC 831
SC PNCY+R++SV GDE II+ AKRD+ EELTYDY F S D ++ C C C
Sbjct: 928 SCMPNCYARIMSV-GDEKSQIILIAKRDVSAGEELTYDYLFDPDESEDCRVPCLCKALNC 986
Query: 832 RGVVN 836
RG +N
Sbjct: 987 RGYMN 991
>gi|357135761|ref|XP_003569477.1| PREDICTED: histone-lysine N-methyltransferase ATX4-like [Brachypodium
distachyon]
Length = 1037
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 158/472 (33%), Positives = 232/472 (49%), Gaps = 52/472 (11%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV V W ++C+VC E++ N + C++C++ VH CYG + W+C C
Sbjct: 597 YDPVHVKW-TTERCAVCRWVEDWDYNKIVICNRCQIAVHQECYGVRGRQDFTSWVCRACE 655
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
PE CCLCPV GGA+KPT D W H+ACA + P+ + MEP G+ ++
Sbjct: 656 --TPEQKRECCLCPVKGGALKPTNVDNLWVHVACAWFQPQVAFASDELMEPAIGILSIAP 713
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ +C IC +G+C QC C YH +CA AG +EL E +
Sbjct: 714 LLFMKMCVICRQIHGSCTQCYR--CSTYYHAICASRAGYQMELH---------CLEKNGK 762
Query: 565 QCIRLLSFCKKHKQPLND---------------RLAVDERLVQVTRRCCDYIPPSNPSGC 609
Q + +S+C H+ P D RL R V +R +P + S
Sbjct: 763 QTTKKISYCAHHRTPNPDNVLIIQTPAGTFSSKRLVKSSRTVAASRLIRKDLPKDSASKV 822
Query: 610 ARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKF 669
+E + R R LKR + V G CQ+ I S
Sbjct: 823 ETTE--SLCAARCR----VYVRKELKRSREAAMAHRVRGSCQHRWDE--------IDSLN 868
Query: 670 SFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGEL 729
RD +F + ++ ++++T R+ FG+SGIH +G+FA+ + G+MV+EY GE
Sbjct: 869 PPRAERDPESFSTFKERLHYLQKTEHSRVCFGRSGIHRWGLFARRDIQEGEMVLEYRGEQ 928
Query: 730 VRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV 789
VR S+AD RE + G Y+F+I +E V+DAT G++A LINHSC PN Y+R++SV
Sbjct: 929 VRRSVADLRE--AQYRVQGKDCYLFKISEEVVVDATDKGNVARLINHSCTPNFYARIMSV 986
Query: 790 NGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
D+ I++ AKR+++ EELTYDY F DE ++ C C CR +N
Sbjct: 987 GHDQSRIVLIAKRNVRAGEELTYDY-LFDPDEADDCKVPCLCQTADCRKFMN 1037
>gi|356570970|ref|XP_003553655.1| PREDICTED: histone-lysine N-methyltransferase ATX3-like [Glycine
max]
Length = 985
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 150/463 (32%), Positives = 237/463 (51%), Gaps = 35/463 (7%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y PV +W ++C++C E++++N + C++C++ VH CYG + W+C +C
Sbjct: 546 YEPVYANWI-TERCAICRWVEDWEDNKIIICNRCQIAVHQECYGAKNVQDFTSWVCRVCE 604
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
P+ CCLCPV GGA+KPT + W H+ CA + PE + K MEP G+ ++
Sbjct: 605 --NPDVERECCLCPVKGGALKPTDVEMLWVHVTCAWFRPEVIFQNDKAMEPASGILKIPP 662
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ + C IC S+G+C C C +H +CA G +EL ++ L ++
Sbjct: 663 NSFSKTCVICKQSHGSCTSC--CKCATYFHVMCASRKGYSMELHSTEKNGTLITNK---- 716
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNY------- 617
L +C H+ P + V +V ++P RS +
Sbjct: 717 -----LIYCAMHRVPNPESGLVVHTPNEVFSSTTSL--QNHPGSIRRSRLVSSEIVVLPE 769
Query: 618 FGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDA 677
E E L+AA R+F+ G + L G L S+ I + ++DA
Sbjct: 770 SANSEINEIEPLSAARC-RVFIRPSRKKDGVPIIHLLGGPNLHSLSAITQ---LNSNKDA 825
Query: 678 PNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADR 737
F S+ ++ H+++T +++ GKSGIHG+G+FA+ G+MV+EY GE +R SI D
Sbjct: 826 QVFSSLKERLHHLQKTENQKVCLGKSGIHGWGLFARRDLEEGEMVVEYRGEQLRRSITDL 885
Query: 738 REHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE--HI 795
RE + G Y F+I++E VIDAT G+IA LINHSC PNC++R++ +GD+ I
Sbjct: 886 REAQYRSE--GKDCYFFKINEEVVIDATDKGNIARLINHSCMPNCFARIVP-SGDQKNRI 942
Query: 796 IIFAKRDIKQWEELTYDYRF--FSIDEQLACYCGFPRCRGVVN 836
++ AK ++ EELTY+Y F +E++ C C P C G +N
Sbjct: 943 VLIAKTNVSAGEELTYNYSFDDERDEEKVPCRCKAPNCSGFMN 985
>gi|222617992|gb|EEE54124.1| hypothetical protein OsJ_00897 [Oryza sativa Japonica Group]
Length = 1585
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 157/470 (33%), Positives = 233/470 (49%), Gaps = 69/470 (14%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L +RPV W ++C+VC E++ N + C++C++ VH CYG + + W+C
Sbjct: 456 LANSFRPVNARWTT-ERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARDVQDFTNWVC 514
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KPT D W H+ CA + P+ + MEP G+
Sbjct: 515 RACE--LPKQKRECCLCPVKGGALKPTDIDQLWVHVTCAWFQPKVSFPVDETMEPAMGIL 572
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
+ + +K C IC +GAC QC C YH +CA AG +EL+ E
Sbjct: 573 SIPSEYFKKACVICKQMHGACTQCYK--CSTYYHAMCASRAGYRMELQ---------YSE 621
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV---------------------DERLVQVTRRCCD 599
+ R++S+C H P D + + RLV+ +
Sbjct: 622 KNGRNITRMVSYCAFHSTPDPDNVLIVKTPEGVFSTKFLPQNNEKQSGTRLVR-KENLQE 680
Query: 600 YIPPSNPSGC--ARSEPYNYFGRRGRKEP-EALAAASLKRLFVENQPYLVGGYCQNGLSG 656
+ P+ S C AR PY + +KEP EA+A + G
Sbjct: 681 KVLPAKISDCPAARCLPYEML--KNKKEPGEAIAHRIM------------------GPRH 720
Query: 657 NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPH 716
++ SI + + +D +F + ++ +++++ KR++ G+SGIHG+G+FA
Sbjct: 721 HSQESIEGLNACMD---QKDEKSFATFRERLRYLQKIENKRVSCGRSGIHGWGLFAAKKI 777
Query: 717 RAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINH 776
+ G MVIEY G+ VR S+AD RE + Y+F+I ++ V+DAT G+IA LINH
Sbjct: 778 QEGQMVIEYRGDQVRRSVADLREARYHRE--KKDCYLFKISEDVVVDATEKGNIARLINH 835
Query: 777 SCEPNCYSRVISVNGDE--HIIIFAKRDIKQWEELTYDYRFFSIDEQLAC 824
SC PNCY+R++SV GDE II+ AKRD+ EELTYDY F DE C
Sbjct: 836 SCMPNCYARIMSV-GDEKSQIILIAKRDVSAGEELTYDY-LFDPDESEDC 883
>gi|222618974|gb|EEE55106.1| hypothetical protein OsJ_02868 [Oryza sativa Japonica Group]
Length = 1032
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 161/494 (32%), Positives = 243/494 (49%), Gaps = 51/494 (10%)
Query: 364 SRPTSKSSLCKLTSKYRDLPGG-YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
+ P +SS K DL Y V V W ++C+VC E++ N + C++C++
Sbjct: 569 TNPAKRSSQKMKKQKLLDLLSEPYDTVNVKWT-TERCAVCRWVEDWDYNKIVICNRCQIA 627
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIW 481
VH CYG + W+C C P+ CCLCPV GGA+KPT D W H+ CA +
Sbjct: 628 VHQECYGVRGKQDFTSWVCRACE--TPDQKRECCLCPVKGGALKPTNVDNLWVHVTCAWF 685
Query: 482 IPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAA 541
P+ + MEP G+ ++ + +C IC +G+C QC C YH +CA A
Sbjct: 686 QPQVAFASDELMEPAIGILNITPLLFMKMCVICRQIHGSCTQCYR--CSTYYHAICASRA 743
Query: 542 GLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQ 592
G R+ + L+++ + Q +S+C H+ P D + + ++LVQ
Sbjct: 744 GY--------RMEIRCLEKNGK-QTTNKISYCAHHRAPNPDNVLIIQTPAGTISSKKLVQ 794
Query: 593 ------VTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLV 646
+R +P + S SE N R R LKR + V
Sbjct: 795 SNGTVAASRLIRKDLPKDSVSEVEISE--NLSAARCR----FYVKKELKRSREGAIAHRV 848
Query: 647 GGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIH 706
G CQ+ I S RD +F + ++ ++++T R+ FG+SGIH
Sbjct: 849 RGSCQHRWDE--------IDSLNPPREERDPESFSTFKERLHYLQKTEHTRVCFGRSGIH 900
Query: 707 GFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATR 766
+G+FA+ + G+MV+EY GE VR S+AD RE + G Y+F+I +E V+DAT
Sbjct: 901 RWGLFARRGIQEGEMVLEYRGEQVRRSVADLREE--QYRVQGKDCYLFKISEEVVVDATD 958
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRF---FSIDEQL 822
G++A LINHSC PNCY+R++SV DE I++ AK+++ EELTYDY F + D ++
Sbjct: 959 KGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTYDYLFDPDEADDRKV 1018
Query: 823 ACYCGFPRCRGVVN 836
C C CR +N
Sbjct: 1019 PCLCQTANCRKFMN 1032
>gi|242044112|ref|XP_002459927.1| hypothetical protein SORBIDRAFT_02g016800 [Sorghum bicolor]
gi|241923304|gb|EER96448.1| hypothetical protein SORBIDRAFT_02g016800 [Sorghum bicolor]
Length = 473
Score = 233 bits (595), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 182/312 (58%), Gaps = 49/312 (15%)
Query: 3 DWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDND 62
DWY G + Y+ + +H VKY DG+ EDL L++ERIKF IS EEM L L F I+N D
Sbjct: 187 DWYKGSITWYNEATKKHSVKYDDGEAEDLSLADERIKFSISSEEMKSLNLKFGISNQDKK 246
Query: 63 GYDYDEMVVLAASLDDCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRS 122
G+D E++ LA S D Q L+PGD++WAK+TGHAMWPA+VVDES + + L I +S
Sbjct: 247 GHD--ELLALAVSFHDYQGLDPGDLVWAKITGHAMWPAVVVDESNVPATRALKSIRLDQS 304
Query: 123 IPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRFTQSLEEAKVYLSEQKLPRRML 182
I VQFFGTHDFAR +LS Q+LP ML
Sbjct: 305 ILVQFFGTHDFAR-----------------------------------FLSTQQLPEIML 329
Query: 183 QLQNAIRADDGENSWSQDE----GSLGSGENCFKDERLQGT---LGSIGISPYVFGDLQI 235
QL+ +++ D + +D+ G+L E +Q T LG++ +S VF I
Sbjct: 330 QLRKSVQHDGSVVNSYEDKVNPCGNLSEDLAVQNGEDVQMTQIELGNLRVSNLVFI-YDI 388
Query: 236 LSL----GKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIRP 291
L++ G+IV DS++F + R IWPEGYTA RK+ S+ADP SYKMEVLR++++K RP
Sbjct: 389 LNIADKNGRIVSDSDHFHNKRDIWPEGYTAFRKYMSIADPHSVTSYKMEVLRNSDTKARP 448
Query: 292 LFRVTLDNGEQF 303
LFRV ++G Q
Sbjct: 449 LFRVISEDGVQL 460
>gi|242053849|ref|XP_002456070.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
gi|241928045|gb|EES01190.1| hypothetical protein SORBIDRAFT_03g029850 [Sorghum bicolor]
Length = 1051
Score = 233 bits (593), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 163/499 (32%), Positives = 245/499 (49%), Gaps = 53/499 (10%)
Query: 360 GLTKSRPTSKSSLCKLTSKYRDLPGG-YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDK 418
G+ + P+ + S K DL Y PV V W ++C+VC E++ N + C++
Sbjct: 584 GVCPTNPSKRLSQKMKKQKLIDLLNDPYDPVNVKW-TTERCAVCRWVEDWDYNKIVICNR 642
Query: 419 CRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLA 477
C++ VH CYG + W+C C PE CCLCPV GGA+KPT D W H+
Sbjct: 643 CQIAVHQECYGVTGKQDFTSWVCRACE--KPEQKRECCLCPVKGGALKPTNVDNLWVHIT 700
Query: 478 CAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLC 537
CA + P+ + MEP G+ + + +C IC +G+C QC C YH +C
Sbjct: 701 CAWFQPQVAFASDELMEPAVGILNIQPLLFMKMCVICKQIHGSCTQCYR--CSTYYHAIC 758
Query: 538 ARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DE 588
A AG +EL+ E + Q + +S+C +H+ P D + + +
Sbjct: 759 ASRAGYRMELQ---------CLEKNGKQTTKKISYCAQHRSPNPDNVLIIQTPAGTFSSK 809
Query: 589 RLVQ------VTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQ 642
+LVQ +R IP P SE N R R LKR
Sbjct: 810 KLVQSNGKVAASRLIRKDIPLDLPLEVEISE--NLSAARCR----IYVRKDLKRSREGAI 863
Query: 643 PYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGK 702
+ V G CQ+ I ++ RD +F + ++ ++++T R+ FG+
Sbjct: 864 AHRVRGPCQHRWD-----EIDILNPPRE---ERDPESFCTFKERLCYLQKTEHGRVCFGR 915
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
SGIH +G+FA+ + G+MV+EY GE VR S+AD RE + G Y+F+I +E V+
Sbjct: 916 SGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKY--RVQGKDCYLFKISEEVVV 973
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE- 820
DAT G++A LINHSC PNCY+R++SV DE I++ A++++ +ELTYDY F DE
Sbjct: 974 DATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIARKNVCAGDELTYDY-LFDPDEA 1032
Query: 821 ---QLACYCGFPRCRGVVN 836
++ C C CR +N
Sbjct: 1033 DERKVPCLCQTANCRKFMN 1051
>gi|293333544|ref|NP_001169542.1| uncharacterized protein LOC100383419 [Zea mays]
gi|193735372|gb|ACF20188.1| histone methylase [Zea mays]
gi|224030013|gb|ACN34082.1| unknown [Zea mays]
Length = 641
Score = 228 bits (582), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 164/499 (32%), Positives = 243/499 (48%), Gaps = 53/499 (10%)
Query: 360 GLTKSRPTSKSSLCKLTSKYRDLPGG-YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDK 418
G+ + P +SS K DL Y PV V W ++C+VC E++ N + C++
Sbjct: 174 GVCPTNPPKRSSQKMKKQKLIDLLNDPYDPVNVKW-TTERCAVCRWVEDWDYNKIVICNR 232
Query: 419 CRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLA 477
C++ VH CYG + W+C C PE CCLCPV GGA+KPT D W H+
Sbjct: 233 CQIAVHQECYGVRGKQDFTSWVCRACE--KPEQKRECCLCPVKGGALKPTNVDNLWVHVT 290
Query: 478 CAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLC 537
CA + + + MEP G+ + + +C IC +G+C QC C YH +C
Sbjct: 291 CAWFQSQVAFASDELMEPAIGILNIQPLLFMKMCVICKQIHGSCTQCYR--CSTYYHAIC 348
Query: 538 ARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DE 588
A AG R+ L L+++ Q + +S+C +H+ P D + + +
Sbjct: 349 ASRAGY--------RMELHCLEKNGR-QTTKKISYCAQHRSPNPDNVLIIQTPAGTFSSK 399
Query: 589 RLVQ------VTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQ 642
+LV +R IP PS SE N R R LKR
Sbjct: 400 KLVHSNGKVAASRLIRKDIPLDLPSEVKISE--NLSAARCR----VYVRKDLKRSREGTI 453
Query: 643 PYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGK 702
+ V G CQ+ I + RD +F + ++ ++++T R+ FG+
Sbjct: 454 VHHVRGPCQHRWEE--------IDNLNPPREERDPESFCTFKERLCYLQKTEHGRVCFGR 505
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
SGIH +G+FA+ + G+MV+EY GE VR S+AD RE + G Y+F+I +E V+
Sbjct: 506 SGIHRWGLFARRDIQEGEMVLEYRGEQVRRSVADLREEKY--RVQGKDCYLFKISEEVVV 563
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE- 820
DAT G++A LINHSC PNCY+R++SV DE I++ AK+ + +ELTYDY F DE
Sbjct: 564 DATDKGNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKKVCAGDELTYDY-LFDPDEA 622
Query: 821 ---QLACYCGFPRCRGVVN 836
++ C C CR +N
Sbjct: 623 DERKVPCLCQTANCRKFMN 641
>gi|159470003|ref|XP_001693149.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158277407|gb|EDP03175.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1708
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 159/522 (30%), Positives = 250/522 (47%), Gaps = 70/522 (13%)
Query: 80 QELEPGDIIWAKLTGHAMWPAIVV--DESLIGDYKGLNKISGGRSIPVQFFGTHDF---- 133
Q L PG+++WA++ A WPAIV+ +E+ G +++ + V+FFG +
Sbjct: 628 QVLRPGEVVWAQVRRSAPWPAIVITREEAEREGLVGGSRLGHNHQVFVRFFGDYTVFALP 687
Query: 134 -----ARI-NVKQVISFLKGLLSSFHLKCKKP----------RFTQSLEEAKVYLSEQKL 177
ARI + V+ FL GL +H + RF ++L E + Y++E +L
Sbjct: 688 AISAGARIPDRGAVVPFLSGLQLGWHARRGAAGGAASSAHAQRFLRALFELRAYMTEGEL 747
Query: 178 PRRML-------QLQNAIRADDGENSWSQDEGSLGSGENCFKDERLQGTLGSIGISPYVF 230
PR M+ + + +++ + G L G++ + V
Sbjct: 748 PRGMIPPNFDEDEDEEEDGSEEIATVAGGRGSAAARGRGGAGTTALDPLAGAVQLPFAVG 807
Query: 231 GDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKIR 290
L+IL+LG++V S +F D+++I+P G+ A R S A + MEV+ +E +R
Sbjct: 808 PKLRILALGEVVWLSRHFHDEKYIYPLGFKAERMMGSGASGGREVMHVMEVI--SEDGVR 865
Query: 291 PLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFS 350
PLFR+T + + T T S + +D A G A + +G+D+FG S
Sbjct: 866 PLFRITPEGKQPVTADTASKTMRALF----------EEDVRARGRA--FARTGADLFGLS 913
Query: 351 NPEVMKLILGLTKSR---------------------PTSKSSLCKLTSKYRDLPGGYRPV 389
N V L+ L + P ++ L ++ LP G V
Sbjct: 914 NARVAALVRALPGAERCERFANWPADMEKPPPPTLTPYEEAQRRALYARALRLPEGVVGV 973
Query: 390 RVDWKDL-DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLCR-- 445
+ +C VC DEE +++ L+CD CR +VH RCYG + P G LWLC++C+
Sbjct: 974 PQTKAGMCFECEVCGEDEESSDDVKLECDMCRCVVHTRCYGVTQPPPPGALWLCDVCQMH 1033
Query: 446 -PGAPEP-PPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVS 503
G PE PPC LCPV+GGA K T G + HL CA+W P +V +EP++G+ +
Sbjct: 1034 ATGLPEELSPPCELCPVLGGARKRTESGGYVHLLCALWTPGVTFGNVDTLEPVEGVAKAV 1093
Query: 504 KDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCV 545
+ R L CS+C +GACIQC+ C A+HP+CAR AG+ +
Sbjct: 1094 QSRASLRCSLCSQMHGACIQCAGDRCYTAFHPMCAREAGMAL 1135
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/292 (40%), Positives = 160/292 (54%), Gaps = 33/292 (11%)
Query: 571 SFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSN---PSGCARSEPYNYFGRRGRKEPE 627
SF ++ + RL E V+ P++ P+G AR+ P + EPE
Sbjct: 1381 SFVRQLPYLVTGRLQHSEEQVRALAPIAIGAKPTSCPRPTGSARARPDGHV------EPE 1434
Query: 628 AL------------AAASLKRLFVENQP----YLVGGYCQNGLSGNTLPSIRVIGS-KFS 670
A A A+ L V Q +VGG + ++ PS +
Sbjct: 1435 ASSLQPIAGLPRQDAEATASMLPVSTQTPVSGAVVGGCKAAAEAEDSGPSAEAVEHLPLE 1494
Query: 671 FSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELV 730
A S+A++Y M+ T RLA GKS IHG+G FAK PH+ GDM+IEY GEL+
Sbjct: 1495 LLAAATASGARSVAERYWAMRATVSVRLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELI 1554
Query: 731 RPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN 790
RP ++D RE +YN LVG GTY+F ++ ++ IDAT+AG++AHL+NHSC+PNCYSR I++
Sbjct: 1555 RPVVSDVREKRMYNDLVGCGTYIFSLNGQQHIDATKAGNMAHLLNHSCDPNCYSRAITLT 1614
Query: 791 ------GDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+H+II AKRD++ WEELTYDYRF S E L C CG CR +VN
Sbjct: 1615 DPLTGATTDHVIITAKRDLQPWEELTYDYRFNSAVE-LPCNCGAASCRLLVN 1665
>gi|224067978|ref|XP_002302628.1| SET domain protein [Populus trichocarpa]
gi|222844354|gb|EEE81901.1| SET domain protein [Populus trichocarpa]
Length = 667
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 149/488 (30%), Positives = 226/488 (46%), Gaps = 71/488 (14%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR 445
Y P+ W ++CSVC E++ +N + C++C++ VH CYG + + W+C
Sbjct: 214 YEPIYAKWTS-ERCSVCRWVEDWDDNKIIICNRCQIAVHQECYGAINVQDFASWVCR--- 269
Query: 446 PGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSK 504
GA+KP+ + W H+ CA + PE + ++MEP G+ R+
Sbjct: 270 ---------------AWGALKPSDIEKLWVHVICAWFQPEVGFLNHEKMEPATGILRIPS 314
Query: 505 DRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
+ C IC +YG+C QC C +H CA AG +EL ++ +
Sbjct: 315 TSFIKRCVICKQTYGSCTQCCK--CATYFHATCASRAGYFMELNCTEKSGM--------- 363
Query: 565 QCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRK 624
Q L +C H++P D + V + N +GC R +
Sbjct: 364 QVTEKLIYCAVHRKPNPDSVVVVRTPSGIFS---GRSFLQNRNGCLRGSRLVSSKKVELP 420
Query: 625 EP--------EALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRD 676
+P E ++AA + N G + L G S+ I S ++ D
Sbjct: 421 DPSTRESNDFEPVSAAKCRAFKRTNYKVSEGEPIFHRLMGPRHDSLHSIISLSTYKETGD 480
Query: 677 APNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIAD 736
+ F S ++ H+++T R+ FGKSGIHG+G+FA+ + G+MVIEY GE VR S+AD
Sbjct: 481 STVFSSFKERLCHLQKTENHRVCFGKSGIHGWGLFARRNIQEGEMVIEYRGEKVRRSVAD 540
Query: 737 RRE-------------------HFIYNSLVGA----GTYMFRIDDERVIDATRAGSIAHL 773
RE + + + GA +F+I +E VIDAT G+IA L
Sbjct: 541 LREARYRLEGKDCYVHHKGFESQDLLSGIEGAMGQLNLPLFKISEEVVIDATNKGNIARL 600
Query: 774 INHSCEPNCYSRVISVNGDEH-IIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGF 828
INHSC PNCY+R++SV E+ I++ AK D+ ELTYDY F DE ++ C C
Sbjct: 601 INHSCMPNCYARIMSVGDVENRIVLIAKTDVSAGNELTYDY-LFDPDERDDLKVPCLCKA 659
Query: 829 PRCRGVVN 836
P CR +N
Sbjct: 660 PNCRKFMN 667
>gi|147843065|emb|CAN83298.1| hypothetical protein VITISV_008292 [Vitis vinifera]
Length = 268
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 110/219 (50%), Positives = 137/219 (62%), Gaps = 35/219 (15%)
Query: 97 MWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKC 156
MWPAIVVDES+I + KGLNKIS +S+PVQFFG+HDFAR
Sbjct: 1 MWPAIVVDESIIHNRKGLNKISKEKSLPVQFFGSHDFAR--------------------- 39
Query: 157 KKPRFTQSLEEAKVYLSEQKLPRRMLQLQNAIRADDGENSWSQDEGSLGSGENCFKDERL 216
YLSEQKL +RML++Q DD E+ +DE SG++C DER+
Sbjct: 40 --------------YLSEQKLSKRMLRMQKLTEDDDCESMSGEDEKRTDSGDDCIGDERV 85
Query: 217 QGTLGSIGISPYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNS 276
+ L + P+ GDLQ++ LGKIVKDS+ FQ + FI PEGYTA+RKFTS+ DP +C
Sbjct: 86 KRKLDDLIKFPFEIGDLQVIRLGKIVKDSDRFQVEGFICPEGYTAMRKFTSITDPSLCAL 145
Query: 277 YKMEVLRDTESKIRPLFRVTLDNGEQFTGSTPSTCWSKI 315
YKMEVLRD ESKI+PLFRVTLDNGEQ S + W ++
Sbjct: 146 YKMEVLRDAESKIQPLFRVTLDNGEQSFMSERNPFWKRV 184
>gi|357436505|ref|XP_003588528.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
gi|355477576|gb|AES58779.1| Histone-lysine N-methyltransferase ATX5 [Medicago truncatula]
Length = 973
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 137/451 (30%), Positives = 211/451 (46%), Gaps = 51/451 (11%)
Query: 346 MFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMD 405
M ++ + + RP+SK KL + +D Y PV W ++C+VC
Sbjct: 553 MLQLADFHAKVAVSSVKPKRPSSKERKHKLLTFLQD---KYEPVCAKW-TTERCAVCRWV 608
Query: 406 EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAM 465
E++ N + C +C++ VH CYG + W+C C P+ CCLCPV GGA+
Sbjct: 609 EDWDYNKIIICVRCQIAVHQECYGARNVRDFTSWVCKACE--TPDITRECCLCPVKGGAL 666
Query: 466 KPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQC 524
KPT W H+ CA + PE ++MEP G+ + + + +C IC +G+C QC
Sbjct: 667 KPTDIHPLWVHVTCAWFRPEVSFASDEKMEPALGILSIPSNSFVKICVICKQIHGSCTQC 726
Query: 525 SNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRL 584
C +H +CA AG +EL E + Q R++ +C H+ P D +
Sbjct: 727 --VKCSTYFHVMCASRAGYRMELHS---------SEKNGKQTTRMVCYCAYHRAPNPDTV 775
Query: 585 AVDER---------LVQVTRRCCDYIPPSN--------PSGCARSEPYNYFGRRGRKEPE 627
+ + L+Q R+ + SN P +P++ R
Sbjct: 776 LIIQTPHGVISTKILIQNKRKVGSRLISSNRIKVEDTPPEDNTEHDPFSA----ARCRVF 831
Query: 628 ALAAASLKRLFVENQPYLVGGYCQNGLSG-NTLPSIRVIGSKFSFSLHRDAPNFLSMADK 686
+ KR E P+ V G+ + L L S +V+ +FS S ++
Sbjct: 832 VRTNHTKKRAADEAIPHKVRGHYHHPLDAIQRLNSSKVVDESQTFS---------SFRER 882
Query: 687 YKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSL 746
H++ T +R+ FG+SGIHG+G+FA+ + G+MV+EY GE VR S+AD RE L
Sbjct: 883 LHHLQRTENERVCFGRSGIHGWGLFARRTIQEGEMVLEYRGEQVRRSVADLRE--ARYKL 940
Query: 747 VGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
Y+F+I +E V+DAT G+IA LINHS
Sbjct: 941 ERKDCYLFKISEEVVVDATDKGNIARLINHS 971
>gi|302849081|ref|XP_002956071.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
nagariensis]
gi|300258576|gb|EFJ42811.1| histone H3 methyltransferase/Trithorax [Volvox carteri f.
nagariensis]
Length = 2029
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 94/166 (56%), Positives = 122/166 (73%), Gaps = 12/166 (7%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S+A++Y M+ T +RLA GKS IHG+G FAK PH+ GDM+IEY GEL+RPS++D RE
Sbjct: 1715 SVAERYAAMRATVGQRLAAGKSAIHGWGAFAKVPHKRGDMLIEYAGELIRPSVSDVREKR 1774
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN------GDEHI 795
+YN LVG GTY+F ++D++ IDATRAG++AHL+NHSC+PNCYSR I++ +H+
Sbjct: 1775 MYNKLVGCGTYIFTLNDDQHIDATRAGNMAHLLNHSCDPNCYSRTITLTDPVTRATSDHV 1834
Query: 796 IIFAK-----RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
II AK RDI+ WEELTYDYRF S E+L C CG CR +VN
Sbjct: 1835 IITAKVCVLWRDIEAWEELTYDYRFNS-SEELPCNCGAATCRLLVN 1879
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 152/533 (28%), Positives = 236/533 (44%), Gaps = 102/533 (19%)
Query: 80 QELEPGDIIWAKLTGHAMWPAIVVDESLIGDYK--GLNKISGGRSIPVQFFGTHDFARIN 137
Q L PG+++WA+ A V GD+ GL + G IP +
Sbjct: 772 QVLRPGEVVWAQF-------AFV---RFFGDHTVLGLPVVQPGARIP------------D 809
Query: 138 VKQVISFLKGLLSSFHLK--------------CKKPRFTQSLEEAKVYLSEQKLPRRML- 182
V+ FL GL +H++ RF +++ E + Y++E +LPR M+
Sbjct: 810 RGGVLPFLAGLELGWHVRGVAGAGAAAGGAGGSHTQRFLRAMFELRTYMTEGELPRGMIP 869
Query: 183 -------------QLQNAIRADDGENSWSQDEGS---------LGSGENCFKDERLQGTL 220
+ Q RA D S LG + E T
Sbjct: 870 PNYDEDDEGEDDDEEQQQHRAGSASGRGCGDARSTRRLSVARALGGEGRASQHEAAGTTA 929
Query: 221 GSIGIS-PYVFGD-LQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYK 278
+ I+ P G L++L LG++V S +F D+++I+P GYTA R S A +
Sbjct: 930 AAQDITLPLKLGSKLRVLRLGEVVWLSRWFHDEKYIYPLGYTADRLMVSGASGGREVRHV 989
Query: 279 MEVLRDTESKIRPLFRVTLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEK 338
MEV+ + +RP+FR+T + + T + + +D G A
Sbjct: 990 MEVVASADG-VRPVFRITPEGRLPVSADTATRAMRAL----------FEEDDRVRGRA-- 1036
Query: 339 ISESGSDMFGFSNPEVMKLILGLTKSR---------PTSKSSLCKLT-----------SK 378
+++G+D+FG ++ V L+ L + + L LT ++
Sbjct: 1037 FAKTGADLFGLTHQRVAALLRSLPGAERCERFANWPDQDRPPLPPLTHLEEVQRRAMYAR 1096
Query: 379 YRDLPGGYRPV-RVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGV 437
LP G + V +C VC D E +NL +CD CR +VH+RCYG +P +G
Sbjct: 1097 ALHLPSGVHAIPEVKTGMCFECEVCGEDMESPDNLKFECDLCRCVVHSRCYGVKQPPHGA 1156
Query: 438 LWLCNLCRPGAPEPP----PPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRM 493
LWLC++C+ A P PPC LCPV G MK T G + H+ CA+W P D+ +
Sbjct: 1157 LWLCDVCQLHAAGLPRDRSPPCELCPVASGPMKQTDAGGYVHVLCAVWTPGVTFGDLDSL 1216
Query: 494 EPIDGLNRVSKDRWKLLCSICGVSYGACIQCS-NTTCRVAYHPLCARAAGLCV 545
EP++G+ + ++R L C +C +GACIQC+ + C A+HP+CAR AGL +
Sbjct: 1217 EPVEGVAKAIQNRASLRCFLCKQQHGACIQCAGDARCYTAFHPMCAREAGLAM 1269
>gi|242044114|ref|XP_002459928.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
gi|241923305|gb|EER96449.1| hypothetical protein SORBIDRAFT_02g016820 [Sorghum bicolor]
Length = 166
Score = 189 bits (480), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 89/112 (79%), Positives = 97/112 (86%), Gaps = 3/112 (2%)
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
L+GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR I++ GDEHIIIFAKRDI
Sbjct: 53 LMGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRAITILGDEHIIIFAKRDIDP 112
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSELIDW 857
WEELTYDYRFFS D++L CYCGFP+CRGVVND EAEE AK+ R + I W
Sbjct: 113 WEELTYDYRFFSSDQRLPCYCGFPKCRGVVNDVEAEEHAAKI---RVQHIGW 161
>gi|307105682|gb|EFN53930.1| hypothetical protein CHLNCDRAFT_25365, partial [Chlorella
variabilis]
Length = 188
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 121/173 (69%), Gaps = 6/173 (3%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S A +Y T +R+ GKSGIHG+G FAK H DMVIEY GELVRPS++D RE
Sbjct: 4 SEAARYAECVATLGQRVTIGKSGIHGWGAFAKRRHAEHDMVIEYVGELVRPSVSDLREAR 63
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y+ +VGAGTY+FR++ +DATRAG++AH++NHSC+PNC+SR I V +H+IIFAK+
Sbjct: 64 CYDDMVGAGTYVFRLNKALCVDATRAGNLAHMLNHSCDPNCFSRTIRVV--DHVIIFAKK 121
Query: 802 DIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYAPRSEL 854
DI+ EELTYDYRF +EQL C CG CRG VN+ E QV A RSEL
Sbjct: 122 DIEVAEELTYDYRFCG-EEQLPCNCGAANCRGRVNEKPPEGQV---LARRSEL 170
>gi|218188776|gb|EEC71203.1| hypothetical protein OsI_03117 [Oryza sativa Indica Group]
Length = 1012
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 145/493 (29%), Positives = 224/493 (45%), Gaps = 79/493 (16%)
Query: 364 SRPTSKSSLCKLTSKYRDL-PGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMM 422
+ P +SS K DL Y V V W ++C+VC E++ N + C++C++
Sbjct: 579 TNPAKRSSQKMKKQKLLDLLSEPYDTVNVKW-TTERCAVCRWVEDWDYNKIVICNRCQIA 637
Query: 423 VHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWI 482
VH CYG + W+C C T D +
Sbjct: 638 VHQECYGVRGKQDFTSWVCRACE----------------------TPDQK---------R 666
Query: 483 PETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
P+ + MEP G+ ++ + +C IC +G+C QC C YH +CA AG
Sbjct: 667 PQVAFASDELMEPAIGILNITPLLFMKMCVICRQIHGSCTQCYR--CSTYYHAICASRAG 724
Query: 543 LCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAV---------DERLVQ- 592
R+ + L+++ + Q +S+C H+ P D + + ++LVQ
Sbjct: 725 Y--------RMEIRCLEKNGK-QTTNKISYCAHHRAPNPDNVLIIQTPAGTISSKKLVQS 775
Query: 593 -----VTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG 647
+R +P + S SE N R R LKR + V
Sbjct: 776 NGTVAASRLIRKDLPKDSVSEVEISE--NLSAARCR----FYVKKELKRSREGAIAHRVR 829
Query: 648 GYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHG 707
G CQ+ I S RD +F + ++ ++++T R+ FG+SGIH
Sbjct: 830 GSCQHRWDE--------IDSLNPPREERDPESFSTFKERLHYLQKTEHTRVCFGRSGIHR 881
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+FA+ + G+MV+EY GE VR S+AD RE + G Y+F+I +E V+DAT
Sbjct: 882 WGLFARRGIQEGEMVLEYRGEQVRRSVADLREE--QYRVQGKDCYLFKISEEVVVDATDK 939
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRF---FSIDEQLA 823
G++A LINHSC PNCY+R++SV DE I++ AK+++ EELTYDY F + D ++
Sbjct: 940 GNVARLINHSCTPNCYARIMSVGHDESRIVLIAKKNVSAGEELTYDYLFDPDEADDRKVP 999
Query: 824 CYCGFPRCRGVVN 836
C C CR +N
Sbjct: 1000 CLCQTANCRKFMN 1012
>gi|168012124|ref|XP_001758752.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
patens subsp. patens]
gi|162689889|gb|EDQ76258.1| histone-lysine N-methyltransferase-like protein [Physcomitrella
patens subsp. patens]
Length = 2373
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 187/383 (48%), Gaps = 52/383 (13%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+WAH+ C +W+P T ++ M D ++RV+ R K LCS+C GAC+ C C
Sbjct: 2025 QWAHMVCTLWMPGTRCLNMGTMGAFD-VSRVTNSRRKALCSMCQKKGGACMHCRVPKCST 2083
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+H CA GL L+ E + +D +Q + C+ H L E ++
Sbjct: 2084 PFHVWCAHEKGL---LQSE-------IVKDGSNQ-VGFFGRCQNHGDFSGTEL---EDVM 2129
Query: 592 QVTRRCCDYIPPSNPSGCARSE-----PYNYFGRRGRKEPEALAAASLKRLFVENQPYLV 646
+ D + ++ CAR+E +G +GR E A A + Q L
Sbjct: 2130 KE-----DSLSSTSGEVCARTEVNLISTALVYGYKGRPSLEERANA-------QQQMTLE 2177
Query: 647 GGYCQNGLSGNTLPSIRVIGSKFSFS-LHRDAPNFLSMADKYKHMKETFR-------KRL 698
G C +R+ G K S LH+ A S+A K+ H KE R K+L
Sbjct: 2178 G--CTAVTPEQVAAWLRISGRKLSTRRLHKSA----SLAMKFDH-KEYLRFKQKKGWKKL 2230
Query: 699 AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE---HFIYNSLVGAGTYMFR 755
A KS IH G++ ++V+EY GE+V +AD+RE H Y+FR
Sbjct: 2231 AVYKSIIHALGLYTTDFIAEREVVVEYVGEIVGHRVADKREVEYHSRKRLQYQGACYLFR 2290
Query: 756 IDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF 815
ID E++IDATR G IA +NHSC PNC ++VI V + +I FAKR+I EE+TYDY+F
Sbjct: 2291 IDTEQIIDATRNGGIARFVNHSCSPNCVAKVICVENLKKVIFFAKRNIDAGEEVTYDYKF 2350
Query: 816 F--SIDEQLACYCGFPRCRGVVN 836
+ +++ C+CG P CRG +N
Sbjct: 2351 NYDEVGDKIPCFCGTPECRGTLN 2373
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 48/126 (38%), Gaps = 12/126 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N L C +C + VH CYG + G W C C+ P C LC
Sbjct: 1826 CDVCGIFSSASYNKLLCCARCPVKVHQACYGVPKVPKGP-WFCRTCKSRVIN--PICVLC 1882
Query: 459 PVVGGAMKPTTDGRWAHLA-CAIW-------IPETCLTDVKRMEPIDGLNRVSKDRWKLL 510
GGAM R L W +PE ++ P+ +N + R +L
Sbjct: 1883 GYGGGAMTRVHKARDFCLGLLQTWRDAKGESVPEP-FSEKNSETPLSAVNNIKFSRGRLS 1941
Query: 511 CSICGV 516
+ GV
Sbjct: 1942 AGLTGV 1947
>gi|412986144|emb|CCO17344.1| predicted protein [Bathycoccus prasinos]
Length = 1990
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 86/195 (44%), Positives = 121/195 (62%), Gaps = 9/195 (4%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC 444
Y+ + + W +D+C+VC + ++ + + C+ C++ VH CYG E P V WLC C
Sbjct: 782 YKSIHIAWT-VDRCAVCDDERDFDFDQLITCEGCQVSVHQSCYGVHEIPDQAVGWLCRAC 840
Query: 445 RP--GAPEPPPPCCLCPVVGGAMKPTT-DGRWAHLACAIWIPETCLTDVKRMEPIDGLNR 501
G P CCLCPV+GGA+KPTT DG WAH AC WIPET + D++ MEPID +
Sbjct: 841 EHTGGVVSETPKCCLCPVIGGALKPTTVDGVWAHSACCQWIPETTVLDIETMEPIDNIAA 900
Query: 502 VSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDED 561
+ ++RW+LLC+IC G +QC + C +AYHPLCAR AGL ++ DE + DED
Sbjct: 901 IQRERWELLCTICKQRCGTKVQCCHPGCFLAYHPLCARGAGLFMDQGDE----YGNADED 956
Query: 562 DEDQCIRLLSFCKKH 576
ED + L+S+C +H
Sbjct: 957 PEDDTMHLISYCHRH 971
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 121/246 (49%), Gaps = 30/246 (12%)
Query: 615 YNYFGRRGRKEPEALAAASL-------KRLFVENQPYLVGGYCQNGLSGNTLPS------ 661
Y G + K + L A +L KR +V QPY+V G + GL + S
Sbjct: 1745 YRRLGVQPPKSTDLLKAKNLGPPPKTGKRQYVGLQPYIVRG-TRGGLHRALVTSRSYTPE 1803
Query: 662 -----------IRVIGSKFSFSLHRDAP---NFLSMADKYKHMKETFRKRLAFGKSGIHG 707
R +G + S+ P LS+ ++ T +KRL FGKS IHG
Sbjct: 1804 EWREKYIKSQAARKLGGEHGASVPNKGPLEGTGLSLKERMLECTNTVKKRLTFGKSAIHG 1863
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+ +K P +AG MVI Y GE VR IAD RE G Y+ R DD V+D T
Sbjct: 1864 WGLISKVPIKAGAMVIIYRGEAVRTPIADLREARYERD--GTDCYLLRADDHTVVDCTNM 1921
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCG 827
G+IA NHSC+PN Y+++I G+ H+ F + D+ E+TY+YRF D ++ CYC
Sbjct: 1922 GNIARFTNHSCDPNMYTKIIKSGGEHHVCFFTRVDVPAGTEMTYNYRFEIEDGKVPCYCA 1981
Query: 828 FPRCRG 833
CRG
Sbjct: 1982 SHNCRG 1987
>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
Length = 4493
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/235 (40%), Positives = 134/235 (57%), Gaps = 24/235 (10%)
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NPSGC RSEPY RK + + +L+ Y + S
Sbjct: 4283 PPENPSGCIRSEPYET-----RKPFDIFS-------------FLMSQYREMPGSKERKND 4324
Query: 662 IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
+ ++ + D P MA +++ +KE R+ + +S IHG G++ K G+M
Sbjct: 4325 VEMVHKSARRATSMDLP----MAMRFRKLKEHAREAVGVYRSHIHGRGLYCKRNIDEGEM 4380
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY+GE++R S+ D+RE + G G YMFRIDD VIDAT G+ A INHSCEPN
Sbjct: 4381 VIEYSGEVIRGSLTDKREKYYEGK--GIGCYMFRIDDYDVIDATLHGNAARFINHSCEPN 4438
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CYS+VI+V+G +HI+IFA + IK+ EELTYDY+F + ++ C CG +CR +N
Sbjct: 4439 CYSKVINVDGKKHIVIFAMKSIKRGEELTYDYKFPIEEVKIPCTCGAKKCRRYLN 4493
>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
Length = 3756
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 133/237 (56%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3555 CARCEPYSSRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3593
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3594 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3641
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INHSCE
Sbjct: 3642 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHSCE 3699
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3700 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3756
>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
Length = 3828
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 133/237 (56%), Gaps = 44/237 (18%)
Query: 609 CARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY ++ R RK+P ++FV QP S N L
Sbjct: 3627 CARCEPYVSRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3665
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3666 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3713
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INHSCE
Sbjct: 3714 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHSCE 3771
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3772 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3828
>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
Length = 3822
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 133/237 (56%), Gaps = 44/237 (18%)
Query: 609 CARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY ++ R RK+P ++FV QP S N L
Sbjct: 3621 CARCEPYVSRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3659
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3660 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3707
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INHSCE
Sbjct: 3708 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHSCE 3765
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3766 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3822
>gi|242005679|ref|XP_002423690.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212506866|gb|EEB10952.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 3311
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/234 (42%), Positives = 136/234 (58%), Gaps = 33/234 (14%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALA--AASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
NPSGCAR+E + RKE + A A+ ++L P L + TL SI
Sbjct: 3109 NPSGCARTEKFT-----NRKEYDMFAWLASKHRKL-----PKLTES------AEETLCSI 3152
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
R N L MA KY+++KET + + +S IHG G+F AG+MV
Sbjct: 3153 R-------------RANNLPMAMKYRNLKETSKLYVGVYRSQIHGRGLFCLREIEAGEMV 3199
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++D+RE Y + G G YMFRIDD V+DAT G+ A INHSCEPNC
Sbjct: 3200 IEYAGEVIRANLSDKREK--YYTEKGIGCYMFRIDDHFVVDATMKGNAARFINHSCEPNC 3257
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
YSRV+ + G +HI+IFA R I EELTYDY+F DE+++C+C +C+ +N
Sbjct: 3258 YSRVVDILGKKHIVIFALRKINIMEELTYDYKFPFEDEKISCHCLSKKCKKYLN 3311
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 63/154 (40%), Gaps = 43/154 (27%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDGL-----NR 501
C +C VVG + P GR W H CA+W E E IDG +
Sbjct: 1487 CVMCKVVGDGL-PGHTGRLLYCGQNEWVHCNCALWSGEV-------FEEIDGSLQNVHSA 1538
Query: 502 VSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDED 561
+S+ R + C C + GA I C +C+ YH CA+ G C ++D+
Sbjct: 1539 ISRSR-SIKCPECNLK-GASIGCCARSCQETYHFSCAKKLG-CAFMDDKTM--------- 1586
Query: 562 DEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTR 595
+C H + +N++L +E+ ++ R
Sbjct: 1587 ----------YCLAHLKDVNNKLVQNEKEFEIRR 1610
>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 3474
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 134/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY ++ R RK+P ++FV QP S
Sbjct: 3269 NAYDCARCEPYATRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 3307
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 3308 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 3355
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 3356 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 3413
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +
Sbjct: 3414 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYL 3473
Query: 836 N 836
N
Sbjct: 3474 N 3474
>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
Length = 3837
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 108/156 (69%), Gaps = 2/156 (1%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA KY+ +KET++ + +S IHG G++ AG+MVIEY GEL+R ++ D+RE
Sbjct: 3684 LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRER 3743
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMF+IDD V+DAT G+ A INHSCEPNCYS+V+ + G +HIIIFA
Sbjct: 3744 Y-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFAL 3801
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3802 RRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3837
>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 3779
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 134/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY ++ R RK+P ++FV QP S
Sbjct: 3574 NAYDCARCEPYATRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 3612
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 3613 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 3660
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 3661 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 3718
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +
Sbjct: 3719 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYL 3778
Query: 836 N 836
N
Sbjct: 3779 N 3779
>gi|168007105|ref|XP_001756249.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
gi|162692759|gb|EDQ79115.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
Length = 1900
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 179/384 (46%), Gaps = 37/384 (9%)
Query: 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYG 519
V +K + +WAH+ C +W+P T ++ M D ++ V K+LCSIC G
Sbjct: 1547 TVRSVLKDSGAKQWAHMVCTLWMPGTRCLNMGTMGVFD-VSGVKVSLRKMLCSICRRKGG 1605
Query: 520 ACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
ACIQC C +H LCA GL L+ E + D + C H
Sbjct: 1606 ACIQCRVPKCSTPFHVLCAHEKGL---LQSETVM--------DGSNQVGFFGKCLNHGDS 1654
Query: 580 LNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFV 639
+ L + L +V R P+ CAR+E K + A +R+ +
Sbjct: 1655 SVNEL---DDLTKVGLR------PARREVCARTEISESV--YTEKYLKTFLWAFHQRMTL 1703
Query: 640 ENQPYLVGGYCQNGLSGNT--LPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR 697
E + L + +P+ R++ S ++ D +L K K
Sbjct: 1704 EGAMAVTPEQVAAALRIDVRKMPTRRLLKPTNS-AMKCDHREYLRFKQKRGWTK------ 1756
Query: 698 LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE---HFIYNSLVGAGTYMF 754
L KSGIH G++ G++V+EY GE+V +AD+RE H Y+F
Sbjct: 1757 LGVYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACYLF 1816
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID E++IDATR G IA +NHSC PNC ++VI V + ++ FAKRDI EE+TYDY+
Sbjct: 1817 RIDTEQIIDATRKGGIARFVNHSCSPNCVAKVICVENLKKVVFFAKRDIYAGEEVTYDYK 1876
Query: 815 FF--SIDEQLACYCGFPRCRGVVN 836
F + +++ C+CG P CRG +N
Sbjct: 1877 FNCDEVGDKIPCFCGTPECRGTLN 1900
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 13/136 (9%)
Query: 363 KSRPT-SKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRM 421
K++PT SK + K R + + + + + +KC VC N L C +C +
Sbjct: 1324 KAKPTFSKEMMGSYLKKERSV-ASFEIKKPNIHERNKCDVCGAFTSVSYNKLLCCSRCPV 1382
Query: 422 MVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGR-WAHLACAI 480
VH CYG + G W C C+ C LC GGAM R + H +
Sbjct: 1383 KVHQACYGVPKIPKGP-WSCRTCKFKIKN--SICVLCGYGGGAMTRVHKARSFCHGLLQV 1439
Query: 481 W-------IPETCLTD 489
W +P++ L D
Sbjct: 1440 WRDMKDENMPDSSLAD 1455
>gi|255075355|ref|XP_002501352.1| set domain protein [Micromonas sp. RCC299]
gi|226516616|gb|ACO62610.1| set domain protein [Micromonas sp. RCC299]
Length = 2166
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/219 (44%), Positives = 133/219 (60%), Gaps = 19/219 (8%)
Query: 377 SKYRDLPGG----YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE 432
SKY L G Y+ VRV W +D+C+VC D ++ + + CD C + VH CYG E
Sbjct: 687 SKYPALGGAKKRSYQMVRVQWS-VDRCAVCDDDRDFDFDQLVTCDGCGISVHQSCYGIPE 745
Query: 433 -PVNGVLWLCNLCR--PGAPEPPPPCCLCPVVGGAMKPTTD-GRWAHLACAIWIPETCLT 488
P + V +LCN C G P C LCPV GGA+KPTT G W H AC WIPET +
Sbjct: 746 IPDDAVGFLCNACEHTGGDTSETPLCVLCPVEGGALKPTTKPGVWCHSACCQWIPETTVV 805
Query: 489 DVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
DV RMEPID ++ + ++RW+LLC++C GA IQC + C ++YHPLCARAAGL +E
Sbjct: 806 DVDRMEPIDQIHTIQRERWELLCTVCKQRMGAKIQCDHPGCYLSYHPLCARAAGLFME-- 863
Query: 549 DEDRLNLLSLDED-DEDQCIRLLSFCKKHKQPLNDRLAV 586
+LD+D DED ++++S+C +H + +R A+
Sbjct: 864 -------ANLDDDGDEDSPLQMVSYCHRHCRVDTERAAL 895
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 91/141 (64%), Gaps = 2/141 (1%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
R+RL FGKS IHG+G+ AK +AG MV+EY GE +RPS+AD RE + G Y+
Sbjct: 2027 RERLTFGKSNIHGWGLIAKQFLKAGSMVVEYVGERLRPSVADLREKVYERT--GHDVYLL 2084
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
DD+ VID T GSIA NHSC PN Y+++++++GD I F + D+ +ELTY+YR
Sbjct: 2085 AADDKTVIDTTVKGSIARFTNHSCTPNMYTKLVALDGDSRIFFFTRIDVHPGQELTYNYR 2144
Query: 815 FFSIDEQLACYCGFPRCRGVV 835
F + ++ CYCG CRG +
Sbjct: 2145 FDAESGKVPCYCGANNCRGFL 2165
>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
Length = 3358
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3157 CARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3195
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3196 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3243
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3244 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3301
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3302 PNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3358
>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
Length = 3759
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3558 CARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3596
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3597 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3644
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3645 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3702
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3703 PNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3759
>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
Length = 3489
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 132/241 (54%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
NP G AR EPY ++ R RK+P + A S+ + + S
Sbjct: 3284 NPYGAARCEPYSSRSEYDMFSWLASRHRKQPMPVVAQSIDDTVIPRR-----------GS 3332
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
G+ LP MA +Y+ +KET ++ + +S IHG G+F
Sbjct: 3333 GSNLP----------------------MAMRYRTLKETSKESVGVYRSHIHGRGLFCNRD 3370
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+ID+ V+DAT G+ A IN
Sbjct: 3371 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDEHFVVDATMRGNAARFIN 3428
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F D ++ C CG +CR +
Sbjct: 3429 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYL 3488
Query: 836 N 836
N
Sbjct: 3489 N 3489
>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
Length = 3358
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3157 CARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3195
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3196 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3243
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3244 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3301
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3302 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3358
>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
spliced, exon II-containing isoform} [Drosophila,
embryos, Peptide, 3726 aa]
Length = 3726
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3525 CARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3563
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3564 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3611
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3612 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3669
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3670 PNCYSKVVDILGHKHIIIFAVRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3726
>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
Length = 3069
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 132/241 (54%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
NP G AR EPY ++ R RK+P + A S+ + + S
Sbjct: 2864 NPYGAARCEPYSSRSEYDMFSWLASRHRKQPMPVVAQSIDDTVIPRRG-----------S 2912
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
G+ LP MA +Y+ +KET ++ + +S IHG G+F
Sbjct: 2913 GSNLP----------------------MAMRYRTLKETSKESVGVYRSHIHGRGLFCNRD 2950
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+ID+ V+DAT G+ A IN
Sbjct: 2951 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDEHFVVDATMRGNAARFIN 3008
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F D ++ C CG +CR +
Sbjct: 3009 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYL 3068
Query: 836 N 836
N
Sbjct: 3069 N 3069
>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
Full=Lysine N-methyltransferase 2A
gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
Length = 3726
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY+ + R RK+P ++FV QP S N L
Sbjct: 3525 CARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3563
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3564 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3611
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3612 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3669
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 3670 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 3726
>gi|291240989|ref|XP_002740398.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like,
partial [Saccoglossus kowalevskii]
Length = 357
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 99/238 (41%), Positives = 136/238 (57%), Gaps = 23/238 (9%)
Query: 599 DYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNT 658
D P NPSGCAR+E RR + + A+ + L V +
Sbjct: 143 DEEPVVNPSGCARAE---VIARRSKFDMFNFLASKHRSLVVRT---------------DN 184
Query: 659 LPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRA 718
+ I K S R L MA +++H+++T ++ + +SGIHG G+F K P A
Sbjct: 185 MDDEEEIQHK---SARRATSMDLPMAMRFRHLQKTSKEAVGVYRSGIHGRGLFCKRPIDA 241
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 778
+MVIEY G ++R S+ D+RE + Y+S G G YMFRIDD V+DAT G+ A INHSC
Sbjct: 242 AEMVIEYAGMVIRSSLTDKREKY-YDS-KGIGCYMFRIDDYDVVDATMHGNAARFINHSC 299
Query: 779 EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EPNC+SRVI V+G +HI+IFA R I EELTYDY+F +E++ C CG +CR +N
Sbjct: 300 EPNCFSRVIQVDGKKHIVIFASRKIMPGEELTYDYKFPFEEEKIPCTCGSKKCRKHLN 357
>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
Length = 3708
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 81/156 (51%), Positives = 108/156 (69%), Gaps = 2/156 (1%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA KY+ +KET++ + +S IHG G++ AG+MVIEY GEL+R ++ D+RE
Sbjct: 3555 LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRER 3614
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMF+IDD V+DAT G+ A INHSCEPNCYS+V+ + G +HIIIFA
Sbjct: 3615 Y-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFAL 3672
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
R I Q EELTYDY+F +E++ C CG RCR +N
Sbjct: 3673 RRIVQGEELTYDYKFPFEEEKIPCSCGSKRCRKYLN 3708
>gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster]
Length = 751
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 133/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY+ + R RK+P ++FV QP S
Sbjct: 546 NAYDCARCEPYSNRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 584
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 585 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 632
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 633 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 690
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
H CEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +
Sbjct: 691 HCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYL 750
Query: 836 N 836
N
Sbjct: 751 N 751
>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
Length = 3741
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 132/237 (55%), Gaps = 44/237 (18%)
Query: 609 CARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL 659
CAR EPY ++ R RK+P ++FV QP S N L
Sbjct: 3540 CARCEPYASRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------SDNEL 3578
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
R GS L MA KY+ +KET++ + +S IHG G++ AG
Sbjct: 3579 VPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAG 3626
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A INH CE
Sbjct: 3627 EMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCE 3684
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
PNCYS+V+ + G +HIIIFA R I Q EELTYDY+F +E++ C CG RCR +N
Sbjct: 3685 PNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKIPCSCGSKRCRKYLN 3741
>gi|195145308|ref|XP_002013638.1| GL23289 [Drosophila persimilis]
gi|194102581|gb|EDW24624.1| GL23289 [Drosophila persimilis]
Length = 293
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 134/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY ++ R RK+P ++FV QP S
Sbjct: 88 NAYDCARCEPYATRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 126
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 127 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 174
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 175 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 232
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F DE++ C CG RCR +
Sbjct: 233 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYL 292
Query: 836 N 836
N
Sbjct: 293 N 293
>gi|195446231|ref|XP_002070688.1| GK10891 [Drosophila willistoni]
gi|194166773|gb|EDW81674.1| GK10891 [Drosophila willistoni]
Length = 447
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 134/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY ++ R RK+P ++FV QP S
Sbjct: 242 NAYDCARCEPYATRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 280
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 281 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 328
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 329 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 386
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F +E++ C CG RCR +
Sbjct: 387 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKIPCSCGSKRCRKYL 446
Query: 836 N 836
N
Sbjct: 447 N 447
>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
Length = 4925
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 94/241 (39%), Positives = 132/241 (54%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
NP G AR EPY ++ R RK+P + A ++ + + S
Sbjct: 4720 NPHGTARCEPYGTRSEYDMFSWLASRHRKQPMPIVAQNIDDTIIPRR-----------GS 4768
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
G+ LP MA +Y+ +KE+ ++ + +S IHG G+F
Sbjct: 4769 GSNLP----------------------MAMRYRTLKESSKESVGVYRSHIHGRGLFCNRD 4806
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+ID+ V+DAT G+ A IN
Sbjct: 4807 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDENFVVDATMRGNAARFIN 4864
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F D ++ C CG +CR +
Sbjct: 4865 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYL 4924
Query: 836 N 836
N
Sbjct: 4925 N 4925
>gi|301622725|ref|XP_002940678.1| PREDICTED: hypothetical protein LOC100144721 [Xenopus (Silurana)
tropicalis]
Length = 2771
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 132/244 (54%), Gaps = 43/244 (17%)
Query: 604 SNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
+NPSGCARSE Y N+ + R PE G C+
Sbjct: 2560 TNPSGCARSEVYVRKSTFDMFNFLASQHRTLPEI-------------------GPCEEEE 2600
Query: 655 SGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
L S R R L MA +++H+K+T ++ + +S IHG G+F K
Sbjct: 2601 EEVQLKSTR-----------RATSLDLPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKR 2649
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
AG+MVIEY+G ++R + D+RE F Y+S G G YMFRIDD V+DAT G+ A I
Sbjct: 2650 NIDAGEMVIEYSGIVIRSVLTDKREKF-YDS-KGIGCYMFRIDDFDVVDATMHGNAARFI 2707
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRCR 832
NHSCEPNCYSRVI V G +HI+IFA R I + EELTYDY+F D +L C CG +CR
Sbjct: 2708 NHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYDYKFPIEDASNKLPCNCGAKKCR 2767
Query: 833 GVVN 836
+N
Sbjct: 2768 RFLN 2771
>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 1557
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 77/155 (49%), Positives = 109/155 (70%), Gaps = 2/155 (1%)
Query: 683 MADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
++D K + FRK+ L F KS IH +G+FA P A +MVIEY G++VRP +ADRRE F
Sbjct: 1404 ISDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQMVRPIMADRREQF 1463
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + +Y+FR+D E +IDAT+ G++A INHSC PNCY++VI+V G + I+I++K+
Sbjct: 1464 -YTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQ 1522
Query: 802 DIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I EE+TYDY+F DE++ C CG P+CRG +N
Sbjct: 1523 PINVNEEITYDYKFPLEDEKIVCLCGAPQCRGFLN 1557
>gi|166796317|gb|AAI59185.1| mll4 protein [Xenopus (Silurana) tropicalis]
Length = 1622
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 102/244 (41%), Positives = 132/244 (54%), Gaps = 43/244 (17%)
Query: 604 SNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGL 654
+NPSGCARSE Y N+ + R PE G C+
Sbjct: 1411 TNPSGCARSEVYVRKSTFDMFNFLASQHRTLPEI-------------------GPCEEEE 1451
Query: 655 SGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKH 714
L S R R L MA +++H+K+T ++ + +S IHG G+F K
Sbjct: 1452 EEVQLKSTR-----------RATSLDLPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKR 1500
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLI 774
AG+MVIEY+G ++R + D+RE F Y+S G G YMFRIDD V+DAT G+ A I
Sbjct: 1501 NIDAGEMVIEYSGIVIRSVLTDKREKF-YDS-KGIGCYMFRIDDFDVVDATMHGNAARFI 1558
Query: 775 NHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRCR 832
NHSCEPNCYSRVI V G +HI+IFA R I + EELTYDY+F D +L C CG +CR
Sbjct: 1559 NHSCEPNCYSRVIHVEGQKHIVIFALRSIYRGEELTYDYKFPIEDASNKLPCNCGAKKCR 1618
Query: 833 GVVN 836
+N
Sbjct: 1619 RFLN 1622
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 25/124 (20%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVA 532
W H+ CAIW E + ++ + V++ R +L C C GA + C +TC
Sbjct: 445 WTHINCAIWSAEVFEENDGSLKNVHAA--VARGR-QLRCDNCQ-KIGATVGCCLSTCHSN 500
Query: 533 YHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQ 592
+H +CARA+ C +D+ ++ FC KH + L D V+E
Sbjct: 501 FHFMCARASRCC--FQDDKKM------------------FCSKHTK-LLDGTPVEEDCFD 539
Query: 593 VTRR 596
V RR
Sbjct: 540 VLRR 543
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 91/232 (39%), Positives = 137/232 (59%), Gaps = 23/232 (9%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NPSGCAR EP++ G+R + + + + + P + ++
Sbjct: 2478 NPSGCARMEPFD--GQRSDHDMFSWLTS-----YHRHPPRFSSPTSDHDVTPTC------ 2524
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
SK SL D P+ + +Y+H+KET ++ +SGIHG G++ K G+M+IE
Sbjct: 2525 --SKRQTSL--DLPDGM----RYRHLKETTKQVAGVYRSGIHGRGLYCKKDIAKGEMIIE 2576
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y GE++R S+ DRRE + G G YMFR+D++ V+DAT G+ A INHSC+PNCYS
Sbjct: 2577 YAGEVIRASLCDRREKYYEGR--GLGCYMFRMDNDEVVDATVKGNAARFINHSCDPNCYS 2634
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++I+V+ +HI+I+A R+I+ EELTYDY+F D++L C CG RCR +N
Sbjct: 2635 KMITVDNKKHIVIYALREIRTGEELTYDYKFPIEDDKLHCTCGSRRCRKFMN 2686
>gi|145344711|ref|XP_001416870.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577096|gb|ABO95163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1782
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/195 (44%), Positives = 118/195 (60%), Gaps = 14/195 (7%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC 444
Y V+V W +D+C+VC + ++ + + C+ C + VH CYG + P + V WLC C
Sbjct: 613 YEIVQVSWS-VDRCAVCDDERDFDFDQLITCEACAVTVHQSCYGVPDIPDDTVGWLCRSC 671
Query: 445 R--PGAPEPPPPCCLCPVVGGAMKPTT-DGRWAHLACAIWIPETCLTDVKRMEPIDGLNR 501
GA P CCLCPV GGA+KPTT WAH AC WIPET + D++RMEPID +
Sbjct: 672 EHTGGAVSETPLCCLCPVAGGALKPTTIPSLWAHSACCQWIPETTVLDIERMEPIDNIAN 731
Query: 502 VSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDED 561
+ K+RW LLC++C GA IQC + C +AYHPLCARA GL + + D D
Sbjct: 732 IQKERWSLLCTVCKQRMGAKIQCCHPGCYIAYHPLCARATGLYM---------DANDDGD 782
Query: 562 DEDQCIRLLSFCKKH 576
D++ ++LLS+C +H
Sbjct: 783 DDESPLQLLSYCHRH 797
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 117/213 (54%), Gaps = 17/213 (7%)
Query: 635 KRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFS-FSLHRDAPNF------------- 680
KRL+V PY+V G + + S + +F+ R A
Sbjct: 1570 KRLYVGLTPYIVRG-SRGAFKRELIASKKYTPEEFAAVQAERKARGVNGASDDKVVEQHG 1628
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L+M +K + M ++ RL F KS IHG+G+ AK H+AG +V ++ GE R ++AD RE
Sbjct: 1629 LTMREKLEQMTASYSDRLTFAKSNIHGWGLVAKVFHKAGSIVTQFKGETCRSTVADLRET 1688
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F ++ G Y+ + DD+ V+D T G+ A NHSC PN YS+++ V+ HII FA+
Sbjct: 1689 FYEDN--GVDCYLLKQDDDTVVDCTFQGNFARFTNHSCNPNMYSKIVKVDDANHIIFFAR 1746
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRG 833
D++ EELTY+YRF S D ++ CYCG CRG
Sbjct: 1747 TDVRPGEELTYNYRFESEDGKVPCYCGADNCRG 1779
>gi|383849246|ref|XP_003700256.1| PREDICTED: uncharacterized protein LOC100875701 [Megachile rotundata]
Length = 1503
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/244 (39%), Positives = 144/244 (59%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + N PY G NG NT +
Sbjct: 1266 SGSARTEGYYKVDVREKAKHKHHYAQSIQRSNDVEDSNGPYAGGDGTMNGPKNNT----K 1321
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1322 TLTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1380
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G++VRP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1381 MEPIAADEMVIEYVGQMVRPVVADLRESQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1439
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1440 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1499
Query: 833 GVVN 836
G +N
Sbjct: 1500 GTLN 1503
>gi|308801407|ref|XP_003078017.1| trithorax-like (ISS) [Ostreococcus tauri]
gi|116056468|emb|CAL52757.1| trithorax-like (ISS) [Ostreococcus tauri]
Length = 2007
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 85/195 (43%), Positives = 116/195 (59%), Gaps = 14/195 (7%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC 444
Y V+V W +D+C+VC + ++ + + C+ C + VH CYG + P + V WLC C
Sbjct: 571 YDVVKVSW-SVDRCAVCDDERDFDYDQLITCEACAVTVHQSCYGVPDIPDDTVGWLCRAC 629
Query: 445 RP--GAPEPPPPCCLCPVVGGAMKPTT-DGRWAHLACAIWIPETCLTDVKRMEPIDGLNR 501
GA P CCLCPV GGA+KPTT WAH AC WIPET + D++RMEPID +
Sbjct: 630 EHTGGAVSETPLCCLCPVEGGALKPTTIPSLWAHSACCQWIPETTVLDIERMEPIDNIAN 689
Query: 502 VSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDED 561
+ K+RW LLC++C GA IQC + C +AYHPLCARA GL ++ D+ + L
Sbjct: 690 IQKERWTLLCTVCKQRMGAKIQCCHPGCYIAYHPLCARATGLYMDANDDGEDDDSPL--- 746
Query: 562 DEDQCIRLLSFCKKH 576
+LLS+C +H
Sbjct: 747 ------QLLSYCHRH 755
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 111/202 (54%), Gaps = 17/202 (8%)
Query: 635 KRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFS-FSLHRDA-------------PNF 680
KRL+VE PY+V + + S + +F+ R A
Sbjct: 1528 KRLYVELTPYIVRAHAARS-KAKLIASKKYTPDEFAALQAERKARGSAGASEDKIVEQQG 1586
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
++M +K +HM T+ RL F KS IHG+G+ AK H+AG +V E+ GE R ++AD RE
Sbjct: 1587 MTMREKLEHMTATYSDRLTFCKSNIHGWGLLAKTAHKAGSIVTEFKGETCRSTVADMRET 1646
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
G Y+ + DD+ V+D T G++A NHSC PN YS+++ V+G+ HII FA+
Sbjct: 1647 AYEEE--GVDCYLLKQDDDTVVDCTFQGNLARFTNHSCNPNMYSKIVKVDGENHIIFFAR 1704
Query: 801 RDIKQWEELTYDYRFFSIDEQL 822
DIK EELTY+YRF S D ++
Sbjct: 1705 NDIKAGEELTYNYRFESEDGKV 1726
>gi|326676474|ref|XP_003200588.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Danio
rerio]
Length = 820
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 143/277 (51%), Gaps = 55/277 (19%)
Query: 571 SFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPY---------NYFGRR 621
S C+KH+ + +E L P NP+GCAR+E Y N+ +
Sbjct: 588 SLCQKHRFRFHQHEIPEEEL------------PENPNGCARAEVYVRKSTFDMFNFLASQ 635
Query: 622 GRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL 681
R+ PE+ C + L S R R L
Sbjct: 636 HRQLPESRP-------------------CDDDEDDIMLKSSR-----------RATSTEL 665
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
+A +++H+++T ++ + +S IHG G+F K AG+MVIEY G ++R + D+RE +
Sbjct: 666 PVAMRFRHLEKTSKEAVGVYRSAIHGRGLFCKRNIEAGEMVIEYAGNVIRAVLTDKREKY 725
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y+S G G YMFRIDD V+DAT G+ A INHSC+PNCYSRVI+V G +HI+IFA R
Sbjct: 726 -YDSK-GIGCYMFRIDDFDVVDATMHGNAARFINHSCDPNCYSRVINVEGQKHIVIFALR 783
Query: 802 DIKQWEELTYDYRFF--SIDEQLACYCGFPRCRGVVN 836
I + EELTYDY+F D +L C CG RCR +N
Sbjct: 784 KIYRGEELTYDYKFPIEDADNKLHCNCGARRCRRFLN 820
>gi|340721798|ref|XP_003399301.1| PREDICTED: hypothetical protein LOC100643294 [Bombus terrestris]
Length = 1502
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 144/244 (59%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + N PY G NG NT +
Sbjct: 1265 SGSARTEGYYKVDVREKAKHKHHYAQSIQRSNDVEDSNGPYAGGDSIMNGPKNNT----K 1320
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1321 ALTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1379
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G+++RP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1380 MEPIAADEMVIEYVGQMIRPVVADLRESQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1438
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1439 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1498
Query: 833 GVVN 836
G +N
Sbjct: 1499 GTLN 1502
>gi|410905295|ref|XP_003966127.1| PREDICTED: uncharacterized protein LOC101073293 [Takifugu rubripes]
Length = 3463
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/245 (40%), Positives = 137/245 (55%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NPSGCARSE Y N+ + R+ P+ +G Y
Sbjct: 3251 PVNPSGCARSEMYLRKSTFDIFNFLASQHRQLPD------------------IGPY---- 3288
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+ + + ++ + SL L MA +++H+++T ++ + +S IHG G+F K
Sbjct: 3289 --DDEEDEVLLKSTRRATSLE------LPMAMRFRHLEKTSKEAVGVYRSAIHGRGLFCK 3340
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY G ++R + D+RE + Y+S G G YMFRIDD V+DAT G+ A
Sbjct: 3341 RNIEAGEMVIEYAGIVIRSVLTDKREKY-YDS-KGIGCYMFRIDDFDVVDATMHGNAARF 3398
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI+V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 3399 INHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASNKLNCNCGARRC 3458
Query: 832 RGVVN 836
R +N
Sbjct: 3459 RRFLN 3463
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 45/94 (47%), Gaps = 13/94 (13%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
C LC G +KP GR WAH+ C++W E + + + + V++ R
Sbjct: 2155 CSLCQKYGD-LKPNEAGRLLYLGQNEWAHVNCSLWSAEVFEEENGSL--LHVHSAVARGR 2211
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARA 540
+ C C + GA + C T+C+ YH +CAR+
Sbjct: 2212 L-MRCERCNKT-GATVGCCLTSCQSNYHFMCARS 2243
>gi|292621658|ref|XP_002664717.1| PREDICTED: hypothetical protein LOC566825 [Danio rerio]
Length = 3750
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/245 (39%), Positives = 135/245 (55%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NPSGCARSE Y N+ + R+ P+ L+ E + ++
Sbjct: 3538 PVNPSGCARSEVYLRKSTFDMFNFLASQHRQLPDT-------DLYDEEEDEVL------- 3583
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+ ++ + SL L MA +++H++ T ++ + +S IHG G+F K
Sbjct: 3584 ----------LKSTRRATSLE------LPMAMRFRHLERTSKEAVGVYRSAIHGRGLFCK 3627
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY+G ++R + D+RE + G G YMFRIDD V+DAT G+ A
Sbjct: 3628 RNIEAGEMVIEYSGIVIRSVLTDKREKYYDGK--GIGCYMFRIDDFDVVDATMHGNAARF 3685
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI+V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 3686 INHSCEPNCYSRVINVEGQKHIVIFALRKIYRGEELTYDYKFPIEDASNKLGCNCGAKRC 3745
Query: 832 RGVVN 836
R +N
Sbjct: 3746 RRFLN 3750
>gi|225430418|ref|XP_002283013.1| PREDICTED: histone-lysine N-methyltransferase ATX1-like [Vitis
vinifera]
Length = 496
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 169/378 (44%), Gaps = 54/378 (14%)
Query: 468 TTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNT 527
+T +W H+ C +W P T +V M D ++ S+ R ++CSIC G+CI+C
Sbjct: 164 STVKQWVHMVCGLWTPGTRCPNVDTMSAFD-VSGASRPRANVICSICNRPGGSCIKCRVL 222
Query: 528 TCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVD 587
C V +HP CA GL L S E +++ + C H + L D
Sbjct: 223 NCLVPFHPWCAHRKGL-----------LQSEVEGVDNENVGFYGRCMLHAAHPSCELDSD 271
Query: 588 ERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG 647
++ CAR+E Y +GRK+ F N +
Sbjct: 272 PINIETDS------TGEKELTCARTEGY-----KGRKQEG----------FRHNLNFQ-- 308
Query: 648 GYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADK----YKHMKETFRKRLAFGKS 703
NG G +P + + LH + + K YK K K L KS
Sbjct: 309 ---SNGNGGCLVPQ-----EQLNAWLHINGQKSCTKGQKEFARYKQAKGW--KHLVVYKS 358
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS---LVGAGTYMFRIDDER 760
GIH G++ G MV+EY GE+V +AD+RE + Y FRID E
Sbjct: 359 GIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACYFFRIDKEH 418
Query: 761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE 820
+IDATR G IA +NHSC PNC ++VISV ++ ++ FA+RDI EE+TYDY F DE
Sbjct: 419 IIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYDYHFNHEDE 478
Query: 821 --QLACYCGFPRCRGVVN 836
++ C+C CR +N
Sbjct: 479 GKKIPCFCNSRNCRRYLN 496
>gi|363742545|ref|XP_417896.3| PREDICTED: histone-lysine N-methyltransferase MLL [Gallus gallus]
Length = 3871
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 109/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 3716 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREK 3775
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3776 Y-YDSK-GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3833
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3834 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3871
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLL-CSICGVSYGACIQCSNTTCRV 531
W H+ CA+W E D ++ + ++ R K L C C S GA + C T+C
Sbjct: 1779 WTHVNCALWSAEVFEDDDGSLKNV----HMAVIRGKQLRCEFCQKS-GATVGCCLTSCTS 1833
Query: 532 AYHPLCARAAGLCVELEDE 550
YH +C+RA CV L+D+
Sbjct: 1834 NYHFMCSRAKN-CVFLDDK 1851
>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
Length = 2720
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 135/236 (57%), Gaps = 24/236 (10%)
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
P NPSGCARSE Y+ R+ + A+ + L P + G + + S
Sbjct: 2507 PINPSGCARSEVYS---RKATFDMFNFLASQHREL-----PDITGPFEEEDDDFPLKSSR 2558
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
R S+ L MA +++H+++ ++ + +S IHG G+F K AG+MV
Sbjct: 2559 RATSSE------------LPMAMRFRHLEKISKEAVGVYRSEIHGRGLFCKRNIEAGEMV 2606
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY G ++R + D+R+ + G G YMFRIDD V+DAT G+ A INHSCEPNC
Sbjct: 2607 IEYAGTVIRAVLTDKRQKYYDGK--GIGCYMFRIDDFDVVDATMQGNAARFINHSCEPNC 2664
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
YSRVI+V+G +HI+IFA R I + EELTYDY+F D++ L C CG RCRG +N
Sbjct: 2665 YSRVINVDGRKHIVIFALRKIYRGEELTYDYKFPIEDDESKLHCNCGTRRCRGSLN 2720
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 63/150 (42%), Gaps = 35/150 (23%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
C LC G ++ P+ GR WAH+ C++W E + ++ ++R
Sbjct: 1382 CTLCQHYGDSV-PSEAGRLLYMGQNEWAHVNCSLWSAEVYEENGALLQVHSAVSRGRH-- 1438
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
L C CG S GA + C TC+ +H +CAR CV +D
Sbjct: 1439 --LRCDHCGQS-GATVGCCLATCQSNFHFMCARVQN-CVFQQDRK--------------- 1479
Query: 567 IRLLSFCKKHKQPLNDRLAVDERLVQVTRR 596
+C KH+ ++D+ + + +V+RR
Sbjct: 1480 ----VYCHKHRDLVSDKKVIGKGF-EVSRR 1504
>gi|224083075|ref|XP_002188579.1| PREDICTED: histone-lysine N-methyltransferase MLL [Taeniopygia
guttata]
Length = 3849
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 109/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 3694 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREK 3753
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3754 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3811
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3812 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3849
>gi|4127850|emb|CAA09454.1| MLL protein [Gallus gallus]
Length = 945
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 109/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 790 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREK 849
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 850 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 907
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 908 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 945
>gi|449267369|gb|EMC78314.1| Histone-lysine N-methyltransferase HRX, partial [Columba livia]
Length = 3786
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 109/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 3631 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREK 3690
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3691 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3748
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3749 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3786
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLL-CSICGVSYGACIQCSNTTCRV 531
W H+ CA+W E D ++ + ++ R K L C C S GA + C T+C
Sbjct: 1705 WTHVNCALWSAEVFEDDDGSLKNV----HMAVIRGKQLRCEFCQKS-GATVGCCLTSCTS 1759
Query: 532 AYHPLCARAAGLCVELEDE 550
YH +C+RA CV L+D+
Sbjct: 1760 NYHFMCSRAKN-CVFLDDK 1777
>gi|326933334|ref|XP_003212761.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Meleagris gallopavo]
Length = 3851
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 109/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 3696 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREK 3755
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3756 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3813
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3814 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3851
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLL-CSICGVSYGACIQCSNTTCRV 531
W H+ CA+W E D ++ + ++ R K L C C S GA + C T+C
Sbjct: 1775 WTHVNCALWSAEVFEDDDGSLKNV----HMAVIRGKQLRCEFCQKS-GATVGCCLTSCTS 1829
Query: 532 AYHPLCARAAGLCVELEDE 550
YH +C+RA CV L+D+
Sbjct: 1830 NYHFMCSRAKN-CVFLDDK 1847
>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
Length = 2924
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 98/245 (40%), Positives = 134/245 (54%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NPSGCARSE Y N+ + R+ P+ +G Y
Sbjct: 2712 PVNPSGCARSELYLRKSTFDMFNFLASQHRQLPD------------------IGPY---- 2749
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+ + + ++ + SL L MA +++H+K+T ++ + +S IHG G+F K
Sbjct: 2750 --DDEEDEVLLKSTRRATSLE------LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCK 2801
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY G ++R + D+RE + G G YMFRIDD V+DAT G+ A
Sbjct: 2802 RNIEAGEMVIEYAGIVIRSVLTDKREKYYDGK--GIGCYMFRIDDFDVVDATMHGNAARF 2859
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI+V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 2860 INHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASNKLNCNCGARRC 2919
Query: 832 RGVVN 836
R +N
Sbjct: 2920 RRFLN 2924
>gi|327280514|ref|XP_003224997.1| PREDICTED: hypothetical protein LOC100556600 [Anolis carolinensis]
Length = 2812
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 100/245 (40%), Positives = 135/245 (55%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NP GCAR+E Y N+ AS R+ E PY
Sbjct: 2600 PLNPHGCARAEVYSRKCTFDMFNFL-------------ASQHRVLPEGGPY--------- 2637
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+++ ++ + SL L MA +++H+K+T ++ + +S IHG G+F K
Sbjct: 2638 --DEEEDEVQLKSTRRATSLE------LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCK 2689
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY+G ++R + D+RE + Y+S G G YMFRIDD V+DAT G+ A
Sbjct: 2690 RNIDAGEMVIEYSGIVIRSVLTDKREKY-YDS-KGIGCYMFRIDDFDVVDATMHGNAARF 2747
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 2748 INHSCEPNCYSRVIHVEGQKHIVIFALRRIFRGEELTYDYKFPIEDAGSKLPCNCGAKRC 2807
Query: 832 RGVVN 836
R +N
Sbjct: 2808 RRFLN 2812
>gi|195501654|ref|XP_002097885.1| GE26460 [Drosophila yakuba]
gi|194183986|gb|EDW97597.1| GE26460 [Drosophila yakuba]
Length = 343
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 133/241 (55%), Gaps = 44/241 (18%)
Query: 605 NPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N CAR EPY ++ R RK+P ++FV QP S
Sbjct: 138 NAYDCARCEPYANRSEYDMFSWLASRHRKQP--------IQVFV--QP-----------S 176
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
N L R GS L MA KY+ +KET++ + +S IHG G++
Sbjct: 177 DNELVPRRGTGSN------------LPMAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKD 224
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+S G G YMF+IDD V+DAT G+ A IN
Sbjct: 225 IEAGEMVIEYAGELIRSTLTDKRERY-YDSR-GIGCYMFKIDDNLVVDATMRGNAARFIN 282
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
H CEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F +E++ C CG RCR +
Sbjct: 283 HCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEEEKIPCSCGSKRCRKYL 342
Query: 836 N 836
N
Sbjct: 343 N 343
>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
mellifera]
Length = 3195
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L MA +++ +KET ++ + S IHG G+F AG+MVIEY GE++R S+
Sbjct: 3036 RVASTNLPMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGEMVIEYAGEVIRASL 3095
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEPNCYSRV+ + G +H
Sbjct: 3096 TDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRVVDILGKKH 3153
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+IFA R I Q EELTYDY+F D ++ C CG RCR +N
Sbjct: 3154 ILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3195
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 44/101 (43%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC +G + T +GR W H CA+W E E IDG L V
Sbjct: 1223 CCLCKGLGDGPE-TKEGRLLYCGQNEWLHSNCALWSNEV-------FEEIDGSLQNVHSA 1274
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ CS CG GA + C C +H CAR GL
Sbjct: 1275 ISRGRLIRCSECG-KKGASVGCCAKNCNSTFHYPCARNVGL 1314
>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
Length = 3434
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L MA +++ +KET ++ + S IHG G+F AG+MVIEY GE++R S+
Sbjct: 3275 RVASTNLPMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGEMVIEYAGEVIRASL 3334
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEPNCYSRV+ + G +H
Sbjct: 3335 TDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRVVDILGKKH 3392
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+IFA R I Q EELTYDY+F D ++ C CG RCR +N
Sbjct: 3393 ILIFALRRINQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3434
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 44/101 (43%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC +G + T +GR W H CA+W E E IDG L V
Sbjct: 1346 CCLCKGLGDGAE-TKEGRLLYCGQNEWVHSNCALWSNEV-------FEEIDGSLQNVHSA 1397
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ CS CG GA + C C +H CAR GL
Sbjct: 1398 ISRGRLIRCSECG-KKGASVGCCAKNCSSTFHYPCARNVGL 1437
>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
Length = 1200
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 133/241 (55%), Gaps = 11/241 (4%)
Query: 605 NPSGCARSEPYNYF--GRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
NP+GCAR+E Y + P L A+ + P + N + S
Sbjct: 962 NPTGCARTEGYRRVPEAEKSMYLPHRLQLAAKVKSAANKIPEVTAAAVTNPTKNPSSTSR 1021
Query: 663 --RVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRA 718
RV + L+ S AD ++ +K+ +K + F +S IH +G++A A
Sbjct: 1022 MNRVNNRRLVVGLNNQKQMLSSDADVMRFNQLKKR-KKPVKFARSAIHNWGLYAMENISA 1080
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 778
GDM+IEY GE++R +AD RE S +G+ +Y+FRIDD VIDAT+AG IA INHSC
Sbjct: 1081 GDMIIEYVGEIIRQQVADMREKKYLKSGIGS-SYLFRIDDTTVIDATKAGGIARFINHSC 1139
Query: 779 EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVV 835
PNC +++I V G + I+I+A RDI++ EELTYDY+F +E++ C CG C+G +
Sbjct: 1140 TPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFERELESEERIPCLCGSSGCKGFL 1199
Query: 836 N 836
N
Sbjct: 1200 N 1200
>gi|328778989|ref|XP_395451.4| PREDICTED: hypothetical protein LOC411985 isoform 1 [Apis mellifera]
Length = 1503
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 143/244 (58%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + N Y G NG NT +
Sbjct: 1266 SGSARTEGYYKVDVREKAKHKHHYAQSIQRSNDVEDNNGSYASGDGTMNGPKNNT----K 1321
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1322 ALTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1380
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G++VRP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1381 MEPIAADEMVIEYVGQMVRPVVADLRESQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1439
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1440 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1499
Query: 833 GVVN 836
G +N
Sbjct: 1500 GTLN 1503
>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus floridanus]
Length = 3218
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/162 (50%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L MA +++ +KET +K + S IHG G+F AG+MVIEY GE++R S+
Sbjct: 3059 RVASTNLPMAMRFRILKETSKKSVGVYHSHIHGRGLFCLRDIEAGEMVIEYAGEVIRSSL 3118
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEPNCYSRV+ + G +H
Sbjct: 3119 TDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRVVDILGKKH 3176
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+IFA R I Q EELTYDY+F D ++ C CG RCR +N
Sbjct: 3177 ILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3218
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC + + T +GR W H CA+W E E IDG L V
Sbjct: 1224 CCLCKGLSDGQE-TKEGRLFYCGQNEWVHANCALWSNEV-------FEEIDGSLQNVHSA 1275
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ C+ CG GA I C C +H CAR GL
Sbjct: 1276 ISRGRLIRCTECG-KKGASIGCCAKNCSNIFHFPCARNIGL 1315
>gi|296082099|emb|CBI21104.3| unnamed protein product [Vitis vinifera]
Length = 1111
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/387 (31%), Positives = 172/387 (44%), Gaps = 59/387 (15%)
Query: 468 TTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNT 527
+T +W H+ C +W P T +V M D ++ S+ R ++CSIC G+CI+C
Sbjct: 766 STVKQWVHMVCGLWTPGTRCPNVDTMSAFD-VSGASRPRANVICSICNRPGGSCIKCRVL 824
Query: 528 TCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVD 587
C V +HP CA GL L S E +++ + C H + L D
Sbjct: 825 NCLVPFHPWCAHRKGL-----------LQSEVEGVDNENVGFYGRCMLHAAHPSCELDSD 873
Query: 588 ERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG 647
++ CAR+E Y +GRK+ F N +
Sbjct: 874 PINIETDS------TGEKELTCARTEGY-----KGRKQEG----------FRHNLNFQ-- 910
Query: 648 GYCQNGLSGNTLPS------IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFG 701
NG G +P + + G K S + P +Y KE R + A G
Sbjct: 911 ---SNGNGGCLVPQEQLNAWLHINGQK---SCTKGLPKTPISDVEYDCRKEFARYKQAKG 964
Query: 702 -------KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS---LVGAGT 751
KSGIH G++ G MV+EY GE+V +AD+RE +
Sbjct: 965 WKHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKTAC 1024
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y FRID E +IDATR G IA +NHSC PNC ++VISV ++ ++ FA+RDI EE+TY
Sbjct: 1025 YFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITY 1084
Query: 812 DYRFFSIDE--QLACYCGFPRCRGVVN 836
DY F DE ++ C+C CR +N
Sbjct: 1085 DYHFNHEDEGKKIPCFCNSRNCRRYLN 1111
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N L+C +C + VH CYG + V W C CR + C LC
Sbjct: 637 CCVCGSSNKDEINCLLECSRCLIRVHQACYG-VSRVPKGRWYCRPCRTSSKN--IVCVLC 693
Query: 459 PVVGGAM 465
GGAM
Sbjct: 694 GYGGGAM 700
>gi|241687915|ref|XP_002401626.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215504523|gb|EEC14017.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 1036
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/156 (48%), Positives = 107/156 (68%), Gaps = 2/156 (1%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K T ++ + +S IHG G++ K G+M+IEY GE++R ++ D+RE
Sbjct: 883 LPMAMRFRHLKNTAKEAVGVYRSSIHGRGLYCKRNIDGGEMIIEYAGEVIRAALTDKREK 942
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y S G G YMFRIDD V+DAT G+ A INHSCEPNCYS+VI+V+ +HI+IFA
Sbjct: 943 Y-YESK-GIGCYMFRIDDHEVVDATMHGNAARFINHSCEPNCYSKVITVDNKKHIVIFAL 1000
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
R I + EELTYDY+F + ++ C CG RCR +N
Sbjct: 1001 RSILKGEELTYDYKFPIEEVKIPCSCGSRRCRKFLN 1036
>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 3855
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 106/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F + AG+MVIEY+G ++R + D+RE
Sbjct: 3700 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREK 3759
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI ++G +HI+IFA
Sbjct: 3760 YYDGK--GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAM 3817
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +LAC CG +CR +N
Sbjct: 3818 RKIYRGEELTYDYKFPIEDANNKLACNCGTKKCRKFLN 3855
>gi|380013032|ref|XP_003690574.1| PREDICTED: uncharacterized protein LOC100867676 [Apis florea]
Length = 1500
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 143/244 (58%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + N Y G NG NT +
Sbjct: 1263 SGSARTEGYYKVDVREKAKHKHHYAQSIQRSNDVEDNNGSYASGDGTMNGPKNNT----K 1318
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1319 ALTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1377
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G++VRP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1378 MEPIAADEMVIEYVGQMVRPVVADLRESQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1436
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1437 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1496
Query: 833 GVVN 836
G +N
Sbjct: 1497 GTLN 1500
>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 3840
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 106/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F + AG+MVIEY+G ++R + D+RE
Sbjct: 3685 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCRRNIDAGEMVIEYSGNVIRSILTDKREK 3744
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI ++G +HI+IFA
Sbjct: 3745 YYDGK--GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAM 3802
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +LAC CG +CR +N
Sbjct: 3803 RKIYRGEELTYDYKFPIEDANNKLACNCGTKKCRKFLN 3840
>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
Length = 3459
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3304 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3363
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3364 Y-YDSK-GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3421
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3422 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3459
>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
Length = 3530
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L MA +++ +KET ++ + S IHG G+F AG+MVIEY GE++R S+
Sbjct: 3371 RVASTNLPMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGEMVIEYAGEVIRASL 3430
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEPNCYSRV+ + G +H
Sbjct: 3431 TDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRVVDILGKKH 3488
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+IFA R I Q EELTYDY+F D ++ C CG RCR +N
Sbjct: 3489 ILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3530
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 43/101 (42%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC + + T +GR W H CA+W E E IDG L V
Sbjct: 1391 CCLCKGLSDGPE-TKEGRLLYCGQNEWVHSNCALWSNEV-------FEEIDGSLQNVHSA 1442
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ C+ CG GA + C C +H CAR GL
Sbjct: 1443 ISRGRLIRCAECG-KKGASVGCCAKNCNSTFHYPCARNVGL 1482
>gi|156353190|ref|XP_001622957.1| predicted protein [Nematostella vectensis]
gi|156209595|gb|EDO30857.1| predicted protein [Nematostella vectensis]
Length = 288
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 135/234 (57%), Gaps = 20/234 (8%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GC R+EP++ RR + A A+ + P L Y + + ++
Sbjct: 73 NPTGCCRTEPHH---RRLMYDMFAFLASEYR-----TPPQLDDNYDELEAAEQAWGNMTT 124
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR--LAFGKSGIHGFGIFAKHPHRAGDMV 722
I + L MA +Y+H+K+ + +A +SGIHG G++ AG+MV
Sbjct: 125 IKRPTTLD--------LPMAMRYRHLKQNNYNKGTIAVFRSGIHGRGLYCTRHIAAGEMV 176
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY+G L+R ++ D+RE Y G G YMFRID V+DAT +G+ A INHSCEPNC
Sbjct: 177 IEYSGMLIRSTLTDKRE--AYYESKGIGCYMFRIDGTYVVDATTSGNAARFINHSCEPNC 234
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
YSRV++++G++ I+IFA + I + EELTYDY+F DE+L C+C RCR +N
Sbjct: 235 YSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKLPCHCKSKRCRKYLN 288
>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
Length = 3826
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3671 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3730
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3731 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3788
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3789 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3826
>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
taurus]
gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
Length = 3821
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3666 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3725
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3726 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3783
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3784 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3821
>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
Length = 3846
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3691 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3750
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3751 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3808
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3809 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3846
>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
Length = 3710
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3555 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3614
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3615 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3672
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3673 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3710
>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
Length = 3899
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3744 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3803
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3804 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3861
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3862 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3899
>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
Length = 3522
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L MA +++ +KET ++ + S IHG G+F AG+MVIEY GE++R S+
Sbjct: 3363 RVASTNLPMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGEMVIEYAGEVIRASL 3422
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEPNCYSRV+ + G +H
Sbjct: 3423 TDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEPNCYSRVVDILGKKH 3480
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+IFA R I Q EELTYDY+F D ++ C CG RCR +N
Sbjct: 3481 ILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRRCRKYLN 3522
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 43/101 (42%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC + + T +GR W H CA+W E E IDG L V
Sbjct: 1391 CCLCKGLSDGPE-TKEGRLLYCGQNEWVHSNCALWSNEV-------FEEIDGSLQNVHSA 1442
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ C+ CG GA + C C +H CAR GL
Sbjct: 1443 ISRGRLIRCTECG-KKGASVGCCAKNCNSTFHYPCARNVGL 1482
>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
familiaris]
Length = 3829
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3674 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3733
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3734 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3791
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3792 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3829
>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
Length = 3554
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3399 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3458
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3459 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3516
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3517 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3554
>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
garnettii]
Length = 4062
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3907 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3966
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3967 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 4024
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 4025 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 4062
>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
gorilla]
Length = 3837
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3682 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3741
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3742 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3799
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3800 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3837
>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Loxodonta africana]
Length = 3962
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3807 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3866
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3867 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3924
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3925 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3962
>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
Length = 3806
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3651 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3710
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3711 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3768
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3769 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3806
>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
Length = 3824
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3669 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3728
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3729 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3786
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3787 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3824
>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
rotundus]
Length = 3966
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3811 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3870
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3871 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3928
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3929 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3966
>gi|156353192|ref|XP_001622958.1| predicted protein [Nematostella vectensis]
gi|156209596|gb|EDO30858.1| predicted protein [Nematostella vectensis]
Length = 314
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 135/234 (57%), Gaps = 20/234 (8%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GC R+EP++ RR + A A+ + P L Y + + ++
Sbjct: 99 NPTGCCRTEPHH---RRLMYDMFAFLASEYR-----TPPQLDDNYDELEAAEQAWGNMTT 150
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR--LAFGKSGIHGFGIFAKHPHRAGDMV 722
I + L MA +Y+H+K+ + +A +SGIHG G++ AG+MV
Sbjct: 151 IKRPTTLD--------LPMAMRYRHLKQNNYNKGTIAVFRSGIHGRGLYCTRHIAAGEMV 202
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY+G L+R ++ D+RE Y G G YMFRID V+DAT +G+ A INHSCEPNC
Sbjct: 203 IEYSGMLIRSTLTDKRE--AYYESKGIGCYMFRIDGTYVVDATTSGNAARFINHSCEPNC 260
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
YSRV++++G++ I+IFA + I + EELTYDY+F DE+L C+C RCR +N
Sbjct: 261 YSRVVTIDGNKKILIFASKSISRGEELTYDYKFPLEDEKLPCHCKSKRCRKYLN 314
>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
leucogenys]
Length = 3968
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3813 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3872
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3873 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3930
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3931 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3968
>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_e [Homo sapiens]
Length = 3972
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3817 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3876
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3877 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3934
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3935 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3972
>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
sapiens]
Length = 3972
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3817 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3876
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3877 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3934
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3935 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3972
>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
melanoleuca]
Length = 3981
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3826 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3885
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3886 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3943
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3944 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3981
>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
sapiens]
gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
7; AltName: Full=Lysine N-methyltransferase 2A;
Short=KMT2A; AltName: Full=Trithorax-like protein;
AltName: Full=Zinc finger protein HRX; Contains: RecName:
Full=MLL cleavage product N320; AltName: Full=N-terminal
cleavage product of 320 kDa; Short=p320; Contains:
RecName: Full=MLL cleavage product C180; AltName:
Full=C-terminal cleavage product of 180 kDa; Short=p180
gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) [Homo sapiens]
Length = 3969
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3814 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3873
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3874 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3931
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3932 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
Length = 3844
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3689 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3748
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3749 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3806
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3807 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3844
>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_a [Homo sapiens]
Length = 3969
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3814 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3873
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3874 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3931
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3932 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
griseus]
Length = 3907
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3752 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3811
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3812 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3869
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3870 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3907
>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
Length = 3986
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3831 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3890
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3891 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3948
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3949 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3986
>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
Length = 2812
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/158 (51%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+++T ++ + +S IHG G+F K AG+MVIEY G ++R + D+RE
Sbjct: 2657 LPMAMRFRHLEKTSKEAVGVYRSLIHGRGLFCKRNIEAGEMVIEYAGTVIRSVLTDKREK 2716
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA
Sbjct: 2717 FYDGK--GIGCYMFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFAL 2774
Query: 801 RDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
R I + EELTYDY+F DE +L C CG RCR +N
Sbjct: 2775 RKIYRGEELTYDYKFPIEDEDNKLHCNCGTRRCRRFLN 2812
>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
jacchus]
Length = 3994
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3839 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3898
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3899 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3956
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3957 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3994
>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
Length = 4202
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 4047 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 4106
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 4107 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 4164
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 4165 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 4202
>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
Length = 3969
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3814 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3873
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3874 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3931
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3932 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
Length = 3968
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3813 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3872
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3873 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3930
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3931 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3968
>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
Length = 4005
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3850 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3909
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3910 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3967
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3968 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 4005
>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_c [Homo sapiens]
Length = 3130
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 2975 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3034
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3035 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3092
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3093 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3130
>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Metaseiulus occidentalis]
Length = 976
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 137/239 (57%), Gaps = 29/239 (12%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE + + +++ K V+ Q Q+G S + V
Sbjct: 758 TGCARSEGFYKVDSQEKQQ--------YKASHVQTQ------EAQDGKSNKPV----VAA 799
Query: 667 SKFSFSLHRDAPNFLSM-------ADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRA 718
S+ S + L++ +D K + FRK+ L F KS IH +G+FA P A
Sbjct: 800 SQMSREARSNQRRLLTIFQETEVVSDLLKFNQLKFRKKQLKFAKSTIHDWGLFALEPIAA 859
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGT-YMFRIDDERVIDATRAGSIAHLINHS 777
+MVIEY G+ VR S+AD RE + + VG G+ Y+FR+D E +IDAT+ G++A INHS
Sbjct: 860 DEMVIEYVGQSVRCSVADSRE--VQYTKVGIGSSYLFRVDHETIIDATKCGNLARFINHS 917
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY+RVI+V G + I+I++KRDI EE+TYDY+F + ++ C CG P+CRG +N
Sbjct: 918 CNPNCYARVITVEGQKKIVIYSKRDISVNEEITYDYKFPREEVKITCLCGTPQCRGYLN 976
>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
norvegicus]
Length = 3725
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3570 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3629
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3630 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3687
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3688 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3725
>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL [Pongo abelii]
Length = 4012
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3857 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3916
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3917 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3974
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3975 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 4012
>gi|350408006|ref|XP_003488269.1| PREDICTED: hypothetical protein LOC100743429 [Bombus impatiens]
Length = 1503
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 143/244 (58%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + N PY NG NT +
Sbjct: 1266 SGSARTEGYYKVDVREKAKHKHHYAQSIQRSNDVEDSNGPYAGSDGIMNGPKNNT----K 1321
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1322 ALTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1380
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G+++RP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1381 MEPIAADEMVIEYVGQMIRPVVADLRESQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1439
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1440 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1499
Query: 833 GVVN 836
G +N
Sbjct: 1500 GTLN 1503
>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
boliviensis boliviensis]
Length = 3985
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3830 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3889
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3890 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3947
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3948 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3985
>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_d [Homo sapiens]
Length = 4002
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3847 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3906
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3907 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3964
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3965 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 4002
>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
Length = 3463
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3308 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3367
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3368 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3425
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3426 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3463
>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 208
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 109/155 (70%), Gaps = 2/155 (1%)
Query: 683 MADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
++D K + FRK+ L F KS IH +G+FA P A +MVIEY G++VRP +ADRRE
Sbjct: 55 VSDLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQMVRPVMADRREQH 114
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + +Y+FR+D E +IDAT+ G++A INHSC PNCY++VI+V G + I+I++K+
Sbjct: 115 -YTQIGIGSSYLFRVDVETIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQ 173
Query: 802 DIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I EE+TYDY+F +E+++C CG P+CRG +N
Sbjct: 174 PINVNEEITYDYKFPLEEEKISCLCGAPQCRGFLN 208
>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
Length = 3866
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3711 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3770
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3771 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3828
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3829 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3866
>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
norvegicus]
Length = 3894
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3739 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3798
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3799 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3856
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3857 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3894
>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
Length = 3869
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3714 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3773
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3774 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3831
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3832 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3869
>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
Length = 2525
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 2370 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 2429
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 2430 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 2487
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 2488 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 2525
>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
Length = 3987
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3832 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3891
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3892 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3949
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3950 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3987
>gi|47225575|emb|CAG12058.1| unnamed protein product [Tetraodon nigroviridis]
Length = 480
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/245 (40%), Positives = 137/245 (55%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NPSGCARSE Y N+ + R+ P+ +G Y
Sbjct: 268 PVNPSGCARSELYLRKSTFDIFNFLASQHRQLPD------------------IGPY---- 305
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+ + + ++ + SL L MA +++H+++T ++ + +S IHG G+F K
Sbjct: 306 --DDEEDEVLLKSTRRATSLE------LPMAMRFRHLEKTSKEAVGVYRSAIHGRGLFCK 357
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY G ++R + D+RE + Y+S G G YMFRIDD V+DAT G+ A
Sbjct: 358 RNIEAGEMVIEYAGIVIRSVLTDKREKY-YDS-KGIGCYMFRIDDFDVVDATMHGNAARF 415
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI+V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 416 INHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASNKLNCNCGARRC 475
Query: 832 RGVVN 836
R +N
Sbjct: 476 RRFLN 480
>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
Length = 3706
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3551 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3610
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3611 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3668
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3669 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3706
>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
Length = 3963
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3808 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3867
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3868 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3925
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3926 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3963
>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) variant [Homo sapiens]
Length = 2880
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 2725 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 2784
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 2785 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 2842
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 2843 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 2880
>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=Zinc finger protein HRX;
Contains: RecName: Full=MLL cleavage product N320;
AltName: Full=N-terminal cleavage product of 320 kDa;
Short=p320; Contains: RecName: Full=MLL cleavage product
C180; AltName: Full=C-terminal cleavage product of 180
kDa; Short=p180
Length = 3966
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3811 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3870
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3871 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3928
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3929 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3966
>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
[Cavia porcellus]
Length = 2799
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 2644 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 2703
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 2704 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 2761
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 2762 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 2799
>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
JAM81]
Length = 1367
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 103/142 (72%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KR+ F KS IH +G+FA P A DMVIEY GE++R +AD RE S +G+ +Y+F
Sbjct: 1227 KKRIKFDKSIIHDWGLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYEASGIGS-SYLF 1285
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D++ +IDAT+ G++A INH CEPNC ++VISV+G + I+I+A RDIK+ EELTYDY+
Sbjct: 1286 RVDEDTIIDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYK 1345
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F ++++ C CG CRG +N
Sbjct: 1346 FPIEEDKIPCLCGAVNCRGTLN 1367
>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
domestica]
Length = 3960
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 106/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3805 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3864
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3865 Y-YES-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3922
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3923 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3960
>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
Length = 3969
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3814 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3873
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3874 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNRARFINHSCEPNCYSRVINIDGQKHIVIFAM 3931
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3932 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3969
>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
harrisii]
Length = 3995
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 106/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 3840 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 3899
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 3900 Y-YES-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 3957
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3958 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 3995
>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
Length = 3692
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/245 (39%), Positives = 133/245 (54%), Gaps = 43/245 (17%)
Query: 603 PSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNG 653
P NP+GCARSE Y N+ + R+ P+ +G Y
Sbjct: 3480 PVNPNGCARSELYLRKSTFDMFNFLASQHRQLPD------------------IGPY---- 3517
Query: 654 LSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAK 713
+ + + ++ + SL L MA +++H++ T ++ + +S IHG G+F K
Sbjct: 3518 --DDEEDEVLLKSTRRATSLE------LPMAMRFRHLERTSKEAVGVYRSAIHGRGLFCK 3569
Query: 714 HPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHL 773
AG+MVIEY G ++R + D+RE + G G YMFRIDD V+DAT G+ A
Sbjct: 3570 RNIEAGEMVIEYAGIVIRSVLTDKREKYYDGK--GIGCYMFRIDDFDVVDATMHGNAARF 3627
Query: 774 INHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRC 831
INHSCEPNCYSRVI+V G +HI+IFA R I + EELTYDY+F D +L C CG RC
Sbjct: 3628 INHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDYKFPIEDASNKLNCNCGARRC 3687
Query: 832 RGVVN 836
R +N
Sbjct: 3688 RRFLN 3692
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 65/150 (43%), Gaps = 34/150 (22%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
C LC G +KP GR WAH+ C +W E D + + + V++ R
Sbjct: 2152 CSLCQKYGD-LKPDEAGRLLYLGQNEWAHVNCCLWSAEVFEEDNGSL--LHVHSAVTRGR 2208
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
+ C C + GA + C T+C+ YH +CAR+ CV +D+ ++
Sbjct: 2209 L-MRCERCNKT-GATVGCCLTSCQSNYHFMCARSRQ-CV-FQDDKKV------------- 2251
Query: 567 IRLLSFCKKHKQPLNDRLAVDERLVQVTRR 596
+C KH+ ++ R+ + +V RR
Sbjct: 2252 -----YCYKHRHLISGRMITGQEF-EVNRR 2275
>gi|347967460|ref|XP_307938.5| AGAP002246-PA [Anopheles gambiae str. PEST]
gi|333466285|gb|EAA03662.5| AGAP002246-PA [Anopheles gambiae str. PEST]
Length = 2003
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 75/152 (49%), Positives = 107/152 (70%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G++VRPS+AD RE Y
Sbjct: 1855 KFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMVRPSVADLRE-TKYE 1911
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FRID E +IDAT+ G++A INHSC PNCY++VI++ ++ I+I++K+ I
Sbjct: 1912 AIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIG 1971
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F DE++ C CG P CRG +N
Sbjct: 1972 VNEEITYDYKFPLEDEKIPCLCGAPGCRGTLN 2003
>gi|195570949|ref|XP_002103466.1| GD20433 [Drosophila simulans]
gi|194199393|gb|EDX12969.1| GD20433 [Drosophila simulans]
Length = 152
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
Query: 683 MADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
MA KY+ +KET++ + +S IHG G++ AG+MVIEY GEL+R ++ D+RE +
Sbjct: 1 MAMKYRTLKETYKDYVGVFRSHIHGRGLYCTKDIEAGEMVIEYAGELIRSTLTDKRERY- 59
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD 802
Y+S G G YMF+IDD V+DAT G+ A INH CEPNCYS+V+ + G +HIIIFA R
Sbjct: 60 YDSR-GIGCYMFKIDDNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRR 118
Query: 803 IKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I Q EELTYDY+F DE++ C CG RCR +N
Sbjct: 119 IVQGEELTYDYKFPFEDEKIPCSCGSKRCRKYLN 152
>gi|348530102|ref|XP_003452550.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like
[Oreochromis niloticus]
Length = 399
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 105/277 (37%), Positives = 148/277 (53%), Gaps = 54/277 (19%)
Query: 571 SFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEP---------YNYFGRR 621
++CK+H+ + ++E L P NPSGCAR+E +N+ +
Sbjct: 166 AYCKRHRFRFHRCDDIEEEL------------PLNPSGCARAEVCTRKATFDMFNFLASQ 213
Query: 622 GRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL 681
R+ P+ L F E + L S R R + L
Sbjct: 214 HRELPDILGP------FDEEEDEF------------PLKSSR-----------RATSSEL 244
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
MA +++H+++ ++ + +S IHG G+F K AG+MVIEY G ++R + D+RE +
Sbjct: 245 PMAMRFRHLEKISKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYAGTVIRAVLTDKREKY 304
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA R
Sbjct: 305 -YDS-KGIGCYMFRIDDFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALR 362
Query: 802 DIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
I + EELTYDY+F DE +L C CG RCR +N
Sbjct: 363 KIYRGEELTYDYKFPIEDESNKLLCNCGARRCRRYLN 399
>gi|307182871|gb|EFN69932.1| Histone-lysine N-methyltransferase SETD1B [Camponotus floridanus]
Length = 1422
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 143/244 (58%), Gaps = 20/244 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKR---LFVENQPYLVGGYCQNGLSGNTLPSIR 663
SG AR+E Y R + + + A S++R + Y+ G NG N+ +
Sbjct: 1185 SGSARTEGYYKVDIREKAKHKHHYAQSIQRSNDVEDSTNSYIGGDSVMNGPKNNS----K 1240
Query: 664 VIGSKFSFSLHRDAPN----FLSM------ADKYKHMKETFRKR-LAFGKSGIHGFGIFA 712
+ K +L R+A + L+ +D K + FRK+ L F KSGIH +G+FA
Sbjct: 1241 ALTGKMQ-ALSREARSNQRRLLTAFGIDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFA 1299
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
P A +MVIEY G++VRP +AD RE + +G+ +Y+FRID + +IDAT+ G++A
Sbjct: 1300 MEPIAADEMVIEYVGQMVRPVVADLREAQYEATGIGS-SYLFRIDLDTIIDATKCGNLAR 1358
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCR 832
INHSC PNCY++VI++ + I+I++K+ I EE+TYDY+F D+++ C CG P+CR
Sbjct: 1359 FINHSCNPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPLEDDKIPCLCGAPQCR 1418
Query: 833 GVVN 836
G +N
Sbjct: 1419 GTLN 1422
>gi|260836403|ref|XP_002613195.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
gi|229298580|gb|EEN69204.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
Length = 313
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 135/237 (56%), Gaps = 26/237 (10%)
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
PP NP+GC R+E RR + + + A+ ++L P L + +
Sbjct: 101 PPVNPNGCCRAE---VLSRRSKFDMFSWLASQHRQL-----PVLDYNEEEEEVQHK---- 148
Query: 662 IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
S R L MA +++++++T R+ + +S IHG G+F K +G+M
Sbjct: 149 ----------STRRPTSMDLPMAMRFRYLRQTSREAVGVYRSPIHGRGLFCKRNIDSGEM 198
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G ++R + D+RE++ YNS G G YMFRIDD V+DAT G+ A INHSC+PN
Sbjct: 199 VIEYAGMVIRSVLTDKRENY-YNSK-GIGCYMFRIDDYEVVDATMHGNAARFINHSCDPN 256
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
CYSRVI V G +HI+IFA R I + EELTYDY+F D+ ++ C CG RCR +N
Sbjct: 257 CYSRVIQVEGKKHIVIFAMRKIYKGEELTYDYKFPIEDQNSKIDCTCGSKRCRKYLN 313
>gi|255557755|ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
gi|223540953|gb|EEF42511.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
Length = 1125
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 119/379 (31%), Positives = 166/379 (43%), Gaps = 62/379 (16%)
Query: 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYG 519
+ G + T +W H+ C +W P T +V M D ++ S R ++CSIC G
Sbjct: 778 ITAGVLDSTVK-QWVHMVCGLWTPGTRCPNVNTMSAFD-VSGASCPRANVVCSICDRPGG 835
Query: 520 ACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
+CIQC C + +HP CA GL L S E +++ + C H
Sbjct: 836 SCIQCRVANCSIQFHPWCAHQKGL-----------LQSEAEGVDNENVGFYGRCVLHATY 884
Query: 580 LNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFV 639
A D + + P CAR+E Y +GRK + F
Sbjct: 885 PTIESACDSAIFEAG------YPAEKEVSCARTEGY-----KGRK----------RDGFW 923
Query: 640 ENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNF-------LSMADK-YKHMK 691
N G SG +P +F +H + L M++K Y K
Sbjct: 924 HNTNSQ-----SKGKSGCLVPQ-----EQFDAWVHINGQKSCAQGILKLPMSEKEYDCRK 973
Query: 692 ETFR-------KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
E R K L KSGIH G++ G+MV+EY GE+V +AD+RE+ +
Sbjct: 974 EYTRYKQGKAWKHLVVYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQS 1033
Query: 745 S---LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
+ Y FRID E +IDAT G IA +NHSC PNC ++VISV D+ ++ FA+R
Sbjct: 1034 GRKLQYKSACYFFRIDKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAER 1093
Query: 802 DIKQWEELTYDYRFFSIDE 820
DI EE+TYDY F DE
Sbjct: 1094 DIYPGEEITYDYHFNHEDE 1112
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 395 DLDK-CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPP 453
D+D CSVC + + N L+C +C + VH CYG + V W C CR A +
Sbjct: 613 DMDSFCSVCRSSNKDEVNCLLECRRCSIRVHQACYG-VSRVPKGHWYCRPCRTSAKD--I 669
Query: 454 PCCLCPVVGGAM 465
C LC GGAM
Sbjct: 670 VCVLCGYGGGAM 681
>gi|356562088|ref|XP_003549306.1| PREDICTED: uncharacterized protein LOC100816713 [Glycine max]
Length = 992
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 126/399 (31%), Positives = 174/399 (43%), Gaps = 75/399 (18%)
Query: 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYG 519
+ G + PT +W H+ C +W P T +V M D ++ VS+ R ++CSIC G
Sbjct: 647 ITEGVLDPTVK-QWIHMVCGLWTPRTRCPNVDTMSAFD-VSGVSRPRADVVCSICNRWGG 704
Query: 520 ACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
+CI+C C V +HP CA L L S E D+ I C H
Sbjct: 705 SCIECRIADCSVKFHPWCAHQKNL-----------LQSETEGINDEKIGFYGRCMLH--- 750
Query: 580 LNDRLAVDERLVQVTRRCCDYIPPSNPSG--------CARSEPYNYFGRRGRKEPEALAA 631
+ RC P + G CAR E Y GRR
Sbjct: 751 ------------TIEPRCLFIYDPLDEIGSQEQKEFTCARVEGYK--GRRWDG------- 789
Query: 632 ASLKRLFVENQPYLVGGYCQNG--LSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYK 688
F NQ CQ G + L + I + G K + P F + ++
Sbjct: 790 ------FQNNQ-------CQGGCLVPEEQLNAWIHINGQKL---CSQGLPKFPDLDIEHD 833
Query: 689 HMKETFRKRLAFG-------KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
KE R + A G KS IH G++ G+MV+EY GE+V +AD+RE
Sbjct: 834 CRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKE 893
Query: 742 IYNS---LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
+ + Y FRID E +IDATR G IA +NHSC PNC ++VI+V ++ ++
Sbjct: 894 YQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFL 953
Query: 799 AKRDIKQWEELTYDYRFFSIDE-QLACYCGFPRCRGVVN 836
A+RDI EE+TYDY F DE ++ CYC CR +N
Sbjct: 954 AERDIFPGEEITYDYHFNHEDEGKIPCYCYSKNCRRYMN 992
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 44/108 (40%), Gaps = 5/108 (4%)
Query: 358 ILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCD 417
++ +TK + LC +K R+ G+ + D C VC + N L+C
Sbjct: 453 VMDMTKCAQDQEPGLC--GTKSRNSIQGHTSISTINSDAF-CCVCRRSTNDKINCLLECS 509
Query: 418 KCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAM 465
+C + VH CYG W C CR + C LC GGAM
Sbjct: 510 RCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKN--IACVLCGYGGGAM 555
>gi|356554223|ref|XP_003545448.1| PREDICTED: uncharacterized protein LOC100812602 [Glycine max]
Length = 1985
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/402 (30%), Positives = 172/402 (42%), Gaps = 81/402 (20%)
Query: 460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYG 519
+ + PT +W H+ C +W P T +V M D ++ VS+ R ++C IC G
Sbjct: 1640 ITEAVLDPTVK-QWIHMVCGLWTPGTRCPNVDTMSAFD-VSGVSRPRADVVCYICNRWGG 1697
Query: 520 ACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
+CI+C C + +HP CA L L S E +D+ I C H
Sbjct: 1698 SCIECRIADCSIKFHPWCAHQKNL-----------LQSETEGIDDEKIGFYGRCTLHI-- 1744
Query: 580 LNDRLAVDERLVQVTRRCCDYIPPSNPSG--------CARSEPYNYFGRRGRKEPEALAA 631
+ RC P + G CAR+E Y GRR
Sbjct: 1745 -------------IEPRCLPIYDPLDEIGSQEEKEFTCARAEGYK--GRRWDG------- 1782
Query: 632 ASLKRLFVENQPYLVGGYCQNGLSGNTLPS------IRVIGSKFSFSLHRDAPNFLSMAD 685
F NQ CQ G +P I + G K R P F +
Sbjct: 1783 ------FQNNQ-------CQGGC---LVPEEQLNAWIHINGQKL---CSRGLPKFPDLDI 1823
Query: 686 KYKHMKETFRKRLAFG-------KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
++ KE R + A G KS IH G++ G+MV+EY GE+V +AD+R
Sbjct: 1824 EHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKR 1883
Query: 739 EHFIYNS---LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHI 795
E + Y FRID E +IDATR G IA +NHSC PNC ++VI+V ++ +
Sbjct: 1884 EKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKV 1943
Query: 796 IIFAKRDIKQWEELTYDYRFFSIDE-QLACYCGFPRCRGVVN 836
+ A+RDI EE+TYDY F DE ++ CYC CR +N
Sbjct: 1944 VFLAERDIFPGEEITYDYHFNHEDEGKIPCYCNSKNCRRYMN 1985
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 28/67 (41%), Gaps = 2/67 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N L+C +C + VH CYG W C CR + C LC
Sbjct: 1486 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKN--IVCVLC 1543
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1544 GYGGGAM 1550
>gi|157126650|ref|XP_001654691.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108873214|gb|EAT37439.1| AAEL010578-PA [Aedes aegypti]
Length = 172
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 110/162 (67%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R + + L MA +Y+ +KET ++ + +S IHG G+F AG+MVIEY GEL+R ++
Sbjct: 13 RGSGSNLPMAMRYRTLKETSKESVGVYRSHIHGRGLFCNRDIEAGEMVIEYAGELIRSTL 72
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + Y+S G G YMF+ID+ V+DAT G+ A INHSCEPNCYS+V+ + G +H
Sbjct: 73 TDKRERY-YDSR-GIGCYMFKIDEHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKH 130
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
IIIFA R I Q EELTYDY+F D ++ C CG +CR +N
Sbjct: 131 IIIFALRRIVQGEELTYDYKFPFEDVKIPCSCGSKKCRKYLN 172
>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
Length = 2343
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R + N L MA +++ +K T + + +S IH G+F AG+MVIEY+GE++R +
Sbjct: 2184 RGSVNNLPMAMRFRQLKLTSKYSVGVYRSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVL 2243
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + YNS G G YMFRIDD V+DAT G+ A INHSC+PNCYS+V+ + G +H
Sbjct: 2244 TDKREKY-YNSK-GIGCYMFRIDDNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKH 2301
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
IIIFA R I EELTYDY+F ++++ C CG RCR +N
Sbjct: 2302 IIIFALRRIICGEELTYDYKFPIEEDKIPCTCGTRRCRKFLN 2343
>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
Length = 1685
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 96/260 (36%), Positives = 146/260 (56%), Gaps = 25/260 (9%)
Query: 601 IPPSNPSGCARSEPYNYF------------GRR-----GRKEPEALAAASLK-RLFVENQ 642
+PP + SGCAR++ + + GR GR+ P L A+++ +L V
Sbjct: 1427 LPPIHSSGCARTQGFYWMPTEERFRRAWSVGRSLIAEDGRRRPIPLMPATVEAQLGVNAV 1486
Query: 643 PYLVGGYCQNGLSGNTLPSIRVIGSKFS---FSLHRDAPNF--LSMADKYKHMKETFRKR 697
+GG ++ L+ + + ++F + R F + D K + FRK+
Sbjct: 1487 IQAIGGSIEDRLTEQASEAKKKQLTQFREARSAQRRLLAQFQDIETGDLLKFNQLKFRKK 1546
Query: 698 -LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI 756
L F KS IH +G+ A P A +MVIEY G++VR S+A+ RE +G G+Y+FRI
Sbjct: 1547 QLIFAKSPIHAWGLIALEPIAAEEMVIEYVGQVVRKSVAELRERQYEAKGIG-GSYLFRI 1605
Query: 757 DDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFF 816
DD+ VIDAT G+ INHSC+PNCY+++I+V G + I+I++KRDI EE+TYDY+F
Sbjct: 1606 DDDFVIDATMCGNNGRFINHSCQPNCYAKIITVEGKKKIVIYSKRDINVMEEITYDYKFP 1665
Query: 817 SIDEQLACYCGFPRCRGVVN 836
+E++ C CG CRG +N
Sbjct: 1666 YEEEKIPCQCGASTCRGTLN 1685
>gi|91076142|ref|XP_970289.1| PREDICTED: similar to mixed-lineage leukemia protein, mll [Tribolium
castaneum]
Length = 1824
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R + N L MA +++ +K T + + +S IH G+F AG+MVIEY+GE++R +
Sbjct: 1665 RGSVNNLPMAMRFRQLKLTSKYSVGVYRSKIHRRGLFCLRDFEAGEMVIEYSGEVIRSVL 1724
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + YNS G G YMFRIDD V+DAT G+ A INHSC+PNCYS+V+ + G +H
Sbjct: 1725 TDKREKY-YNSK-GIGCYMFRIDDNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKH 1782
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
IIIFA R I EELTYDY+F ++++ C CG RCR +N
Sbjct: 1783 IIIFALRRIICGEELTYDYKFPIEEDKIPCTCGTRRCRKFLN 1824
>gi|449679772|ref|XP_002161520.2| PREDICTED: histone-lysine N-methyltransferase MLL-like [Hydra
magnipapillata]
Length = 281
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/232 (39%), Positives = 128/232 (55%), Gaps = 30/232 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GCARSE F + K+ A A+ ++ C N + V
Sbjct: 80 NPTGCARSE---IFFSKAEKDMFAWLASPYRK---------PPKSCANDTEEDV-----V 122
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
G P L +A KY+ +K + + +S I G G+F K AG+M+IE
Sbjct: 123 FG-----------PQELPVAMKYRLLKNLAKDNVGVYRSKIQGRGLFCKKTIEAGEMIIE 171
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y+GE +R S+ DRRE Y G G YMFRIDD V+DAT G+ A INHSCEPNC+S
Sbjct: 172 YSGEKIRSSLTDRREK--YYEKKGYGCYMFRIDDTDVVDATTKGNAARFINHSCEPNCFS 229
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
R+IS++G + III+A++ + EELTYDY+F D++L C+CG +CR +N
Sbjct: 230 RIISIDGCKKIIIYAQKRVTVGEELTYDYKFAIEDDKLPCFCGAKKCRKYLN 281
>gi|332016392|gb|EGI57305.1| Histone-lysine N-methyltransferase SETD1B [Acromyrmex echinatior]
Length = 1513
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 73/152 (48%), Positives = 108/152 (71%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KSGIH +G+FA P A +MVIEY G++VRP +AD RE
Sbjct: 1365 KFNQLK--FRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLREAQYEA 1422
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
+ +G+ +Y+FRID + +IDAT+ G++A INHSC PNCY++VI++ + I+I++K+ I
Sbjct: 1423 TGIGS-SYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIG 1481
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F D+++ C CG P+CRG +N
Sbjct: 1482 VNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1513
>gi|307195485|gb|EFN77371.1| Histone-lysine N-methyltransferase SETD1B [Harpegnathos saltator]
Length = 1490
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 73/152 (48%), Positives = 108/152 (71%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KSGIH +G+FA P A +MVIEY G++VRP +AD RE
Sbjct: 1342 KFNQLK--FRKKQLKFAKSGIHDWGLFAMEPIAADEMVIEYVGQMVRPVVADLRESQYEA 1399
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
+ +G+ +Y+FRID + +IDAT+ G++A INHSC PNCY++VI++ + I+I++K+ I
Sbjct: 1400 TGIGS-SYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQPIG 1458
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F D+++ C CG P+CRG +N
Sbjct: 1459 VNEEITYDYKFPLEDDKIPCLCGAPQCRGTLN 1490
>gi|115495457|ref|NP_083550.2| histone-lysine N-methyltransferase MLL4 [Mus musculus]
gi|341940998|sp|O08550.3|MLL4_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 4
homolog; AltName: Full=Trithorax homolog 2; AltName:
Full=WW domain-binding protein 7; Short=WBP-7
Length = 2713
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E Y N+ + R PE G C
Sbjct: 2500 PPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPE-------------------GATCDE 2540
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
+++ ++ + SL L MA +++H+K+T ++ + +S IHG G+F
Sbjct: 2541 EED-----EVQLRSTRRATSLE------LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFC 2589
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY+G ++R + D+RE F G G YMFR+DD V+DAT G+ A
Sbjct: 2590 KRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGK--GIGCYMFRMDDFDVVDATMHGNAAR 2647
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D +L C CG R
Sbjct: 2648 FINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKR 2707
Query: 831 CRGVVN 836
CR +N
Sbjct: 2708 CRRFLN 2713
>gi|56744180|dbj|BAD81031.1| mixed lineage leukemia 2 [Mus musculus]
Length = 2713
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E Y N+ + R PE G C
Sbjct: 2500 PPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPE-------------------GATCDE 2540
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
+++ ++ + SL L MA +++H+K+T ++ + +S IHG G+F
Sbjct: 2541 EED-----EVQLRSTRRATSLE------LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFC 2589
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY+G ++R + D+RE F G G YMFR+DD V+DAT G+ A
Sbjct: 2590 KRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGK--GIGCYMFRMDDFDVVDATMHGNAAR 2647
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D +L C CG R
Sbjct: 2648 FINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKR 2707
Query: 831 CRGVVN 836
CR +N
Sbjct: 2708 CRRFLN 2713
>gi|156354350|ref|XP_001623359.1| predicted protein [Nematostella vectensis]
gi|156210050|gb|EDO31259.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 108/158 (68%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKR--LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
L MA +Y+H+K+ + +A +SGIHG G++ AG+MVIEY+G L+R ++ D+R
Sbjct: 156 LPMAMRYRHLKQNNYNKGTIAVFRSGIHGRGLYCTRHIAAGEMVIEYSGMLIRSTLTDKR 215
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E Y G G YMFRID V+DAT +G+ A INHSCEPNCYSRV++++G++ I+IF
Sbjct: 216 E--AYYESKGIGCYMFRIDGTYVVDATTSGNAARFINHSCEPNCYSRVVTIDGNKKILIF 273
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
A + I + EELTYDY+F DE+L C+C RCR +N
Sbjct: 274 ASKSISRGEELTYDYKFPLEDEKLPCHCKSKRCRKYLN 311
>gi|348563138|ref|XP_003467365.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Cavia
porcellus]
Length = 2692
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2537 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2596
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2597 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2654
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2655 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2692
>gi|431918577|gb|ELK17795.1| Histone-lysine N-methyltransferase MLL4 [Pteropus alecto]
Length = 3017
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2862 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2921
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2922 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2979
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2980 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 3017
>gi|402905199|ref|XP_003915410.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Papio
anubis]
Length = 2716
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2561 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2620
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2621 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2678
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2679 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2716
>gi|327288610|ref|XP_003229019.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Anolis carolinensis]
Length = 3817
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 106/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R + D+RE
Sbjct: 3662 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSILTDKREK 3721
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ + +G YMFRIDD V+DAT G+ A INHSCEPNC+SRVI+++G +HI+IFA
Sbjct: 3722 YYDSKDIGC--YMFRIDDAEVVDATMHGNAARFINHSCEPNCFSRVINIDGQKHIVIFAM 3779
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 3780 RKIFRGEELTYDYKFPIEDASNKLPCNCGAKKCRRFLN 3817
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLL-CSICGVSYGACIQCSNTTCRV 531
WAH+ CA+W E D ++ + ++ R K L C C GA + C T+C
Sbjct: 1801 WAHVNCALWSAEVFEDDGGSLKNV----HMALIRGKQLRCEFC-QRPGATVGCCLTSCTS 1855
Query: 532 AYHPLCARAAGLCVELEDE 550
YH +C+RA C+ LED+
Sbjct: 1856 NYHFMCSRAKN-CIFLEDK 1873
>gi|296233585|ref|XP_002807874.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4 [Callithrix jacchus]
Length = 2660
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2505 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2564
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2565 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2622
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2623 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2660
>gi|432100936|gb|ELK29286.1| Histone-lysine N-methyltransferase MLL4 [Myotis davidii]
Length = 2566
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2411 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2470
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2471 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2528
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2529 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2566
>gi|149056302|gb|EDM07733.1| rCG63528 [Rattus norvegicus]
Length = 2270
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2115 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2174
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2175 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2232
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2233 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2270
>gi|351711122|gb|EHB14041.1| Histone-lysine N-methyltransferase MLL4, partial [Heterocephalus
glaber]
Length = 2592
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2437 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2496
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2497 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2554
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2555 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2592
>gi|33990004|gb|AAH56344.1| Wbp7 protein, partial [Mus musculus]
Length = 2013
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 131/246 (53%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E Y N+ + R PE G C
Sbjct: 1800 PPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPE-------------------GATCDE 1840
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
+++ ++ + SL L MA +++H+K+T ++ + +S IHG G+F
Sbjct: 1841 EED-----EVQLRSTRRATSLE------LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFC 1889
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY+G ++R + D+RE F G G YMFR+DD V+DAT G+ A
Sbjct: 1890 KRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGK--GIGCYMFRMDDFDVVDATMHGNAAR 1947
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D +L C CG R
Sbjct: 1948 FINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKR 2007
Query: 831 CRGVVN 836
CR +N
Sbjct: 2008 CRRFLN 2013
>gi|7662046|ref|NP_055542.1| histone-lysine N-methyltransferase MLL4 [Homo sapiens]
gi|12643900|sp|Q9UMN6.1|MLL4_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL4; AltName:
Full=Lysine N-methyltransferase 2B; Short=KMT2B; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein
4; AltName: Full=Trithorax homolog 2; AltName: Full=WW
domain-binding protein 7; Short=WBP-7
gi|5123787|emb|CAB45385.1| trithorax homologue 2 [Homo sapiens]
Length = 2715
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2560 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2619
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2620 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2677
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2678 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2715
>gi|332855019|ref|XP_512597.3| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Pan
troglodytes]
Length = 2526
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2371 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2430
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2431 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2488
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2489 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2526
>gi|312384250|gb|EFR29018.1| hypothetical protein AND_02361 [Anopheles darlingi]
Length = 2074
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 73/152 (48%), Positives = 107/152 (70%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G++VRPS+AD RE Y
Sbjct: 1926 KFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMVRPSVADLRE-TKYE 1982
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FRID E +IDAT+ G++A INHSC PNCY+++I++ ++ I+I++K+ I
Sbjct: 1983 AIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKIITIESEKKIVIYSKQPIG 2042
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F +E++ C CG P CRG +N
Sbjct: 2043 VNEEITYDYKFPLEEEKIRCLCGAPGCRGTLN 2074
>gi|5923931|gb|AAD56420.1|AF186605_1 MLL2 protein [Homo sapiens]
Length = 2605
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2450 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2509
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2510 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2567
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2568 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2605
>gi|335289510|ref|XP_003127115.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Sus scrofa]
Length = 2721
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2566 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2625
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2626 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2683
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2684 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2721
>gi|301771069|ref|XP_002920938.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Ailuropoda melanoleuca]
Length = 2611
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2456 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2515
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2516 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2573
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2574 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2611
>gi|344298323|ref|XP_003420843.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Loxodonta africana]
Length = 2200
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2045 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2104
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2105 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2162
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2163 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2200
>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
Length = 3272
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 94/236 (39%), Positives = 133/236 (56%), Gaps = 30/236 (12%)
Query: 601 IPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLP 660
+P + +G AR+EP+ +GRK + + L +++P+ P
Sbjct: 3067 LPAACINGAARAEPF-----KGRKVHDMFSW-----LASQHRPH---------------P 3101
Query: 661 SIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGD 720
+I I S R A L MA +++ +KET + + S IHG G+F G+
Sbjct: 3102 NIIAISETES---RRAASTNLPMAMRFRILKETSKASVGVYYSRIHGRGLFCLRDIEPGE 3158
Query: 721 MVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEP 780
MVIEY GE++R S+ D+RE + Y+S G YMF+IDD V+DAT G+ A INHSCEP
Sbjct: 3159 MVIEYAGEVIRSSLTDKREKY-YDSK-NIGCYMFKIDDHLVVDATMKGNAARFINHSCEP 3216
Query: 781 NCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
NCYSRV+ + G +HI+IFA R I Q EELTYDY+F D ++ C CG +CR +N
Sbjct: 3217 NCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCTCGSRKCRKYLN 3272
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 44/101 (43%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC +G + T +GR W H CA+W E E IDG L V
Sbjct: 1334 CCLCKGLGDGHE-TKEGRLLYCGQNEWVHANCALWSNEV-------FEEIDGSLQNVHSA 1385
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ CS CG GA I C C +H CAR GL
Sbjct: 1386 ISRGRLIRCSECG-KKGASIGCCAKNCSNTFHFPCARNVGL 1425
>gi|441627688|ref|XP_003280142.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Nomascus leucogenys]
Length = 2433
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2278 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2337
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2338 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2395
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2396 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2433
>gi|71891784|dbj|BAA20763.3| KIAA0304 protein [Homo sapiens]
Length = 2415
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2260 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2319
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2320 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2377
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2378 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2415
>gi|62862148|ref|NP_001015221.1| Set1, isoform A [Drosophila melanogaster]
gi|62862150|ref|NP_001015222.1| Set1, isoform B [Drosophila melanogaster]
gi|161076059|ref|NP_001104406.1| Set1, isoform C [Drosophila melanogaster]
gi|281366745|ref|NP_001163846.1| Set1, isoform D [Drosophila melanogaster]
gi|281366747|ref|NP_001163847.1| Set1, isoform E [Drosophila melanogaster]
gi|281366749|ref|NP_001163848.1| Set1, isoform F [Drosophila melanogaster]
gi|281366751|ref|NP_001163849.1| Set1, isoform G [Drosophila melanogaster]
gi|281366753|ref|NP_001163850.1| Set1, isoform H [Drosophila melanogaster]
gi|281366755|ref|NP_001163851.1| Set1, isoform I [Drosophila melanogaster]
gi|51951109|gb|EAL24598.1| Set1, isoform A [Drosophila melanogaster]
gi|51951110|gb|EAL24599.1| Set1, isoform B [Drosophila melanogaster]
gi|158529717|gb|EDP28071.1| Set1, isoform C [Drosophila melanogaster]
gi|281309231|gb|EFA98694.1| Set1, isoform D [Drosophila melanogaster]
gi|281309232|gb|EFA98695.1| Set1, isoform E [Drosophila melanogaster]
gi|281309233|gb|EFA98696.1| Set1, isoform F [Drosophila melanogaster]
gi|281309234|gb|EFA98697.1| Set1, isoform G [Drosophila melanogaster]
gi|281309235|gb|EFA98698.1| Set1, isoform H [Drosophila melanogaster]
gi|281309236|gb|EFA98699.1| Set1, isoform I [Drosophila melanogaster]
Length = 1641
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 92/239 (38%), Positives = 139/239 (58%), Gaps = 16/239 (6%)
Query: 607 SGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R K A A+ + F E+ +P + + N L + +
Sbjct: 1410 TGCARTEGFYKLDVREKAKHKYHYAKANTEDSFNEDRSDEPTALTNHHHNKL----ISKM 1465
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1466 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1523
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1524 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1582
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1583 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
>gi|297276803|ref|XP_001112093.2| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Macaca
mulatta]
Length = 2789
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2634 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2693
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2694 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2751
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2752 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2789
>gi|403293026|ref|XP_003937525.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 2665
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2510 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2569
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2570 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2627
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2628 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2665
>gi|397490588|ref|XP_003816282.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL4-like [Pan paniscus]
Length = 2776
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2621 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2680
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2681 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2738
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2739 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2776
>gi|359318839|ref|XP_003432729.2| PREDICTED: histone-lysine N-methyltransferase MLL4, partial [Canis
lupus familiaris]
Length = 2713
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2558 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2617
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2618 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2675
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2676 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2713
>gi|426388428|ref|XP_004060643.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Gorilla
gorilla gorilla]
Length = 2536
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2381 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2440
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2441 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2498
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2499 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2536
>gi|444509617|gb|ELV09373.1| Histone-lysine N-methyltransferase MLL4, partial [Tupaia chinensis]
Length = 2209
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2054 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2113
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2114 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2171
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2172 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2209
>gi|358416718|ref|XP_003583467.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2688
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2533 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2592
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2593 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2650
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2651 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2688
>gi|40225368|gb|AAH09337.2| MLL4 protein [Homo sapiens]
Length = 795
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 640 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 699
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 700 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 757
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 758 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 795
>gi|432875260|ref|XP_004072753.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Oryzias
latipes]
Length = 1547
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 129/231 (55%), Gaps = 11/231 (4%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+G ARSE + R+ E + S RL N P+ Q LPS G
Sbjct: 1327 TGSARSEGFYKISRK-----EKMKNLSNSRLST-NPPFTDA---QGARVPAQLPSSLRSG 1377
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRK-RLAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
S F R +F +D + + +RK R+ F +S IH +G+FA P A +MVIEY
Sbjct: 1378 SDFRSEQRRLLSSFSCDSDLVRFNQLKYRKKRIRFSRSHIHEWGLFAMEPIAADEMVIEY 1437
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
GE +R IADRRE +G G+Y+FRID + +IDAT+ G++A INHSC PNCY++
Sbjct: 1438 VGETIRQVIADRREQRYEEEGIG-GSYLFRIDQDTIIDATKCGNLARFINHSCNPNCYAK 1496
Query: 786 VISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
VISV + I+I++++ I EE+TY+Y+F D ++ C CG CRG +N
Sbjct: 1497 VISVESQKKIVIYSRQPISINEEITYNYKFPIEDTKIPCLCGAENCRGSLN 1547
>gi|359075420|ref|XP_003587289.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Bos taurus]
Length = 2711
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2556 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2615
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2616 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2673
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2674 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2711
>gi|440894918|gb|ELR47236.1| Histone-lysine N-methyltransferase MLL4, partial [Bos grunniens
mutus]
Length = 2524
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2369 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2428
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2429 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2486
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2487 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2524
>gi|38565948|gb|AAH62210.1| Wbp7 protein, partial [Mus musculus]
Length = 442
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 127/246 (51%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E Y N+ + R PE G C
Sbjct: 229 PPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPE-------------------GATCDE 269
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
L S R R L MA +++H+K+T ++ + +S IHG G+F
Sbjct: 270 EEDEVQLRSTR-----------RATSLELPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFC 318
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY+G ++R + D+RE F G G YMFR+DD V+DAT G+ A
Sbjct: 319 KRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGK--GIGCYMFRMDDFDVVDATMHGNAAR 376
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D +L C CG R
Sbjct: 377 FINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKR 436
Query: 831 CRGVVN 836
CR +N
Sbjct: 437 CRRFLN 442
>gi|120974668|gb|ABM46716.1| MLL [Gorilla gorilla]
Length = 338
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 183 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 242
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 243 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 300
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 301 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 338
>gi|3309543|gb|AAC41377.1| MLL [Takifugu rubripes]
Length = 4498
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 104/158 (65%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L +A ++K +K T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 4343 LPLAVRFKQLKVTSKESVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREK 4402
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA
Sbjct: 4403 YYDGK--GIGCYMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFAS 4460
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C C +CR +N
Sbjct: 4461 RRIYRGEELTYDYKFPIEDASSKLPCNCNSKKCRKFLN 4498
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 31/83 (37%), Gaps = 18/83 (21%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGV----LWLCNLCRPGAPEPPPP 454
C C + L+CDKCR H C G P +W+CN C
Sbjct: 1718 CQACGRQHQKTKQQLLECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCV--------R 1769
Query: 455 CCLCPVVGGAMKPTT--DGRWAH 475
C C GA KP D +W+H
Sbjct: 1770 CKCC----GATKPGKSWDAQWSH 1788
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVA 532
W H+ CA+W E D ++ + K +L C C + GA + C T+C
Sbjct: 2138 WTHINCALWSSEVFEDDEGSLKNVHTAVLRGK---QLHCQKCQLP-GATVSCCLTSCTSN 2193
Query: 533 YHPLCARAAGLCVELEDE 550
YH +CAR CV LED+
Sbjct: 2194 YHFMCARHC-HCVFLEDK 2210
>gi|410910074|ref|XP_003968515.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Takifugu rubripes]
Length = 4478
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 104/158 (65%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L +A ++K +K T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 4323 LPLAVRFKQLKVTSKESVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREK 4382
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA
Sbjct: 4383 YYDGK--GIGCYMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFAS 4440
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C C +CR +N
Sbjct: 4441 RRIYRGEELTYDYKFPIEDASSKLPCNCNSKKCRKFLN 4478
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 32/83 (38%), Gaps = 19/83 (22%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGV----LWLCNLCRPGAPEPPPP 454
C C + L L+CDKCR H C G P +W+CN C
Sbjct: 1718 CQACGRQHQKTKQL-LECDKCRNSYHPECLGPNHPTRPTKKKRVWVCNKCV--------R 1768
Query: 455 CCLCPVVGGAMKPTT--DGRWAH 475
C C GA KP D +W+H
Sbjct: 1769 CKCC----GATKPGKSWDAQWSH 1787
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVA 532
W H+ CA+W E D ++ + K +L C C + GA + C T+C
Sbjct: 2143 WTHINCALWSSEVFEDDEGSLKNVHTAVLRGK---QLHCQKCQLP-GATVSCCLTSCTSN 2198
Query: 533 YHPLCARAAGLCVELEDE 550
YH +CAR CV LED+
Sbjct: 2199 YHFMCARHC-HCVFLEDK 2215
>gi|124111218|gb|ABM91999.1| MLL [Pan troglodytes]
Length = 338
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 94/246 (38%), Positives = 134/246 (54%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEP---------YNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E +N+ + R+ PE Y N
Sbjct: 125 PPLNPHGSARAEVHLRQSAFDMFNFLASKHRQPPE---------------------YNPN 163
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
+++ ++ + S+ L M +++H+K+T ++ + +S IHG G+F
Sbjct: 164 DXXEE---EVQLKSARRATSMD------LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFC 214
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY G ++R D+RE + Y+S G G YMFRIDD V+DAT G+ A
Sbjct: 215 KRNIDAGEMVIEYAGNVIRSIQTDKREKY-YDS-KGIGCYMFRIDDSEVVDATMHGNAAR 272
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNCYSRVI+++G +HI+IFA R I + EELTYDY+F D +L C CG +
Sbjct: 273 FINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNKLPCNCGAKK 332
Query: 831 CRGVVN 836
CR +N
Sbjct: 333 CRKFLN 338
>gi|26006129|dbj|BAC41407.1| mKIAA0304 protein [Mus musculus]
Length = 1744
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 1589 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 1648
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 1649 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 1706
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 1707 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 1744
>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4218
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 108/168 (64%), Gaps = 6/168 (3%)
Query: 672 SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
S R L + +K++ +K+ R + +S IHG G+F + G+MVIEY+G ++R
Sbjct: 4054 SARRATSTDLPLPEKFRQLKKASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIR 4113
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
+ D+RE + + G G YMFRIDD V+DAT G+ A INHSCEPNCYSRV++V+G
Sbjct: 4114 SVLTDKREKYYDDK--GIGCYMFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDG 4171
Query: 792 DEHIIIFAKRDIKQWEELTYDYRFFSIDE---QLACYCGFPRCRGVVN 836
+HI+IFA R I + EELTYDY+ F I+E +L C CG +CR +N
Sbjct: 4172 QKHIVIFATRKIYKGEELTYDYK-FPIEEPGNKLPCNCGAKKCRKFLN 4218
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 40/99 (40%), Gaps = 22/99 (22%)
Query: 387 RPVRVDWKD--LDKCSVCHM-DEEYQNN-LFLQCDKCRMMVHARCYGELEPVNGV----L 438
RP W++ +C CH+ +YQ L+CDKCR H C G P +
Sbjct: 1659 RPHDEQWENWCCRRCRFCHVCGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRV 1718
Query: 439 WLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT--DGRWAH 475
W+C C C C GA KP D +W+H
Sbjct: 1719 WVCTKCV--------RCKSC----GATKPGKAWDAQWSH 1745
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 35/129 (27%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNR-----VSKDRWKLLCSICGVSYGACIQCSNT 527
WAH+ CA+W E E +DG + VS+ + +L C C GA + C T
Sbjct: 2093 WAHVNCALWSAEV-------YEDVDGALKNVHMAVSRGK-QLQCKNCH-KPGATVSCCMT 2143
Query: 528 TCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVD 587
+C YH +CAR C LED+ +C+ HK + + V
Sbjct: 2144 SCTNNYHFMCARQQ-QCAFLEDKK-------------------VYCQHHKDLVKGEV-VP 2182
Query: 588 ERLVQVTRR 596
E +VTRR
Sbjct: 2183 ESSFEVTRR 2191
>gi|380796799|gb|AFE70275.1| histone-lysine N-methyltransferase MLL4, partial [Macaca mulatta]
Length = 396
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 127/246 (51%), Gaps = 43/246 (17%)
Query: 602 PPSNPSGCARSEPY---------NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
PP NP G AR+E Y N+ + R PE G C
Sbjct: 183 PPLNPHGAARAEVYLRKCTFDMFNFLASQHRVLPE-------------------GATCDE 223
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFA 712
L S R R L MA +++H+K+T ++ + +S IHG G+F
Sbjct: 224 EEDEVQLRSTR-----------RATSLELPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFC 272
Query: 713 KHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAH 772
K AG+MVIEY+G ++R + D+RE F G G YMFR+DD V+DAT G+ A
Sbjct: 273 KRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGK--GIGCYMFRMDDFDVVDATMHGNAAR 330
Query: 773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPR 830
INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D +L C CG R
Sbjct: 331 FINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASNKLPCNCGAKR 390
Query: 831 CRGVVN 836
CR +N
Sbjct: 391 CRRFLN 396
>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
Length = 3452
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 114/191 (59%), Gaps = 19/191 (9%)
Query: 663 RVIGSKFSF--SLHRDAPNFLS---------------MADKYKHMKETFRKRLAFGKSGI 705
R + FS+ S HR PN ++ MA +++ +KET + + S I
Sbjct: 3264 RKVHDMFSWLASQHRPHPNIITISETESRRAVSTNLPMAMRFRILKETSKASVGVYYSHI 3323
Query: 706 HGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDAT 765
HG G+F G+MVIEY GE++R S+ D+RE + Y+S G YMF+IDD V+DAT
Sbjct: 3324 HGRGLFCLRDIEPGEMVIEYAGEVIRSSLTDKREKY-YDSK-NIGCYMFKIDDHLVVDAT 3381
Query: 766 RAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACY 825
G+ A INHSCEPNCYSRV+ + G +HI+IFA R I Q EELTYDY+F D ++ C
Sbjct: 3382 MKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDYKFPFEDIKIPCT 3441
Query: 826 CGFPRCRGVVN 836
CG +CR +N
Sbjct: 3442 CGSRKCRKYLN 3452
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 44/101 (43%), Gaps = 21/101 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
CCLC +G + T +GR W H CA+W E E IDG L V
Sbjct: 1349 CCLCKGLGDGHE-TKEGRLLYCGQNEWVHANCALWSNEV-------FEEIDGSLQNVHSA 1400
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
R +L+ CS CG GA I C C +H CAR GL
Sbjct: 1401 ISRGRLIRCSECG-KKGASIGCCAKNCSNTFHFPCARNIGL 1440
>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4219
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/168 (45%), Positives = 108/168 (64%), Gaps = 6/168 (3%)
Query: 672 SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
S R L + +K++ +K+ R + +S IHG G+F + G+MVIEY+G ++R
Sbjct: 4055 SARRATSTDLPLPEKFRQLKKASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIR 4114
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
+ D+RE + + G G YMFRIDD V+DAT G+ A INHSCEPNCYSRV++V+G
Sbjct: 4115 SVLTDKREKYYDDK--GIGCYMFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVVNVDG 4172
Query: 792 DEHIIIFAKRDIKQWEELTYDYRFFSIDE---QLACYCGFPRCRGVVN 836
+HI+IFA R I + EELTYDY+ F I+E +L C CG +CR +N
Sbjct: 4173 QKHIVIFATRKIYKGEELTYDYK-FPIEEPGNKLPCNCGAKKCRKFLN 4219
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 40/99 (40%), Gaps = 22/99 (22%)
Query: 387 RPVRVDWKD--LDKCSVCHM-DEEYQNN-LFLQCDKCRMMVHARCYGELEPVNGV----L 438
RP W++ +C CH+ +YQ L+CDKCR H C G P +
Sbjct: 1659 RPHDEQWENWCCRRCRFCHVCGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRV 1718
Query: 439 WLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT--DGRWAH 475
W+C C C C GA KP D +W+H
Sbjct: 1719 WVCTKCV--------RCKSC----GATKPGKAWDAQWSH 1745
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 35/129 (27%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNR-----VSKDRWKLLCSICGVSYGACIQCSNT 527
WAH+ CA+W E E +DG + VS+ + +L C C GA + C T
Sbjct: 2093 WAHVNCALWSAEV-------YEDVDGALKNVHMAVSRGK-QLQCKNCH-KPGATVSCCMT 2143
Query: 528 TCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVD 587
+C YH +CAR C LED+ +C+ HK + + V
Sbjct: 2144 SCTNNYHFMCARQQ-QCAFLEDKK-------------------VYCQHHKDLVKGEV-VP 2182
Query: 588 ERLVQVTRR 596
E +VTRR
Sbjct: 2183 ESSFEVTRR 2191
>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 574
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/152 (48%), Positives = 108/152 (71%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G++VRP +AD RE
Sbjct: 426 KFNQLK--FRKKQLKFSKSDIHDWGLFAMEPIAADEMVIEYVGQMVRPFLADFREKEYEK 483
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
+G+ +Y+FRID E +IDAT+ G++A INHSC PNCY+++I++ G + I+I++K+DIK
Sbjct: 484 RGIGS-SYLFRIDLETIIDATKCGNLARFINHSCNPNCYAKIITIEGQKKIVIYSKKDIK 542
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F +E++ C CG +C+G +N
Sbjct: 543 VDEEITYDYKFPIEEEKIPCLCGAAQCKGYLN 574
>gi|395846912|ref|XP_003796132.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Otolemur
garnettii]
Length = 2714
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 2559 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 2618
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 2619 YYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 2676
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 2677 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 2714
>gi|358054536|dbj|GAA99462.1| hypothetical protein E5Q_06161 [Mixia osmundae IAM 14324]
Length = 992
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/279 (34%), Positives = 139/279 (49%), Gaps = 26/279 (9%)
Query: 387 RPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446
+P V + +C++C E +N + CD C + VH CYG G WLC C
Sbjct: 135 KPAAVALAEDSRCAICDDGECENSNAIVFCDGCNLAVHQDCYGVPYIPEG-QWLCRKCT- 192
Query: 447 GAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
+P+ P C LCP GGA K TT +WAHL CAIWIPET +++V MEP+DG+N + K R
Sbjct: 193 VSPDKPVSCVLCPAEGGAFKQTTANQWAHLLCAIWIPETGISNVVYMEPVDGVNHIPKSR 252
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
WKL C +C GACIQC+N +C A+H CAR L ++L R D+D +++
Sbjct: 253 WKLQCYLCKRRVGACIQCANRSCYTAFHVTCAREYNLYLKL----RPVSAQADDDSKNE- 307
Query: 567 IRLLSFCKKHKQPLNDRLAVDER----LVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG 622
++C +H Q + +DE V R +PPS R + +R
Sbjct: 308 ----AYCHRHHQ-VGSASEIDEESQDSFVASASRGSKLVPPS----LKRKRDESPVLQRR 358
Query: 623 RKEPEALAAASLKRLFVENQ---PYLVGGYCQNGLSGNT 658
+ P AAA + R + ++ P LV Y ++ T
Sbjct: 359 KANP---AAAKISRAYKKSYSQGPPLVPNYIHGRVAAYT 394
>gi|194380712|dbj|BAG58509.1| unnamed protein product [Homo sapiens]
Length = 323
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 168 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 227
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 228 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 285
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 286 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 323
>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
Length = 1340
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 99/142 (69%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
RK++ F +S IH +G+FA A DMVIEY GE++R +AD RE +G+ +Y+F
Sbjct: 1200 RKKIKFQRSDIHDWGLFAMETINAKDMVIEYIGEVIRQKVADEREKRYIKKGIGS-SYLF 1258
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD+ +IDAT G++A INH C+PNC ++V+++N + III+AKRDI EE+TYDY+
Sbjct: 1259 RVDDDTIIDATLKGNLARFINHCCDPNCIAKVLTINNQKKIIIYAKRDINIGEEITYDYK 1318
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F DE++ C C P+CRG +N
Sbjct: 1319 FPIEDEKIPCLCKSPKCRGTLN 1340
>gi|218202612|gb|EEC85039.1| hypothetical protein OsI_32352 [Oryza sativa Indica Group]
Length = 1741
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 171/372 (45%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CS+C + G+ + C + C V
Sbjct: 1417 QWVHVVCGLWTPGTKCPNTITMSAFD-ISGASPAKRNTECSMCNRTGGSFMGCRDVNCSV 1475
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+HP CA GL L S E + ++ + C H +D V
Sbjct: 1476 LFHPWCAHQRGL-----------LQSEPEGEHNENVGFYGRCLDHAM-------LDPNHV 1517
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ C SN CAR+E + RGRK A S K P G C
Sbjct: 1518 NPKKECLR----SNDWTCARTEVF-----RGRKGDSFGANRSRK-------PEEKFGEC- 1560
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
+S + + IR+ GSK S R ++ YK +K K L KS IHG G+
Sbjct: 1561 -SVSQEQINAWIRINGSK---SCMRGQKEYV----HYKQLKGW--KHLVVYKSSIHGLGL 1610
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV++Y GE+V +AD+RE Y S + Y F+I E +IDATR
Sbjct: 1611 YTSEFIPRGSMVVQYVGEIVGQCVADKRE-IEYQSGKRQQYKSACYFFKIGKEHIIDATR 1669
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA INHSC+PNC ++VISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 1670 KGGIARFINHSCQPNCVAKVISVRNEKKVVFFAERHINPGEEITYDYHFNREDEGQRIPC 1729
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 1730 FCRSRGCRRYLN 1741
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + +N ++C KC + VH CYG L+ G W C C+ + C LC
Sbjct: 1255 CCVCGISNLEPSNQLIECSKCFIKVHQACYGVLKVPRG-QWFCKPCKINTQD--TVCVLC 1311
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1312 GYGGGAM 1318
>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1359
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 131/234 (55%), Gaps = 27/234 (11%)
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
P NPSGCARSEPY R + A+ A+ K V G +G+ + S
Sbjct: 1153 PVNPSGCARSEPYK---RMDLDKKIAMRLATTK----------VRGTSGDGVDSHIGKS- 1198
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
S++ L +D + + +K++ F +S IH +G+FA P A DMV
Sbjct: 1199 SGRSSRYESRLGQDV------------VMTSRKKKIKFERSLIHDWGLFALEPIYARDMV 1246
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +AD RE +G+ +Y+FRIDD+ +IDAT G+ A INH C+PNC
Sbjct: 1247 IEYIGEVIRQKVADEREKRYTKQGIGS-SYLFRIDDDTIIDATFKGNQARFINHCCDPNC 1305
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++VI++ G + III+AKRDI EELTYDY+F D ++ C C +CRG +N
Sbjct: 1306 MAKVITMGGQKKIIIYAKRDINVGEELTYDYKFPIEDVKIPCLCKSAKCRGTLN 1359
>gi|222642073|gb|EEE70205.1| hypothetical protein OsJ_30300 [Oryza sativa Japonica Group]
Length = 1792
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 171/372 (45%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CS+C + G+ + C + C V
Sbjct: 1468 QWVHVVCGLWTPGTKCPNTITMSAFD-ISGASPAKRNTECSMCNRTGGSFMGCRDVNCSV 1526
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+HP CA GL L S E + ++ + C H +D V
Sbjct: 1527 LFHPWCAHQRGL-----------LQSEPEGEHNENVGFYGRCLDHAM-------LDPNHV 1568
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ C SN CAR+E + RGRK A S K P G C
Sbjct: 1569 NPKKECLR----SNDWTCARTEVF-----RGRKGDSFGANRSRK-------PEEKFGEC- 1611
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
+S + + IR+ GSK S R ++ YK +K K L KS IHG G+
Sbjct: 1612 -SVSQEQINAWIRINGSK---SCMRGQKEYV----HYKQLKGW--KHLVVYKSSIHGLGL 1661
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV++Y GE+V +AD+RE Y S + Y F+I E +IDATR
Sbjct: 1662 YTSEFIPRGSMVVQYVGEIVGQCVADKRE-IEYQSGKRQQYKSACYFFKIGKEHIIDATR 1720
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA INHSC+PNC +++ISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 1721 KGGIARFINHSCQPNCVAKIISVRNEKKVVFFAERHINPGEEITYDYHFNREDEGQRIPC 1780
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 1781 FCRSRGCRRYLN 1792
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + +N ++C KC + VH CYG L+ G W C C+ + C LC
Sbjct: 1306 CCVCGISNLEPSNQLIECSKCFIKVHQACYGVLKVPRG-QWFCKPCKINTQD--TVCVLC 1362
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1363 GYGGGAM 1369
>gi|349604316|gb|AEP99904.1| Histone-lysine N-methyltransferase HRX-like protein, partial [Equus
caballus]
Length = 297
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 142 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 201
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 202 Y-YDS-KGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 259
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 260 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 297
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 154 bits (389), Expect = 2e-34, Method: Composition-based stats.
Identities = 76/160 (47%), Positives = 104/160 (65%), Gaps = 5/160 (3%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L A +++ +KET + + +S IHG G+F K GDMVIEY GE++R +AD+RE
Sbjct: 4229 LPPAMRFRQLKETSKASVGVYRSHIHGRGLFCKRDIEEGDMVIEYAGEVIRAVLADQREK 4288
Query: 741 FIYNSLVG----AGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHII 796
Y ++ G G YMFRIDD V+DAT G+ A INHSC+PNCYSRV+ ++G +HI+
Sbjct: 4289 -KYEAMSGRRGVGGCYMFRIDDNLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHIL 4347
Query: 797 IFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
IFA R I EELTYDY+F + ++ C CG +CR +N
Sbjct: 4348 IFALRRITIGEELTYDYKFPFEEVKIPCTCGAKKCRKYLN 4387
Score = 44.7 bits (104), Expect = 0.23, Method: Composition-based stats.
Identities = 36/108 (33%), Positives = 45/108 (41%), Gaps = 22/108 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRVSK- 504
C LC G P +GR W H CA+W E E IDG L V
Sbjct: 1386 CVLCKTCGDG-SPAMEGRLLYCGQNDWIHANCALWSAEV-------FEEIDGSLQNVHSA 1437
Query: 505 -DRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDE 550
R K++ C+ C V GA + C C YH CAR A C ++D+
Sbjct: 1438 ISRGKMIKCAECEVK-GASVGCCAKNCSETYHYACARKA-TCAFMDDK 1483
>gi|348526824|ref|XP_003450919.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oreochromis
niloticus]
Length = 4517
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 105/164 (64%), Gaps = 4/164 (2%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L +A ++K +K T ++ + +S IHG G+F K AG+MVIEY+G ++R +
Sbjct: 4356 RRASMELPLAVRFKQLKATSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVL 4415
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE + G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +H
Sbjct: 4416 TDKREKYYDGK--GIGCYMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVITVDGQKH 4473
Query: 795 IIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
I+IFA R I EELTYDY+F D +L C C +CR +N
Sbjct: 4474 IVIFASRRIYCGEELTYDYKFPIEDASNKLPCNCSAKKCRKFLN 4517
>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
Length = 1755
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/239 (38%), Positives = 139/239 (58%), Gaps = 16/239 (6%)
Query: 607 SGCARSEPYNYFGRRGR-KEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R + K A A++ E+ +P + + N L + +
Sbjct: 1524 TGCARTEGYYKLDVREKAKHKYHHAKANIDDAENEDRCDEPTALTNHHHNKL----ISKM 1579
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1580 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1637
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1638 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1696
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1697 CNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1755
>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
latipes]
Length = 4536
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 105/164 (64%), Gaps = 4/164 (2%)
Query: 675 RDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
R A L +A ++K +K ++ + +S IHG G+F K AG+MVIEY+G ++R +
Sbjct: 4375 RRASMELPLAVRFKQLKAMSKETVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVL 4434
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
D+RE Y G G YMFRIDD V+DAT G+ A INHSCEPNCYSRV++V+G +H
Sbjct: 4435 TDKREK--YYDAKGIGCYMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVLTVDGQKH 4492
Query: 795 IIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
I+IFA R I EELTYDY+F D +L C CG +CR +N
Sbjct: 4493 IVIFASRRICCGEELTYDYKFPIEDASNKLPCNCGTKKCRKFLN 4536
>gi|195356446|ref|XP_002044683.1| GM18767 [Drosophila sechellia]
gi|194133849|gb|EDW55365.1| GM18767 [Drosophila sechellia]
Length = 1637
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 91/239 (38%), Positives = 139/239 (58%), Gaps = 16/239 (6%)
Query: 607 SGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R K A A+ + F E+ +P + + N L + +
Sbjct: 1406 TGCARTEGFYKLDVREKAKHKYHHAKANTEDSFNEDRSDEPTALTNHHHNKL----ISKM 1461
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1462 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1519
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1520 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1578
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F +E++ C CG CRG +N
Sbjct: 1579 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1637
>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
Length = 1670
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/152 (48%), Positives = 106/152 (69%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G++VRPS+AD RE Y
Sbjct: 1522 KFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMVRPSVADLRE-TKYE 1578
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FRID E +IDAT+ G++A INHSC PNCY++VI++ ++ I+I++K+ I
Sbjct: 1579 AIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQAIG 1638
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1639 INEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1670
>gi|405967140|gb|EKC32340.1| Histone-lysine N-methyltransferase SETD1B-A [Crassostrea gigas]
Length = 1401
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/162 (45%), Positives = 105/162 (64%), Gaps = 9/162 (5%)
Query: 683 MADKYKHMKET--------FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
M Y H+ E+ +K+L F +SGIH +G+FA P A +MVIEY GE +R S+
Sbjct: 1241 MKKTYDHLLESDDPMGYWLRKKQLKFARSGIHDWGLFALEPIAADEMVIEYVGETLRQSL 1300
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
AD RE Y + +Y+FR+D E +IDAT+ G++A INH C PNCY+++I+V +
Sbjct: 1301 ADLREK-KYEAEGCGSSYLFRVDTETIIDATKCGNLARFINHCCNPNCYAKIITVESQKK 1359
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+I++KRDI EE+TYDY+F DE++ C CG P CRG +N
Sbjct: 1360 IVIYSKRDIDVNEEITYDYKFPIEDEKIPCLCGAPNCRGTLN 1401
>gi|223365716|pdb|2W5Y|A Chain A, Binary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine.
gi|223365717|pdb|2W5Z|A Chain A, Ternary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine
And Histone Peptide
Length = 192
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 37 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 96
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 97 Y-YDSK-GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 154
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 155 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 192
>gi|196014878|ref|XP_002117297.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
gi|190580050|gb|EDV20136.1| hypothetical protein TRIADDRAFT_32357 [Trichoplax adhaerens]
Length = 217
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 112/168 (66%), Gaps = 6/168 (3%)
Query: 669 FSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 728
SF DA + L + + +T +K+L FG+S IH +G+FA+ P A +MVIEY G+
Sbjct: 56 ISFFQASDAGDLLKL-----NQLKTRKKQLRFGRSQIHEWGLFAREPIAADEMVIEYVGQ 110
Query: 729 LVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVIS 788
+R ++AD REH Y + +Y+FRIDD +IDAT+ G++A INHSC PNCY+++IS
Sbjct: 111 TIRQTVADEREHR-YEKIGIGSSYLFRIDDNYIIDATKCGNLARFINHSCSPNCYAKIIS 169
Query: 789 VNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+ + I+I++K DI+ EE+TYDY+F D ++ C+CG +CRG +N
Sbjct: 170 LESQKKIVIYSKYDIQVNEEITYDYKFPIEDVKIPCHCGALQCRGALN 217
>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
Length = 1548
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/239 (38%), Positives = 139/239 (58%), Gaps = 16/239 (6%)
Query: 607 SGCARSEPYNYFGRRGR-KEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R + K A A++ E+ +P + + N L + +
Sbjct: 1317 TGCARTEGYYKLDVREKAKHKYHHAKANIDDAENEDRCDEPTALTNHHHNKL----ISKM 1372
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1373 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1430
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1431 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1489
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1490 CNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1548
>gi|357631650|gb|EHJ79119.1| hypothetical protein KGM_15585 [Danaus plexippus]
Length = 1491
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/152 (47%), Positives = 108/152 (71%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KSGIH +G+FA+ A +MVIEY G++VRP +AD RE
Sbjct: 1343 KFNQLK--FRKKQLKFAKSGIHDWGLFAQEAIAADEMVIEYVGQMVRPIVADVREAHYEA 1400
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
+ +G+ +Y+FRID + +IDAT+ G++A INHSC PNCY+++I++ + I+I++K+ I
Sbjct: 1401 TGIGS-SYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAKIITIESQKKIVIYSKQPIG 1459
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F DE++ C CG P+CRG +N
Sbjct: 1460 VDEEITYDYKFPLEDEKIPCLCGAPQCRGYLN 1491
>gi|226500304|ref|NP_001141695.1| uncharacterized protein LOC100273824 [Zea mays]
gi|194705578|gb|ACF86873.1| unknown [Zea mays]
Length = 418
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 173/372 (46%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CSIC + G+ ++C C V
Sbjct: 94 QWVHMVCGLWTPGTKCPNDTTMSAFD-VSGASPAKRNTACSICDRTGGSFVKCRAVNCLV 152
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+H CA GL L S E + ++ I C H + + +
Sbjct: 153 FFHAWCAHQRGL-----------LQSEPEGEHNENIGFYGRCVSHAIVFSSHVNPKKEYF 201
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ +N CAR+E + GR+G E + +S K+ + +
Sbjct: 202 R-----------NNNWTCARTEGFK--GRKG----EGFSDSSHKKYEEYSGEF------- 237
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
G+S + + +R+ GSK R ++ YK +K K L KSGIHG G+
Sbjct: 238 -GVSQEQINAWLRINGSK---PCGRGQKEYI----HYKQLKGW--KHLVVYKSGIHGLGL 287
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV+EY GE+V +ADRRE Y S + Y F+ID E +IDATR
Sbjct: 288 YTSVFIPRGSMVVEYVGEIVGQRVADRRE-IEYQSGKRQQYKSACYFFKIDREHIIDATR 346
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA +NHSC+PNC +++ISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 347 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNREDEGQRILC 406
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 407 FCRSRYCRRYLN 418
>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
queenslandica]
Length = 1053
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 71/168 (42%), Positives = 110/168 (65%), Gaps = 6/168 (3%)
Query: 674 HRDAPNFLSMADKYKHMKE-----TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGE 728
HR + + D + +++ +K+L F KS IH +G+FA A +MV+EY G+
Sbjct: 887 HRRVASLFTSGDVSRELQQYNQLKARKKQLTFAKSTIHNWGLFALETIPADEMVVEYIGQ 946
Query: 729 LVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVIS 788
+VR IAD RE +G+ +Y+FR+D + VIDAT++G+ A INH C+PNCY+++I+
Sbjct: 947 VVRHGIADERERRYEAQGIGS-SYLFRVDYDHVIDATKSGNFARFINHCCDPNCYAKIIT 1005
Query: 789 VNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
V + I+I++KRDI+ EE+TYDY+F DE++ C CG P+CRG +N
Sbjct: 1006 VGNQKKIVIYSKRDIRAGEEITYDYKFPIEDEKIPCLCGAPQCRGTLN 1053
>gi|326430870|gb|EGD76440.1| mixed-lineage leukemia protein [Salpingoeca sp. ATCC 50818]
Length = 2027
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 107/159 (67%), Gaps = 3/159 (1%)
Query: 679 NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
+ LS A +Y+ KE R+ + G S IHGFG++A+ G+M++EY GE++RP + D+R
Sbjct: 1871 SHLSDAMQYRQCKERMRETVRVGYSKIHGFGLYAQRDISGGEMIVEYVGEVIRPELTDKR 1930
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E F Y++ G+YMFRIDD++V+DAT G A +NHSC+PNC SR+IS + + I+I
Sbjct: 1931 EQF-YDAR-NMGSYMFRIDDKQVVDATLTGGQARFVNHSCDPNCISRIISTDRGKKIVIV 1988
Query: 799 AKRDIKQWEELTYDYRF-FSIDEQLACYCGFPRCRGVVN 836
AK+ I + +ELTYDY+F D ++ C CG CRG +N
Sbjct: 1989 AKQHICKGDELTYDYQFPLDDDNKVRCSCGAANCRGFMN 2027
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 11/90 (12%)
Query: 455 CCLC-------PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRW 507
CCLC P GG + P W H+ CA+W E ++ +D R K
Sbjct: 1363 CCLCHQHGDLDPKDGGRLLPLEVDGWVHVNCALWSSEVFEDTDGTLQLLDSAIRRGK--- 1419
Query: 508 KLLCSICGVSYGACIQCSNTTCRVAYHPLC 537
++ C+ C + GA I C +++C+ +H C
Sbjct: 1420 RMRCTTCDHT-GATIGCCSSSCKGNFHFGC 1448
>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
Length = 213
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 101/142 (71%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L F KS IH +G+FA P A +MVIEY GE++R +IAD RE +G+ +YMF
Sbjct: 73 KKQLKFAKSSIHDWGLFALEPIAADEMVIEYVGEVIRQAIADYRERCYEERGIGS-SYMF 131
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ +IDAT G+ A INH C+PNCY++VI+V + I+I++KRDI+ EE+TYDY+
Sbjct: 132 RLDETTIIDATTMGNFARFINHCCDPNCYAKVIAVENMKKIVIYSKRDIQVDEEITYDYK 191
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F DE++ C CG P+CRG +N
Sbjct: 192 FPIEDEKIPCLCGAPQCRGTLN 213
>gi|121483959|gb|ABM54292.1| MLL [Pan paniscus]
Length = 162
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 107/158 (67%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M +++H+K+T ++ + +S IHG G+F K AG+MVIEY G ++R D+RE
Sbjct: 7 LPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREK 66
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+ Y+S G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA
Sbjct: 67 Y-YDSK-GIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAM 124
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG +CR +N
Sbjct: 125 RKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN 162
>gi|115480613|ref|NP_001063900.1| Os09g0556700 [Oryza sativa Japonica Group]
gi|113632133|dbj|BAF25814.1| Os09g0556700 [Oryza sativa Japonica Group]
Length = 424
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 171/372 (45%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CS+C + G+ + C + C V
Sbjct: 100 QWVHVVCGLWTPGTKCPNTITMSAFD-ISGASPAKRNTECSMCNRTGGSFMGCRDVNCSV 158
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+HP CA GL L S E + ++ + C H +D V
Sbjct: 159 LFHPWCAHQRGL-----------LQSEPEGEHNENVGFYGRCLDH-------AMLDPNHV 200
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ C SN CAR+E + RGRK A S K P G C
Sbjct: 201 NPKKECLR----SNDWTCARTEVF-----RGRKGDSFGANRSRK-------PEEKFGECS 244
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
+S + + IR+ GSK S R ++ YK +K K L KS IHG G+
Sbjct: 245 --VSQEQINAWIRINGSK---SCMRGQKEYV----HYKQLKGW--KHLVVYKSSIHGLGL 293
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV++Y GE+V +AD+RE Y S + Y F+I E +IDATR
Sbjct: 294 YTSEFIPRGSMVVQYVGEIVGQCVADKRE-IEYQSGKRQQYKSACYFFKIGKEHIIDATR 352
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA INHSC+PNC +++ISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 353 KGGIARFINHSCQPNCVAKIISVRNEKKVVFFAERHINPGEEITYDYHFNREDEGQRIPC 412
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 413 FCRSRGCRRYLN 424
>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
Length = 1312
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 68/147 (46%), Positives = 104/147 (70%), Gaps = 1/147 (0%)
Query: 690 MKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGA 749
M ++ +K L F +SGIH FG+F++ A D+VIEY GE++R SI+D REH +G+
Sbjct: 1167 MLKSRKKLLKFQRSGIHAFGLFSQENISANDLVIEYVGEVIRQSISDIREHHYERRGIGS 1226
Query: 750 GTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEEL 809
+Y FRID++ V+DAT G++A +NH CEPNCY+++I V+G + I+I++KRDIK+ EE+
Sbjct: 1227 -SYFFRIDEDHVVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEI 1285
Query: 810 TYDYRFFSIDEQLACYCGFPRCRGVVN 836
TYDY+F + ++ C CG C+ +N
Sbjct: 1286 TYDYKFPYEENKIPCLCGAVNCKKFLN 1312
>gi|189238620|ref|XP_969339.2| PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum]
gi|270009170|gb|EFA05618.1| hypothetical protein TcasGA2_TC015826 [Tribolium castaneum]
Length = 1268
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 129/230 (56%), Gaps = 14/230 (6%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCAR+E Y + + + A S + +N P GLS + R +
Sbjct: 1053 TGCARTEGYYKIAAHEKAKYKYHHAKS-HAIVSQNAPV----SKMQGLSREARSNQRRLL 1107
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 726
+ F D F + + KH+K F KS IH +G+FA P A +MVIEY
Sbjct: 1108 AAFGGDTDSDLLKFNQLKFRKKHLK--------FAKSAIHDWGLFAMEPIAADEMVIEYV 1159
Query: 727 GELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 786
G++VR S+AD RE + +G+ +Y+FRID E +IDAT+ G++A INHSC PNCY++V
Sbjct: 1160 GQMVRHSVADLRERKYEATGIGS-SYLFRIDLENIIDATKCGNLARFINHSCNPNCYAKV 1218
Query: 787 ISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I++ + I+I++K+ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1219 ITIESQKKIVIYSKQSIGVNEEITYDYKFPIEDEKIPCLCGAATCRGTLN 1268
>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
Length = 2067
Score = 153 bits (386), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 68/148 (45%), Positives = 106/148 (71%), Gaps = 4/148 (2%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
++ RK+L FG+S +H +G+FA P AGD+VIEY GE++ ++AD+RE + +G+
Sbjct: 1921 QSRRKKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQYEDMQIGS-D 1979
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
YMFR+D++ V+DAT GS+A INHSC+P+C +R+I+V G + I+I+A+RD+ EEL+Y
Sbjct: 1980 YMFRVDEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDVAMGEELSY 2039
Query: 812 DYRFFSIDEQLA---CYCGFPRCRGVVN 836
DY+F ++ A C+CG +CRG +N
Sbjct: 2040 DYKFPPEPDEAARVPCHCGSEKCRGFIN 2067
>gi|211828317|gb|AAH07353.3| MLL4 protein [Homo sapiens]
Length = 172
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 17 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 76
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 77 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 134
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 135 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 172
>gi|443723097|gb|ELU11678.1| hypothetical protein CAPTEDRAFT_179499 [Capitella teleta]
Length = 323
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 126/238 (52%), Gaps = 22/238 (9%)
Query: 599 DYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNT 658
D P NPSG ARSE G RK E + +L+ Y +
Sbjct: 108 DIEPDINPSGSARSE-----GFSSRKPFEMFS-------------FLLSKY--RPFPKTS 147
Query: 659 LPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRA 718
R++ S R L MA +++++KE R + +S IHG G+F+
Sbjct: 148 AKVDRLLPQLGSSRYERMPSQDLPMAMRFRYLKEDARDNVVVYRSLIHGRGLFSIRDFDV 207
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 778
G+M+IEY G ++R S+ D RE Y G G YMFRIDD VIDAT G+ A INHSC
Sbjct: 208 GEMIIEYAGMVIRSSLTDSREK--YYETKGIGCYMFRIDDHFVIDATVNGNAARFINHSC 265
Query: 779 EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+PNC+SRVI V+G +HIIIFA + +K EELTYDY+F D ++ C C RCR +N
Sbjct: 266 DPNCFSRVIDVSGVKHIIIFALKAVKAGEELTYDYKFPYEDVKIPCTCQAKRCRKYLN 323
>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
Length = 1258
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 127/233 (54%), Gaps = 13/233 (5%)
Query: 608 GCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGS 667
GCARS + + A ++ + + Y V + LS + S R
Sbjct: 1035 GCARSSITGWEWHKWSHSASPADRARVRGIHCLHANYSVSEAYTSQLSNGKVLSARTNRV 1094
Query: 668 KFSFSLHRDAPNFLSMADKYKHMKET----FRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
K N L+ A+ +K T +KRL F +S IH +G+ A P A D VI
Sbjct: 1095 KMR--------NLLAAAEGADLLKATQLKARKKRLRFQQSKIHDWGLVALEPIEAEDFVI 1146
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EY GEL+RP I+D RE +Y + +Y+FR+DD V+DAT+ G +A INHSCEPNCY
Sbjct: 1147 EYVGELIRPRISDIRER-LYEKMGIGSSYLFRLDDGYVVDATKRGGVARFINHSCEPNCY 1205
Query: 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++VISV G + I I+AKR I EE+TY+Y+F ++++ C CG +CRG +N
Sbjct: 1206 TKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKKIPCNCGSRKCRGSLN 1258
>gi|148692059|gb|EDL24006.1| WW domain binding protein 7 [Mus musculus]
Length = 209
Score = 152 bits (385), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 105/158 (66%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 54 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 113
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 114 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 171
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR +N
Sbjct: 172 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFLN 209
>gi|15217143|gb|AAK92531.1|AF401284_1 trithorax 3 [Arabidopsis thaliana]
Length = 330
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 164/354 (46%), Gaps = 54/354 (15%)
Query: 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCC 456
++C+VC E+++ N + C++C++ VH CYG + + W+C C P+ CC
Sbjct: 6 ERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDFTSWVCRACE--TPDIERDCC 63
Query: 457 LCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515
LCPV GGA+KP+ +G W H+ CA + PE + + MEP GL ++ + + +C+IC
Sbjct: 64 LCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLFKIPANSFLKVCTICK 123
Query: 516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKK 575
++G+C+ C C +H +CA AG +EL E + Q R +C
Sbjct: 124 QTHGSCVHCCK--CATHFHAMCASRAGYNMELH---------CLEKNGVQRTRKSVYCSF 172
Query: 576 HKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGR---------RGRKEP 626
H++P D + V PS G +R+ N +GR + K P
Sbjct: 173 HRKPDPDSVVVVH-------------TPSGVFG-SRNLLQNQYGRAKGSRLVLTKKMKLP 218
Query: 627 --EALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRD-------- 676
+ A R+F L C+ NT + I + H
Sbjct: 219 GFQTQTQAEQSRVF----DSLSAARCRIYSRSNTKIDLEAISHRLKGPSHHSLSASENLN 274
Query: 677 ---APNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTG 727
A +F S ++ KH++ T R+ FGKSGIHG+G+FA+ + G+M+IEY G
Sbjct: 275 SFKATDFTSFRERLKHLQRTENFRVCFGKSGIHGWGLFARKSIQEGEMIIEYRG 328
>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Strongylocentrotus purpuratus]
Length = 282
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 108/155 (69%), Gaps = 2/155 (1%)
Query: 683 MADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
+++ +K + FRK+ + F KS IHG+G++A P A +MVIEY GE VR SIAD RE
Sbjct: 129 VSNLFKFNQLQFRKKDIKFCKSSIHGWGLYAMEPIAADEMVIEYVGESVRQSIADSREK- 187
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + +Y+FRID +IDAT++G++A INHSC PNCY+++I+V ++ I+I++K+
Sbjct: 188 AYERMGIGSSYLFRIDAVTIIDATKSGNLARFINHSCNPNCYAKIITVESEKKIVIYSKQ 247
Query: 802 DIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I +E+TYDY+F DE+++C CG +CRG +N
Sbjct: 248 TINVGDEITYDYKFPIEDEKISCLCGAAQCRGTLN 282
>gi|242045554|ref|XP_002460648.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
gi|241924025|gb|EER97169.1| hypothetical protein SORBIDRAFT_02g032470 [Sorghum bicolor]
Length = 1658
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 170/371 (45%), Gaps = 52/371 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CSIC + G+ I+C + C V
Sbjct: 1334 QWVHMVCGLWTPGTKCPNATTMSAFD-VSGASPAKRNTACSICDRTGGSFIKCRDVNCSV 1392
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+H CA GL L S E + ++ + C H + + + +
Sbjct: 1393 LFHAWCAHQRGL-----------LQSEPEGEHNENVGFYGRCMDHTIDFSSHVDPKKEYL 1441
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ SN CAR+E + GR+G E +S K+ + + V
Sbjct: 1442 R-----------SNNWTCARTEGFK--GRKG----EGFFDSSRKKYEEYSGEFSVSQEQI 1484
Query: 652 NGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIF 711
N +R+ GSK R ++ YK +K L KSGIHG G++
Sbjct: 1485 NAW-------LRINGSK---PCGRGQKEYI----HYKQLKGW--NHLVVYKSGIHGLGLY 1528
Query: 712 AKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATRA 767
G MV+EY GE+V +AD+RE Y S + Y F+ID E ++DATR
Sbjct: 1529 TSVFIPRGSMVVEYVGEIVGQRVADKRE-IEYQSGKRQQYKSACYFFKIDREHIVDATRK 1587
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACY 825
G IA +NHSC+PNC +++IS+ ++ ++ FA+R I EE+TYDY F DE ++ C+
Sbjct: 1588 GGIARFVNHSCQPNCVAKIISIRNEKKVMFFAERHINPGEEITYDYHFNREDEGQRIPCF 1647
Query: 826 CGFPRCRGVVN 836
C CR +N
Sbjct: 1648 CRSRYCRRYLN 1658
Score = 42.7 bits (99), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N ++C KC + VH CYG L+ G W C C+ A C LC
Sbjct: 1192 CCVCAISDLEPCNRLIECSKCYIKVHQACYGVLKVPRG-QWFCRPCK--ANTMNTVCVLC 1248
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1249 GYGGGAM 1255
>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 1014
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 131/218 (60%), Gaps = 9/218 (4%)
Query: 626 PEALAAASLKRLFVENQPYLVGGYCQNG----LSGNTLPSIRVIGSKFSFSLHRD-APNF 680
P +A +L R N + QN LS P+ + S + + AP+
Sbjct: 799 PTQFSALALSRSTRVNSRRFILNMEQNKKASILSSGNNPANDSLDSSLQLQANPNPAPSL 858
Query: 681 LSMAD--KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
S+AD K+ ++ T +K+L F +S IH +G++A AG+MVIEY GE++R ++ADRR
Sbjct: 859 DSVADVLKFNQLR-TRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRR 917
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +Y + +Y+FR+DD+ V+DAT+ G++ LINH C PNC +++I++NG++ I+I+
Sbjct: 918 EK-LYERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIY 976
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
AK I+ +E+TYDY F + ++ C CG +C+G +N
Sbjct: 977 AKVTIELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1014
>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 153
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 71/145 (48%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K+L F +S IH +G++A G+MVIEY GE+VR IADRRE +G+ +
Sbjct: 10 QTRKKQLRFARSPIHDWGLYAMERIARGEMVIEYVGEIVRAPIADRREKLYERQGIGS-S 68
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID+E V+DAT+ G++ LINHSC+PNC +++ISVNG + I+I+AK+DI+ +ELTY
Sbjct: 69 YLFRIDEELVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIELGDELTY 128
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F + ++ C CG +CRG +N
Sbjct: 129 DYHFPREEAKIPCLCGAAKCRGFLN 153
>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
Length = 375
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 101/142 (71%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L F KS IH +G++A GDMVIEY GE+VR ++AD RE +G+ +Y+F
Sbjct: 235 KKQLKFSKSPIHNWGLYAMEHIDMGDMVIEYVGEIVRQTVADIRERQYERQGIGS-SYLF 293
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD+ V+DAT+ G+IA INHSC+P+C +++I V G++ I+I+A RDI++ EE+TYDY+
Sbjct: 294 RIDDDTVVDATKKGNIARFINHSCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYK 353
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 354 FPIEDVKIPCLCGAKACRGTLN 375
>gi|12847486|dbj|BAB27589.1| unnamed protein product [Mus musculus]
Length = 209
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/158 (50%), Positives = 104/158 (65%), Gaps = 4/158 (2%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L MA +++H+K+T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 54 LPMAMRFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREK 113
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 114 FYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFAL 171
Query: 801 RDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
R I + EELTYDY+F D +L C CG RCR N
Sbjct: 172 RRILRGEELTYDYKFPIEDASNKLPCNCGAKRCRRFFN 209
>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 1502
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 131/218 (60%), Gaps = 9/218 (4%)
Query: 626 PEALAAASLKRLFVENQPYLVGGYCQNG----LSGNTLPSIRVIGSKFSFSLHRD-APNF 680
P +A +L R N + QN LS P+ + S + + AP+
Sbjct: 1287 PTQFSALALSRSTRVNSRRFILNMEQNKKASILSSGNNPANDSLDSSLQLQANPNPAPSL 1346
Query: 681 LSMAD--KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
S+AD K+ ++ T +K+L F +S IH +G++A AG+MVIEY GE++R ++ADRR
Sbjct: 1347 DSVADVLKFNQLR-TRKKQLKFSRSPIHDWGLYAMETIPAGEMVIEYVGEVIRQAVADRR 1405
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +Y + +Y+FR+DD+ V+DAT+ G++ LINH C PNC +++I++NG++ I+I+
Sbjct: 1406 EK-LYERMGIGSSYLFRVDDDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIY 1464
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
AK I+ +E+TYDY F + ++ C CG +C+G +N
Sbjct: 1465 AKVTIELGDEVTYDYHFPKEEVKIPCLCGSVKCKGTLN 1502
>gi|414590164|tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1591
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 173/372 (46%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CSIC + G+ ++C C V
Sbjct: 1267 QWVHMVCGLWTPGTKCPNDTTMSAFD-VSGASPAKRNTACSICDRTGGSFVKCRAVNCLV 1325
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+H CA GL L S E + ++ I C H + + +
Sbjct: 1326 FFHAWCAHQRGL-----------LQSEPEGEHNENIGFYGRCVSHAIVFSSHVNPKKEYF 1374
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ +N CAR+E + GR+G E + +S K+ + +
Sbjct: 1375 R-----------NNNWTCARTEGFK--GRKG----EGFSDSSHKKYEEYSGEF------- 1410
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
G+S + + +R+ GSK R ++ YK +K K L KSGIHG G+
Sbjct: 1411 -GVSQEQINAWLRINGSK---PCGRGQKEYI----HYKQLKGW--KHLVVYKSGIHGLGL 1460
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV+EY GE+V +ADRRE Y S + Y F+ID E +IDATR
Sbjct: 1461 YTSVFIPRGSMVVEYVGEIVGQRVADRRE-IEYQSGKRQQYKSACYFFKIDREHIIDATR 1519
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA +NHSC+PNC +++ISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 1520 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNREDEGQRILC 1579
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 1580 FCRSRYCRRYLN 1591
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N ++C KC + VH CYG L+ G W C C+ + C LC
Sbjct: 1125 CCVCGISDLEPCNRLIECSKCYIKVHQACYGVLKVPRG-QWFCRPCKNNTMD--TVCVLC 1181
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1182 GYGGGAM 1188
>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
Length = 1765
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 138/239 (57%), Gaps = 16/239 (6%)
Query: 607 SGCARSEPYNYFGRRGR-KEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R + K A+ + E+ +P + + N L + +
Sbjct: 1534 TGCARTEGYYKLDVREKAKHKYHHTKANADDVQNEDRCDEPTALTNHHHNKL----ISKM 1589
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1590 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1647
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1648 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1706
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1707 CNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1765
>gi|158286859|ref|XP_001237160.2| AGAP006776-PA [Anopheles gambiae str. PEST]
gi|157020669|gb|EAU77706.2| AGAP006776-PA [Anopheles gambiae str. PEST]
Length = 4422
Score = 151 bits (382), Expect = 1e-33, Method: Composition-based stats.
Identities = 75/181 (41%), Positives = 101/181 (55%), Gaps = 14/181 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C G P C LC
Sbjct: 273 CDVCRSPDSEEANEMVFCDNCNICVHQACYGITNIPSG-QWLCRTCSMGQK---PKCVLC 328
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P +GGAMK T G +WAH++CA+WIPE + V RMEPI ++ + RW L+C++C
Sbjct: 329 PNMGGAMKSTRSGQKWAHVSCALWIPEVSIGSVDRMEPITKISNIPSSRWALVCALCRER 388
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS TC+ AYH CA GL + + DE+ ED ++L S+C+KH
Sbjct: 389 VGACIQCSVKTCKTAYHVTCAFQHGL--------EMRAIIEDENAEDG-VKLRSYCQKHG 439
Query: 578 Q 578
+
Sbjct: 440 E 440
>gi|414590165|tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1566
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 173/372 (46%), Gaps = 54/372 (14%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T + M D ++ S + CSIC + G+ ++C C V
Sbjct: 1242 QWVHMVCGLWTPGTKCPNDTTMSAFD-VSGASPAKRNTACSICDRTGGSFVKCRAVNCLV 1300
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLV 591
+H CA GL L S E + ++ I C H + + +
Sbjct: 1301 FFHAWCAHQRGL-----------LQSEPEGEHNENIGFYGRCVSHAIVFSSHVNPKKEYF 1349
Query: 592 QVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQ 651
+ +N CAR+E + GR+G E + +S K+ + +
Sbjct: 1350 R-----------NNNWTCARTEGFK--GRKG----EGFSDSSHKKYEEYSGEF------- 1385
Query: 652 NGLSGNTLPS-IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
G+S + + +R+ GSK R ++ YK +K K L KSGIHG G+
Sbjct: 1386 -GVSQEQINAWLRINGSK---PCGRGQKEYI----HYKQLKGW--KHLVVYKSGIHGLGL 1435
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS----LVGAGTYMFRIDDERVIDATR 766
+ G MV+EY GE+V +ADRRE Y S + Y F+ID E +IDATR
Sbjct: 1436 YTSVFIPRGSMVVEYVGEIVGQRVADRRE-IEYQSGKRQQYKSACYFFKIDREHIIDATR 1494
Query: 767 AGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLAC 824
G IA +NHSC+PNC +++ISV ++ ++ FA+R I EE+TYDY F DE ++ C
Sbjct: 1495 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNREDEGQRILC 1554
Query: 825 YCGFPRCRGVVN 836
+C CR +N
Sbjct: 1555 FCRSRYCRRYLN 1566
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N ++C KC + VH CYG L+ G W C C+ + C LC
Sbjct: 1100 CCVCGISDLEPCNRLIECSKCYIKVHQACYGVLKVPRG-QWFCRPCKNNTMD--TVCVLC 1156
Query: 459 PVVGGAM 465
GGAM
Sbjct: 1157 GYGGGAM 1163
>gi|326670188|ref|XP_699940.5| PREDICTED: histone-lysine N-methyltransferase SETD1B [Danio rerio]
Length = 753
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 128/231 (55%), Gaps = 18/231 (7%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE Y + +R E L +R E Q S + G
Sbjct: 540 TGCARSEGYYFISKR-----EKLQYLCNERTVSEE----FTADSQVSHSSRS-------G 583
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRK-RLAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
S+ R +F +D K + FRK RL FGKS IH +G+FA+ P A +M+IEY
Sbjct: 584 SELRAEQRRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIEY 643
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
G+ +R IAD RE +G+ +Y+FR+D + +IDAT+ G++A INHSC PNCY++
Sbjct: 644 VGQSIRQVIADMRERRYETEGIGS-SYLFRVDHDTIIDATKCGNLARFINHSCNPNCYAK 702
Query: 786 VISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
VI+V + I+I++++ I EE+TYDY+F DE++ C C CRG +N
Sbjct: 703 VITVEAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 753
>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
Length = 1388
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 105/162 (64%), Gaps = 5/162 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ AD +K + +K L F +S IH +G+ A P A D VIEY GEL+R SI
Sbjct: 1228 NLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSI 1287
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
++ RE Y + +Y+FR+DD V+DAT+ G IA INHSCEPNCY+++ISV G +
Sbjct: 1288 SEIRER-QYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKK 1346
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I I+AKR I EE++Y+Y+F D+++ C CG P+CRG +N
Sbjct: 1347 IFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGAPKCRGSLN 1388
>gi|410922130|ref|XP_003974536.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Takifugu
rubripes]
Length = 1302
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 133/241 (55%), Gaps = 25/241 (10%)
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQ------PYLVGGYCQNGLSG 656
P + +G ARSE F R K+ KR ++ N P G G++G
Sbjct: 1080 PQHKTGSARSE---GFYRTSTKD---------KRTYMNNTELSAELPSTQG----TGVAG 1123
Query: 657 NTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLA-FGKSGIHGFGIFAKHP 715
S+R GS F R +F +D K + FRK++ F S IH +G+FA P
Sbjct: 1124 QQSTSLR-WGSDFRLEQRRLLSSFNCDSDLLKFNQLKFRKKMTRFSPSHIHEWGLFAMEP 1182
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
A +MV+EY GE++R IAD RE S + + +YMFRID E +IDAT+ G++A IN
Sbjct: 1183 IAAEEMVMEYVGEIIRQVIADMREQRYEESGIRS-SYMFRIDQETIIDATKCGNVARFIN 1241
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSC PNCY+++I+V + I+I++++ I EE+TYDY+F S D ++ C C CRG +
Sbjct: 1242 HSCNPNCYAKIITVESQKKIVIYSRQPISINEEITYDYKFPSEDTKIPCLCRATGCRGSL 1301
Query: 836 N 836
N
Sbjct: 1302 N 1302
>gi|194898301|ref|XP_001978769.1| GG11901 [Drosophila erecta]
gi|190650472|gb|EDV47727.1| GG11901 [Drosophila erecta]
Length = 1626
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 138/239 (57%), Gaps = 16/239 (6%)
Query: 607 SGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R K A A+ + E+ +P + + N L + +
Sbjct: 1395 TGCARTEGFYKLDVREKAKHKYHHAKANTEDSINEDRSDEPTALTNHHHNKL----ISKM 1450
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1451 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1508
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1509 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1567
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F +E++ C CG CRG +N
Sbjct: 1568 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1626
>gi|198414837|ref|XP_002125245.1| PREDICTED: similar to AGAP002246-PA, partial [Ciona intestinalis]
Length = 754
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 8/232 (3%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE + + L L +N P G S +T
Sbjct: 529 TGCARSEGFYKVNDNTK-----LIQRKLMYRQADNTPQPKGRSKYANESASTSGVATEKS 583
Query: 667 SKFSFSLHRDAPNFLSMA-DKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIE 724
+ + R A F S A D K+ + +RK+ + F +S IHG+G+FA+ A +MVIE
Sbjct: 584 RETRHMMRRIASEFGSDASDLLKYNQLMYRKKSVKFKRSHIHGWGLFAEETIGADEMVIE 643
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y G+LVR IADRRE +G+ +Y+FRID + +IDAT+ G+ A +NHSC P+CY+
Sbjct: 644 YVGQLVRSLIADRREVDYTRRGIGS-SYLFRIDSDHIIDATKCGNFARFMNHSCNPSCYA 702
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+VI+V+G + I+I++K IK +E+TYDY+F D ++ C+CG P CRG +N
Sbjct: 703 KVIAVDGAKKIVIYSKDTIKPTDEITYDYKFPIEDVKIPCFCGAPNCRGTLN 754
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 132/235 (56%), Gaps = 29/235 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP GCAR+E Y +GR +P+ + + K + P ++ L
Sbjct: 3198 NPHGCARAEQY-----KGRSKPDMFSFLASKHRY---PPVYDPDRQEDDLQ--------- 3240
Query: 665 IGSKFSFSLHRDAPNF-LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
++R A + L MA +++ ++ R + +S IHG G++ K +G+M++
Sbjct: 3241 -------QINRRATSCDLPMAMRFRQLRHLTRDSVGVYRSTIHGRGLYCKRDFDSGEMIM 3293
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EYTG+++R + D+RE + + +G YMFR+DD V+DAT GS A INHSC+PNCY
Sbjct: 3294 EYTGQIIRQELTDKREKYYESKSIGC--YMFRMDDFYVVDATVLGSGARFINHSCDPNCY 3351
Query: 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
SR++ G +HI+IFA R+I + EELTYDY+F DE ++AC CG CR +N
Sbjct: 3352 SRIVQFEGKKHIVIFALREIYKGEELTYDYKFPIEDENHKIACTCGARLCRKFMN 3406
>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1319
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/275 (36%), Positives = 146/275 (53%), Gaps = 30/275 (10%)
Query: 583 RLAVDERLVQVTRRCCD--YIPPSNPSGCARSE--------------PYNYFGRRGRKEP 626
RL VD ++ + + Y+P NP+G AR+E P++ ++ R+E
Sbjct: 1054 RLNVDPKMGGAKQPKIEGYYVP--NPTGSARTEGVKKILNSEKSKYLPHHIKVKKAREER 1111
Query: 627 EALAA-ASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD 685
+A A A K L V P +N +S T + R + L+ +D
Sbjct: 1112 QARAINAPSKTLTVVKVP------DENNMSRGTSRANRANHRRVVADLNDQKKTLGQDSD 1165
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
+ + RK+ + F +S IH +G++A DM+IEY GE VRPS+A RE
Sbjct: 1166 VLRFNQLRKRKKPVKFARSAIHNWGLYAMENIPKDDMIIEYVGEEVRPSVAKVREARYLK 1225
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
S +G+ TY+FRIDDE VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI
Sbjct: 1226 SGIGS-TYLFRIDDEAVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIG 1284
Query: 805 QWEELTYDYRFFSIDEQ---LACYCGFPRCRGVVN 836
Q EELTYDY+F ++Q + C CG C+G +N
Sbjct: 1285 QNEELTYDYKFEPEEDQKDRVPCLCGTTACKGFLN 1319
>gi|47225089|emb|CAF97504.1| unnamed protein product [Tetraodon nigroviridis]
Length = 352
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/159 (49%), Positives = 105/159 (66%), Gaps = 6/159 (3%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
+ + ++K +K T ++ + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 197 VPLTVRFKQLKVTSKESVGVYRSPIHGRGLFCKKTIEAGEMVIEYSGNVIRSVLTDKREK 256
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
Y G G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA
Sbjct: 257 --YYDGKGIGCYMFRIDDYEVVDATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFAS 314
Query: 801 RDIKQWEELTYDYRFFSIDE---QLACYCGFPRCRGVVN 836
R I Q EELTYDY+ F I+E +L C C +CR +N
Sbjct: 315 RRIYQGEELTYDYK-FPIEEASSKLPCNCNSKKCRKFLN 352
>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
Length = 292
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 134/249 (53%), Gaps = 34/249 (13%)
Query: 599 DYIPPSNPSGCARSEPYNYFGR----------RGRKEPEALAAASLKRLFVENQPYLVGG 648
D + ++ +GCAR+E Y RG E +A AA K++ + +
Sbjct: 67 DEVMVAHKTGCARTEGYYKISHQEKHGYLKDFRGVNERDARDAALKKKVQISREARSESR 126
Query: 649 YCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHG 707
QN L +++G + D K + FRK+ L F KSGIH
Sbjct: 127 RWQNVL--------QLMGE--------------DIGDLLKFNQLKFRKKNLKFAKSGIHD 164
Query: 708 FGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+G+FA P G+MVIEY G +VR S AD RE Y ++ +Y+FRID + +IDAT+
Sbjct: 165 WGLFALEPIAEGEMVIEYVGAVVRQSTADLREK-KYEAMGIGSSYLFRIDHDLIIDATKC 223
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCG 827
G++A INHSC PNC +++I+V + I+I+++RDI EE+TYDY+F DE++ C CG
Sbjct: 224 GNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEEITYDYKFPLEDEKIPCLCG 283
Query: 828 FPRCRGVVN 836
CRG +N
Sbjct: 284 TSACRGTLN 292
>gi|336368694|gb|EGN97037.1| hypothetical protein SERLA73DRAFT_58213 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1092
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 73/150 (48%), Positives = 92/150 (61%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N + CD C + VH CYG G WLC C +PE P C LC
Sbjct: 134 CAICDDSEGENSNAIVFCDGCNLAVHQDCYGVPYIPEGQ-WLCRKCT-VSPEIPVSCILC 191
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G WAHL CAIWIPET + + MEPI G++++SK RWKL CSICG+
Sbjct: 192 PNEGGAFKQTVFGDWAHLLCAIWIPETRVANEVFMEPITGVDKISKQRWKLKCSICGIRE 251
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQCS T+C +A+H CAR L + ++
Sbjct: 252 GACIQCSKTSCFLAFHATCARKEKLLMPMK 281
>gi|449458127|ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
Length = 1289
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 104/162 (64%), Gaps = 5/162 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ AD +K + +KRL F +S IH +G+ A P A D VIEY GEL+RP I
Sbjct: 1129 NLLAAADGADLLKASQLKARKKRLRFQRSKIHDWGLVALEPIEAEDFVIEYVGELIRPRI 1188
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE Y + +Y+FR+DD V+DAT+ G +A INHSCEPNCY++VI+V G +
Sbjct: 1189 SDIRER-QYEKMGIGSSYLFRLDDGYVVDATKRGGVARFINHSCEPNCYTKVITVEGQKK 1247
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I I+AKR I EE+TY+Y+F ++++ C C RCRG +N
Sbjct: 1248 IFIYAKRHISAGEEITYNYKFPLEEKKIPCNCRSRRCRGSLN 1289
>gi|195062427|ref|XP_001996188.1| GH22347 [Drosophila grimshawi]
gi|193899683|gb|EDV98549.1| GH22347 [Drosophila grimshawi]
Length = 1714
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 137/239 (57%), Gaps = 16/239 (6%)
Query: 607 SGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E Y R K A A+ E+ +P + + N L + +
Sbjct: 1483 TGCARTEGFYKLDVREKAKHKYHHAKANTHDAQNEDRCDEPTALTNHHHNKL----ISKM 1538
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 1539 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 1596
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 1597 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1655
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ ++ I+I++K+ I EE+TYDY+F +E++ C CG CRG +N
Sbjct: 1656 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1714
>gi|357154561|ref|XP_003576824.1| PREDICTED: uncharacterized protein LOC100834435 [Brachypodium
distachyon]
Length = 973
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 178/379 (46%), Gaps = 59/379 (15%)
Query: 472 RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRV 531
+W H+ C +W P T M D ++ + CS+C + G+ + C + +C V
Sbjct: 640 QWIHVVCGLWTPGTKCPKSTTMCAFD-ISGALPAKENSACSVCNRTGGSFMMCRDISCSV 698
Query: 532 AYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ-PLNDRLAVDERL 590
+H CA GL ++ E E LN + I C H +DR+ ++
Sbjct: 699 MFHSWCAHQRGL-LQSEPEGELN----------ENIGFYGRCLNHATLEYSDRVNPEKEH 747
Query: 591 VQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG--------RKEPEALAAASLKRLFVENQ 642
++ SN CAR+E + GR+G +K E+ + S+ + +
Sbjct: 748 LR-----------SNDWTCARTEGFK--GRKGEGWSGSNYKKPQESYSECSVSQEQINAW 794
Query: 643 PYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGK 702
+ G + P IR G +F + F+S+ YK +K +K L K
Sbjct: 795 LRING----------SKPCIR--GQVTTFMI---LYVFISI---YKQLKG--QKHLVVYK 834
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE---HFIYNSLVGAGTYMFRIDDE 759
SGIHG G++ G MVIEY GE+V +AD+RE H + Y F+ID E
Sbjct: 835 SGIHGLGLYTSEFIPRGSMVIEYVGEIVGQRVADKREIEYHSGKRQQYKSVCYFFKIDRE 894
Query: 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID 819
+IDAT+ G +A INHSC+PNC +++ISV ++ ++ F++R I EE+TYDY F D
Sbjct: 895 HIIDATQKGGVARFINHSCQPNCVAKIISVRNEKKVVFFSERQINPGEEITYDYHFTQED 954
Query: 820 E--QLACYCGFPRCRGVVN 836
E ++ C+C CR +N
Sbjct: 955 EGQRIPCFCRSRSCRRYLN 973
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 30/69 (43%), Gaps = 1/69 (1%)
Query: 397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCC 456
D C VC N +QC KC + VH CYG L+ G W C C+ C
Sbjct: 495 DSCCVCGSPCLEPCNQLIQCSKCYIKVHQACYGVLKVPRG-QWFCRPCKTKTNTEDMVCV 553
Query: 457 LCPVVGGAM 465
LC GGAM
Sbjct: 554 LCGYGGGAM 562
>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
Length = 1614
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 103/146 (70%), Gaps = 2/146 (1%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE +G+ +
Sbjct: 1470 QTRKKHLRFSRSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVADKREKTYEKQGIGS-S 1528
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID+E V+DAT+ G++ LINHSC+PNC +++I+++G + I+I+AK+DI+ EE+TY
Sbjct: 1529 YLFRIDEEFVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITY 1588
Query: 812 DYRF-FSIDEQLACYCGFPRCRGVVN 836
DY F D ++ C CG RCRG +N
Sbjct: 1589 DYHFPIEQDNKIPCLCGSARCRGYLN 1614
>gi|340381930|ref|XP_003389474.1| PREDICTED: histone-lysine N-methyltransferase trithorax-like
[Amphimedon queenslandica]
Length = 192
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 74/158 (46%), Positives = 104/158 (65%), Gaps = 2/158 (1%)
Query: 679 NFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
N + +A +Y+ + +R+ + S IHG G+F +GDMVIEY G ++R ++ D R
Sbjct: 37 NDVPVAMRYRACQRMYRENVGVFGSHIHGLGLFCLQEIDSGDMVIEYAGTVIRSTLTDYR 96
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E F Y S G G YMFRID + V+DAT +G++A INHSCEPNCYS+V++V+G + I+IF
Sbjct: 97 ERF-YESR-GIGCYMFRIDSDEVVDATMSGNMARFINHSCEPNCYSKVVAVDGQKKIMIF 154
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
A R I EELTYDY+F + ++ C CG RCR +N
Sbjct: 155 ALRRIVPGEELTYDYKFPIEEAKIPCKCGSARCRKTLN 192
>gi|194764460|ref|XP_001964347.1| GF23123 [Drosophila ananassae]
gi|190614619|gb|EDV30143.1| GF23123 [Drosophila ananassae]
Length = 236
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 90/241 (37%), Positives = 138/241 (57%), Gaps = 16/241 (6%)
Query: 605 NPSGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVENQ---PYLVGGYCQNGLSGNTLP 660
+ +GCAR+E Y R K A A+ L E++ P + + N L +
Sbjct: 3 HKTGCARTEGFYKLDVREKAKHKYHHAKANADDLLNEDRCDDPTALTNHHHNKL----IS 58
Query: 661 SIRVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHP 715
++ I + + R F S+ + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 59 KMQGISREARSNQRRLLTAFGSIGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEP 116
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A IN
Sbjct: 117 IAADEMVIEYVGQMIRPVVADLRES-KYEAIGIGSSYLFRIDMETIIDATKCGNLARFIN 175
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVV 835
HSC PNCY++VI++ ++ I+I++K+ I EE+TYDY+F DE++ C C CRG +
Sbjct: 176 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCAAQGCRGTL 235
Query: 836 N 836
N
Sbjct: 236 N 236
>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
lacrymans S7.3]
Length = 260
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE +G+ +
Sbjct: 117 QTRKKHLRFARSPIHDWGLYAMERISRGEMVIEYVGEVIRAQVADKREKVYERQGIGS-S 175
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I++NG++ I+I+AK+DI+ EE+TY
Sbjct: 176 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITY 235
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 236 DYHFPIEQDKIPCLCGSAKCRGYLN 260
>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
Length = 1687
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 105/152 (69%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G+++RP +AD RE Y
Sbjct: 1539 KFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRE-TKYE 1595
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FRID E +IDAT+ G++A INHSC PNCY++VI++ ++ I+I++K+ I
Sbjct: 1596 AIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIG 1655
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F +E++ C CG CRG +N
Sbjct: 1656 INEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1687
>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
972h-]
gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1; AltName: Full=Set1 complex
component set1; Short=Set1C component set1; AltName:
Full=Spset1
gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
Length = 920
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 93/142 (65%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG S IH G+FA DMVIEY GE++R +AD RE +G +Y+F
Sbjct: 780 KKQLHFGPSRIHTLGLFAMENIDKNDMVIEYIGEIIRQRVADNREKNYVREGIG-DSYLF 838
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID++ ++DAT+ G+IA INHSC PNC +R+I V G I+I+A RDI EELTYDY+
Sbjct: 839 RIDEDVIVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYK 898
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F +++ C CG P CRG +N
Sbjct: 899 FPEEADKIPCLCGAPTCRGYLN 920
>gi|401888120|gb|EJT52085.1| bromodomain and PHD finger-containing protein 3 [Trichosporon
asahii var. asahii CBS 2479]
Length = 1111
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 76/179 (42%), Positives = 101/179 (56%), Gaps = 15/179 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC++C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 129 KCAICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCLF 186
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G WAHL CAIWIPE + + MEP++G++ V K RWKL CS+C
Sbjct: 187 CPNEGGAFKQTTTGHWAHLLCAIWIPELGVGNAIYMEPVEGVDMVPKSRWKLHCSLCKER 246
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC N +C A+H CAR AGL + ++ L+ D +L ++C+KH
Sbjct: 247 VGACIQCENKSCFTAFHVTCARQAGLLMSMK------LMGADG-------QLKAYCEKH 292
>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
Length = 2482
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 100/142 (70%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L+F KS IH +G+FA P A +MVIEY G+ +R +IAD RE +G+ +Y+F
Sbjct: 2342 KKQLSFRKSRIHDWGLFALEPIAAEEMVIEYVGQCIRQTIADERERRYEEQGIGS-SYLF 2400
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY+++I+V G + I+I+++RDI EE+TYDY+
Sbjct: 2401 RVDHDMIIDATKNGNLARFINHCCNPNCYAKIITVEGYKKIVIYSRRDIAVNEEITYDYK 2460
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F DE++ C CG CRG +N
Sbjct: 2461 FPIEDEKIPCLCGAENCRGTLN 2482
>gi|195496958|ref|XP_002095897.1| GE25383 [Drosophila yakuba]
gi|194181998|gb|EDW95609.1| GE25383 [Drosophila yakuba]
Length = 1628
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 105/152 (69%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FRK+ L F KS IH +G+FA P A +MVIEY G+++RP +AD RE Y
Sbjct: 1480 KFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRE-TKYE 1536
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FRID E +IDAT+ G++A INHSC PNCY++VI++ ++ I+I++K+ I
Sbjct: 1537 AIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIG 1596
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F +E++ C CG CRG +N
Sbjct: 1597 INEEITYDYKFPLEEEKIPCLCGAQGCRGTLN 1628
>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
Length = 1212
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 109/162 (67%), Gaps = 5/162 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ +K ++ +KRL F +S IH +G+ A A D VIEY GEL+R +
Sbjct: 1052 NLLAAAEGTDLLKIMQSKSRKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQV 1111
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE S +G+ +Y+FR+DD+ V+DAT+ G +A INHSC+PNCY++VI+V G +
Sbjct: 1112 SDIREDQYEKSGIGS-SYLFRLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKK 1170
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+I+AKR I EELTY+Y+F ++++ C+CG RCRG +N
Sbjct: 1171 IVIYAKRRIYAGEELTYNYKFPLEEKKIPCHCGSQRCRGSMN 1212
>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
Length = 1300
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K L F +S IH +G+ A P A D VIEY GEL+RP I+D RE Y + +Y+F
Sbjct: 1160 KKHLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIRER-QYEKMGIGSSYLF 1218
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD V+DAT+ G IA INHSCEPNCY++VISV G + I I+AKR I EE+TY+Y+
Sbjct: 1219 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1278
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F ++++ C CG +CRG +N
Sbjct: 1279 FPLEEKKIPCNCGSRKCRGSLN 1300
>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
Length = 1213
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K L F +S IH +G+ A P A D VIEY GEL+RP I+D RE Y + +Y+F
Sbjct: 1073 KKHLRFQRSKIHDWGLLALEPIEAEDFVIEYIGELIRPRISDIRER-QYEKMGIGSSYLF 1131
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD V+DAT+ G IA +NHSCEPNCY++VISV G + I I+AKR I EE+TY+Y+
Sbjct: 1132 RLDDGYVVDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYK 1191
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F ++++ C CG +CRG +N
Sbjct: 1192 FPLEEKKIPCNCGSRKCRGSLN 1213
>gi|406699157|gb|EKD02370.1| Bromodomain and PHD finger-containing protein 3 [Trichosporon
asahii var. asahii CBS 8904]
Length = 1090
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 76/179 (42%), Positives = 101/179 (56%), Gaps = 15/179 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC++C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 129 KCAICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCLF 186
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G WAHL CAIWIPE + + MEP++G++ V K RWKL CS+C
Sbjct: 187 CPNEGGAFKQTTTGHWAHLLCAIWIPELGVGNAIYMEPVEGVDMVPKSRWKLHCSLCKER 246
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC N +C A+H CAR AGL + ++ L+ D +L ++C+KH
Sbjct: 247 VGACIQCENKSCFTAFHVTCARQAGLLMSMK------LMGADG-------QLKAYCEKH 292
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 95/142 (66%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG S IH G+FA DMVIEY GE+VR +AD RE +G +Y+F
Sbjct: 837 KKQLRFGPSRIHTLGLFAMENIDKNDMVIEYVGEIVRQRVADTRERKYVREGIG-DSYLF 895
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID + ++DAT+ G+IA INHSC PNC +++I V G + I+I+A RDI++ EELTYDY+
Sbjct: 896 RIDKDAIVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEEGEELTYDYK 955
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F +++ C CG P CRG +N
Sbjct: 956 FPEEVDKIPCLCGAPTCRGYLN 977
>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
Length = 171
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE +G+ +
Sbjct: 28 QTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREATYERQGIGS-S 86
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID+E V+DAT+ G++ LINHSC+PNC +++I++NG++ I+I+AKRDI+ +E+TY
Sbjct: 87 YLFRIDEEIVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEITY 146
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 147 DYHFPFEQDKIPCLCGTAKCRGFLN 171
>gi|328701191|ref|XP_003241521.1| PREDICTED: hypothetical protein LOC100573227 [Acyrthosiphon pisum]
Length = 1315
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 100/142 (70%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FGKS IH +G+FA A +MVIEY G++VRP +AD RE + +G+ +Y+F
Sbjct: 1175 KKQLKFGKSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGS-SYLF 1233
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID + +IDAT+ G++A INHSC PNCY+++I ++G + I+I++K+ I EE+TYDY+
Sbjct: 1234 RIDLDTIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYK 1293
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1294 FPLEDNKIPCLCGTHCCRGTLN 1315
>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
sulphuraria]
Length = 969
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/231 (38%), Positives = 127/231 (54%), Gaps = 26/231 (11%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
S CAR EPY G R + +PE +K + +N G C ++ N +R
Sbjct: 764 STCARLEPYR--GFRMKTKPEV----EIKSVITQN------GLCLPSMNKNVFNDVRFSD 811
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYT 726
+ + + Y+ M+ +R+R +SGI G G++ + VIEY
Sbjct: 812 QQIQ-----------NYSTSYRRMRRRWRQRCKVKQSGIQGLGLYTLENLPDEEFVIEYA 860
Query: 727 GELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRV 786
GEL+RP IAD RE F +G YMFR++D+ ++DAT G+ A INHSCEPNC S++
Sbjct: 861 GELIRPVIADIREKFYDRRKIGC--YMFRLNDDFIVDATMKGNYARFINHSCEPNCRSKI 918
Query: 787 ISVNGDEHII-IFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I+V+GD+ +I IFAKR+I EELTYDY+F E + C CG P CRG +N
Sbjct: 919 ITVDGDKQVIGIFAKRNIAAGEELTYDYQFEEFGETIPCNCGAPNCRGKMN 969
>gi|343427780|emb|CBQ71306.1| related to Peregrin (Bromodomain and PHD finger-containing protein
1) [Sporisorium reilianum SRZ2]
Length = 1220
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 107/193 (55%), Gaps = 5/193 (2%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC++C E +N + CD C + VH CYG + + WLC C +P+ C L
Sbjct: 157 KCAICDDGECENSNAIVFCDGCNLAVHQDCYG-IPYIPEGQWLCRKCT-VSPDRAVSCIL 214
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G+WAHL CA+WIPET +++ MEPID + R+ K RWKL C +C
Sbjct: 215 CPHEGGAFKQTTAGKWAHLLCAMWIPETGVSNPVYMEPIDSVERIPKARWKLQCYLCRYR 274
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDD-EDQCIRLLSFCKKH 576
GACIQC N +C A+H CAR AGL E + L D D+ +D+ +L C
Sbjct: 275 MGACIQCDNRSCFTAFHVTCARKAGLLFRTERTRVSHHLYDDSDNSDDEGAEVLRACCHR 334
Query: 577 KQPLN--DRLAVD 587
P++ D+ +D
Sbjct: 335 HMPVDMRDQFKID 347
>gi|321466655|gb|EFX77649.1| hypothetical protein DAPPUDRAFT_105888 [Daphnia pulex]
Length = 208
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/164 (45%), Positives = 104/164 (63%), Gaps = 4/164 (2%)
Query: 675 RDAPNF--LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRP 732
R NF L M ++KH+++ + + +S IHG G+F K G+MV+EY G+++R
Sbjct: 47 RRVTNFELLPMTVRFKHLRQVAKNSVGVYRSHIHGRGLFCKRDIECGEMVVEYAGQVIRS 106
Query: 733 SIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGD 792
+ D+RE G G YMFRID++ V+DAT G+ A INHSCEPNCYSR++ + G
Sbjct: 107 VLCDKREK--EYEAKGMGCYMFRIDEQTVVDATVHGNAARFINHSCEPNCYSRIVDIFGK 164
Query: 793 EHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+HIIIFA R I + EELTYDY+F D ++ C CG +CR +N
Sbjct: 165 KHIIIFALRRILRGEELTYDYKFPLEDVKIRCTCGSRKCRKYLN 208
>gi|328711160|ref|XP_001945277.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
[Acyrthosiphon pisum]
Length = 1322
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 100/142 (70%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FGKS IH +G+FA A +MVIEY G++VRP +AD RE + +G+ +Y+F
Sbjct: 1182 KKQLKFGKSAIHDWGLFAMESIAADEMVIEYVGQMVRPVVADLRERQYEATGIGS-SYLF 1240
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID + +IDAT+ G++A INHSC PNCY+++I ++G + I+I++K+ I EE+TYDY+
Sbjct: 1241 RIDLDTIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYK 1300
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1301 FPLEDNKIPCLCGTHCCRGTLN 1322
>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
SS1]
Length = 1014
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 106/151 (70%), Gaps = 2/151 (1%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
KY + +T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE
Sbjct: 866 KYNQL-QTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYVGEVIRAQVADKREKAYERQ 924
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I+++G++ I+I+AK+DI+
Sbjct: 925 GIGS-SYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIEL 983
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
E+TYDY F +++ C CG +CRG +N
Sbjct: 984 GSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1014
>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
SS1]
Length = 1095
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 106/151 (70%), Gaps = 2/151 (1%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
KY + +T +K L F +S IH +G++A GD+VIEY GE++R +AD+RE
Sbjct: 947 KYNQL-QTRKKHLRFARSPIHDWGLYAMEKINRGDLVIEYVGEVIRAQVADKREKAYERQ 1005
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I+++G++ I+I+AK+DI+
Sbjct: 1006 GIGS-SYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIEL 1064
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
E+TYDY F +++ C CG +CRG +N
Sbjct: 1065 GSEITYDYHFPIEQDKIPCLCGSAKCRGTLN 1095
>gi|405121297|gb|AFR96066.1| bromodomain and PHD finger-containing protein 3 [Cryptococcus
neoformans var. grubii H99]
Length = 1188
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 101/185 (54%), Gaps = 16/185 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 126 KCSICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCIF 183
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G+WAHL CAIWIPET L + MEP++ + V K RWKL+CS+C
Sbjct: 184 CPNEGGAFKQTTTGQWAHLLCAIWIPETTLGNSIYMEPVESVELVPKGRWKLVCSLCKER 243
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC N C A+H CAR GL ++ SL D L ++C KH
Sbjct: 244 VGACIQCENRNCFTAFHVTCARQLGLLQSMK--------SLTTDGT-----LHAYCHKH- 289
Query: 578 QPLND 582
PL++
Sbjct: 290 MPLDE 294
>gi|363742984|ref|XP_419262.3| PREDICTED: LOW QUALITY PROTEIN: bromodomain and PHD
finger-containing protein 3 [Gallus gallus]
Length = 1174
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 100/184 (54%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G+N + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGINNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E +N + +R ++C+
Sbjct: 333 MGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSINGTTF-------TVRKTAYCES 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|406863054|gb|EKD16102.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1280
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 139/255 (54%), Gaps = 22/255 (8%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N +GCAR+E P++ +R R+E +A A+ + + E
Sbjct: 1030 YVP--NATGCARTEEFKKILNSEKSLYLPHHIKVKRAREERQAQASRAGRDPAAEAALAA 1087
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLA-FGKSG 704
++ GN+ + RV +++ ++ N D + + R++L F +S
Sbjct: 1088 KLVAEKSVARGNSRAN-RVTQRRYATDVNEQKKNLGGDVDVLRFNQLQKRQKLVNFERSA 1146
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G+FA DM+IEY GE VR ++AD REH S +G+ +Y+FRID+ VIDA
Sbjct: 1147 IHKWGLFAMENINQNDMIIEYVGEKVRQTVADLREHRYLKSGIGS-SYLFRIDESTVIDA 1205
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQ 821
T+ G IA INHSC PNC +++I+V + I+I+A RDI Q EELTYDY+F ++
Sbjct: 1206 TKKGGIARFINHSCMPNCTAKIITVEKGKRIVIYALRDIAQNEELTYDYKFEREIGSTDR 1265
Query: 822 LACYCGFPRCRGVVN 836
+ C CG C+G +N
Sbjct: 1266 IPCLCGTAACKGFLN 1280
>gi|392572964|gb|EIW66107.1| hypothetical protein TREMEDRAFT_74868 [Tremella mesenterica DSM
1558]
Length = 1413
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 106/194 (54%), Gaps = 20/194 (10%)
Query: 383 PGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCN 442
P + PV + +C+VC E +N + CD C + VH CYG G WLC
Sbjct: 132 PASHLPV-----EDSRCAVCDDGEGENSNAIVFCDGCNLAVHQDCYGVPYIPEG-QWLCR 185
Query: 443 LCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRV 502
C +PE P C CP GGA K TT G WAHL CAIWIPE + + MEP+DG+ +
Sbjct: 186 KCT-VSPENPVSCLFCPNEGGAFKQTTTGHWAHLLCAIWIPELIVGNPIYMEPVDGVETI 244
Query: 503 SKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDD 562
K+RWKL CS+C GACIQC++ +C VA+H CAR GL L+S +
Sbjct: 245 PKNRWKLTCSLCREKVGACIQCADRSCFVAFHVTCARQHGL-----------LMSNRTHN 293
Query: 563 EDQCIRLLSFCKKH 576
D+ ++ ++C+KH
Sbjct: 294 TDELLK--AYCQKH 305
>gi|326933544|ref|XP_003212862.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Meleagris gallopavo]
Length = 1167
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 100/184 (54%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSHPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G+N + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGINNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E +N + +R ++C+
Sbjct: 333 MGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSINGTTF-------TVRKTAYCES 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|391333600|ref|XP_003741200.1| PREDICTED: uncharacterized protein LOC100900769 [Metaseiulus
occidentalis]
Length = 748
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
L M ++ +K+T R+ + +S IH G+FAK AG+MVIEY+GEL+R + D+RE
Sbjct: 595 LHMVMQFYSLKDTARQTVGVFRSKIHRRGVFAKRAIDAGEMVIEYSGELIRAVLTDKREQ 654
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+Y + G YMF+ID++ V+DAT G+ A INHSC+PNCYS+ I + G +HI+I+++
Sbjct: 655 -LYKAR-GIDCYMFKIDEDEVVDATMHGNAARFINHSCDPNCYSKCIEIFGKKHIVIYSQ 712
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+ IK EELTYDY+F D ++ C CG +CR +N
Sbjct: 713 KRIKVGEELTYDYKFPKEDVKVPCTCGARKCRRYLN 748
>gi|393214462|gb|EJC99954.1| hypothetical protein FOMMEDRAFT_127365 [Fomitiporia mediterranea
MF3/22]
Length = 1195
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 100/178 (56%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 132 CAICDDAEGENANAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 189
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G WAHL CAIW+PET + + MEPI G+++++K RWKL CS+CG+
Sbjct: 190 PNEGGAFKQTVSGDWAHLLCAIWVPETRVANDVFMEPITGVDKINKQRWKLKCSLCGIRE 249
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS ++C +A+H CAR L L+ + + L ++C+KH
Sbjct: 250 GACIQCSKSSCFLAFHATCARRQKL-----------LMPMKSAHGSEPATLAAYCEKH 296
>gi|157107709|ref|XP_001649902.1| phd finger protein [Aedes aegypti]
gi|108879514|gb|EAT43739.1| AAEL004834-PA [Aedes aegypti]
Length = 2274
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 98/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + W+C C G P C LC
Sbjct: 243 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITNIPSGQWMCRTCSMGQK---PDCVLC 298
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V RMEPI ++ + RW L+C +C
Sbjct: 299 PNKGGAMKSTRSGQKWAHVSCALWIPEVSIGSVDRMEPITKISSIPPSRWALICVLCRER 358
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL + + DE+ ED ++L S+C+KH
Sbjct: 359 VGACIQCSVKTCKTAYHVTCAFQHGL--------EMRAIIEDENAED-GVKLRSYCQKH 408
>gi|307105681|gb|EFN53929.1| hypothetical protein CHLNCDRAFT_13553, partial [Chlorella
variabilis]
Length = 152
Score = 147 bits (371), Expect = 3e-32, Method: Composition-based stats.
Identities = 69/153 (45%), Positives = 97/153 (63%), Gaps = 5/153 (3%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC+ ++E + + +QC KC++ VH CYG + +G LWLC++C + E PPPC LC
Sbjct: 1 CHVCNEEDEDEEDFIVQCGKCKLFVHMCCYGIRQAPHGKLWLCDVCSL-SLEKPPPCALC 59
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGL----NRVSKDRWKLLCSIC 514
PV+GGA+K TT GRWAH CA+W+PET L + GL +V + R++L C +C
Sbjct: 60 PVLGGALKRTTCGRWAHPTCALWLPETSLDATASHHLLHGLVQGVQKVHRSRFQLTCQLC 119
Query: 515 GVSYGACIQCSNTTCRVAYHPLCARAAGLCVEL 547
+GAC+QC T C +HPLCAR AG +E+
Sbjct: 120 RQQHGACMQCCETRCYAGFHPLCARQAGYQMEV 152
>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
Length = 1376
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 143/260 (55%), Gaps = 30/260 (11%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NPSG AR++ P+ ++ R+E EALA + +
Sbjct: 1124 YVP--NPSGSARTDGRKRILESEKSKYLPHRIKVQKAREEREALAKSDPTAAADAARVIA 1181
Query: 646 VGGYCQNGLSG---NTLPSIRVIGSKFSFSLHRDA-PNFLSMAD--KYKHMKETFRKRLA 699
++ L+ N + + R+I + + A P D ++ +K+ +K +
Sbjct: 1182 AKTLSKSTLTSSRSNRVNNRRLIAD---INAQKQALPTQSGEGDVLRFNQLKKR-KKPVR 1237
Query: 700 FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDE 759
F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+
Sbjct: 1238 FARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDEN 1296
Query: 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---F 816
VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+F +
Sbjct: 1297 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1356
Query: 817 SIDEQLACYCGFPRCRGVVN 836
D+++ C CG C+G +N
Sbjct: 1357 DSDDRIPCLCGSTGCKGFLN 1376
>gi|113206662|gb|ABI34482.1| SET domain containing 1Bb [Danio rerio]
Length = 175
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 109/172 (63%), Gaps = 2/172 (1%)
Query: 666 GSKFSFSLHRDAPNFLSMADKYKHMKETFRK-RLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
GS+ R +F +D K + FRK RL FGKS IH +G+FA+ P A +M+IE
Sbjct: 5 GSELRAEQRRLLSSFSCDSDLLKFNQLKFRKKRLRFGKSRIHDWGLFAEEPIAADEMIIE 64
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y G+ +R IAD RE +G+ +Y+FR+D + +IDAT+ G++A INHSC PNCY+
Sbjct: 65 YVGQSIRQVIADMRERRYETEGIGS-SYLFRVDHDTIIDATKCGNLARFINHSCNPNCYA 123
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+VI+V + I+I++++ I EE+TYDY+F DE++ C C CRG +N
Sbjct: 124 KVITVEAQKKIVIYSRQPITVNEEITYDYKFPIEDEKIPCLCAAENCRGTLN 175
>gi|71023893|ref|XP_762176.1| hypothetical protein UM06029.1 [Ustilago maydis 521]
gi|46101634|gb|EAK86867.1| hypothetical protein UM06029.1 [Ustilago maydis 521]
Length = 1227
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/170 (44%), Positives = 96/170 (56%), Gaps = 2/170 (1%)
Query: 395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPP 454
D KC++C E +N + CD C + VH CYG + + WLC C +P+
Sbjct: 154 DDSKCAICDDGECENSNAIVFCDGCNLAVHQDCYG-IPYIPEGQWLCRKCT-VSPDRAVS 211
Query: 455 CCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSIC 514
C LCP GGA K TT G+WAHL CA+WIPET +++ MEPID + R+ K RWKL C +C
Sbjct: 212 CILCPHEGGAFKQTTTGKWAHLLCAMWIPETGVSNPVYMEPIDSVERIPKARWKLQCYLC 271
Query: 515 GVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDED 564
GACIQC N +C A+H CAR AGL E + L D D+ D
Sbjct: 272 RYRMGACIQCDNRSCFTAFHVTCARKAGLLFRTERTRVSHHLYDDSDNSD 321
>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
Length = 539
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 107/155 (69%), Gaps = 3/155 (1%)
Query: 684 ADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
+D +K + +RK++ F +S IHG+G++A P +M++EY G+ +RP++AD RE
Sbjct: 386 SDFFKVNQLKYRKKMIKFARSRIHGWGLYALEPITPDEMIVEYVGQKIRPTVADEREKRY 445
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD 802
+G+ +Y+FRID + VIDAT G+ A INHSC+PNCY++V+ V+G++ I+I++K
Sbjct: 446 IRKGMGS-SYLFRIDSDNVIDATNMGNFARFINHSCQPNCYAKVVVVDGEKRIVIYSKTP 504
Query: 803 IKQWEELTYDYRF-FSIDEQLACYCGFPRCRGVVN 836
I + +E+TYDY+F ++++ C CG P CRG +N
Sbjct: 505 INKGDEITYDYKFPIEEEDKIDCLCGAPSCRGTLN 539
>gi|390594432|gb|EIN03843.1| hypothetical protein PUNSTDRAFT_146824 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1177
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 69/141 (48%), Positives = 86/141 (60%), Gaps = 2/141 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 133 CAICDDSEGENMNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPEIPVSCILC 190
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G WAHL CAIW+PET + + MEPI G+ ++ K RWKL CS+C V
Sbjct: 191 PNEGGAFKQTVTGDWAHLLCAIWVPETRVANEVFMEPITGVEKIPKQRWKLKCSLCDVRE 250
Query: 519 GACIQCSNTTCRVAYHPLCAR 539
GACIQCS ++C VA+HP CAR
Sbjct: 251 GACIQCSKSSCFVAFHPTCAR 271
>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1271
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 107/316 (33%), Positives = 162/316 (51%), Gaps = 35/316 (11%)
Query: 547 LEDEDRLNLL--SLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS 604
++D++ +N L +L E L ++ KHK+ D V R Y P
Sbjct: 965 IKDDEDINFLKKALSEHLAADIGNLAAWTWKHKEIKAINRGGDRGPVHSETRVDGYYVP- 1023
Query: 605 NPSGCARSEPYNYFGRRGRKEPEA---------LAAASLKRLF-VENQPYLVGGYCQ--- 651
NPSG AR+E GR+ +E E + A +RL +N P+
Sbjct: 1024 NPSGSARTE-----GRKRIRESEKSKYLPHRIKVQKAREERLAKAKNDPHAAAAEAARLL 1078
Query: 652 --NGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD------KYKHMKETFRKRLAFGKS 703
LS +T S RV + ++ L M + ++ +K+ +K + F +S
Sbjct: 1079 AAKSLSKSTSRSTRVNNRRLIADINAQK-QALPMQNGDSDVLRFNQLKKR-KKPVRFARS 1136
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+ VID
Sbjct: 1137 AIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVID 1195
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+F + D+
Sbjct: 1196 ATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDD 1255
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1256 RIPCLCGSAGCKGFLN 1271
>gi|321260108|ref|XP_003194774.1| bromodomain and PHD finger-containing protein 3 [Cryptococcus
gattii WM276]
gi|317461246|gb|ADV22987.1| Bromodomain and PHD finger-containing protein 3, putative
[Cryptococcus gattii WM276]
Length = 1170
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 100/185 (54%), Gaps = 16/185 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 126 KCSICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCIF 183
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G WAHL CAIWIPET L + MEP++ + V K RWKL+CS+C
Sbjct: 184 CPNEGGAFKQTTTGHWAHLLCAIWIPETTLGNSIYMEPVESVELVPKGRWKLVCSLCKER 243
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC N C A+H CAR GL ++ SL D L ++C KH
Sbjct: 244 IGACIQCENRNCFTAFHVTCARQLGLLQSMK--------SLTTDGT-----LHAYCHKH- 289
Query: 578 QPLND 582
PL++
Sbjct: 290 MPLDE 294
>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 1271
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 107/316 (33%), Positives = 162/316 (51%), Gaps = 35/316 (11%)
Query: 547 LEDEDRLNLL--SLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPS 604
++D++ +N L +L E L ++ KHK+ D V R Y P
Sbjct: 965 IKDDEDINFLKKALSEHLAADIGNLAAWTWKHKEIKAINRGGDRGPVHSETRVDGYYVP- 1023
Query: 605 NPSGCARSEPYNYFGRRGRKEPEA---------LAAASLKRLF-VENQPYLVGGYCQ--- 651
NPSG AR+E GR+ +E E + A +RL +N P+
Sbjct: 1024 NPSGSARTE-----GRKRIRESEKSKYLPHRIKVQKAREERLAKAKNDPHAAAAEAARLL 1078
Query: 652 --NGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD------KYKHMKETFRKRLAFGKS 703
LS +T S RV + ++ L M + ++ +K+ +K + F +S
Sbjct: 1079 AAKSLSKSTSRSTRVNNRRLIADINAQK-QALPMQNGDSDVLRFNQLKKR-KKPVRFARS 1136
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+ VID
Sbjct: 1137 AIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVID 1195
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+F + D+
Sbjct: 1196 ATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDD 1255
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1256 RIPCLCGSAGCKGFLN 1271
>gi|241554585|ref|XP_002399516.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215501703|gb|EEC11197.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 544
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 690 MKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGA 749
+K++ + + +S IHG G+++K AG+MVIEY G++VR + D+RE +Y S G
Sbjct: 400 LKDSAKATVGVYRSLIHGRGLYSKRNIDAGEMVIEYAGQVVRSVLTDKRER-LYESR-GI 457
Query: 750 GTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEEL 809
G YMFR+D+++V+DAT G+ A INHSC+PNCYS+VI+V G +HIII+A R I + EEL
Sbjct: 458 GCYMFRMDEDQVVDATTHGNAARFINHSCDPNCYSKVIAVFGQKHIIIYALRKIYKGEEL 517
Query: 810 TYDYRFFSIDEQLACYCGFPRCRGVVN 836
TYDY+F + ++ C CG RCR +N
Sbjct: 518 TYDYKFPKEEVKIPCSCGARRCRKFLN 544
>gi|66805051|ref|XP_636258.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74852293|sp|Q54HS3.1|SET1_DICDI RecName: Full=Histone-lysine N-methyltransferase set1; AltName:
Full=Histone H3 lysine 4 methyltransferase; AltName:
Full=SET domain-containing protein 1
gi|60464689|gb|EAL62816.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 1486
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 66/142 (46%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
RKR+ F +S IH +G+FA A DMVIEY GE++R +AD RE +G+ +Y+F
Sbjct: 1346 RKRIKFERSDIHDWGLFAMETISAKDMVIEYIGEVIRQKVADEREKRYVKKGIGS-SYLF 1404
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD+ +IDAT G++A INH C+PNC ++V+++ + III+AKRDI EE+TYDY+
Sbjct: 1405 RVDDDTIIDATFKGNLARFINHCCDPNCIAKVLTIGNQKKIIIYAKRDINIGEEITYDYK 1464
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C C P+CR +N
Sbjct: 1465 FPIEDVKIPCLCKSPKCRQTLN 1486
>gi|50292043|ref|XP_448454.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527766|emb|CAG61415.1| unnamed protein product [Candida glabrata]
Length = 761
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/181 (39%), Positives = 95/181 (52%), Gaps = 5/181 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC N+ + CD C + VH CYG + G WLC C + C C
Sbjct: 260 CAVCDGTVSTTTNMIVFCDGCDIAVHQECYGIVFIPEG-QWLCRRCFI-SRNKQVNCVTC 317
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T G WAH+ CA+WIPE ++ MEPI+G+ ++K RWKL+C IC +
Sbjct: 318 PSTTGAFKQTHTGSWAHVLCALWIPELVFANLHYMEPIEGVENINKSRWKLVCYICKLRV 377
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQCSN C AYH CA+ AGLC++ D ++ + + S+C KH
Sbjct: 378 GACIQCSNKNCFAAYHVTCAKRAGLCLDTHD---TSIAEMASKHYQMHHHVTSYCDKHSP 434
Query: 579 P 579
P
Sbjct: 435 P 435
>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
Length = 1787
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 126/231 (54%), Gaps = 11/231 (4%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE G + + + RL E P G + S R G
Sbjct: 1567 TGCARSE-----GYYKIDKKDKIKYLQSSRLLSEEPPVDTQGMS---IPAQVQASSRA-G 1617
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
S+ R +F +D K + FRK+ + F KS IH +G+FA P A +MVIEY
Sbjct: 1618 SERRSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSNIHDWGLFAMEPIAADEMVIEY 1677
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
G+ +R IAD RE +G+ +YMFR+D + +IDAT+ G+ A INHSC PNCY++
Sbjct: 1678 VGQNIRQVIADMREKRYEEEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAK 1736
Query: 786 VISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
VI+V + I+I++++ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1737 VITVESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1787
>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
[Coccidioides immitis RS]
Length = 1271
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 95/261 (36%), Positives = 142/261 (54%), Gaps = 34/261 (13%)
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEA---------LAAASLKRLF-VENQPYLVGGY 649
Y+P NPSG AR+E GR+ +E E + A +RL +N P+
Sbjct: 1021 YVP--NPSGSARTE-----GRKRIRESEKSKYLPHRIKVQKAREERLAKAKNDPHAAAAE 1073
Query: 650 CQ-----NGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD------KYKHMKETFRKRL 698
LS +T S RV + ++ L M + ++ +K+ +K +
Sbjct: 1074 AARLLAAKSLSKSTSRSTRVNNRRLIADINAQK-QALPMQNGDSDVLRFNQLKKR-KKPV 1131
Query: 699 AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDD 758
F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+
Sbjct: 1132 RFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDE 1190
Query: 759 ERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF--- 815
VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+F
Sbjct: 1191 NTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFERE 1250
Query: 816 FSIDEQLACYCGFPRCRGVVN 836
+ D+++ C CG C+G +N
Sbjct: 1251 WDSDDRIPCLCGSAGCKGFLN 1271
>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1389
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 102/145 (70%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE +G+ +
Sbjct: 1246 QTRKKHLRFARSPIHDWGLYAMEKISRGEMVIEYVGEIIRAQVADKREKAYERQGIGS-S 1304
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I++N ++ I+I+AK+DI+ E+TY
Sbjct: 1305 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITY 1364
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 1365 DYHFPIEQDKIPCLCGSAKCRGYLN 1389
>gi|134113070|ref|XP_774811.1| hypothetical protein CNBF2410 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257457|gb|EAL20164.1| hypothetical protein CNBF2410 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1120
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 100/185 (54%), Gaps = 16/185 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 126 KCSICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCIF 183
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G WAHL CAIWIPET L + MEP++ + V K RWKL+CS+C
Sbjct: 184 CPNEGGAFKQTTTGHWAHLLCAIWIPETTLGNSIYMEPVESVELVPKGRWKLVCSLCKER 243
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC N C A+H CAR GL ++ SL D L ++C KH
Sbjct: 244 VGACIQCENRNCFTAFHVTCARQLGLLKSMK--------SLTTDGT-----LHAYCHKH- 289
Query: 578 QPLND 582
PL++
Sbjct: 290 MPLDE 294
>gi|449016155|dbj|BAM79557.1| unknown RNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1151
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 145/283 (51%), Gaps = 27/283 (9%)
Query: 557 SLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYN 616
+LD+D D R ++H P + L D + + ++ + PP+N +GCAR+E Y
Sbjct: 893 TLDQDKSDIATRY----REHWSPYLETL-FDTHGICIKQKLQEPEPPANSTGCARTEGYA 947
Query: 617 YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRD 676
F R E +A R Q + + + +L RV + ++
Sbjct: 948 AFSR----ELKAF------RWRPPKQVIIPADEARRWSATGSLRGTRVEQRQLLSAVSDR 997
Query: 677 APNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIAD 736
L Y +E F L F +SGIHG+G++A AGD +IEY GELVR ++AD
Sbjct: 998 EQTVL-----YLQQRERF---LRFYRSGIHGYGLWALEDIPAGDYIIEYRGELVRSAVAD 1049
Query: 737 RREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHII 796
RE +G ++MFRID + V+DAT GS+A +NHSC+PN +R++ + G HI+
Sbjct: 1050 LRERAYRQQGMG-DSFMFRIDADTVVDATHIGSVARFVNHSCDPNAIARIVQLGGASHIL 1108
Query: 797 IFAKRDIKQWEELTYDYRFFSID---EQLACYCGFPRCRGVVN 836
++KR I EE+TYDY F D E++ C CG P CR +N
Sbjct: 1109 FYSKRSICVGEEITYDYNFDIEDDASEKVPCLCGAPNCRQYLN 1151
>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
Length = 1260
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 135/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E +A S K +
Sbjct: 1010 YVP--NPTGCARTEGVKKILNSEKSKYLPHHLKVKKAREERQAQNGKSAKDSVLAAAEAA 1067
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + R +F L+ +D ++ +K+ +K + F +S
Sbjct: 1068 RLAAESLVAKGNSR-ANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKR-KKPVKFARS 1125
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ REH S +G+ +Y+FRIDD VID
Sbjct: 1126 AIHNWGLYAMENIPKDDMIIEYVGEEVRQQIAELREHRYLKSGIGS-SYLFRIDDNTVID 1184
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1185 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTD 1244
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1245 RIPCLCGTAACKGFLN 1260
>gi|224085193|ref|XP_002196597.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Taeniopygia guttata]
Length = 1118
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 100/184 (54%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G+N + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGINNIPPARWKLTCCICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E +N + +R ++C+
Sbjct: 333 MGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSINGTTF-------TVRKTAYCES 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|392564953|gb|EIW58130.1| hypothetical protein TRAVEDRAFT_71843 [Trametes versicolor
FP-101664 SS1]
Length = 1468
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 135 CAICDDSEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 192
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G+ R+SK RW+L CS+C V
Sbjct: 193 PNEGGAFKQTVSGDWVHLLCAIWVPETAVANDVFMEPITGVERISKQRWRLRCSVCDVRE 252
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GAC+QCS +C VA+H CAR L L+ + + L +C+KH
Sbjct: 253 GACVQCSKASCFVAFHATCARKEKL-----------LMPMKATQGSEAPTLACYCEKH 299
>gi|58268350|ref|XP_571331.1| Bromodomain and PHD finger-containing protein 3 [Cryptococcus
neoformans var. neoformans JEC21]
gi|57227566|gb|AAW44024.1| Bromodomain and PHD finger-containing protein 3, putative
[Cryptococcus neoformans var. neoformans JEC21]
Length = 1064
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 100/185 (54%), Gaps = 16/185 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C E +N + CD C + VH CYG + + WLC C +PE P C
Sbjct: 126 KCSICDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCIF 183
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G WAHL CAIWIPET L + MEP++ + V K RWKL+CS+C
Sbjct: 184 CPNEGGAFKQTTTGHWAHLLCAIWIPETTLGNSIYMEPVESVELVPKGRWKLVCSLCKER 243
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC N C A+H CAR GL ++ SL D L ++C KH
Sbjct: 244 VGACIQCENRNCFTAFHVTCARQLGLLKSMK--------SLTTDGT-----LHAYCHKH- 289
Query: 578 QPLND 582
PL++
Sbjct: 290 MPLDE 294
>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 1188
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 140/255 (54%), Gaps = 22/255 (8%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
++P N +G AR+E P+ ++ R+E EALA + + + V++
Sbjct: 938 FVP--NSTGSARTEGKKRIFEAEKSKYLPHRIKVQKAREEREALAQSDPQSVLVDSAKAG 995
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSM-ADKYKHMKETFRKRLAFGKSG 704
N S +T R + + + P A ++ +K+ +K + F +S
Sbjct: 996 PTKPVNNSTSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKR-KKPVRFARSA 1054
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+ VIDA
Sbjct: 1055 IHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENAVIDA 1113
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQ 821
T+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+F + +++
Sbjct: 1114 TKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDR 1173
Query: 822 LACYCGFPRCRGVVN 836
+ C CG C+G +N
Sbjct: 1174 IPCLCGSAGCKGFLN 1188
>gi|312371947|gb|EFR20005.1| hypothetical protein AND_20789 [Anopheles darlingi]
Length = 4717
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 119/220 (54%), Gaps = 44/220 (20%)
Query: 605 NPSGCARSEPYN---------YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLS 655
N G AR EPY+ + R RK+P + A ++ + + +
Sbjct: 4505 NAYGTARCEPYSSRSEYDMFSWLASRHRKQPMPIVAQTIDDTVIPRRG-----------T 4553
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHP 715
G+ LP MA +Y+ +KE+ ++ + +S IHG G+F
Sbjct: 4554 GSNLP----------------------MAMRYRTLKESSKESVGVYRSHIHGRGLFCNRD 4591
Query: 716 HRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLIN 775
AG+MVIEY GEL+R ++ D+RE + Y+ G G YMF+ID+ V+DAT G+ A IN
Sbjct: 4592 IEAGEMVIEYAGELIRSTLTDKRERY-YDGR-GIGCYMFKIDENFVVDATMRGNAARFIN 4649
Query: 776 HSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF 815
HSCEPNCYS+V+ + G +HIIIFA R I Q EELTYDY+F
Sbjct: 4650 HSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKF 4689
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 48/108 (44%), Gaps = 22/108 (20%)
Query: 455 CCLCPVVGGAMKPTTDGR--------WAHLACAIWIPETCLTDVKRMEPIDG-LNRV--S 503
C LC G M P + R W H+ CA+W E E IDG L V +
Sbjct: 2088 CMLCKQQGEGM-PLHESRLLFCGQNNWVHINCALWSAEV-------FEEIDGSLQNVHSA 2139
Query: 504 KDRWKLL-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDE 550
R +L+ C CGV GA + C+ C YH CAR G CV + D+
Sbjct: 2140 ASRGRLIKCCHCGVK-GATVGCNVKNCGEHYHFPCARRIG-CVFMLDK 2185
>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
distachyon]
Length = 1194
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 100/142 (70%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KRL F +S IH +G+ A A D VIEY GEL+R ++D RE S +G+ +Y+F
Sbjct: 1054 KKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRPVSDIREAQYEKSGIGS-SYLF 1112
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD+ V+DAT+ G +A INHSCEPNCY++VI+V+G + I I++KR I EELTY+Y+
Sbjct: 1113 RLDDDYVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYK 1172
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F ++++ C+CG RCRG +N
Sbjct: 1173 FPLEEKKIPCHCGSLRCRGSMN 1194
>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
Length = 1224
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 103/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
++ +K+L F +S IH +G++A G+MVIEY GE++R +A++RE S +G+ +
Sbjct: 1081 QSRKKQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQVAEKRERAYERSGIGS-S 1139
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRIDD+ V+DAT+ G++ LINHSC+PNC +++I++ G + I+I+AK DI +E+TY
Sbjct: 1140 YLFRIDDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTY 1199
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +E++ C CG +CRG +N
Sbjct: 1200 DYHFPIENEKIPCLCGAAKCRGFLN 1224
>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 1344
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 141/259 (54%), Gaps = 30/259 (11%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+G AR++ P+ ++ R+E EALA + E L
Sbjct: 1094 YVP--NPTGSARTDGRKRILESEKSKYLPHRIKVQKAREEREALAKND-PQATTEASRVL 1150
Query: 646 VGGYCQNGLSG--NTLPSIRVIGSKFSFSLHRDA-PNFLSMAD--KYKHMKETFRKRLAF 700
S N + + R+I + + A P D ++ +K+ +K + F
Sbjct: 1151 AAKTISKSTSSRSNRVNNRRLIAD---INAQKQALPTQSGEGDVLRFNQLKKR-KKPVRF 1206
Query: 701 GKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER 760
+S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+
Sbjct: 1207 ARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENT 1265
Query: 761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FS 817
VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+F +
Sbjct: 1266 VIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWD 1325
Query: 818 IDEQLACYCGFPRCRGVVN 836
D+++ C CG C+G +N
Sbjct: 1326 SDDRIPCLCGSTGCKGFLN 1344
>gi|358342811|dbj|GAA39455.2| histone-lysine N-methyltransferase MLL3 [Clonorchis sinensis]
Length = 1443
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 124/238 (52%), Gaps = 27/238 (11%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GCARSEP R R +A L+ + N P
Sbjct: 1227 NPTGCARSEPKLRTHVRRRPTGDASQHGQLRNAAIAN------------------PMSTP 1268
Query: 665 IGSKFSFSLHRDAPNFLSMA--DKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ S S ++ A +FLS++ +Y+ ++ +R + +S I G G++A +
Sbjct: 1269 VRSGLSSPVYSTAASFLSVSKSQQYRRLRLEWRYNVILARSQIQGLGLYAARNLSKHAFI 1328
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GEL+R + +RRE L G YMFR+D++ ++DAT G +A INHSCEPNC
Sbjct: 1329 IEYLGELIRNEVGNRRERLY--ELQNRGIYMFRVDEDSIVDATMCGGLARYINHSCEPNC 1386
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ----LACYCGFPRCRGVVN 836
+ V++ HIII A RDI++ EELTYDY+ F I+E+ + C CG P CR +N
Sbjct: 1387 TAEVLNCENGSHIIIIASRDIEKGEELTYDYK-FDIEEERGDRIPCLCGAPNCRRWMN 1443
>gi|281201147|gb|EFA75361.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1458
Score = 146 bits (368), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 95/142 (66%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K++ F +S IH +G+FA P A DMVIEY GE++R +AD RE +G+ +Y+F
Sbjct: 1318 KKKIKFSRSAIHDWGLFALEPITAKDMVIEYIGEVIRQKVADEREKRYTKKGIGS-SYLF 1376
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD+ +IDAT G++A INH C+PNC +++I + I+I+AKRDI EE+TYDY+
Sbjct: 1377 RIDDDTIIDATFKGNLARFINHCCDPNCIAKIIQTGNQKKIVIYAKRDIMIGEEITYDYK 1436
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C C +CRG +N
Sbjct: 1437 FPIEDVKIPCLCKSAKCRGTLN 1458
>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 164
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +AD+RE +G+ +
Sbjct: 21 QTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQVADKREKAYERQGIGS-S 79
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I++NG++ I+I+AK+DI+ +E+TY
Sbjct: 80 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEITY 139
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 140 DYHFPIEQDKIPCLCGSAKCRGYLN 164
>gi|402221447|gb|EJU01516.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 164
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 97/142 (68%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K L F +S IH +G++A AGDMVIEY GE+VR +AD+RE +G+ +Y+F
Sbjct: 24 KKELKFARSPIHDWGLYALEYIPAGDMVIEYVGEVVRQQVADKREKVYERQGIGS-SYLF 82
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD+ V+DAT G+I LINHSC PNC +R+I++N + I+I+AK I+ EE+TYDY
Sbjct: 83 RIDDDLVVDATMKGNIGRLINHSCSPNCTARIITINSSKKIVIYAKTPIEPGEEITYDYH 142
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F E++ C CG +CRG +N
Sbjct: 143 FPIEQEKIPCLCGSEKCRGFLN 164
>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
Length = 1056
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 108/154 (70%), Gaps = 2/154 (1%)
Query: 684 ADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
+D +K + FRK+L F +S IHG+G++A +M++EY G+ +RP++AD RE
Sbjct: 904 SDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEMIVEYIGQKIRPTVADEREKRY 963
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD 802
+G+ +Y+FRID + VIDAT+ G++A INHSC+PNCY++++ V+G++ I+I++K
Sbjct: 964 ERRGMGS-SYLFRIDSDNVIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLA 1022
Query: 803 IKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I + +E+TYDY+F ++++ C CG P CRG +N
Sbjct: 1023 INKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 1056
>gi|388857302|emb|CCF49144.1| related to Peregrin (Bromodomain and PHD finger-containing protein
1) [Ustilago hordei]
Length = 1261
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 78/193 (40%), Positives = 105/193 (54%), Gaps = 5/193 (2%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC++C E +N + CD C + VH CYG + + WLC C +P+ C L
Sbjct: 155 KCAICDDGECENSNAIVFCDGCNLAVHQDCYG-IPYIPEGQWLCRKCT-VSPDRAVSCIL 212
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G+WAHL CA+WIPET +++ MEPID + R+ K RWKL C +C
Sbjct: 213 CPHEGGAFKQTTTGKWAHLLCAMWIPETGVSNPVYMEPIDSIERIPKARWKLQCYLCRYR 272
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLL--SLDEDDEDQCIRLLSFCKK 575
GACIQC N +C A+H CAR AGL + L LD D++ L + C +
Sbjct: 273 MGACIQCDNRSCFTAFHVTCARQAGLLFRTVRTRMAHHLYEDLDNSDDEGAEVLRACCHR 332
Query: 576 HKQP-LNDRLAVD 587
H + D+ +D
Sbjct: 333 HMPADMRDQFKID 345
>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
Length = 1286
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 135/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E +A + K +
Sbjct: 1036 YVP--NPTGCARTEGVKKILNSEKSKYLPHHLMVQKAREERQAQTGKNAKDAALAAAEAA 1093
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + R +F L+ +D ++ +K+ +K + F +S
Sbjct: 1094 RLAAESLVAKGNSR-ANRANNRRFVADLNDQRKTLGQDSDVLRFNQLKKR-KKPVKFDRS 1151
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ REH S +G+ +Y+FRIDD VID
Sbjct: 1152 AIHNWGLYAMENIPKDDMIIEYVGEEVRQQIAEIREHRYLKSGIGS-SYLFRIDDNTVID 1210
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1211 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTD 1270
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1271 RIPCLCGTAACKGFLN 1286
>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
antarctica T-34]
Length = 1366
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 70/150 (46%), Positives = 101/150 (67%), Gaps = 5/150 (3%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ ++ T +K+L F KS IH +G++A AGDMVIEY GE+VR +AD RE Y
Sbjct: 1216 KFNQLR-TRKKQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREK-QYER 1273
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
TY+FR+DD+ V+DAT G+IA L+NH C PNC ++++++NG++ I++FAK I+
Sbjct: 1274 QGNFSTYLFRVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRP 1333
Query: 806 WEELTYDYRFFSI---DEQLACYCGFPRCR 832
EELTYDY+F S ++ + C CG P CR
Sbjct: 1334 GEELTYDYKFQSSADDEDAIPCLCGSPGCR 1363
>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 1334
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 87/259 (33%), Positives = 140/259 (54%), Gaps = 28/259 (10%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+G AR++ P+ ++ R+E EALA +
Sbjct: 1082 YVP--NPTGSARTDGRKRILESEKSKYLPHRIKVQKAREEREALAKNDPAAADAAARVIA 1139
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD-----KYKHMKETFRKRLAF 700
++ L+ + S RV + ++ + + ++ +K+ +K + F
Sbjct: 1140 TKTISKSTLTSSR--SNRVNNRRLVADINAQKQALPTQSGEGDVLRFNQLKKR-KKPVRF 1196
Query: 701 GKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER 760
+S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+
Sbjct: 1197 ARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENT 1255
Query: 761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FS 817
VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+F +
Sbjct: 1256 VIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWD 1315
Query: 818 IDEQLACYCGFPRCRGVVN 836
D+++ C CG C+G +N
Sbjct: 1316 SDDRIPCLCGSTGCKGFLN 1334
>gi|348523333|ref|XP_003449178.1| PREDICTED: bromodomain and PHD finger-containing protein 3
[Oreochromis niloticus]
Length = 1232
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 103/191 (53%), Gaps = 8/191 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG G WLC C +P+ P C LC
Sbjct: 238 CCVCLDDECLNSNVILFCDSCNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPQKPVDCVLC 295
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G+N + RWKL C +C
Sbjct: 296 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGVNNIPPARWKLTCYLCKQKG 355
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E N+ + ++ +FC+ H
Sbjct: 356 RGASIQCHKANCYTAFHVTCAQRAGLFMKIEPVRETNI-----NGTTFSVKKTAFCEAHS 410
Query: 578 QPLNDRLAVDE 588
P + + DE
Sbjct: 411 PPGQEPGSDDE 421
>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 160
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE+VR +AD+RE +G+ +
Sbjct: 17 QTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYERQGIGS-S 75
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +R+I+++G++ I+I+AK+DI+ +E+TY
Sbjct: 76 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELGDEITY 135
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 136 DYHFPIEQDKIPCLCGSAKCRGYLN 160
>gi|295913201|gb|ADG57859.1| transcription factor [Lycoris longituba]
Length = 164
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KRL F +S IH +G+ A P + D VIEY GELVR I+D RE Y + +Y+F
Sbjct: 24 KKRLRFQRSNIHDWGLVALEPIESEDFVIEYVGELVRRQISDIRE-CQYEKMGIGSSYLF 82
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD V+DAT+ G IA INHSCEPNCY++VI+V G + I I+AKR I EELTY+Y+
Sbjct: 83 RLDDGYVVDATKRGGIARFINHSCEPNCYTKVITVEGQKKIFIYAKRHIHAGEELTYNYK 142
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F ++++ C CG RCRG +N
Sbjct: 143 FPLEEQKILCNCGSKRCRGSMN 164
>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 1337
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1194 KKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1252
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1253 RIDENTVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1312
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1313 FEREWDSDDRIPCLCGSTGCKGFLN 1337
>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
Length = 6073
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 125/238 (52%), Gaps = 33/238 (13%)
Query: 605 NPSGCARSEP--YNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTL-P- 660
NP+GCARSEP N + R + A S LF + P GLS L P
Sbjct: 5863 NPTGCARSEPRQKNLLPYK-RPHTQRTATTSRPNLFASSLP---------GLSSEILCPY 5912
Query: 661 SIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGD 720
S + + SK S +YK MK +R + +S I G G++A
Sbjct: 5913 SKQFVHSKSS---------------QYKKMKLEWRNNVYLARSKIQGLGLYAARDLERHT 5957
Query: 721 MVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEP 780
MVIEY GE++R +++ RE Y + G YMFR+D+ERVIDAT G +A INHSC P
Sbjct: 5958 MVIEYIGEIIRHELSESREKQ-YEAR-NRGIYMFRLDEERVIDATICGGLARYINHSCNP 6015
Query: 781 NCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
NC + ++ V+ D IIIFAKR I + EEL YDY+F D+Q + C CG P CR +N
Sbjct: 6016 NCVAEIVEVDRDLRIIIFAKRRITRGEELAYDYKFDIEDDQHKIPCLCGAPNCRKWMN 6073
>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 595
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/235 (36%), Positives = 130/235 (55%), Gaps = 11/235 (4%)
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
P + +G ARSE + R+ + + RL E G C + S+
Sbjct: 371 PHHRTGSARSEGFYKISRKDK-----MRYLHNTRLAAELPSTSAQGTC---IPAQQPTSL 422
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFR-KRLAFGKSGIHGFGIFAKHPHRAGDM 721
R GS F R +F +D K + FR KR+ F +S IH +G+FA P A +M
Sbjct: 423 RA-GSDFRCEQRRLLSSFSCDSDLVKFNQLKFRRKRIRFSRSYIHEWGLFAMEPIAADEM 481
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G+++R IAD RE +G+ +Y+FR+D + +IDAT+ G++A INHSC PN
Sbjct: 482 VIEYVGQIIRQVIADMREQRYEEEGIGS-SYLFRVDQDTIIDATKCGNLARFINHSCNPN 540
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY+++I+V + I+I++++ I EE+TYDY+F + ++ C CG CRG +N
Sbjct: 541 CYAKIITVESQKKIVIYSRQPININEEITYDYKFPIEETKIPCLCGADGCRGSLN 595
>gi|195059316|ref|XP_001995609.1| GH17848 [Drosophila grimshawi]
gi|193896395|gb|EDV95261.1| GH17848 [Drosophila grimshawi]
Length = 2535
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2326 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCTTQRMANSAAIA 2367
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2368 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2421
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2422 IEYIGEVIRTEVSEIREKQ-YES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2479
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2480 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2535
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 34/145 (23%), Positives = 60/145 (41%), Gaps = 27/145 (18%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F +I+P GY R + P Y V E
Sbjct: 2170 VGNMTFLNVGQLLPHQLEAFHTSHYIYPIGYKVSRYYWCTRRPNRRCRY---VCSIAEVG 2226
Query: 289 IRPLFRVTLDNGE------QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE- 341
RP FR+ + G ++ STPS W +I I+ + K+ +
Sbjct: 2227 CRPEFRIQVHEGHDKEPDREYRDSTPSGVWQQILQPIQRLRQ-----------VHKLLQL 2275
Query: 342 -----SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2276 FPQHISGEDLFGLTEPAIVRILESL 2300
>gi|156842174|ref|XP_001644456.1| hypothetical protein Kpol_520p19 [Vanderwaltozyma polyspora DSM
70294]
gi|156115099|gb|EDO16598.1| hypothetical protein Kpol_520p19 [Vanderwaltozyma polyspora DSM
70294]
Length = 725
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 98/178 (55%), Gaps = 5/178 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC+ E +N + CD C + VH CYG + G WLC LC + C LC
Sbjct: 237 CAVCNETESTNSNAIVFCDGCDVAVHQECYGIVFIPEG-QWLCRLCLV-SKNRKVNCALC 294
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T G WAH+ CAIWIPE ++ MEPI+G+ + K RWKL C IC
Sbjct: 295 PSHTGAFKQTDAGAWAHVICAIWIPELYFANLNYMEPIEGIQNIHKSRWKLNCYICDQKV 354
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCSN C AYH CA+ A LC+ ++ ++ + ++ D I+ S+C KH
Sbjct: 355 GSCIQCSNKNCFTAYHVTCAKRASLCINF-NKTPVSTIVQNQMSSDNMIQ--SYCDKH 409
>gi|432940862|ref|XP_004082744.1| PREDICTED: protein Jade-1-like [Oryzias latipes]
Length = 886
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/179 (43%), Positives = 102/179 (56%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + NN + CDKC + VH CYG ++ V WLC +C G P C LC
Sbjct: 197 CDVCQSPDGEDNNEMVFCDKCNICVHQACYG-IQKVPQGSWLCRICALGIL---PKCQLC 252
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++ + +RW L+C +C
Sbjct: 253 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSHIPSNRWALICCLCKEK 312
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS CR A+H C GL LE +N + L EDDE ++ SFC KH
Sbjct: 313 AGACIQCSAKNCRTAFHVTC----GLQANLE----MNTI-LTEDDE---VKFKSFCPKH 359
>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
Length = 1266
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1123 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1181
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1182 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1241
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1242 FEREWDSDDRIPCLCGSTGCKGFLN 1266
>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 191
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 103/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R IAD+RE +G+ +
Sbjct: 48 QTRKKHLRFARSPIHDWGLYAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIGS-S 106
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I++ G++ I+I+AK+DI+ +E+TY
Sbjct: 107 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITY 166
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG RCRG +N
Sbjct: 167 DYHFPIEQDKIPCLCGSARCRGYLN 191
>gi|444318419|ref|XP_004179867.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
gi|387512908|emb|CCH60348.1| hypothetical protein TBLA_0C05500 [Tetrapisispora blattae CBS 6284]
Length = 1216
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +MVIEY GE +R +++ RE +G+ +Y+F
Sbjct: 1073 KKPVTFARSAIHNWGLYALEPITAKEMVIEYVGERIRQPVSEMREKRYIKGGIGS-SYLF 1131
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSCEP+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 1132 RIDENTVIDATKRGGIARFINHSCEPSCTAKIIKVGGMKRIVIYALRDIGLNEELTYDYK 1191
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L CYCG P C+G +N
Sbjct: 1192 FEREIDAEERLPCYCGAPSCKGFLN 1216
>gi|182892006|gb|AAI65673.1| Phf17 protein [Danio rerio]
Length = 829
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 109/200 (54%), Gaps = 18/200 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 199 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGIF---PKCHLC 254
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++ + +RW L+C +C
Sbjct: 255 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSHIPSNRWALICCLCKEK 314
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +CRVA+H C GL ++N + L E DE ++ SFC KH
Sbjct: 315 TGACIQCSAKSCRVAFHVTCGLHCGL--------KMNTI-LTEADE---VKFKSFCPKHS 362
Query: 578 Q-PLNDRLAVDERLVQVTRR 596
N+ D+R V+V R
Sbjct: 363 GLDWNEEEGDDDRPVKVPTR 382
>gi|41054211|ref|NP_956099.1| protein Jade-1 [Danio rerio]
gi|82177005|sp|Q803A0.1|JADE1_DANRE RecName: Full=Protein Jade-1; AltName: Full=PHD finger protein 17
gi|28422365|gb|AAH46874.1| PHD finger protein 17 [Danio rerio]
gi|40389475|tpe|CAE30491.1| TPA: putative Jade1 protein [Danio rerio]
Length = 829
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 109/200 (54%), Gaps = 18/200 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 199 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGIF---PKCHLC 254
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++ + +RW L+C +C
Sbjct: 255 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSHIPSNRWALICCLCKEK 314
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +CRVA+H C GL ++N + L E DE ++ SFC KH
Sbjct: 315 TGACIQCSAKSCRVAFHVTCGLHCGL--------KMNTI-LTEADE---VKFKSFCPKHS 362
Query: 578 Q-PLNDRLAVDERLVQVTRR 596
N+ D+R V+V R
Sbjct: 363 GLDWNEEEGDDDRPVKVPTR 382
>gi|358337310|dbj|GAA34342.2| bromodomain and PHD finger-containing protein 1 [Clonorchis
sinensis]
Length = 1388
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/182 (40%), Positives = 101/182 (55%), Gaps = 12/182 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
CS+C N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 318 CSICQDGSCENTNVILFCDVCNLAVHQECYGVPYVPEGP-WLCRKCL-HSPSEPVSCVLC 375
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T+D RWAH+ C +W+PE ++ +EP++G++R++ RW+L C IC +
Sbjct: 376 PNTGGAFKKTSDDRWAHVICGLWVPEVMFANLTFLEPLEGIDRIAPARWRLQCFICKQRN 435
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC T+C A+H CA+ AGL +++ED D D ++ IR +FC H
Sbjct: 436 AGACIQCHKTSCYRAFHVTCAQQAGLYMKIEDTD---------DPKEAGIRKNAFCDLHC 486
Query: 578 QP 579
P
Sbjct: 487 PP 488
>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1330
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1187 KKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1245
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1246 RIDENTVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1305
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1306 FEREWDSDDRIPCLCGSTGCKGFLN 1330
>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1259
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1116 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1174
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1175 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1234
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1235 FEREWDSDDRIPCLCGSTGCKGFLN 1259
>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1267
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1124 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1182
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1183 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1242
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1243 FEREWDSDDRIPCLCGSTGCKGFLN 1267
>gi|348520762|ref|XP_003447896.1| PREDICTED: protein Jade-1-like [Oreochromis niloticus]
Length = 892
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 101/179 (56%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + NN + CDKC + VH CYG ++ V WLC +C G P C LC
Sbjct: 207 CDVCQSPDGEDNNEMVFCDKCNICVHQACYG-IQKVPKGSWLCRICALGIL---PKCQLC 262
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++ + +RW L+C +C
Sbjct: 263 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSHIPSNRWALICCLCKEK 322
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS CR A+H C GL LE +N + D+D+ ++ S+C KH
Sbjct: 323 TGACIQCSAKNCRTAFHVTC----GLHANLE----MNTILTDDDE----VKFKSYCPKH 369
>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1267
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1124 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1182
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1183 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1242
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1243 FEREWDSDDRIPCLCGSTGCKGFLN 1267
>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
Length = 1331
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1188 KKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1246
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1247 RIDENTVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1306
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1307 FEREWDSDDRIPCLCGSTGCKGFLN 1331
>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
98AG31]
Length = 191
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 118/186 (63%), Gaps = 12/186 (6%)
Query: 661 SIRVIGSKFSFSLHRDAP----NFLSMAD------KYKHMKETFRKRLAFGKSGIHGFGI 710
S RV +F ++ ++ + ++ AD K+ ++ T +K+L F +S IH +G+
Sbjct: 8 STRVNSRRFVINMEQNKKATGVSLVNTADAVADVLKFNQLR-TRKKQLKFARSPIHDWGL 66
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSI 770
+A G+MVIEY GE++R ++ADRRE Y + +Y+FR+DD+ V+DAT+ G++
Sbjct: 67 YAMEMIPVGEMVIEYVGEVIRQAVADRREK-AYERMGIGSSYLFRVDDDLVVDATKKGNL 125
Query: 771 AHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPR 830
LINH C PNC +++I++NG++ I+I+AK I+ +E+TYDY F D ++ C CG +
Sbjct: 126 GRLINHCCAPNCTAKIITINGEKKIVIYAKATIELGDEVTYDYHFPKEDVKIPCLCGSSK 185
Query: 831 CRGVVN 836
C+G +N
Sbjct: 186 CKGTLN 191
>gi|388580816|gb|EIM21128.1| histone methyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 144
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 67/144 (46%), Positives = 99/144 (68%), Gaps = 1/144 (0%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTY 752
T +K L F KS IH +G+++ G+MVIEY GE+VR +ADRRE +G+ +Y
Sbjct: 2 TRKKELRFAKSSIHSWGLYSCQVIPKGEMVIEYVGEVVRQQVADRREKAYEKQGIGS-SY 60
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812
+F+IDD+ ++DAT GS+A LINH C+PNC +++I++ G++ III+AK +I +E+TYD
Sbjct: 61 LFKIDDDNIVDATMKGSVARLINHCCQPNCTAKIITILGEKKIIIYAKTEISPGDEITYD 120
Query: 813 YRFFSIDEQLACYCGFPRCRGVVN 836
Y F DE++ C CG CRG +N
Sbjct: 121 YHFPIEDEKIPCLCGVEGCRGSLN 144
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 122/235 (51%), Gaps = 23/235 (9%)
Query: 605 NPSGCARSEPYNYFGRR-GRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIR 663
NPSG AR+EP + R + +S++ LF P L N L P +
Sbjct: 5001 NPSGAARTEPRSRSQLHWKRPHTQRTGGSSVRPLFGPT-PALTNN---NTLGDGACPYTK 5056
Query: 664 VIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
+H S + +YK MK+ +R + +S I G G++A MVI
Sbjct: 5057 QF-------VH-------SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVI 5102
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EY GE++R +A+ RE G YMFR+D+ERV+DAT G +A INHSC PNC
Sbjct: 5103 EYIGEIIRTELAETREKKY--EAKNRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCV 5160
Query: 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
+ + V+ D IIIFAKR I++ EEL YDY+F D+Q ++C CG P CR +N
Sbjct: 5161 AETVEVDRDYRIIIFAKRRIQRGEELAYDYKFDIEDDQHKISCMCGAPNCRKWMN 5215
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 25/143 (17%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G L L++G+++ + F + +I+P GY +R + S+ R N + E+
Sbjct: 4847 VGSLIFLNVGQLLPHQLQNFHNQNYIYPIGYKIIRFYWSM---RHLNKRCKYICSIHENG 4903
Query: 289 IRPLFRV----TLDNGEQFTGSTPSTCWSKICMKI----REGQNNTSDDFSAEGAAEKIS 340
RP F+V TL+N + STP W +I +I RE Q +
Sbjct: 4904 GRPEFKVVVQETLENDVELKDSTPKAVWQRILEQIAAMRRENQ-----------CVQLFP 4952
Query: 341 E--SGSDMFGFSNPEVMKLILGL 361
+ SG D+FG + P V++++ L
Sbjct: 4953 QFVSGEDLFGLTEPAVVRVLESL 4975
>gi|389739145|gb|EIM80339.1| hypothetical protein STEHIDRAFT_142791 [Stereum hirsutum FP-91666
SS1]
Length = 1616
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 133 CAICDDPEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 190
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T W HL CAIW+PET + + MEP+ G+ ++SK RWKL C IC V
Sbjct: 191 PNEGGAFKQTVSSEWVHLLCAIWVPETRVANEVFMEPVTGIEKISKQRWKLKCQICDVRE 250
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS T+C +A+H CAR L + ++ L L QC FC++H
Sbjct: 251 GACIQCSKTSCFLAFHATCARKDKLLMPMKASSGLEAPML------QC-----FCERH 297
>gi|170087556|ref|XP_001875001.1| bromodomain containing protein [Laccaria bicolor S238N-H82]
gi|164650201|gb|EDR14442.1| bromodomain containing protein [Laccaria bicolor S238N-H82]
Length = 1181
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 95/178 (53%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 135 CAICDDSEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVQCILC 192
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G+ +VSK RWKL CSIC
Sbjct: 193 PNEGGAFKQTVHGEWVHLLCAIWVPETRVANEVFMEPITGIEKVSKQRWKLKCSICDYRG 252
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC+ T+C A+H CAR LL + + + L +C++H
Sbjct: 253 GACIQCAKTSCFTAFHATCARREKF-----------LLPMKTTQGSEPVTLTCYCERH 299
>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1202
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1059 KKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1117
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1118 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1177
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1178 FEREWDSDDRIPCLCGSTGCKGFLN 1202
>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
Length = 1167
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 152/288 (52%), Gaps = 27/288 (9%)
Query: 569 LLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSE--------------P 614
L ++ +KH + LN L E + + DY P NP+GCAR+E P
Sbjct: 887 LFAYKEKHVKKLNTGL---EGVAYKEVKIEDYYKP-NPTGCARTEGVKKILESEKSKYLP 942
Query: 615 YNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSIRVIGSKFSFS 672
+ ++ R+E EALA A+ + V N + R++ +
Sbjct: 943 HRIRVQKAREEREALANANPAAAAQMAKAAAIPVPKTSTTTSRSNRANNRRLVNEISTQK 1002
Query: 673 LHRDAPNFLSMADKYKHMKETFRKRLA-FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
A ++A ++ +K+ RK+L F +S IH +G++A+ A DM+IEY GE VR
Sbjct: 1003 QALAADGSDNIAIRFNQLKK--RKKLVKFDRSAIHNWGLYAQEDISANDMIIEYVGEKVR 1060
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
+AD RE Y+ +Y+FRID++ V+DAT+ G IA INHSC PNC +++I V+G
Sbjct: 1061 QKVADIRE-IKYDKQGVGSSYLFRIDEDSVVDATKKGGIARFINHSCSPNCTAKIIRVDG 1119
Query: 792 DEHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
+ I+I+A RDIK EELTYDY+F D+++ C CG C+G +N
Sbjct: 1120 TKRIVIYALRDIKTNEELTYDYKFEREIGSDDRIPCLCGSVNCKGFLN 1167
>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
Length = 1230
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1087 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLF 1145
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1146 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1205
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1206 FEREWDSDDRIPCLCGSTGCKGFLN 1230
>gi|432118432|gb|ELK38086.1| Protein Jade-3 [Myotis davidii]
Length = 1177
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/181 (44%), Positives = 98/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC E N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 708 CDVCRSPESEDGNDMVFCDKCNICVHQACYGILKIPEGS-WLCRSCVVGVPAQ---CVLC 763
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C +C V
Sbjct: 764 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIGCPERMEPITKISHIPPSRWALVCYLCKVK 823
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS TC A+H CA L ++ LDEDDE +R SFC KH
Sbjct: 824 TGACIQCSVKTCTTAFHVTCAFEHNLEMK---------TILDEDDE---VRFKSFCLKHG 871
Query: 578 Q 578
Q
Sbjct: 872 Q 872
>gi|443898988|dbj|GAC76321.1| PHD finger protein BR140/LIN-49 [Pseudozyma antarctica T-34]
Length = 1202
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 89/151 (58%), Gaps = 2/151 (1%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC++C E +N + CD C + VH CYG + + WLC C +P+ C L
Sbjct: 157 KCAICDDGEAENSNAIVFCDGCNLAVHQDCYG-IPYIPEGQWLCRKCT-VSPDRAVSCLL 214
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA K TT G+WAHL CA+W+PET +++ MEPID + R+ K RWKL C +C
Sbjct: 215 CPHEGGAFKQTTTGKWAHLLCAMWVPETGVSNPVYMEPIDSVERIPKARWKLQCYLCRHR 274
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC N +C A+H CAR AGL E
Sbjct: 275 MGACIQCDNRSCFTAFHVTCARKAGLLFRTE 305
>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 1239
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1096 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1154
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1155 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1214
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1215 FEREWDSDDRIPCLCGSTGCKGFLN 1239
>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 1232
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1089 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLF 1147
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1148 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1207
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1208 FEREWDSDDRIPCLCGSTGCKGFLN 1232
>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
[Aspergillus oryzae 3.042]
Length = 1223
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 104/154 (67%), Gaps = 5/154 (3%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
++ +K+ +K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S
Sbjct: 1072 RFNQLKKR-KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKS 1130
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++
Sbjct: 1131 GIGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIER 1189
Query: 806 WEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
EELTYDY+F + D+++ C CG C+G +N
Sbjct: 1190 DEELTYDYKFEREWDSDDRIPCLCGSTGCKGFLN 1223
>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
oryzae RIB40]
gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1229
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 104/154 (67%), Gaps = 5/154 (3%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
++ +K+ +K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S
Sbjct: 1078 RFNQLKKR-KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKS 1136
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++
Sbjct: 1137 GIGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIER 1195
Query: 806 WEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
EELTYDY+F + D+++ C CG C+G +N
Sbjct: 1196 DEELTYDYKFEREWDSDDRIPCLCGSTGCKGFLN 1229
>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
niger CBS 513.88]
Length = 1239
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1096 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1154
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1155 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1214
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1215 FEREWDSDDRIPCLCGSTGCKGFLN 1239
>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
Length = 1079
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 936 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 994
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 995 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1054
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1055 FEREWDSDDRIPCLCGSTGCKGFLN 1079
>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 1846
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 126/231 (54%), Gaps = 11/231 (4%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE G + + + RL E P G + S R G
Sbjct: 1626 TGCARSE-----GYYKIDKKDKIKYLQSTRLQSEEPPVDTQGMS---IPAQVHASTRA-G 1676
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
S+ R +F +D K + FRK+ + F KS IH +G+FA P A +MVIEY
Sbjct: 1677 SERRSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFAMEPIAADEMVIEY 1736
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
G+ +R IAD RE +G+ +YMFR+D + +IDAT+ G+ A INHSC PNCY++
Sbjct: 1737 VGQNIRQVIADMREKRYEEEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAK 1795
Query: 786 VISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
VI+V + I+I++++ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1796 VITVESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1846
>gi|440632035|gb|ELR01954.1| hypothetical protein GMDG_05127 [Geomyces destructans 20631-21]
Length = 1301
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 133/255 (52%), Gaps = 22/255 (8%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N +G AR+E P+ +R R+E +A A K E+
Sbjct: 1051 YVP--NETGSARTEGTTKILNSEKSKYLPHRIKVQRDREERQARAKRDGKEDPSEDA-QA 1107
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
+S ++ + RV +F L+ AD + + RK+ + F +S
Sbjct: 1108 TQASADKLVSKDSSRANRVNNRRFVADLNDQKKTLGGEADALRFNQLKKRKKPVKFARSA 1167
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A DM+IEY GE +R S+AD RE S +G+ +Y+FRID+ V+DA
Sbjct: 1168 IHNWGLYAMENIAMNDMIIEYVGEKLRQSVADLRERIYLKSGIGS-SYLFRIDENTVVDA 1226
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQ 821
T+ G IA INHSC PNC +++I V G I+I+A RDIK EELTYDY+F D++
Sbjct: 1227 TKRGGIARFINHSCMPNCTAKIIKVEGTRRIVIYALRDIKLNEELTYDYKFEREIGSDDR 1286
Query: 822 LACYCGFPRCRGVVN 836
+ C CG C+G +N
Sbjct: 1287 IPCLCGTVACKGFLN 1301
>gi|149465920|ref|XP_001518528.1| PREDICTED: bromodomain and PHD finger-containing protein 3, partial
[Ornithorhynchus anatinus]
Length = 1059
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 100/184 (54%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 222 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 279
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G++ + RWKL C IC
Sbjct: 280 PNEGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGIDNIPPARWKLTCYICKQKG 339
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 340 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 392
Query: 576 HKQP 579
H P
Sbjct: 393 HSPP 396
>gi|154311696|ref|XP_001555177.1| hypothetical protein BC1G_06307 [Botryotinia fuckeliana B05.10]
Length = 451
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 133/249 (53%), Gaps = 22/249 (8%)
Query: 607 SGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQN 652
+GCAR+E P+ +R R+E EA A + K + V +
Sbjct: 206 TGCARTEGTKKILNSEKSLYLPHRIKVQRAREEREAQAKRAGKDV-VAEAAEAAKIAAEK 264
Query: 653 GLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKSGIHGFGI 710
L+ + + RV ++ L+ D ++ +K+ +K + F +S IH +G+
Sbjct: 265 LLARTSGRATRVNNRRYVADLNDQKKTLGGDTDVLRFNQLKKR-KKPVKFARSAIHNWGL 323
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSI 770
+A DM+IEY GE VR +AD RE+ S +G+ +Y+FRID+ VIDAT+ G I
Sbjct: 324 YAMENIAMNDMIIEYVGEKVRQQVADLRENRYLKSGIGS-SYLFRIDENTVIDATKKGGI 382
Query: 771 AHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCG 827
A INHSC PNC +++I+V + I+I+A RDI Q EELTYDY+F +++ C CG
Sbjct: 383 ARFINHSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCG 442
Query: 828 FPRCRGVVN 836
P C+G +N
Sbjct: 443 TPACKGFLN 451
>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
Length = 1101
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 958 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1016
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1017 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1076
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1077 FEREWDSDDRIPCLCGSTGCKGFLN 1101
>gi|159490078|ref|XP_001703016.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158270923|gb|EDO96754.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 2983
Score = 144 bits (362), Expect = 3e-31, Method: Composition-based stats.
Identities = 75/169 (44%), Positives = 96/169 (56%), Gaps = 10/169 (5%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC 444
YRPVR W D+CSVC + + ++ + CD C + VH CYG E P +WLC C
Sbjct: 784 YRPVRARWAG-DRCSVCDSEIDADSDRLVSCDCCGITVHQICYGITEAPGLDEMWLCRAC 842
Query: 445 R----PGAPEPPPPCCLCPVVGGAMKPTT--DGRWAHLACAIWIPETCLTDVKRMEPIDG 498
+PP CCLCPV GGA+KPTT G WAH+ C W+PE D EP+D
Sbjct: 843 ELRQPGPPGQPPAQCCLCPVAGGALKPTTLGPGTWAHMVCLNWLPELTCGDPITGEPVDN 902
Query: 499 LNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVEL 547
+ + ++RW+L C IC GA IQC+ C AYHPLC R AGL +E+
Sbjct: 903 IPGIQRERWELSCCICKQRMGAKIQCA--LCYQAYHPLCGRMAGLHMEV 949
Score = 110 bits (274), Expect = 4e-21, Method: Composition-based stats.
Identities = 59/153 (38%), Positives = 87/153 (56%), Gaps = 9/153 (5%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN-----SLV 747
T R R+ FGKSGIHG+G+FA+ +V E+ GE+VRP +A+ RE +
Sbjct: 2831 TERARITFGKSGIHGWGVFARVDIPQDAVVTEFRGEVVRPVLAEMRERRYRAAGRAWACA 2890
Query: 748 GAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHI-IIFAKR-DIKQ 805
G Y+F + E V+D+T G NHSC P+ Y++V+ G + + F R DI+
Sbjct: 2891 GKDCYLFHVSRELVLDSTHLGHYGRFANHSCSPSLYTKVLEFEGGRRVRLAFCARVDIRA 2950
Query: 806 WEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
+ELT+DYRF + ++AC CG P C+G +N
Sbjct: 2951 GQELTFDYRFKEEEGSAKVACRCGAPNCKGTLN 2983
>gi|224061805|ref|XP_002300607.1| SET domain protein [Populus trichocarpa]
gi|222842333|gb|EEE79880.1| SET domain protein [Populus trichocarpa]
Length = 1390
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/153 (47%), Positives = 99/153 (64%), Gaps = 5/153 (3%)
Query: 679 NFLSMADKYKHMKET----FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ +K T +KRL F +S IH +G+ A A D VIEY GEL+RP I
Sbjct: 1139 NLLAAAEGVDLLKATQLKARKKRLCFQRSKIHDWGLVALESIEAEDFVIEYVGELIRPQI 1198
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE +Y + +Y+FR+DD V+DAT+ G IA INHSCEPNCY++VISV G +
Sbjct: 1199 SDIRER-LYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKVISVEGQKK 1257
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCG 827
I I+AKR I EE+TY+Y+F D+++ C CG
Sbjct: 1258 IFIYAKRYIAAGEEITYNYKFPLEDKKIPCNCG 1290
>gi|6850865|emb|CAB71104.1| putative protein [Arabidopsis thaliana]
Length = 902
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 127/492 (25%), Positives = 210/492 (42%), Gaps = 103/492 (20%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PVR W ++C+VC E+++ N + C++C++ VH CYG + + W+C
Sbjct: 477 LEEKYEPVRAKWT-TERCAVCRWVEDWEENKMIICNRCQVAVHQECYGVSKSQDLTSWVC 535
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
C P+ CCLCPV GGA+KP+ +G W H+ CA + PE + + MEP GL
Sbjct: 536 RACE--TPDIERDCCLCPVKGGALKPSDVEGLWVHVTCAWFRPEVGFLNHENMEPAVGLF 593
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
++ + + ++G+C+ C C +H +CA AG +E ++LL + +
Sbjct: 594 KIPANSFLK-------THGSCVHCCK--CATHFHAMCASRAGYNME-----EVSLLFISQ 639
Query: 561 DDEDQCIRLLS----FCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYN 616
D D + + + F ++ A RLV + +P ++E
Sbjct: 640 PDPDSVVVVHTPSGVFGSRNLLQNQYGRAKGSRLVLTKKM---KLPGFQTQ--TQAEQSR 694
Query: 617 YFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSG-NTLPSIRV----------- 664
F + ++ K++ +E + + G + LS L S +
Sbjct: 695 VFDSLSAARCRIYSRSNTKKIDLEAISHRLKGPSHHSLSAIENLNSFKASFSFRAPFMSV 754
Query: 665 ---IGSKFSFSL-----------HRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGI 710
+G+ FS L H++A +F S ++ KH+++ F +S +
Sbjct: 755 FCFLGATFSEYLRKILISIYLVTHQEA-DFTSFRERLKHLQKIF-------ESALVNLAY 806
Query: 711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSI 770
D +IEY G VR S+AD RE + G Y
Sbjct: 807 M--------DGIIEYRGVKVRRSVADLREANYRSQ--GKDCY------------------ 838
Query: 771 AHLINHSCEPNCYSRVISVNG--DEHIIIFAKRDIKQWEELTYDYRFFSIDE----QLAC 824
C PNCY+R++S+ D I++ AK ++ EELTYDY F +DE ++ C
Sbjct: 839 -------CMPNCYARIVSMGDGEDNRIVLIAKTNVAAGEELTYDY-LFEVDESEEIKVPC 890
Query: 825 YCGFPRCRGVVN 836
C P CR +N
Sbjct: 891 LCKAPNCRKFMN 902
>gi|426196466|gb|EKV46394.1| hypothetical protein AGABI2DRAFT_185833 [Agaricus bisporus var.
bisporus H97]
Length = 1243
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 89/150 (59%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N + CD C + VH CYG G WLC C +PE P C LC
Sbjct: 134 CAICDDSEGENSNAIVFCDGCNLAVHQDCYGVPYIPEG-QWLCRKCT-VSPENPVQCILC 191
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K TT+G W HL CAIW+PET + + MEP+ G ++SK RW+L CSIC +
Sbjct: 192 PNEGGAFKQTTNGDWVHLLCAIWVPETRVVNEVFMEPVSGAEKISKQRWRLRCSICDIRE 251
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+ T+C +A+H CAR L ++
Sbjct: 252 GACIQCAKTSCFLAFHATCARKEKLLSSMK 281
>gi|321472797|gb|EFX83766.1| hypothetical protein DAPPUDRAFT_301653 [Daphnia pulex]
Length = 303
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/152 (46%), Positives = 104/152 (68%), Gaps = 4/152 (2%)
Query: 686 KYKHMKETFR-KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
K+ +K FR +L FGKS IH +G+FA A +MVIEY G++VRP +AD RE Y
Sbjct: 155 KFNQLK--FRGAKLKFGKSSIHDWGLFAVESIAADEMVIEYVGQVVRPVLADLRET-QYE 211
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
++ +Y+FR+D E +IDAT+ G++A INHSC PNCY+RVI++ + I+I++K+ I
Sbjct: 212 AVGIGSSYLFRVDLEYIIDATKCGNLARFINHSCNPNCYARVITIESQKKIVIYSKQPIG 271
Query: 805 QWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F ++++ C CG +CRG +N
Sbjct: 272 VGEEITYDYKFPIEEDKIICLCGSSQCRGTLN 303
>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
Full=SET domain-containing protein 1B-A
gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
Length = 1844
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 129/235 (54%), Gaps = 19/235 (8%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE Y ++ + +K L N L G ++P+ +
Sbjct: 1624 TGCARSEGYYKIDKKDK----------MKYL---NSSRLQSEEPDVDTQGKSIPAQPQVS 1670
Query: 667 SKFSFSLHRDAPNFLSM----ADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDM 721
++ + LS +D K + FRK+ + F +S IH +G+FA P A +M
Sbjct: 1671 TRAGSERRSEQRRLLSSFSCDSDLLKFNQLKFRKKKIRFCRSHIHDWGLFAMEPIAADEM 1730
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G+ +R IAD RE + +G+ +YMFR+D + +IDAT+ G+ A INHSC PN
Sbjct: 1731 VIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPN 1789
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY++VI+V + I+I++++ I EE+TYDY+F DE++ C CG CRG +N
Sbjct: 1790 CYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1844
>gi|409081233|gb|EKM81592.1| hypothetical protein AGABI1DRAFT_118703 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1241
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 89/150 (59%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N + CD C + VH CYG G WLC C +PE P C LC
Sbjct: 134 CAICDDSEGENSNAIVFCDGCNLAVHQDCYGVPYIPEG-QWLCRKCT-VSPENPVQCILC 191
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K TT+G W HL CAIW+PET + + MEP+ G ++SK RW+L CSIC +
Sbjct: 192 PNEGGAFKQTTNGDWVHLLCAIWVPETRVVNEVFMEPVSGAEKISKQRWRLRCSICDIRE 251
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+ T+C +A+H CAR L ++
Sbjct: 252 GACIQCAKTSCFLAFHATCARKEKLLSSMK 281
>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
Length = 602
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 134/234 (57%), Gaps = 21/234 (8%)
Query: 607 SGCARSEPY---NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIR 663
SGCAR+E Y +Y +RG + + E YLV + R
Sbjct: 386 SGCARTEGYYKLSYKEKRGVLRRPDVFLTEINERDDEKARYLV----------QSTREAR 435
Query: 664 VIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMV 722
+ + S+ + +D +K + FRK+L F +S IHG+G++A +M+
Sbjct: 436 SMNRRLLTSMGDTS------SDIFKVNQLKFRKKLIKFARSRIHGWGLYALEVIAPDEMI 489
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
+EY G+ +RP++AD RE +G+ +Y+FRID + VIDAT+ G++A INHSC+PNC
Sbjct: 490 VEYIGQKIRPTVADEREKRYERRGMGS-SYLFRIDSDNVIDATQMGNLARFINHSCQPNC 548
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
Y++++ V+G++ I+I++K I + +E+TYDY+F ++++ C CG P CRG +N
Sbjct: 549 YAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 602
>gi|432109745|gb|ELK33804.1| Bromodomain and PHD finger-containing protein 3, partial [Myotis
davidii]
Length = 1185
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 198 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 255
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 256 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 315
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 316 LGAAIQCQKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 368
Query: 576 HKQP 579
H P
Sbjct: 369 HSPP 372
>gi|384485229|gb|EIE77409.1| hypothetical protein RO3G_02113 [Rhizopus delemar RA 99-880]
Length = 1187
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 12/178 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC E +N + CD C + VH CYG + + WLC C+ AP P C C
Sbjct: 212 CAVCDDTEVENSNAIVFCDGCNVAVHQDCYG-IPYIPEGQWLCKKCQ-IAPNEPVSCIFC 269
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K TTD WAHL CAIWIPE L + MEPID +++V K RW+L C IC
Sbjct: 270 PNKDGAFKQTTDDLWAHLLCAIWIPEVRLKNTVYMEPIDYVDKVPKGRWRLTCCICKKRQ 329
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC N C A+H CA+AAGL ++++ + N I L ++C KH
Sbjct: 330 GACIQCDNKHCFSAFHVTCAKAAGLSMKMKLQTTQN----------GGIILNAYCDKH 377
>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
Length = 1236
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 137/256 (53%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NPSGCAR+E P++ ++ R+E EA + + + +
Sbjct: 986 YVP--NPSGCARTEGAKKILNSEKSKYLPHHIKVQKAREEREANSKKTGRDATIAANEAA 1043
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
+ GN+ + R ++ L+ +D +K + RK+ + F +S
Sbjct: 1044 KLAAEKLIAKGNSR-ANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSA 1102
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A DM+IEY GE VR IA+ RE+ S +G+ +Y+FRIDD VIDA
Sbjct: 1103 IHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGS-SYLFRIDDNTVIDA 1161
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFF----SIDE 820
T+ G IA INHSC PNC +++I V G + I+I+A RDI EELTYDY+F S+D
Sbjct: 1162 TKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLD- 1220
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1221 RIPCLCGTAACKGFLN 1236
>gi|242220128|ref|XP_002475834.1| predicted protein [Postia placenta Mad-698-R]
gi|220724937|gb|EED78949.1| predicted protein [Postia placenta Mad-698-R]
Length = 1481
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/178 (40%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 165 CAICDDSEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 222
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G++++SK RW+L CSIC V
Sbjct: 223 PNEGGAFKQTAHGDWVHLLCAIWVPETRVANDVFMEPITGIDKISKQRWRLKCSICDVRE 282
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GAC+QC+ +C +A+H CAR L L+ + + L FC+KH
Sbjct: 283 GACVQCTKASCFLAFHATCARKEKL-----------LMPMKATQGSEAPTLACFCEKH 329
>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
2508]
Length = 1282
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 134/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E E A
Sbjct: 1032 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVKKAREEREKNAKNGNTNSVAAAAEAA 1089
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + RV ++ ++ NF +D ++ +K+ +K + F +S
Sbjct: 1090 RLAADSLVAKGNSR-ANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKR-KKPVKFARS 1147
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VID
Sbjct: 1148 AIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGS-SYLFRIDDNTVID 1206
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1207 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTD 1266
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1267 RIPCLCGTAACKGFLN 1282
>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 1155
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 140/255 (54%), Gaps = 22/255 (8%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
++P N +G AR+E P+ ++ R+E EALA + + + V++
Sbjct: 905 FVP--NSTGSARTEGKKRIFEAEKSKYLPHRIKVQKAREEREALAQSDPQSVPVDSGKAG 962
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSM-ADKYKHMKETFRKRLAFGKSG 704
N S +T R + + + P A ++ +K+ +K + F +S
Sbjct: 963 PTKPVNNSTSRSTRVQNRRLAADINAQKQVLPPQGGDGDALRFNQLKKR-KKPVRFARSA 1021
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+ VIDA
Sbjct: 1022 IHNWGLYAEENIAANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENAVIDA 1080
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQ 821
T+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+F + +++
Sbjct: 1081 TKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDR 1140
Query: 822 LACYCGFPRCRGVVN 836
+ C CG C+G +N
Sbjct: 1141 IPCLCGSAGCKGFLN 1155
>gi|224003381|ref|XP_002291362.1| set domain-containing protein [Thalassiosira pseudonana CCMP1335]
gi|220973138|gb|EED91469.1| set domain-containing protein, partial [Thalassiosira pseudonana
CCMP1335]
Length = 189
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/180 (42%), Positives = 111/180 (61%), Gaps = 8/180 (4%)
Query: 664 VIGSKFSFSLHRDAPNFLSMADKYKHMKETFRK----RLAFGKSGIHGFGIFAKHPHRAG 719
V G S R + S+ D K + + RL F KS IHG+G+FA+ AG
Sbjct: 5 VTGRAVRASQRRMLKSLASLGDASKTVDRLVSRDREERLRFDKSLIHGWGVFAEEAINAG 64
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
DM+IEY GEL+ ++AD+RE + +G+ YMFRID+ V DAT+ G++A IN SC
Sbjct: 65 DMIIEYRGELIGNAVADKRELEYERAKIGS-DYMFRIDEFWVCDATKLGNVARFINASCC 123
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
PNCY+++I+ N + I+IFAK++I++ EEL YDY+F ++++ CYCG CRG +N
Sbjct: 124 PNCYTKIITANETKRIVIFAKKNIQRGEELCYDYKFQMEHEKEKRIPCYCGSSECRGFMN 183
>gi|410899066|ref|XP_003963018.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Takifugu rubripes]
Length = 1207
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 99/179 (55%), Gaps = 8/179 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG G WLC C +P+ P C LC
Sbjct: 223 CCVCLDDECLNSNVILFCDSCNLAVHQECYGVPYIPEG-QWLCRCCL-QSPQKPVDCVLC 280
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G+ + RWKL C +C
Sbjct: 281 PNHGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGVGNIPTARWKLTCYLCKQKG 340
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQC C A+H CA+ AGL +++E +N+ + ++ +FC+ H
Sbjct: 341 HGASIQCQKANCYTAFHVTCAQRAGLFMKIEPVREINV-----NGTTFSVKKTAFCEAH 394
>gi|395534025|ref|XP_003769049.1| PREDICTED: bromodomain and PHD finger-containing protein 3
[Sarcophilus harrisii]
Length = 1184
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
tropicalis]
gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
Length = 1956
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 127/235 (54%), Gaps = 17/235 (7%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE Y ++ + L R E P G + S R G
Sbjct: 1734 TGCARSEGYYKIDKKDK-----LKYLINNRSLTEELPIDTQG---KSIPAQPQASTRA-G 1784
Query: 667 SKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDM 721
S+ R +F D K+ +K FRK+ L F KS IH +G+FA P A +M
Sbjct: 1785 SERRSEQRRLLSSFTGSCDSDLLKFNQLK--FRKKKLRFCKSHIHDWGLFAMEPIIADEM 1842
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G+ +R IAD RE + +G+ +YMFR+D + +IDAT+ G+ A INHSC PN
Sbjct: 1843 VIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPN 1901
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY++VI+V + I+I++K+ I EE+TYDY+F D ++ C CG CRG +N
Sbjct: 1902 CYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 1956
>gi|327283585|ref|XP_003226521.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Anolis carolinensis]
Length = 1159
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSHPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G+ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIENIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 333 MGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|126309915|ref|XP_001378780.1| PREDICTED: bromodomain and PHD finger-containing protein 3
[Monodelphis domestica]
Length = 1184
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCMDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQQAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
tetrasperma FGSC 2509]
Length = 1313
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 134/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E E A
Sbjct: 1063 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVKKAREEREKNAKNGNTNSVAAAAEAA 1120
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + RV ++ ++ NF +D ++ +K+ +K + F +S
Sbjct: 1121 RLAADSLVAKGNSR-ANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKR-KKPVKFARS 1178
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VID
Sbjct: 1179 AIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGS-SYLFRIDDNTVID 1237
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1238 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTD 1297
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1298 RIPCLCGTAACKGFLN 1313
>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
Length = 1313
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 134/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E E A
Sbjct: 1063 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVKKAREEREKNAKNGNTNSVAAAAEAA 1120
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + RV ++ ++ NF +D ++ +K+ +K + F +S
Sbjct: 1121 RLAADSLVAKGNSR-ANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKR-KKPVKFARS 1178
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VID
Sbjct: 1179 AIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGS-SYLFRIDDNTVID 1237
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1238 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTD 1297
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1298 RIPCLCGTAACKGFLN 1313
>gi|388581460|gb|EIM21768.1| hypothetical protein WALSEDRAFT_68696 [Wallemia sebi CBS 633.66]
Length = 891
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 106/198 (53%), Gaps = 15/198 (7%)
Query: 380 RDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLW 439
+ LP + + + + C +C+ E +N + CD C + VH CYG + + W
Sbjct: 71 KKLPASH--TKAHFNEDSTCVICNDSECDNSNAIVFCDGCNLAVHQDCYG-IPYIPEGQW 127
Query: 440 LCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGL 499
LC C +PE P C LCP GGA K T G WAH+ CA WIPET L + EP++G+
Sbjct: 128 LCRKCT-VSPENPVSCVLCPNEGGAFKQTNSGAWAHVLCANWIPETGLANPVYQEPVEGI 186
Query: 500 NRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLD 559
+++ K RWKL C IC GACIQC + +C VA HP CA+ GL + + NL
Sbjct: 187 DKIPKSRWKLNCYICKEKMGACIQCDDRSCFVAMHPTCAKNFGLLCKTK-----NL---- 237
Query: 560 EDDEDQCIRLLSFCKKHK 577
+DQ I + + C +H+
Sbjct: 238 --PDDQTIIMRALCHRHR 253
>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile rotundata]
Length = 4805
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 4651 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQ 4710
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 4711 -YEAR-NRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 4768
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 4769 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4805
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 62/142 (43%), Gaps = 23/142 (16%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPR-----VCNSYKMEVLR 283
G L LS+G+++ F +I+P GY VR + S+ P VC+ + +
Sbjct: 4443 VGSLIFLSVGQLLPHQLANFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSIHDVSG-- 4500
Query: 284 DTESKIRPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKI 339
RP FRV L + +TP WS+I + E + T S + +
Sbjct: 4501 ------RPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRTT---HSVQLFPRYV 4551
Query: 340 SESGSDMFGFSNPEVMKLILGL 361
S G D+FG + P V++++ L
Sbjct: 4552 S--GEDLFGLTEPAVVRVLESL 4571
>gi|355702679|gb|AES02012.1| myeloid/lymphoid or mixed-lineage leukemia [Mustela putorius furo]
Length = 374
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/150 (49%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 681 LSMADKYKHMKETFRKRLA-FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE 739
L M +++H+K+T ++ + + +S IHG G+F K AG+MVIEY+G ++R + D+RE
Sbjct: 226 LPMPMRFRHLKKTSKEAVGVYSRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKRE 285
Query: 740 HFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFA 799
F G G YMFR+DD V+DAT G+ A INHSCEPNC+SRVI V G +HI+IFA
Sbjct: 286 KFYDGK--GIGCYMFRMDDFDVVDATMHGNAARFINHSCEPNCFSRVIHVEGQKHIVIFA 343
Query: 800 KRDIKQWEELTYDYRFFSID--EQLACYCG 827
R I + EELTYDY+F D +L C CG
Sbjct: 344 LRRILRGEELTYDYKFPIEDASNKLPCNCG 373
>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
Length = 4678
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 4524 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQ 4583
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 4584 -YEAR-NRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 4641
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 4642 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 4678
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPR-----VCNSYKMEVLR 283
G L LS+G+++ F +I+P GY VR + S+ P VC+ + +
Sbjct: 4316 VGSLIFLSVGQLLPHQLANFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSIHDVSG-- 4373
Query: 284 DTESKIRPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKI 339
RP FRV L + +TP WS+I + E + +T S + +
Sbjct: 4374 ------RPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRST---HSVQLFPRYV 4424
Query: 340 SESGSDMFGFSNPEVMKLILGL 361
S G D+FG + P V++++ L
Sbjct: 4425 S--GEDLFGLTEPAVVRVLESL 4444
>gi|340368978|ref|XP_003383026.1| PREDICTED: peregrin-like [Amphimedon queenslandica]
Length = 1083
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 69/150 (46%), Positives = 86/150 (57%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E + N L CD C + VH CYG G WLC C +P C LC
Sbjct: 300 CCICNDGECHNTNAILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRSVDCVLC 357
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T DGRW+H+ C +WIPE + +EPIDG+N V RWKLLC IC
Sbjct: 358 PNKGGAFKQTIDGRWSHVICGLWIPEIQFANPVFLEPIDGINDVPSARWKLLCYICRKRT 417
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+ C VA+H CA+ A LC+++E
Sbjct: 418 GACIQCAKANCYVAFHVTCAQQANLCMKIE 447
>gi|119467882|ref|XP_001257747.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405899|gb|EAW15850.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 1241
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1098 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLF 1156
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+
Sbjct: 1157 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYK 1216
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1217 FEREWDSDDRIPCLCGSTGCKGFLN 1241
>gi|410898832|ref|XP_003962901.1| PREDICTED: protein Jade-1-like [Takifugu rubripes]
Length = 851
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 102/179 (56%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + NN + CDKC + VH CYG ++ V WLC +C G P C LC
Sbjct: 198 CDVCQSPDGEDNNEMVFCDKCNICVHQACYG-IQKVPKGSWLCRICALGIL---PKCQLC 253
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++++ +RW L+C +C
Sbjct: 254 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSQIPSNRWALICCLCKEK 313
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS CR A+H C GL LE +N + L DDE ++ S+C KH
Sbjct: 314 SGACIQCSAKNCRTAFHVTC----GLHASLE----MNTI-LTADDE---VKFKSYCPKH 360
>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1280
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +A+ RE S +G+ +Y+F
Sbjct: 1137 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVANMRERRYLKSGIGS-SYLF 1195
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1196 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1255
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1256 FEREWDSDDRIPCLCGSTGCKGFLN 1280
>gi|225380776|gb|ACN88689.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 148
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/151 (49%), Positives = 100/151 (66%), Gaps = 6/151 (3%)
Query: 689 HMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVG 748
+K+ R + +S IHG G+F + G+MVIEY+G ++R + D+RE + + G
Sbjct: 1 QLKKASRDAVGVYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYDDK--G 58
Query: 749 AGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEE 808
G YMFRIDD V+DAT G+ A INHSCEPNCYSRVI+V+G +HI+IFA R I + EE
Sbjct: 59 IGCYMFRIDDYEVVDATIHGNSARFINHSCEPNCYSRVINVDGRKHIVIFATRKIYKGEE 118
Query: 809 LTYDYRFFSIDE---QLACYCGFPRCRGVVN 836
LTYDY+ F I+E +L C CG +CR +N
Sbjct: 119 LTYDYK-FPIEEPGNKLPCNCGAKKCRKFLN 148
>gi|322792928|gb|EFZ16758.1| hypothetical protein SINV_09282 [Solenopsis invicta]
Length = 3429
Score = 143 bits (360), Expect = 5e-31, Method: Composition-based stats.
Identities = 71/157 (45%), Positives = 95/157 (60%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 3275 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREK- 3333
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 3334 -QYEARNRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 3392
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 3393 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3429
Score = 47.4 bits (111), Expect = 0.032, Method: Composition-based stats.
Identities = 39/136 (28%), Positives = 63/136 (46%), Gaps = 13/136 (9%)
Query: 231 GDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKI 289
G L LS+G+++ + F +I+P GY VR + S+ P Y + D
Sbjct: 3046 GSLIFLSVGQLLPHQLQNFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSI-HDVSG-- 3102
Query: 290 RPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSD 345
RP FRV L + +TP WS+I + E + NT+ S + + SG D
Sbjct: 3103 RPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRNTN---SVQLFPRYV--SGED 3157
Query: 346 MFGFSNPEVMKLILGL 361
+FG + P V++++ L
Sbjct: 3158 LFGLTEPAVVRVLESL 3173
>gi|195392728|ref|XP_002055009.1| GJ19139 [Drosophila virilis]
gi|194149519|gb|EDW65210.1| GJ19139 [Drosophila virilis]
Length = 2531
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2322 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2363
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2364 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2417
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2418 IEYIGEVIRTEVSEIREKQ-YES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2475
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 2476 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2531
>gi|432864388|ref|XP_004070297.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Oryzias latipes]
Length = 1199
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 100/185 (54%), Gaps = 8/185 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG G WLC C +P+ P C LC
Sbjct: 231 CCVCLDDECLNSNVILFCDSCNLAVHQECYGVPYIPEG-QWLCRCCL-QSPQKPIDCVLC 288
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G++ + RWKL C +C
Sbjct: 289 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGVSNIPPARWKLTCYLCKQKG 348
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E N+ + ++ +FC+ H
Sbjct: 349 RGASIQCHKANCYTAFHVTCAQRAGLFMKIEPVRETNV-----NGTTFSVKKTAFCEAHS 403
Query: 578 QPLND 582
P D
Sbjct: 404 PPGQD 408
>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1150
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 134/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E E A
Sbjct: 900 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVKKAREEREKNAKNGNTNSVAAAAEAA 957
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + RV ++ ++ NF +D ++ +K+ +K + F +S
Sbjct: 958 RLAADSLVAKGNSRAN-RVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKR-KKPVKFARS 1015
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VID
Sbjct: 1016 AIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGS-SYLFRIDDNTVID 1074
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1075 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTD 1134
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1135 RIPCLCGTAACKGFLN 1150
>gi|24639197|ref|NP_525040.2| trithorax-related, isoform C [Drosophila melanogaster]
gi|22831528|gb|AAF45684.2| trithorax-related, isoform C [Drosophila melanogaster]
Length = 2410
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2201 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2242
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2243 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2296
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2297 IEYIGEVIRTEVSEIREKQ-YESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2354
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2355 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2410
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 34/148 (22%), Positives = 63/148 (42%), Gaps = 33/148 (22%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 2045 VGNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAG 2101
Query: 289 IRPLFRVTL-DNGE-----QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE- 341
+P FR+ + D G+ +F GS+PS W +I I K+ +
Sbjct: 2102 CKPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITR--------------LRKVHKW 2147
Query: 342 --------SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2148 LQLFPQHISGEDLFGLTEPAIVRILESL 2175
>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
Length = 1111
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +MVIEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 968 KKPVTFARSAIHNWGLYALEPINAKEMVIEYVGERIRQPVAEMRERRYIKNGIGS-SYLF 1026
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH CEP+C +++I V G I+I+A RDI EELTYDY+
Sbjct: 1027 RIDEHTVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYK 1086
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1087 FERETDAEERLPCLCGAPSCKGFLN 1111
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 5559 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQ 5618
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 5619 -YEAR-NRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 5676
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 5677 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5713
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPR-----VCNSYKMEVLR 283
G L LS+G+++ F +I+P GY VR + S+ P VC+ + +
Sbjct: 5351 VGSLIFLSVGQLLPHQLANFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSIHDVSG-- 5408
Query: 284 DTESKIRPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKI 339
RP FRV L + +TP WS+I + E + +T S + +
Sbjct: 5409 ------RPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRST---HSVQLFPRYV 5459
Query: 340 SESGSDMFGFSNPEVMKLILGL 361
S G D+FG + P V++++ L
Sbjct: 5460 S--GEDLFGLTEPAVVRVLESL 5479
>gi|428171302|gb|EKX40220.1| hypothetical protein GUITHDRAFT_75734 [Guillardia theta CCMP2712]
Length = 156
Score = 143 bits (360), Expect = 5e-31, Method: Composition-based stats.
Identities = 68/158 (43%), Positives = 97/158 (61%), Gaps = 13/158 (8%)
Query: 690 MKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGA 749
MK +RL G+SGI G+GIFA+ P G+MV+EYTGE +RP +AD RE G
Sbjct: 1 MKANQNRRLVLGRSGIEGYGIFARQPIFKGEMVVEYTGEAIRPVVADVREE--QYEAAGL 58
Query: 750 GTYMFRIDD-----------ERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
GTY +R+++ ++DAT +I H INH C+PNC ++++ +NG III
Sbjct: 59 GTYFWRLEEYLGAGEPPEGRAAIVDATIRHNIGHYINHCCDPNCEAKILKINGQRRIIIS 118
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
A D++ EELTYDY+ D+++ C+CG P CRG +N
Sbjct: 119 AIHDVQFGEELTYDYKLPFEDKKIPCHCGAPTCRGTMN 156
>gi|395325651|gb|EJF58070.1| hypothetical protein DICSQDRAFT_182787 [Dichomitus squalens
LYAD-421 SS1]
Length = 1592
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 97/186 (52%), Gaps = 13/186 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 133 CAICDDSEGENANAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 190
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G+ ++SK RW+L CSIC +
Sbjct: 191 PNEGGAFKQTVSGDWVHLLCAIWVPETAVANEVFMEPITGVEKISKQRWRLRCSICDEKH 250
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQC+ +C A+H CAR L L+ + + L +C+KH
Sbjct: 251 GACIQCTKPSCFTAFHATCARKEKL-----------LMPMKASQGSEAPVLACYCEKHLP 299
Query: 579 PLNDRL 584
P ++
Sbjct: 300 PEQQKV 305
>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
Length = 1674
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 102/151 (67%), Gaps = 2/151 (1%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K + +K+L F KS IH +G+FA+ DMVIEY GE++R +A+ RE
Sbjct: 1526 KFNQLK-SRKKQLKFAKSPIHDWGLFAEEHIDVNDMVIEYVGEMIRQQVAEEREKQYERC 1584
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FR+DD+ VIDAT+ GSIA INH C PNC +++I+V+ + I+I+A RDI+
Sbjct: 1585 GIGS-SYLFRVDDDTVIDATKRGSIARFINHCCSPNCSAKIITVDKQKKIVIYANRDIEP 1643
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+F E++ C CG C+G +N
Sbjct: 1644 GEEITYDYKFPIEAEKIPCLCGSKFCKGTLN 1674
>gi|159124080|gb|EDP49198.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 1241
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1098 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLF 1156
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+
Sbjct: 1157 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYK 1216
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1217 FEREWDSDDRIPCLCGSTGCKGFLN 1241
>gi|70991351|ref|XP_750524.1| SET domain protein [Aspergillus fumigatus Af293]
gi|74671075|sp|Q4WNH8.1|SET1_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=SET domain-containing protein 1
gi|66848157|gb|EAL88486.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 1241
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 68/145 (46%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1098 KKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLF 1156
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI + EELTYDY+
Sbjct: 1157 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYK 1216
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1217 FEREWDSDDRIPCLCGSTGCKGFLN 1241
>gi|194912727|ref|XP_001982563.1| GG12672 [Drosophila erecta]
gi|190648239|gb|EDV45532.1| GG12672 [Drosophila erecta]
Length = 2406
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2197 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2238
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2239 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2292
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2293 IEYIGEVIRTEVSEIREKQ-YESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2350
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2351 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2406
>gi|28571451|ref|NP_726773.2| trithorax-related, isoform D [Drosophila melanogaster]
gi|74865454|sp|Q8IRW8.2|TRR_DROME RecName: Full=Histone-lysine N-methyltransferase trr; AltName:
Full=Lysine N-methyltransferase 2C; AltName:
Full=Trithorax-related protein
gi|28381556|gb|AAN09063.2| trithorax-related, isoform D [Drosophila melanogaster]
Length = 2431
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2222 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2263
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2264 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2317
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2318 IEYIGEVIRTEVSEIREKQ-YESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2375
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2376 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2431
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 32/148 (21%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 2066 VGNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAG 2122
Query: 289 IRPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE- 341
+P FR+ + + +F GS+PS W +I I K+ +
Sbjct: 2123 CKPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITR--------------LRKVHKW 2168
Query: 342 --------SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2169 LQLFPQHISGEDLFGLTEPAIVRILESL 2196
>gi|3256105|emb|CAA15944.1| EG:63B12.3 [Drosophila melanogaster]
Length = 2422
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2213 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2254
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2255 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2308
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2309 IEYIGEVIRTEVSEIREKQ-YESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2366
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2367 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2422
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/148 (21%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 2057 VGNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAG 2113
Query: 289 IRPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE- 341
+P FR+ + + +F GS+PS W +I I K+ +
Sbjct: 2114 CKPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITR--------------LRKVHKW 2159
Query: 342 --------SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2160 LQLFPQHISGEDLFGLTEPAIVRILESL 2187
>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
Length = 278
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 109/156 (69%), Gaps = 2/156 (1%)
Query: 682 SMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
+ +D +K + FRK+L F +S IHG+G++A +M++EY G+ +RP++AD RE
Sbjct: 124 TSSDIFKVNQLKFRKKLIKFARSRIHGWGLYALEAIAPDEMIVEYIGQKIRPTVADEREK 183
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+G+ +Y+FRID + VIDAT+ G++A INHSC+PNCY++++ V+G++ I+I++K
Sbjct: 184 RYERRGMGS-SYLFRIDSDNVIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSK 242
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I + +E+TYDY+F ++++ C CG P CRG +N
Sbjct: 243 LAINKGDEITYDYKFPIEEDKIDCLCGAPGCRGSLN 278
>gi|195132807|ref|XP_002010834.1| GI21761 [Drosophila mojavensis]
gi|193907622|gb|EDW06489.1| GI21761 [Drosophila mojavensis]
Length = 2748
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2539 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2580
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2581 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2634
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2635 IEYIGEVIRTEVSEIREKQ-YES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2692
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 2693 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2748
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 63/141 (44%), Gaps = 19/141 (13%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F +I+P GYT R + + P Y V E
Sbjct: 2383 VGNMTFLNVGQLLPHQLEAFHTPHYIYPIGYTVSRYYWCVRRPNRRCRY---VCTIAEVG 2439
Query: 289 IRPLFRVTLDNGE------QFTGSTPSTCWSKICMKIRE--GQNNTSDDFSAEGAAEKIS 340
RP FR+ + +G +F STPS W +I I+ ++N F
Sbjct: 2440 CRPEFRIQVHDGNDKEPDREFRDSTPSAVWQQILQPIQRLRKEHNWLQLFPQHI------ 2493
Query: 341 ESGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2494 -SGEDLFGLTEPAIVRILESL 2513
>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
Length = 1468
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 97/143 (67%), Gaps = 4/143 (2%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTY 752
T +K+L F KS IH +G++A AGDMVIEY GE+VR +AD RE Y TY
Sbjct: 1324 TRKKQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREK-QYERQGNFSTY 1382
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812
+FR+DD+ V+DAT G+IA L+NH C PNC ++++++NG++ I++FAK I+ EELTYD
Sbjct: 1383 LFRVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYD 1442
Query: 813 YRFFSI---DEQLACYCGFPRCR 832
Y+F S ++ + C CG P CR
Sbjct: 1443 YKFQSSADDEDAIPCLCGSPGCR 1465
>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
Length = 163
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F KS IH +G++A G+MVIEY GE++R ++AD+RE +G+ +
Sbjct: 20 QTRKKHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADKREKAYERQGIGS-S 78
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I+++G + I+I+AK+DI+ +E+TY
Sbjct: 79 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEITY 138
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 139 DYHFPFEQDKIPCLCGSAKCRGFLN 163
>gi|164662150|ref|XP_001732197.1| hypothetical protein MGL_0790 [Malassezia globosa CBS 7966]
gi|159106099|gb|EDP44983.1| hypothetical protein MGL_0790 [Malassezia globosa CBS 7966]
Length = 574
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/198 (38%), Positives = 108/198 (54%), Gaps = 9/198 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +P P C LC
Sbjct: 106 CALCDDSECENLNAIVFCDGCNLAVHQDCYG-VPFIPEGQWLCRKCT-VSPNRPVSCALC 163
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T DG WAHL CA+WIPET +++ MEPIDG+N + K RW+L C +C +
Sbjct: 164 PQEGGAFKQTIDGTWAHLLCAMWIPETGVSNSVYMEPIDGINAIPKARWRLRCYLCQSRH 223
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQC + +C A+H +CAR AGL + + E+ + + ++C H
Sbjct: 224 GACIQCEHRSCFTAFHVMCARRAGLLSHAHGQHEM------EEQDTKPDGPAAYCHHHLP 277
Query: 579 PLNDRLAVDERLVQVTRR 596
P ++ A+ ER+ RR
Sbjct: 278 PA-EKAALLERVRSNKRR 294
>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1314
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 134/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E E A
Sbjct: 1064 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVKKAREEREKNAKNGNINSVAAAAEAA 1121
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + RV ++ ++ NF +D ++ +K+ +K + F +S
Sbjct: 1122 RLAADSLVAKGNSR-ANRVNNRRYVAEINDQRKNFGQDSDVLRFNQLKKR-KKPVKFARS 1179
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VID
Sbjct: 1180 AIHNWGLYAMENINKDDMIIEYVGEEVRQQIAELREARYLKSGIGS-SYLFRIDDNTVID 1238
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1239 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTD 1298
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1299 RIPCLCGTAACKGFLN 1314
>gi|296422910|ref|XP_002841001.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637229|emb|CAZ85192.1| unnamed protein product [Tuber melanosporum]
Length = 1313
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/179 (41%), Positives = 94/179 (52%), Gaps = 9/179 (5%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC +C E +N + CD C + VH CYG G WLC C P C
Sbjct: 509 KCVICDDGECENSNAIVFCDGCNLAVHQECYGVPHIPEG-QWLCRKCL-AIPNKTANCIF 566
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K TT+ +WAHL CAIWIPE L + MEP+DG++ V K RWKL C IC
Sbjct: 567 CPNTDGAFKQTTNTKWAHLLCAIWIPEVILGNTSYMEPVDGMDSVPKSRWKLSCYICKQK 626
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCSN C +A+H C R A L +++++ SL + L ++C KH
Sbjct: 627 MGACIQCSNKNCFIAFHVTCGRRARLSMKMKN-------SLGTGALMETSALKAYCDKH 678
>gi|366999050|ref|XP_003684261.1| hypothetical protein TPHA_0B01550 [Tetrapisispora phaffii CBS 4417]
gi|357522557|emb|CCE61827.1| hypothetical protein TPHA_0B01550 [Tetrapisispora phaffii CBS 4417]
Length = 734
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 96/178 (53%), Gaps = 5/178 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC E Q N + CD C + VH CYG + G WLC LC + + C C
Sbjct: 254 CAVCGGGESTQTNAIVFCDGCNLAVHQECYGIIFIPEGQ-WLCRLCLV-SKDRKVDCLFC 311
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T G WAH+ CA+W+PE ++ MEPI+G+ ++K RW+L C IC
Sbjct: 312 PSTTGAFKQTDTGSWAHVVCALWLPELYFANLNYMEPIEGMKNINKSRWRLNCYICDQKI 371
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCSN C AYH CA+ + L + + + + S+ ++ + + SFC KH
Sbjct: 372 GACIQCSNKNCFTAYHVTCAKRSNLYMSF---NNIPVSSVAQNQTVNDLTIESFCDKH 426
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 5465 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQ 5524
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 5525 -YEAR-NRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 5582
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 5583 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5619
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPR-----VCNSYKMEVLR 283
G L LS+G+++ F +I+P GY VR + S+ P VC+ + +
Sbjct: 5257 VGSLIFLSVGQLLPHQLANFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSIHDVSG-- 5314
Query: 284 DTESKIRPLFRVTLDNGEQ----FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKI 339
RP FRV + Q +TP WS+I + E + +T S + +
Sbjct: 5315 ------RPEFRVLVQEPSQEDVELRDATPRAVWSRILEPLAELRRST---HSVQLFPRYV 5365
Query: 340 SESGSDMFGFSNPEVMKLILGL 361
S G D+FG + P V++++ L
Sbjct: 5366 S--GEDLFGLTEPAVVRVLESL 5385
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 5468 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQ 5527
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 5528 -YEAR-NRGIYMFRLDEERVVDATLCGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKR 5585
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 5586 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5622
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPR-----VCNSYKMEVLR 283
G L LS+G+++ F +I+P GY VR + S+ P VC+ + +
Sbjct: 5260 VGSLIFLSVGQLLPHQLANFHTPNYIYPVGYKIVRFYWSMRRPNKRCRYVCSIHDVSG-- 5317
Query: 284 DTESKIRPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKI 339
RP FRV L + +TP WS+I + E + +T S + +
Sbjct: 5318 ------RPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRST---HSVQLFPRYV 5368
Query: 340 SESGSDMFGFSNPEVMKLILGL 361
S G D+FG + P V++++ L
Sbjct: 5369 S--GEDLFGLTEPAVVRVLESL 5388
>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
Length = 1076
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 89/256 (34%), Positives = 135/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E +A + K +
Sbjct: 826 YVP--NPTGCARAEGVKKILNSEKSKYLPHHIKVKKAREERQAQNGKNAKDSVLAAAEAA 883
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMAD--KYKHMKETFRKRLAFGKS 703
GN+ + R +F L+ +D ++ +K+ +K + F +S
Sbjct: 884 RLAAESLVAKGNSRAN-RANNRRFVADLNDQRKTLGQDSDVLRFNQLKKR-KKPVKFARS 941
Query: 704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID 763
IH +G++A DM+IEY GE VR IA+ RE+ S +G+ +Y+FRIDD VID
Sbjct: 942 AIHNWGLYAMENIPKDDMIIEYVGEEVRQQIAELRENRYLKSGIGS-SYLFRIDDNTVID 1000
Query: 764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDE 820
AT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F +
Sbjct: 1001 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTD 1060
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1061 RIPCLCGTAACKGFLN 1076
>gi|307219250|ref|NP_001090707.2| PHD finger protein 17 [Xenopus (Silurana) tropicalis]
Length = 831
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/187 (41%), Positives = 100/187 (53%), Gaps = 20/187 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 205 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 260
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L CS+C
Sbjct: 261 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALFCSLCNEK 320
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH- 576
GACIQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 321 LGACIQCSIKNCRTAFHVTCAFDHGLEMK---------TILTEEDE---VKFKSYCPKHG 368
Query: 577 --KQPLN 581
K+P N
Sbjct: 369 STKKPEN 375
>gi|148224982|ref|NP_001085440.1| protein Jade-1 [Xenopus laevis]
gi|82184662|sp|Q6GQJ2.1|JADE1_XENLA RecName: Full=Protein Jade-1; AltName: Full=PHD finger protein 17
gi|49119086|gb|AAH72750.1| MGC79115 protein [Xenopus laevis]
Length = 827
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + +RW LLCS+C
Sbjct: 259 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSNRWALLCSLCNEK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS CR A+H CA GL + + ED+ ++ S+C KH
Sbjct: 319 VGACIQCSIKNCRTAFHVTCAFDHGL--------EMKTILTQEDE----VKFKSYCPKH 365
>gi|198467813|ref|XP_001354516.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
gi|198146119|gb|EAL31569.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
Length = 2566
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2357 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSASIA 2398
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2399 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2452
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2453 IEYIGEVIRTEVSEIREKQ-YES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2510
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 2511 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2566
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 33/148 (22%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F FI+P GY R + + P Y + E+
Sbjct: 2201 VGNMTFLNVGQLLPHQLEAFHTPHFIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAG 2257
Query: 289 IRPLFRVTLDNGE------QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE- 341
+P FR+ + +G +F STP+ W +I I K+ +
Sbjct: 2258 CKPEFRIVVQDGSDKEPEREFQASTPTGVWQQILQPITR--------------LRKVHKW 2303
Query: 342 --------SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2304 LQLFPQHISGEDLFGLTEPAIVRILESL 2331
>gi|198282069|ref|NP_001128279.1| bromodomain and PHD finger containing, 3 [Xenopus (Silurana)
tropicalis]
gi|197245583|gb|AAI68469.1| LOC779492 protein [Xenopus (Silurana) tropicalis]
Length = 983
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N L CD C + VH CYG G WLC C +P P C LC
Sbjct: 219 CCVCLDDECHNSNAILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSKPVSCVLC 276
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G+N + RWKL C +C
Sbjct: 277 PNQGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGVNNIPSARWKLTCYLCKQKG 336
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R +FC+
Sbjct: 337 RGAAIQCHKVNCYTAFHVTCAQRAGLFMKVEPVRETGLNGTTFT-------VRKTAFCEL 389
Query: 576 HKQP 579
H P
Sbjct: 390 HCPP 393
>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
Length = 1489
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 70/143 (48%), Positives = 96/143 (67%), Gaps = 4/143 (2%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTY 752
T +K+L F KS IH +G++A AGDMVIEY GE+VR +AD RE Y TY
Sbjct: 1345 TRKKQLKFAKSPIHDWGLYAMEYIPAGDMVIEYVGEMVRQQVADNREK-QYERQGNFSTY 1403
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812
+FR+DD+ V+DAT G+IA L+NH C PNC +++++VNG++ I++FAK IK EELTYD
Sbjct: 1404 LFRVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYD 1463
Query: 813 YRFFSI---DEQLACYCGFPRCR 832
Y+F S ++ + C CG CR
Sbjct: 1464 YKFQSSADDEDAIPCLCGSDGCR 1486
>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1040
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 897 KKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGS-SYLF 955
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 956 RIDDNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1015
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1016 FERETNDEERIRCLCGAPGCKGYLN 1040
>gi|47206036|emb|CAF91716.1| unnamed protein product [Tetraodon nigroviridis]
Length = 716
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/190 (38%), Positives = 103/190 (54%), Gaps = 8/190 (4%)
Query: 388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG 447
P + +D C VC DE +N+ L CD C + VH CYG G WLC C
Sbjct: 219 PAQNTIEDDAFCCVCLDDECLNSNVILFCDSCNLAVHQECYGVPYIPEG-QWLCRCCL-Q 276
Query: 448 APEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRW 507
+P+ P C LCP GGA K T+DGRWAH+ CAIWIPE C ++ +EP++G+ + RW
Sbjct: 277 SPQKPVDCVLCPNRGGAFKQTSDGRWAHVVCAIWIPEVCFSNTVFLEPVEGVGNIPTARW 336
Query: 508 KLLCSICGVS-YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
KL C +C GA IQC C A+H CA+ AGL +++E +N+ +
Sbjct: 337 KLTCCLCKQKGRGASIQCHKANCYTAFHVTCAQRAGLFMKIEPVREINV-----NGTTFS 391
Query: 567 IRLLSFCKKH 576
++ +FC+ H
Sbjct: 392 VKKTAFCEAH 401
>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
Length = 1040
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 897 KKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGS-SYLF 955
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 956 RIDDNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1015
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1016 FERETNDEERIRCLCGAPGCKGYLN 1040
>gi|345496695|ref|XP_001602710.2| PREDICTED: hypothetical protein LOC100118826 [Nasonia vitripennis]
Length = 3084
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/179 (41%), Positives = 98/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C G P C LC
Sbjct: 253 CDVCRSPDSEEGNEMVFCDFCNICVHQACYGITSIPDGS-WLCRTCSLGQR---PECVLC 308
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V++MEPI ++ + RW L+C +C
Sbjct: 309 PNPGGAMKSTRSGQKWAHVSCALWIPEVSIGCVEKMEPITKISSIPPSRWALICVLCRER 368
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL + + DE ED ++L S+C+KH
Sbjct: 369 VGACIQCSIKTCKTAYHVTCAFKHGL--------EMKAIIEDEQAED-GVKLRSYCQKH 418
>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 144
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 104/145 (71%), Gaps = 1/145 (0%)
Query: 692 ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT 751
+T +K L F +S IH +G++A G+MVIEY GE++R +A++RE +G+ +
Sbjct: 1 QTRKKHLRFARSPIHDWGLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGS-S 59
Query: 752 YMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTY 811
Y+FRID++ V+DAT+ G++ LINHSC+PNC +++I+++G++ I+I+AK+DI+ +E+TY
Sbjct: 60 YLFRIDEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITY 119
Query: 812 DYRFFSIDEQLACYCGFPRCRGVVN 836
DY F +++ C CG +CRG +N
Sbjct: 120 DYHFPFEQDKILCLCGSVKCRGFLN 144
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 95/157 (60%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +A+ RE
Sbjct: 3320 SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSELAETREK- 3378
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
G YMFR+D+ERVIDAT G +A INHSC PNC + ++ V D IIIFAKR
Sbjct: 3379 -QYEARNRGIYMFRLDEERVIDATLCGGLARYINHSCNPNCVAEIVEVERDFRIIIFAKR 3437
Query: 802 DIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 3438 RISRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 3474
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 13/137 (9%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G L LS+G+++ + F +I+P GY VR + S+ P Y V +
Sbjct: 3090 VGSLIFLSVGQLLPHQLQNFHTPNYIYPVGYKIVRFYWSMRRPNKRCRY---VCSIHDVS 3146
Query: 289 IRPLFRV----TLDNGEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGS 344
RP FRV L + +TP WS+I + E + +T+ S + +S G
Sbjct: 3147 GRPEFRVLVQEPLQEDVELRDATPRAVWSRILEPLAELRRSTN---SVQLFPRYVS--GE 3201
Query: 345 DMFGFSNPEVMKLILGL 361
D+FG + P V++++ L
Sbjct: 3202 DLFGLTEPAVVRVLESL 3218
>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1156
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 1013 KKPVTFARSSIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGS-SYLF 1071
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V+G + I+I+A RDI+ EELTYDY+
Sbjct: 1072 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYK 1131
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + DE++ C CG P C+G +N
Sbjct: 1132 FERETNDDERIRCLCGAPGCKGFLN 1156
>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
Length = 986
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE++R +A+ RE S +G+ +Y+F
Sbjct: 843 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGEILRQPVAEMRERTYLKSGIGS-SYLF 901
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 902 RVDESTVIDATKKGGIARFINHCCDPSCTAKIIRVGGRKRIVIYALRDIAANEELTYDYK 961
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C+CG P C+G +N
Sbjct: 962 FERETDDEERLPCFCGAPTCKGFLN 986
>gi|118763646|gb|AAI28634.1| phf17 protein [Xenopus (Silurana) tropicalis]
Length = 782
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/187 (41%), Positives = 100/187 (53%), Gaps = 20/187 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 205 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 260
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L CS+C
Sbjct: 261 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALFCSLCNEK 320
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH- 576
GACIQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 321 LGACIQCSIKNCRTAFHVTCAFDHGLEMK---------TILTEEDE---VKFKSYCPKHG 368
Query: 577 --KQPLN 581
K+P N
Sbjct: 369 STKKPEN 375
>gi|195404288|ref|XP_002060445.1| GJ19853 [Drosophila virilis]
gi|194156330|gb|EDW71514.1| GJ19853 [Drosophila virilis]
Length = 1720
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 1511 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 1552
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 1553 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 1606
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 1607 IEYIGEVIRTEVSEIREKQ-YES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 1664
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 1665 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1720
>gi|313239304|emb|CBY14252.1| unnamed protein product [Oikopleura dioica]
Length = 2763
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 114/206 (55%), Gaps = 24/206 (11%)
Query: 647 GGYCQNGLSGNTLPSIRVIGSKFSF--SLHRDA----PNF-------LSMADKYKHMKET 693
G + G SG PS FSF S HR NF L + +++ M ET
Sbjct: 2566 GCWRAQGWSGKRNPS-----DMFSFLNSTHRSGLTHDQNFRPTMETDLPLTMRFRKMVET 2620
Query: 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYM 753
R L S IHG G+FA+ P+ G++VIEY+G ++R + D RE + + G GTYM
Sbjct: 2621 SRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDR--GIGTYM 2678
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FR+DD+ V+DAT G A INHSC PNC S++I+V+ +HI I A R I EELTYDY
Sbjct: 2679 FRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTYDY 2738
Query: 814 RF---FSIDEQLACYCGFPRCRGVVN 836
+F F E++ C C P CR +N
Sbjct: 2739 KFDRDFG-SERIECGCQAPNCRRFMN 2763
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 39/100 (39%), Gaps = 7/100 (7%)
Query: 453 PPCCLCPVVGGAMKPTTDGR--WAHLACAIWIPETCLTDVKRMEPIDGLNR---VSKDRW 507
P CCLC + G+ WAH AC +W T I G + S R
Sbjct: 364 PACCLCTAKPSYARLIYLGQDEWAHAACVLWAGRLANTKGNYSSMIRGYSHGIGPSAARE 423
Query: 508 KLLCSICGVSYGACI--QCSNTTCRVAYHPLCARAAGLCV 545
+ C CG S + I +C+ C + H CAR G V
Sbjct: 424 EESCRHCGYSIRSAISVKCAKEGCNLVMHFFCARLTGWLV 463
>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
Length = 1453
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 97/143 (67%), Gaps = 4/143 (2%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTY 752
T +K+L F KS IH +G++A AGDMVIEY GE+VR +AD RE Y TY
Sbjct: 1309 TRKKQLKFAKSPIHDWGLYAMEFIPAGDMVIEYVGEVVRQQVADEREK-QYERQGNFSTY 1367
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812
+FR+DD+ V+DAT G+IA L+NH C PNC ++++++NG++ I++FAK I+ EELTYD
Sbjct: 1368 LFRVDDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYD 1427
Query: 813 YRFFSI---DEQLACYCGFPRCR 832
Y+F S ++ + C CG P CR
Sbjct: 1428 YKFQSSADDEDAIPCLCGSPGCR 1450
>gi|28277052|gb|AAH44818.1| Mll1 protein, partial [Mus musculus]
Length = 142
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/137 (54%), Positives = 94/137 (68%), Gaps = 4/137 (2%)
Query: 702 KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERV 761
+S IHG G+F K AG+MVIEY G ++R D+RE + Y+S G G YMFRIDD V
Sbjct: 8 RSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKY-YDSK-GIGCYMFRIDDSEV 65
Query: 762 IDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID-- 819
+DAT G+ A INHSCEPNCYSRVI+++G +HI+IFA R I + EELTYDY+F D
Sbjct: 66 VDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDAS 125
Query: 820 EQLACYCGFPRCRGVVN 836
+L C CG +CR +N
Sbjct: 126 NKLPCNCGAKKCRKFLN 142
>gi|390461571|ref|XP_002746520.2| PREDICTED: bromodomain and PHD finger-containing protein 3
[Callithrix jacchus]
Length = 1101
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|403261686|ref|XP_003923245.1| PREDICTED: bromodomain and PHD finger-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|195011749|ref|XP_001983299.1| GH15663 [Drosophila grimshawi]
gi|193896781|gb|EDV95647.1| GH15663 [Drosophila grimshawi]
Length = 3547
Score = 141 bits (356), Expect = 1e-30, Method: Composition-based stats.
Identities = 71/179 (39%), Positives = 97/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 338 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PDCVLC 393
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + RW L+C +C
Sbjct: 394 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPHSRWALICVLCRER 453
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS TC+ AYH CA GL E R ++E + + ++L S+C+KH
Sbjct: 454 VGSCIQCSVKTCKTAYHVTCAFQHGL------EMR---AIIEEGNSEDGVKLRSYCQKH 503
>gi|410959026|ref|XP_003986113.1| PREDICTED: bromodomain and PHD finger-containing protein 3 isoform
1 [Felis catus]
Length = 936
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 387
Query: 578 QP 579
P
Sbjct: 388 PP 389
>gi|194767856|ref|XP_001966030.1| GF19475 [Drosophila ananassae]
gi|190622915|gb|EDV38439.1| GF19475 [Drosophila ananassae]
Length = 2490
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 2281 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 2322
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 2323 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 2376
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 2377 IEYIGEVIRTEVSEIREKQ-YESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 2434
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 2435 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 2490
>gi|410209438|gb|JAA01938.1| bromodomain and PHD finger containing, 3 [Pan troglodytes]
gi|410264976|gb|JAA20454.1| bromodomain and PHD finger containing, 3 [Pan troglodytes]
gi|410348494|gb|JAA40851.1| bromodomain and PHD finger containing, 3 [Pan troglodytes]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|109659078|gb|AAI17388.1| BRPF3 protein [Homo sapiens]
gi|219518057|gb|AAI43918.1| BRPF3 protein [Homo sapiens]
Length = 935
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 387
Query: 578 QP 579
P
Sbjct: 388 PP 389
>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
Length = 1938
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/235 (37%), Positives = 127/235 (54%), Gaps = 17/235 (7%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE Y ++ + L R + P G + S R G
Sbjct: 1716 TGCARSEGYYKIDKKDK-----LKYLINNRSLADEPPIDTQG---KSIPAQPQASTRA-G 1766
Query: 667 SKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDM 721
S+ R +F D K+ +K FRK+ + F KS IH +G+FA P A +M
Sbjct: 1767 SERRSEQRRLLSSFTGSCDSDLLKFNQLK--FRKKKIRFCKSHIHDWGLFAMEPIVADEM 1824
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G+ +R IAD RE + +G+ +YMFR+D + +IDAT+ G+ A INHSC PN
Sbjct: 1825 VIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPN 1883
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY++V++V + I+I++K+ I EE+TYDY+F D ++ C CG CRG +N
Sbjct: 1884 CYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 1938
>gi|114607127|ref|XP_001172835.1| PREDICTED: bromodomain and PHD finger-containing protein 3 isoform
3 [Pan troglodytes]
gi|410300152|gb|JAA28676.1| bromodomain and PHD finger containing, 3 [Pan troglodytes]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|395832260|ref|XP_003789191.1| PREDICTED: bromodomain and PHD finger-containing protein 3
[Otolemur garnettii]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCALC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|50546869|ref|XP_500904.1| YALI0B14883p [Yarrowia lipolytica]
gi|74689791|sp|Q6CEK8.1|SET1_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49646770|emb|CAG83155.1| YALI0B14883p [Yarrowia lipolytica CLIB122]
Length = 1170
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 66/144 (45%), Positives = 96/144 (66%), Gaps = 3/144 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE+VR IAD RE S +G+ +Y+F
Sbjct: 1028 KKPVKFARSAIHNWGLYAIEPIAANEMIIEYVGEVVRQEIADLREARYMRSGIGS-SYLF 1086
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ V+DAT+ G IA INH C P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 1087 RVDESTVVDATKRGGIARFINHCCTPSCTAKIIKVEGQKRIVIYASRDIAANEELTYDYK 1146
Query: 815 FFSI--DEQLACYCGFPRCRGVVN 836
F +E++ C CG P C+G +N
Sbjct: 1147 FEKEIGEERIPCLCGAPGCKGYLN 1170
>gi|397496259|ref|XP_003818959.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain and PHD
finger-containing protein 3 [Pan paniscus]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|380811766|gb|AFE77758.1| bromodomain and PHD finger-containing protein 3 [Macaca mulatta]
gi|383417557|gb|AFH31992.1| bromodomain and PHD finger-containing protein 3 [Macaca mulatta]
gi|384946470|gb|AFI36840.1| bromodomain and PHD finger-containing protein 3 [Macaca mulatta]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|355561634|gb|EHH18266.1| hypothetical protein EGK_14832 [Macaca mulatta]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|281338425|gb|EFB14009.1| hypothetical protein PANDA_002219 [Ailuropoda melanoleuca]
Length = 1206
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|345778685|ref|XP_538883.3| PREDICTED: bromodomain and PHD finger-containing protein 3 [Canis
lupus familiaris]
Length = 1207
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|347827280|emb|CCD42977.1| similar to histone-lysine N-methyltransferase [Botryotinia
fuckeliana]
Length = 1265
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A DM+IEY GE VR +AD RE+ S +G+ +Y+F
Sbjct: 1122 KKPVKFARSAIHNWGLYAMENIAMNDMIIEYVGEKVRQQVADLRENRYLKSGIGS-SYLF 1180
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I+V + I+I+A RDI Q EELTYDY+
Sbjct: 1181 RIDENTVIDATKKGGIARFINHSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYK 1240
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG P C+G +N
Sbjct: 1241 FEREIGSTDRIPCLCGTPACKGFLN 1265
>gi|426352889|ref|XP_004043936.1| PREDICTED: bromodomain and PHD finger-containing protein 3 [Gorilla
gorilla gorilla]
Length = 1205
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|156061849|ref|XP_001596847.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980]
gi|154700471|gb|EDO00210.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1264
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A DM+IEY GE VR +AD RE+ S +G+ +Y+F
Sbjct: 1121 KKPVKFARSAIHNWGLYAMENIAMNDMIIEYVGEKVRQQVADLRENRYLKSGIGS-SYLF 1179
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I+V + I+I+A RDI Q EELTYDY+
Sbjct: 1180 RIDENTVIDATKKGGIARFINHSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYK 1239
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG P C+G +N
Sbjct: 1240 FEREIGSTDRIPCLCGTPACKGFLN 1264
>gi|444725564|gb|ELW66128.1| Bromodomain and PHD finger-containing protein 3 [Tupaia chinensis]
Length = 1330
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 342 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 399
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 400 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 459
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 460 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 514
Query: 578 QP 579
P
Sbjct: 515 PP 516
>gi|168273214|dbj|BAG10446.1| bromodomain and PHD finger-containing protein 3 [synthetic
construct]
Length = 1205
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|402866807|ref|XP_003897565.1| PREDICTED: bromodomain and PHD finger-containing protein 3 [Papio
anubis]
Length = 1205
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|332259687|ref|XP_003278916.1| PREDICTED: bromodomain and PHD finger-containing protein 3
[Nomascus leucogenys]
Length = 1205
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|148727368|ref|NP_056510.2| bromodomain and PHD finger-containing protein 3 [Homo sapiens]
gi|71153496|sp|Q9ULD4.2|BRPF3_HUMAN RecName: Full=Bromodomain and PHD finger-containing protein 3
gi|119624283|gb|EAX03878.1| bromodomain and PHD finger containing, 3, isoform CRA_b [Homo
sapiens]
Length = 1205
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|440910380|gb|ELR60182.1| Bromodomain and PHD finger-containing protein 3 [Bos grunniens
mutus]
Length = 1206
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|301756985|ref|XP_002914339.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 1207
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|119624285|gb|EAX03880.1| bromodomain and PHD finger containing, 3, isoform CRA_d [Homo
sapiens]
gi|119624287|gb|EAX03882.1| bromodomain and PHD finger containing, 3, isoform CRA_d [Homo
sapiens]
Length = 1204
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|297677962|ref|XP_002816851.1| PREDICTED: bromodomain and PHD finger-containing protein 3 isoform
2 [Pongo abelii]
Length = 1205
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|6331389|dbj|BAA86600.1| KIAA1286 protein [Homo sapiens]
Length = 1214
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 224 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 281
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 282 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 341
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 342 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 394
Query: 576 HKQP 579
H P
Sbjct: 395 HSPP 398
>gi|355748502|gb|EHH52985.1| hypothetical protein EGM_13536 [Macaca fascicularis]
Length = 1205
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|348576332|ref|XP_003473941.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Cavia porcellus]
Length = 1208
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|329663705|ref|NP_001192556.1| bromodomain and PHD finger-containing protein 3 [Bos taurus]
gi|296474528|tpg|DAA16643.1| TPA: CG1845-like [Bos taurus]
Length = 1207
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|291396103|ref|XP_002714704.1| PREDICTED: bromodomain and PHD finger containing, 3 [Oryctolagus
cuniculus]
Length = 1207
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|410959028|ref|XP_003986114.1| PREDICTED: bromodomain and PHD finger-containing protein 3 isoform
2 [Felis catus]
Length = 872
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 387
Query: 578 QP 579
P
Sbjct: 388 PP 389
>gi|256088989|ref|XP_002580602.1| bromodomain-containing nuclear protein 1 brd1 [Schistosoma mansoni]
gi|360042774|emb|CCD78184.1| putative bromodomain-containing nuclear protein 1, brd1
[Schistosoma mansoni]
Length = 701
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/211 (37%), Positives = 111/211 (52%), Gaps = 16/211 (7%)
Query: 370 SSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG 429
S+ C+ T D+ G + + D C+VC N+ L CD C + VH CYG
Sbjct: 310 SNGCETTDVLPDIGGNHSGIDED----AVCAVCQDGTCENTNVILFCDVCNLAVHQECYG 365
Query: 430 ELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTD 489
G WLC C +P P C LCP GGA K TTD RWAH+ C +W+PE +
Sbjct: 366 VPYVPEGP-WLCRKCL-HSPSEPVSCVLCPNRGGAFKKTTDDRWAHVICGLWVPEVMFAN 423
Query: 490 VKRMEPIDGLNRVSKDRWKLLCSICGV-SYGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
+ +EP++G++R++ RW+L C IC + GACIQC ++C A+H CA+ AGL +++E
Sbjct: 424 LTFLEPLEGIDRIAAARWRLQCFICKQRNVGACIQCHKSSCYRAFHVTCAQHAGLYMKIE 483
Query: 549 DEDRLNLLSLDEDDEDQCIRLLSFCKKHKQP 579
D D D IR +FC +H P
Sbjct: 484 HTD---------DPGDLGIRKSAFCDQHCPP 505
>gi|256078227|ref|XP_002575398.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|353230389|emb|CCD76560.1| putative mixed-lineage leukemia protein, mll [Schistosoma mansoni]
Length = 3002
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/140 (50%), Positives = 91/140 (65%), Gaps = 2/140 (1%)
Query: 697 RLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI 756
R++ S IHG G+FA R +MVIEYTGEL+R I D RE + G YMFRI
Sbjct: 2865 RVSVHPSRIHGRGLFALREFREDEMVIEYTGELIRSIICDARE--LKYRATGVDCYMFRI 2922
Query: 757 DDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFF 816
D + VIDAT AG+ A INH+C+PNCY++V+S++ +HI+I A+R I EELTYDYRF
Sbjct: 2923 DPDWVIDATYAGNAARFINHACDPNCYAKVVSIDDKKHIVILAQRKIYPGEELTYDYRFP 2982
Query: 817 SIDEQLACYCGFPRCRGVVN 836
++L C CG CR +N
Sbjct: 2983 KESDKLPCNCGSYSCRKYLN 3002
>gi|219521531|gb|AAI43919.1| BRPF3 protein [Homo sapiens]
Length = 871
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 387
Query: 578 QP 579
P
Sbjct: 388 PP 389
>gi|344264382|ref|XP_003404271.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain and PHD
finger-containing protein 3-like [Loxodonta africana]
Length = 1227
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|431916823|gb|ELK16583.1| Bromodomain and PHD finger-containing protein 3 [Pteropus alecto]
Length = 1082
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 95 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 152
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 153 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 212
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 213 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 267
Query: 578 QP 579
P
Sbjct: 268 PP 269
>gi|213406125|ref|XP_002173834.1| PHD finger protein c [Schizosaccharomyces japonicus yFS275]
gi|212001881|gb|EEB07541.1| PHD finger protein c [Schizosaccharomyces japonicus yFS275]
Length = 691
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 117/233 (50%), Gaps = 22/233 (9%)
Query: 388 PVRVDWKDLD-KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP 446
PVR + ++LD CS+C+ E NN + CD C + VH CYG + V W C CR
Sbjct: 179 PVRTEVEELDGSCSICNESECEHNNAIVFCDSCNLAVHQNCYG-IPFVPEGQWFCKKCR- 236
Query: 447 GAPEPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
AP+ C CP GA + T DGRW H CA+ IPE DV R++ + + + K R
Sbjct: 237 IAPDQIISCVCCPDHEGAFRTTVDGRWCHTLCAMAIPEVWFHDVPRLDLVRNVPMIPKSR 296
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
WKL+CSIC +GAC+QC+N +C VA+H CAR AGL ++ S + D
Sbjct: 297 WKLVCSICKQRWGACVQCTNKSCYVAFHITCARRAGLYYKIHQH------SPNYDS---- 346
Query: 567 IRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFG 619
+ L +C KH + + + + L + D+ A S P+NY
Sbjct: 347 VELECYCDKHTSASHLHVGMHKLLPIARKYYTDF---------AASVPFNYLS 390
>gi|148685638|gb|EDL17585.1| mCG141846, isoform CRA_b [Mus musculus]
Length = 1006
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 866 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 924
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 925 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 984
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 985 FPLEDNKIPCLCGTESCRGSLN 1006
>gi|426250203|ref|XP_004018827.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain and PHD
finger-containing protein 3 [Ovis aries]
Length = 1212
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 HKQP 579
H P
Sbjct: 386 HSPP 389
>gi|168027908|ref|XP_001766471.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
gi|162682380|gb|EDQ68799.1| histone-lysine N-methyltransferase [Physcomitrella patens subsp.
patens]
Length = 2933
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/386 (30%), Positives = 166/386 (43%), Gaps = 72/386 (18%)
Query: 465 MKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQC 524
+K +WAH+ CA+W+P T ++ M D ++ V+ K+LCSIC GACIQC
Sbjct: 2469 LKDVAAKQWAHMVCALWMPGTRCLNMGTMGVFD-VSNVNASLRKMLCSICHRRGGACIQC 2527
Query: 525 SNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDRL 584
C H CA GL L S D + C+ H D
Sbjct: 2528 RVPNCSTPIHVWCAHEKGL-----------LQSEIVQDGSNRVGFFGKCQTH----GDYC 2572
Query: 585 AVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPY 644
+ V C Y S C R+E Y +G+ E A A +R+ +E
Sbjct: 2573 GSESDDV-AKEGICTYTRES----CVRTEGY-----KGKLSLEERAKAQ-QRMTLEGGLA 2621
Query: 645 LVGGYCQNGLSGNTLPSIRVIGSKFSF-----SLHRDAPNFLSMADKYKHMKETFRKRLA 699
+ L + +R I S+ S +L D +L ++K K KRLA
Sbjct: 2622 VTPEQVTAALRID----VRKISSRRSLKPATSALKCDQREYL----RFKQKKGW--KRLA 2671
Query: 700 FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE---HFIYNSLVGAGTYMFRI 756
KSGIH G++ G++V+EY GE+V +AD+RE H Y+FRI
Sbjct: 2672 VYKSGIHALGLYTTDFIAEGEVVVEYVGEIVGSRVADKREAEYHSGKRLQYQGACYLFRI 2731
Query: 757 DDERVIDATRAGSIAHLINHSCE---------------------------PNCYSRVISV 789
D E++IDATR G IA +NHSC PNC ++VI V
Sbjct: 2732 DTEQIIDATRKGGIARFVNHSCSLCMCGILEIANTSGLLGYGYLIAWNSWPNCVAKVICV 2791
Query: 790 NGDEHIIIFAKRDIKQWEELTYDYRF 815
+ ++ FAKR+I EE+TYDY+F
Sbjct: 2792 ENLKKVVFFAKRNIYAAEEVTYDYKF 2817
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 35/85 (41%), Gaps = 4/85 (4%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KC VC + N L C +C + VH CYG + G W C C+ P C L
Sbjct: 2276 KCDVCGIFTSASYNKLLCCSRCPVKVHQACYGVPKIPKGP-WSCRTCKFKVTN--PICVL 2332
Query: 458 CPVVGGAMKPTTDGR-WAHLACAIW 481
C GGAM R + H +W
Sbjct: 2333 CGYGGGAMTRVHKSRSFCHGLLRVW 2357
>gi|366990781|ref|XP_003675158.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
gi|342301022|emb|CCC68787.1| hypothetical protein NCAS_0B07030 [Naumovozyma castellii CBS 4309]
Length = 1082
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 939 KKPVTFARSAIHNWGLYALEPIAAREMIIEYVGESIRQPVAEMRERRYLKSGIGS-SYLF 997
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G I+I+A RDI EELTYDY+
Sbjct: 998 RVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYK 1057
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L CYCG C+G +N
Sbjct: 1058 FERELDAEERLPCYCGSVNCKGFLN 1082
>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
2.1.1.43)(COMPASS component SET1)(SET domain-containing
protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
[Aspergillus nidulans FGSC A4]
Length = 1220
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A +M+IEY GE VR +AD RE S +G+ +Y+F
Sbjct: 1077 KKPVRFARSAIHNWGLYAEVNISANEMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLF 1135
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++ EELTYDY+
Sbjct: 1136 RIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYK 1195
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + D+++ C CG C+G +N
Sbjct: 1196 FEREWDSDDRIPCLCGSAGCKGFLN 1220
>gi|443717120|gb|ELU08315.1| hypothetical protein CAPTEDRAFT_106550 [Capitella teleta]
Length = 806
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/152 (47%), Positives = 90/152 (59%), Gaps = 5/152 (3%)
Query: 399 CSVCHMDEEYQN-NLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
CS+C MD E QN N+ L CD C + VH CYG G WLC C +P CCL
Sbjct: 260 CSIC-MDGECQNTNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVKCCL 316
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV- 516
CP GGA K T+ RWAH+ CA+WIPE ++ +EPID ++R+ RWKL C IC
Sbjct: 317 CPNKGGAFKQTSTSRWAHVVCALWIPEVAFANIVFLEPIDNVDRIPAARWKLSCYICKQR 376
Query: 517 SYGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC T C A+H CA+ AGL +++E
Sbjct: 377 GTGACIQCHKTNCYTAFHVTCAQQAGLYMKIE 408
>gi|29437232|gb|AAH49883.1| Setd1a protein [Mus musculus]
Length = 849
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 709 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 767
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 768 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 827
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 828 FPLEDNKIPCLCGTESCRGSLN 849
>gi|452820772|gb|EME27810.1| histone-lysine N-methyltransferase isoform 2 [Galdieria
sulphuraria]
Length = 797
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 99/144 (68%), Gaps = 3/144 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KR+ FG+S IHG+G++A + +IEY GE++R I+D RE + +G +YMF
Sbjct: 655 KKRVRFGRSIIHGWGLYAMQDIEPNEFIIEYVGEIIRQKISDEREKRYFRQGIG-DSYMF 713
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D++++IDATR GS+A +NHSCE N +++I+++ + I+ ++KR I+ EE+TYDY+
Sbjct: 714 RLDEDQIIDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYK 773
Query: 815 FFSIDE--QLACYCGFPRCRGVVN 836
F + DE ++ C CG P CR +N
Sbjct: 774 FNTEDENNKILCLCGAPTCRKFLN 797
>gi|402223721|gb|EJU03785.1| hypothetical protein DACRYDRAFT_64824, partial [Dacryopinax sp.
DJM-731 SS1]
Length = 490
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 98/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHA---RCYGELEPVNGVLWLCNLCRPGAPEPPPPC 455
C+VC E +N + CD C + VH +CYG + + WLC C +PE P C
Sbjct: 61 CAVCDDGEGENSNAIVFCDGCNLAVHQGACQCYG-IPYIPEGQWLCRKCT-ISPENPVSC 118
Query: 456 CLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG 515
LCP GA K T G+W HL CA+W PE +T MEPI+G+ R+SK RW+L CSICG
Sbjct: 119 VLCPAEAGAFKQTNTGKWVHLLCALWNPEVTVTKGAYMEPIEGIERISKPRWRLACSICG 178
Query: 516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKK 575
+ GACIQC +C A+H CAR GL ++ S E++ L FC+K
Sbjct: 179 IKKGACIQCQKASCATAFHVTCARQEGLLGSMK--------SFAEEEHS----LRVFCEK 226
Query: 576 H 576
H
Sbjct: 227 H 227
>gi|195347745|ref|XP_002040412.1| GM18944 [Drosophila sechellia]
gi|194121840|gb|EDW43883.1| GM18944 [Drosophila sechellia]
Length = 898
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 689 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 730
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 731 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 784
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 785 IEYIGEVIRTEVSEIREK-QYES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 842
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 843 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 898
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/138 (21%), Positives = 59/138 (42%), Gaps = 25/138 (18%)
Query: 231 GDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKI 289
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 544 GNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAGC 600
Query: 290 RPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESG 343
+P FR+ + + +F GS+PS W +I I + +G
Sbjct: 601 KPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITRLRK---------------VHNG 645
Query: 344 SDMFGFSNPEVMKLILGL 361
D+FG + P +++++ L
Sbjct: 646 EDLFGLTEPAIVRILESL 663
>gi|302910631|ref|XP_003050330.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
gi|256731267|gb|EEU44617.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
Length = 1281
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/256 (35%), Positives = 135/256 (52%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E EA S K
Sbjct: 1031 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVQKARQEREARFKKSGKDAAAAAAEAA 1088
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
+ GN+ + R ++ L+ +D +K + RK+ + F +S
Sbjct: 1089 KLAAEKLIAKGNSR-ANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSA 1147
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A DM+IEY GE VR IA+ RE+ S +G+ +Y+FRID+ VIDA
Sbjct: 1148 IHNWGLYAMENIAKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGS-SYLFRIDENTVIDA 1206
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF----FSIDE 820
T+ G IA INHSC PNC +++I V G + I+I+A RDI EELTYDY+F S+D
Sbjct: 1207 TKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLD- 1265
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1266 RIPCLCGTAACKGFLN 1281
>gi|133902336|gb|ABO41859.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4137
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/144 (46%), Positives = 94/144 (65%), Gaps = 2/144 (1%)
Query: 672 SLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVR 731
S R L + +K++ +K+ R + +S IHG G+F + G+MVIEY+G ++R
Sbjct: 3987 SARRATSTDLPLPEKFRQLKKASRDAVGAYRSAIHGRGLFCRKNIEPGEMVIEYSGNVIR 4046
Query: 732 PSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG 791
+ D+RE + + G G YMFRIDD V+DAT G+ A INHSCEPNCYS V++V+G
Sbjct: 4047 SVLTDKREKYYDDK--GIGCYMFRIDDYEVVDATIHGNSARFINHSCEPNCYSHVVNVDG 4104
Query: 792 DEHIIIFAKRDIKQWEELTYDYRF 815
+HI+IFA R I + EELTYDY+F
Sbjct: 4105 QKHIVIFATRRIYKGEELTYDYKF 4128
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 40/99 (40%), Gaps = 22/99 (22%)
Query: 387 RPVRVDWKD--LDKCSVCHM-DEEYQNN-LFLQCDKCRMMVHARCYGELEPVNGV----L 438
RP W++ +C CH+ +YQ L+CDKCR H C G P +
Sbjct: 1592 RPHDEQWENWCCRRCRFCHVCGRKYQKTKQLLECDKCRNSYHPECLGPNHPTRPTKKKRV 1651
Query: 439 WLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT--DGRWAH 475
W+C C C C GA KP D +W+H
Sbjct: 1652 WVCTKCV--------RCKSC----GATKPGKAWDAQWSH 1678
>gi|342887802|gb|EGU87231.1| hypothetical protein FOXB_02213 [Fusarium oxysporum Fo5176]
Length = 1258
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 88/256 (34%), Positives = 135/256 (52%), Gaps = 28/256 (10%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P++ ++ R+E EA + +
Sbjct: 1012 YVP--NPTGCARTEGVKKILNSEKSKYLPHHIKVQKARQEREARNKSGKDAAADAAR--- 1066
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
+ + ++ R ++ L+ +D +K + RK+ + F +S
Sbjct: 1067 --IAIEKQAANDSERQKRATNRRYVQDLNDQKKTLGQDSDIFKFNQLKKRKKPVKFARSA 1124
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A DM+IEY GE VR IA+ RE+ S +G+ +Y+FRIDD VIDA
Sbjct: 1125 IHNWGLYAMENIAKDDMIIEYVGEQVRQQIAEIRENRYLKSGIGS-SYLFRIDDNTVIDA 1183
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF----FSIDE 820
T+ G IA INHSCEPNC +++I V G + I+I+A +DI EELTYDY+F S+D
Sbjct: 1184 TKKGGIARFINHSCEPNCTAKIIKVEGSKRIVIYALQDIAMSEELTYDYKFEREIGSLD- 1242
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1243 RIPCLCGTAACKGFLN 1258
>gi|119572565|gb|EAW52180.1| hCG1998636, isoform CRA_b [Homo sapiens]
Length = 1189
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1049 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1107
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1108 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1167
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1168 FPLEDNKIPCLCGTESCRGSLN 1189
>gi|148690644|gb|EDL22591.1| mCG18535 [Mus musculus]
Length = 1199
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 385
Query: 576 H 576
H
Sbjct: 386 H 386
>gi|452820773|gb|EME27811.1| histone-lysine N-methyltransferase isoform 1 [Galdieria
sulphuraria]
Length = 769
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 99/144 (68%), Gaps = 3/144 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KR+ FG+S IHG+G++A + +IEY GE++R I+D RE + +G +YMF
Sbjct: 627 KKRVRFGRSIIHGWGLYAMQDIEPNEFIIEYVGEIIRQKISDEREKRYFRQGIG-DSYMF 685
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D++++IDATR GS+A +NHSCE N +++I+++ + I+ ++KR I+ EE+TYDY+
Sbjct: 686 RLDEDQIIDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYK 745
Query: 815 FFSIDE--QLACYCGFPRCRGVVN 836
F + DE ++ C CG P CR +N
Sbjct: 746 FNTEDENNKILCLCGAPTCRKFLN 769
>gi|301090958|ref|XP_002895674.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262097084|gb|EEY55136.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 2943
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 102/158 (64%), Gaps = 3/158 (1%)
Query: 681 LSMADKYKHMKE-TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE 739
L + +Y+ ++ F +R+ KS IHG+G+F K P G M++EY G+++ +IAD RE
Sbjct: 2787 LPITMQYRELRRRPFDERMLVRKSSIHGYGLFLKEPVSDGQMIVEYQGQMINQTIADERE 2846
Query: 740 HFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE-HIIIF 798
VG+ YMFR+D++ +IDATR G++A INHSC+P ++R+++V G E I+IF
Sbjct: 2847 RRYEEQGVGS-CYMFRLDEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIF 2905
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
AKR I +E+TYDY+F DE + C C P C G +N
Sbjct: 2906 AKRAIAVGDEVTYDYKFPIEDEAIRCDCNAPNCIGRMN 2943
>gi|124486783|ref|NP_001074784.1| bromodomain and PHD finger-containing protein 3 [Mus musculus]
gi|187956992|gb|AAI57916.1| Bromodomain and PHD finger containing, 3 [Mus musculus]
Length = 1204
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 12/181 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTTF-------TVRKTAYCEA 385
Query: 576 H 576
H
Sbjct: 386 H 386
>gi|297791815|ref|XP_002863792.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
gi|297309627|gb|EFH40051.1| hypothetical protein ARALYDRAFT_494795 [Arabidopsis lyrata subsp.
lyrata]
Length = 1414
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/153 (46%), Positives = 99/153 (64%), Gaps = 5/153 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ AD +K + +KRL F +S IH +G+ A P A D VIEY GEL+R SI
Sbjct: 1232 NLLAAADGADVLKISQLKARKKRLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSI 1291
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
++ RE Y + +Y+FR+DD V+DAT+ G IA INHSCEPNCY+++ISV G +
Sbjct: 1292 SEIRER-QYEKMGIGSSYLFRLDDGYVVDATKRGGIARFINHSCEPNCYTKIISVEGKKK 1350
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCG 827
I I+AKR I EE++Y+Y+F D+++ C CG
Sbjct: 1351 IFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCG 1383
>gi|21064853|gb|AAM29656.1| SD13650p [Drosophila melanogaster]
Length = 1019
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 810 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 851
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 852 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 905
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 906 IEYIGEVIRTEVSEIREK-QYES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 963
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 964 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1019
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 62/138 (44%), Gaps = 15/138 (10%)
Query: 231 GDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKI 289
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 655 GNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAGC 711
Query: 290 RPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESG 343
+P FR+ + + +F GS+PS W +I I + + + I SG
Sbjct: 712 KPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITRLRKVHK---WLQLFPQHI--SG 766
Query: 344 SDMFGFSNPEVMKLILGL 361
D+FG + P +++++ L
Sbjct: 767 EDLFGLTEPAIVRILESL 784
>gi|301622880|ref|XP_002940755.1| PREDICTED: hypothetical protein LOC100487270 [Xenopus (Silurana)
tropicalis]
Length = 1817
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1677 KKKLRFGRSHIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYSQQGIGS-SYLF 1735
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1736 RMDQDTIIDATKCGNLARFINHCCSPNCYAKVITIESQKKIVIYSKQPIGINEEITYDYK 1795
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1796 FPLEDNKIPCLCGTENCRGTLN 1817
>gi|47217812|emb|CAG07226.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1884
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1744 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADNREKRYAQQGIGS-SYLF 1802
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1803 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQAIAVNEEITYDYK 1862
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F + ++ C CG CRG +N
Sbjct: 1863 FPLEENKIPCLCGTENCRGTLN 1884
>gi|195553639|ref|XP_002076709.1| GD11928 [Drosophila simulans]
gi|194202088|gb|EDX15664.1| GD11928 [Drosophila simulans]
Length = 374
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/258 (34%), Positives = 141/258 (54%), Gaps = 35/258 (13%)
Query: 607 SGCARSEPYNYFGRRGR-KEPEALAAASLKRLFVEN---QPYLVGGYCQNGLSGNTLPSI 662
+GCAR+E + R + K A A+ + F E+ +P + + N L + +
Sbjct: 124 TGCARTEGFYKLDVREKAKHKYHHAKANTEDSFNEDRSDEPTALTNHHHNKL----ISKM 179
Query: 663 RVIGSKFSFSLHRDAPNFLSMAD----KYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
+ I + + R F SM + K+ +K FRK+ L F KS IH +G+FA P
Sbjct: 180 QGISREARSNQRRLLTAFGSMGESELLKFNQLK--FRKKQLKFAKSAIHDWGLFAMEPIA 237
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+++RP +AD RE Y ++ +Y+FRID E +IDAT+ G++A INHS
Sbjct: 238 ADEMVIEYVGQMIRPVVADLRE-TKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 296
Query: 778 C-------------------EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSI 818
C +PNCY++VI++ ++ I+I++K+ I EE+TYDY+F
Sbjct: 297 CNVSITGYFDIMLPTYNFVFQPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLE 356
Query: 819 DEQLACYCGFPRCRGVVN 836
+E++ C CG CRG +N
Sbjct: 357 EEKIPCLCGAQGCRGTLN 374
>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1830
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 125/231 (54%), Gaps = 11/231 (4%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIG 666
+GCARSE G + + + RL E P G + S R G
Sbjct: 1610 TGCARSE-----GYYKIDKKDKIKYLQSTRLQSEEPPVDTQGMS---IPAQVHASTRA-G 1660
Query: 667 SKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEY 725
S+ R +F +D K + FRK+ + F KS IH +G+FA P A +MVIEY
Sbjct: 1661 SERRSEQRRLLSSFACDSDLLKFNQLKFRKKKIRFCKSHIHDWGLFALEPIAADEMVIEY 1720
Query: 726 TGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSR 785
G+ +R IAD RE +G+ +YMFR+D + +IDAT+ G+ A INHSC PNCY++
Sbjct: 1721 VGQNIRQVIADMREKRYEEEGIGS-SYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAK 1779
Query: 786 VISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
VI+V + I+I++++ I EE+TYDY+F D ++ C CG CRG +N
Sbjct: 1780 VITVESQKKIVIYSRQPINVNEEITYDYKFPIEDVKIPCLCGAENCRGTLN 1830
>gi|195477920|ref|XP_002100342.1| GE16999 [Drosophila yakuba]
gi|194187866|gb|EDX01450.1| GE16999 [Drosophila yakuba]
Length = 1421
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 1212 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 1253
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 1254 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 1307
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 1308 IEYIGEVIRTEVSEIREK-QYES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 1365
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 1366 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1421
>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 1270
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 7/159 (4%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S A K+ +K+ +K + F +S IH +G++ + DM+IEY GE VR SI++ RE
Sbjct: 1115 SDAFKFNQLKKR-KKPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKR 1173
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
S +G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V+G + I+I+A R
Sbjct: 1174 YLKSGMGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALR 1232
Query: 802 DIKQWEELTYDYRF----FSIDEQLACYCGFPRCRGVVN 836
DI Q EELTYDY+F S+D ++ C CG C+G +N
Sbjct: 1233 DIAQHEELTYDYKFEREIGSLD-RIPCLCGTAACKGFLN 1270
>gi|297290685|ref|XP_002803774.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Macaca mulatta]
Length = 1115
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 130 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 187
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 188 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 247
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 248 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 302
Query: 578 QP 579
P
Sbjct: 303 PP 304
>gi|16118405|gb|AAL12215.1| trithorax 4 [Arabidopsis thaliana]
Length = 285
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 45/303 (14%)
Query: 560 EDDEDQCIRLLSFCKKHKQPLNDRLAVDE--------------------RLVQVTRRCCD 599
E + Q +++S+C H+ P D + + + RL+ + R D
Sbjct: 2 EKNGQQITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNEKKGGSRLISLIRED-D 60
Query: 600 YIPPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSG-NT 658
P N C +P++ R K S KR+ E P+ G + + T
Sbjct: 61 EAPAENTITC---DPFSAARCRVFKR----KINSKKRIEEEAIPHHTRGPRHHASTAVQT 113
Query: 659 LPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRA 718
L + R + + +F S ++ H++ T R+ FG+SGIHG+G+F + +
Sbjct: 114 LNTFRHVPE--------EPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFGRRNIQE 165
Query: 719 GDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSC 778
G+MV+EY GE VR SIAD RE VG Y+F+I +E V+DAT G+IA LINHSC
Sbjct: 166 GEMVLEYRGEQVRGSIADLRE--ARYRRVGKDCYLFKISEELVVDATDKGNIARLINHSC 223
Query: 779 EPNCYSRVISVNGDE-HIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRG 833
PNCY+R++SV +E I++ AK ++ EELTYDY F DE ++ C C P CR
Sbjct: 224 TPNCYARIMSVGDEESRIVLIAKANVAVGEELTYDY-LFDPDEAEELKVPCLCKAPNCRK 282
Query: 834 VVN 836
+N
Sbjct: 283 FMN 285
>gi|384244738|gb|EIE18236.1| hypothetical protein COCSUDRAFT_10830, partial [Coccomyxa
subellipsoidea C-169]
Length = 151
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 92/153 (60%), Gaps = 6/153 (3%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC--RPGAPEPPPPC 455
CSVC D ++ + + CD C + VH CYG E P +WLC C + P C
Sbjct: 1 CSVCDSDVDFDFDQLVSCDACGITVHQSCYGVAELPGVDDMWLCRACELKVRRDAKAPQC 60
Query: 456 CLCPVVGGAMKPTTD-GRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSIC 514
CLCPV GGA+KP TD G WAH AC WIPE + DV RMEP+ + + K+RW LLC IC
Sbjct: 61 CLCPVTGGALKPATDKGLWAHAACMQWIPEVTVEDVSRMEPVSHIKSIQKERWDLLCVIC 120
Query: 515 GVSYGACIQCSNTTCRVAYHPLCARAAGLCVEL 547
GA IQC T+C AYHPLCAR AGL +E+
Sbjct: 121 KQRVGAKIQC--TSCYTAYHPLCARIAGLHMEI 151
>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 1257
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 7/159 (4%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S A K+ +K+ +K + F +S IH +G++ + DM+IEY GE VR SI++ RE
Sbjct: 1102 SDAFKFNQLKKR-KKPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKR 1160
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
S +G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V+G + I+I+A R
Sbjct: 1161 YLKSGMGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALR 1219
Query: 802 DIKQWEELTYDYRF----FSIDEQLACYCGFPRCRGVVN 836
DI Q EELTYDY+F S+D ++ C CG C+G +N
Sbjct: 1220 DIGQHEELTYDYKFEREIGSLD-RIPCLCGTAACKGFLN 1257
>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
magnipapillata]
Length = 1403
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 95/142 (66%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K L F KS IH +G+FA P A +MV EY G++VR +A+ RE +G+ +Y+F
Sbjct: 1263 KKALRFSKSSIHNWGLFACEPINADEMVCEYVGQMVRSIVAEIRERRYEKQGIGS-SYLF 1321
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + VIDAT+ G A INH C+PNCY++VI V G + I+I+++R IK EE+TYDY+
Sbjct: 1322 RLDSDSVIDATKDGCNARFINHCCDPNCYAKVILVEGAKKIVIYSRRAIKLGEEITYDYK 1381
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F DE++ C CG CRG +N
Sbjct: 1382 FPIEDEKIPCLCGAALCRGTLN 1403
>gi|348510108|ref|XP_003442588.1| PREDICTED: hypothetical protein LOC100694318 [Oreochromis niloticus]
Length = 2205
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 2065 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADNREKRYAQQGIGS-SYLF 2123
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 2124 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIAVNEEITYDYK 2183
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F + ++ C CG CRG +N
Sbjct: 2184 FPLEENKIPCLCGTENCRGTLN 2205
>gi|328792724|ref|XP_396822.4| PREDICTED: hypothetical protein LOC413377 [Apis mellifera]
Length = 1859
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 99/179 (55%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C + P C LC
Sbjct: 251 CDVCRSPDSEEGNEMVFCDCCNICVHQACYGITSIPDGS-WLCRTC---SLSQRPDCVLC 306
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V+RMEPI ++ + + RW L+C +C
Sbjct: 307 PNKGGAMKCTRSGQKWAHVSCALWIPEVSIGCVERMEPITKISSIPQSRWALICVLCRER 366
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL ++ ED + D ++L S+C+KH
Sbjct: 367 VGACIQCSIKTCKTAYHVTCAFKYGLEMKAIIEDEM---------ADDGVKLRSYCQKH 416
>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 2607
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/138 (45%), Positives = 93/138 (67%), Gaps = 1/138 (0%)
Query: 693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTY 752
+ +K L F +S IH +G+ A A D VIEY GE++R +++ RE Y + +Y
Sbjct: 2470 SMKKHLKFQRSKIHDWGLLALESIEAEDFVIEYVGEIIRRQVSNFRER-QYEIMGIGSSY 2528
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD 812
+FR+DDE V+DAT+ G +A INHSC PNCY+++I+V G + ++I++KR I EELTYD
Sbjct: 2529 LFRVDDELVVDATQKGGLARFINHSCNPNCYTKIITVEGRKKVVIYSKRAIGAGEELTYD 2588
Query: 813 YRFFSIDEQLACYCGFPR 830
Y+F D+++ CYCG PR
Sbjct: 2589 YKFSLEDKKIPCYCGAPR 2606
>gi|9759476|dbj|BAB10481.1| unnamed protein product [Arabidopsis thaliana]
Length = 1421
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 99/155 (63%), Gaps = 5/155 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ AD +K + +K L F +S IH +G+ A P A D VIEY GEL+R SI
Sbjct: 1243 NLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSI 1302
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
++ RE Y + +Y+FR+DD V+DAT+ G IA INHSCEPNCY+++ISV G +
Sbjct: 1303 SEIRER-QYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKK 1361
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFP 829
I I+AKR I EE++Y+Y+F D+++ C CG P
Sbjct: 1362 IFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGAP 1396
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 90/250 (36%), Positives = 131/250 (52%), Gaps = 20/250 (8%)
Query: 605 NPSGCARSEPY--NYFGRRGRKEPEALAAASLKRLFV------ENQPYLVGGYCQNGLSG 656
NPSGCAR EP R + P A A + + + +++ G + NG+
Sbjct: 2100 NPSGCARCEPRFRTLIKHRHHRTPPAAVAPTQRSSSITDVVSRDSRTRGGGNWANNGVDA 2159
Query: 657 NTLPSIRVIGSKFSFSL---HRDAPNFLSMA--DKYKHMKETFRKRLAFGKSGIHGFGIF 711
+T + G L R P LS + +Y+ MK+ ++ + +S I G G++
Sbjct: 2160 STRAMLLASGLSPDMVLGPYARWEPGQLSGSYYTQYQKMKKEWKNNVYLARSRIQGLGLY 2219
Query: 712 AKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIA 771
AK M+IEY GE++R + + RE G YMFRID+ER++DAT AG A
Sbjct: 2220 AKRDIEMNAMIIEYKGEVIRSEVGEMREKKY--EAQNRGVYMFRIDEERLVDATMAGGPA 2277
Query: 772 HLINHSCEPNCYSRVISV--NGDE-HIIIFAKRDIKQWEELTYDYRFFSID--EQLACYC 826
INHSC+PNC +R+++ NGD+ III A R I EELTYDY+F D +++AC C
Sbjct: 2278 RYINHSCDPNCSTRIVASGPNGDDKKIIIIANRPISAGEELTYDYQFDIEDTADKIACLC 2337
Query: 827 GFPRCRGVVN 836
G P C+ +N
Sbjct: 2338 GAPNCQKWMN 2347
>gi|409041309|gb|EKM50795.1| hypothetical protein PHACADRAFT_213668 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1307
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 93/178 (52%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 137 CAICDDSEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 194
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEP+ G++R+ K RWKL C +C V
Sbjct: 195 PNEGGAFKQTVHGEWVHLLCAIWVPETRVANDVFMEPVTGVDRIPKQRWKLKCQLCDVRT 254
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC +C A+H CAR L L+ + + L ++C+KH
Sbjct: 255 GACIQCIKNSCFSAFHATCARKEKL-----------LMPMKASQGSEAPTLAAYCEKH 301
>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
Length = 1037
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 894 KKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHRERSYLKTGIGS-SYLF 952
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V+G + I+I+A RDI+ EELTYDY+
Sbjct: 953 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYK 1012
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + E++ C CG P C+G +N
Sbjct: 1013 FERETNDSERIRCLCGAPGCKGYLN 1037
>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 1262
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 7/159 (4%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S A K+ +K+ +K + F +S IH +G++ + DM+IEY GE VR SI++ RE
Sbjct: 1107 SDAFKFNQLKKR-KKPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQSISEIREKR 1165
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
S +G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V+G + I+I+A R
Sbjct: 1166 YLKSGMGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALR 1224
Query: 802 DIKQWEELTYDYRF----FSIDEQLACYCGFPRCRGVVN 836
DI Q EELTYDY+F S+D ++ C CG C+G +N
Sbjct: 1225 DIGQHEELTYDYKFEREIGSLD-RIPCLCGTAACKGFLN 1262
>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
Length = 1802
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 153/278 (55%), Gaps = 29/278 (10%)
Query: 563 EDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG 622
ED +RL + K+ L D DE L V IP + +GC+R+ PY +
Sbjct: 1550 EDPLLRLNPI--RSKKGLPDAFYEDEELDGV-------IPVA--AGCSRARPYEKMTMKQ 1598
Query: 623 RKEPEALAAASLKRLFVENQPYLV-GGYCQNGLSGNTLPS--IRVIGSKFSFSLHRDAPN 679
++ + ++R E+ P + + + L S +R++ + SL DA N
Sbjct: 1599 KR-------SLVRRPDNESHPTAIFSERDETAIRHQHLASKDMRLLQRRLLTSLG-DANN 1650
Query: 680 FLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
D +K + FRK++ F +S IHG+G++A +M++EY G+ +R +A+ R
Sbjct: 1651 -----DFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEER 1705
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +G+ +Y+FRID VIDAT+ G+ A INHSC+PNCY++V+++ G++ I+I+
Sbjct: 1706 EKAYERRGIGS-SYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIY 1764
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++ IK+ EE+TYDY+F D+++ C CG CRG +N
Sbjct: 1765 SRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1802
>gi|30694058|ref|NP_199055.2| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
gi|332007422|gb|AED94805.1| histone-lysine N-methyltransferase SETD1 [Arabidopsis thaliana]
Length = 1423
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 99/155 (63%), Gaps = 5/155 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ AD +K + +K L F +S IH +G+ A P A D VIEY GEL+R SI
Sbjct: 1245 NLLAAADGADVLKMSQLKARKKHLRFQQSKIHDWGLVALEPIEAEDFVIEYVGELIRSSI 1304
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
++ RE Y + +Y+FR+DD V+DAT+ G IA INHSCEPNCY+++ISV G +
Sbjct: 1305 SEIRER-QYEKMGIGSSYLFRLDDGYVLDATKRGGIARFINHSCEPNCYTKIISVEGKKK 1363
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFP 829
I I+AKR I EE++Y+Y+F D+++ C CG P
Sbjct: 1364 IFIYAKRHIDAGEEISYNYKFPLEDDKIPCNCGAP 1398
>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1071
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P +M+IEY GE +R +AD RE S +G+ +Y+F
Sbjct: 928 KKPVSFARSAIHNWGLYALEPIAQKEMIIEYVGERIRQQVADFREKAYLKSGIGS-SYLF 986
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 987 RIDENTVIDATKKGGIARFINHCCQPSCTAKIIKVEGQKRIVIYALRDIGANEELTYDYK 1046
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1047 FERETNDNERVRCLCGAPGCKGYLN 1071
>gi|358349267|ref|XP_003638660.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
gi|355504595|gb|AES85798.1| Histone-lysine N-methyltransferase ATX3 [Medicago truncatula]
Length = 1149
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/169 (46%), Positives = 106/169 (62%), Gaps = 9/169 (5%)
Query: 673 LHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRP 732
LH DA F S + H+++T R+ FGKSGIHG+G+FA+ + GDMV+EY GE VR
Sbjct: 985 LH-DAEVFTSFKVRLHHLQKTENLRICFGKSGIHGWGLFARRDLQEGDMVVEYRGEQVRR 1043
Query: 733 SIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV-NG 791
S+AD RE + G Y+F+I +E VIDAT G+IA LINHSC PNCY+R++S+ N
Sbjct: 1044 SVADLREAKYRSE--GKDCYLFKISEEVVIDATHKGNIARLINHSCMPNCYARILSLGNQ 1101
Query: 792 DEHIIIFAKRDIKQWEELTYDYRFFSIDE----QLACYCGFPRCRGVVN 836
I++ AK ++ +ELTYDY F DE ++ C C CR +N
Sbjct: 1102 GNRIVLIAKTNVSAGQELTYDY-LFDPDEPEELKVPCLCRASNCRKFMN 1149
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 101/206 (49%), Gaps = 15/206 (7%)
Query: 382 LPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLC 441
L Y PV W ++C++C E+Y++N + C++C++ VH CYG + W+C
Sbjct: 548 LQEKYEPVHAKWTT-ERCAICRWIEDYEDNKIIICNRCQIAVHQECYGAKHVQDFTSWVC 606
Query: 442 NLCRPGAPEPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLN 500
+C P+ CCLCPV GGA+KPT + W H+ CA + E + + MEP G+
Sbjct: 607 RVCE--TPDVERECCLCPVKGGALKPTDVEMLWVHVTCAWFQREVVFKNPEVMEPALGIL 664
Query: 501 RVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDE 560
R+ + + C IC S+G+C C C +H +CA AG +EL +
Sbjct: 665 RIPPNSFVKTCVICKQSHGSCTSCCK--CSTHFHVMCASRAGYTMELHSMVK-------- 714
Query: 561 DDEDQCIRLLSFCKKHKQPLNDRLAV 586
+ + + +C H+ P D + V
Sbjct: 715 -NGTTITKKIIYCAVHRVPSPDSVLV 739
>gi|119572564|gb|EAW52179.1| hCG1998636, isoform CRA_a [Homo sapiens]
gi|119572567|gb|EAW52182.1| hCG1998636, isoform CRA_a [Homo sapiens]
Length = 1010
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 870 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 928
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 929 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 988
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 989 FPLEDNKIPCLCGTESCRGSLN 1010
>gi|348584344|ref|XP_003477932.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cavia
porcellus]
Length = 1702
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1562 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1620
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1621 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1680
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1681 FPLEDNKIPCLCGTESCRGSLN 1702
>gi|34191762|gb|AAH33652.2| BRPF3 protein, partial [Homo sapiens]
Length = 602
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 191 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 248
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 249 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 308
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 309 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 363
Query: 578 QP 579
P
Sbjct: 364 PP 365
>gi|410903041|ref|XP_003965002.1| PREDICTED: uncharacterized protein LOC101071878 [Takifugu rubripes]
Length = 1776
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1636 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADNREKRYAQQGIGS-SYLF 1694
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1695 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQAIAVNEEITYDYK 1754
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F + ++ C CG CRG +N
Sbjct: 1755 FPLEENKIPCLCGTENCRGTLN 1776
>gi|17861882|gb|AAL39418.1| GM10003p [Drosophila melanogaster]
Length = 421
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 212 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSAAIA 253
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 254 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 307
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 308 IEYIGEVIRTEVSEIREK-QYESK-NRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 365
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 366 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 421
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 62/138 (44%), Gaps = 15/138 (10%)
Query: 231 GDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKI 289
G++ L++G+++ E F +I+P GY R + + P Y + E+
Sbjct: 57 GNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AEAGC 113
Query: 290 RPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESG 343
+P FR+ + + +F GS+PS W +I I + + + I SG
Sbjct: 114 KPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITRLRKVHK---WLQLFPQHI--SG 168
Query: 344 SDMFGFSNPEVMKLILGL 361
D+FG + P +++++ L
Sbjct: 169 EDLFGLTEPAIVRILESL 186
>gi|332262972|ref|XP_003280531.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Nomascus
leucogenys]
Length = 1675
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1535 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1593
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1594 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1653
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1654 FPLEDNKIPCLCGTESCRGSLN 1675
>gi|195170125|ref|XP_002025864.1| GL18348 [Drosophila persimilis]
gi|194110717|gb|EDW32760.1| GL18348 [Drosophila persimilis]
Length = 1046
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 122/236 (51%), Gaps = 30/236 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C N+
Sbjct: 837 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCSTQRMANSASIA 878
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
+ +S S + +YK MK+ +R + +S I G G++A M+
Sbjct: 879 GEVACPYSKQF------VHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMI 932
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC PNC
Sbjct: 933 IEYIGEVIRTEVSEIREK-QYES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNC 990
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ V+ D IIIFAKR I + EEL+YDY+F D+ ++ C CG P CR +N
Sbjct: 991 VTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCACGAPNCRKWMN 1046
>gi|6683126|dbj|BAA20797.2| KIAA0339 protein [Homo sapiens]
Length = 1709
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1569 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1627
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1628 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1687
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1688 FPLEDNKIPCLCGTESCRGSLN 1709
>gi|313247613|emb|CBY15785.1| unnamed protein product [Oikopleura dioica]
Length = 386
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 114/206 (55%), Gaps = 24/206 (11%)
Query: 647 GGYCQNGLSGNTLPSIRVIGSKFSF--SLHRDA----PNF-------LSMADKYKHMKET 693
G + G SG PS FSF S HR NF L + +++ M ET
Sbjct: 189 GCWRAQGWSGKRNPS-----DMFSFLNSTHRSGLTHDQNFRPTMETDLPLTMRFRKMVET 243
Query: 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYM 753
R L S IHG G+FA+ P+ G++VIEY+G ++R + D RE + + G GTYM
Sbjct: 244 SRNSLQVLPSAIHGRGLFARRPYEPGELVIEYSGTVIRGELCDMREKYYDDR--GIGTYM 301
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FR+DD+ V+DAT G A INHSC PNC S++I+V+ +HI I A R I EELTYDY
Sbjct: 302 FRVDDDLVVDATMEGGRARFINHSCGPNCLSKIITVDSRKHICIIAGRFIDFGEELTYDY 361
Query: 814 RF---FSIDEQLACYCGFPRCRGVVN 836
+F F E++ C C P CR +N
Sbjct: 362 KFDRDFG-SERIECGCQAPNCRRFMN 386
>gi|297283866|ref|XP_002808342.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Macaca mulatta]
Length = 1704
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1564 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1622
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1623 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1682
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1683 FPLEDNKIPCLCGTESCRGSLN 1704
>gi|254585415|ref|XP_002498275.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
gi|238941169|emb|CAR29342.1| ZYRO0G06446p [Zygosaccharomyces rouxii]
Length = 1015
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 872 KKPVTFARSTIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREIRYLKSGIGS-SYLF 930
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH CEP+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 931 RIDENTVIDATKKGGIARFINHCCEPSCTAKIIKVGGMKRIVIYALRDIGANEELTYDYK 990
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 991 FEREIDAEERLPCLCGAPSCKGFLN 1015
>gi|296220008|ref|XP_002807469.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Callithrix jacchus]
Length = 1708
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1568 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1626
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1627 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1686
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1687 FPLEDNKIPCLCGTESCRGSLN 1708
>gi|351701131|gb|EHB04050.1| Protein Jade-1 [Heterocephalus glaber]
Length = 843
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDQGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|395845756|ref|XP_003795589.1| PREDICTED: protein Jade-1 isoform 1 [Otolemur garnettii]
gi|395845758|ref|XP_003795590.1| PREDICTED: protein Jade-1 isoform 2 [Otolemur garnettii]
Length = 842
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/209 (37%), Positives = 105/209 (50%), Gaps = 19/209 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKHS 369
Query: 578 QPLNDRLAVDERLVQVTRRCCDYIPPSNP 606
++ E Q PP NP
Sbjct: 370 SHRKPEESLGEGATQ--ENGAPECPPRNP 396
>gi|354497843|ref|XP_003511027.1| PREDICTED: histone-lysine N-methyltransferase SETD1A-like [Cricetulus
griseus]
gi|344247133|gb|EGW03237.1| Histone-lysine N-methyltransferase SETD1A [Cricetulus griseus]
Length = 1723
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1583 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1641
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1642 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1701
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1702 FPLEDNKIPCLCGTESCRGSLN 1723
>gi|119572569|gb|EAW52184.1| hCG1998636, isoform CRA_e [Homo sapiens]
Length = 1175
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1035 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1093
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1094 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1153
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1154 FPLEDNKIPCLCGTESCRGSLN 1175
>gi|55741677|ref|NP_055527.1| histone-lysine N-methyltransferase SETD1A [Homo sapiens]
gi|68052990|sp|O15047.3|SET1A_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1A; AltName:
Full=Lysine N-methyltransferase 2F; AltName: Full=SET
domain-containing protein 1A; Short=hSET1A; AltName:
Full=Set1/Ash2 histone methyltransferase complex subunit
SET1
gi|168272962|dbj|BAG10320.1| SET domain-containing protein 1A [synthetic construct]
Length = 1707
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1567 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1625
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1626 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1685
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1686 FPLEDNKIPCLCGTESCRGSLN 1707
>gi|351711375|gb|EHB14294.1| Histone-lysine N-methyltransferase SETD1A [Heterocephalus glaber]
Length = 1502
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1362 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1420
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1421 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1480
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1481 FPLEDSKIPCLCGAESCRGSLN 1502
>gi|119508422|ref|NP_821172.2| SET domain containing 1A [Mus musculus]
Length = 1716
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1576 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1634
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1635 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1694
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1695 FPLEDNKIPCLCGTESCRGSLN 1716
>gi|357449257|ref|XP_003594905.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
gi|355483953|gb|AES65156.1| Histone-lysine N-methyltransferase SETD1B [Medicago truncatula]
Length = 1232
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/133 (50%), Positives = 89/133 (66%), Gaps = 1/133 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+KRL F +S IH +G+ A P A D VIEY GEL+RP I+D RE Y + +Y+F
Sbjct: 1076 KKRLRFQRSKIHDWGLVALEPIEAEDFVIEYIGELIRPRISDIRE-VQYEKMGIGSSYLF 1134
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD V+DAT+ G IA INHSCEPNCY +VIS G + I I+AKR I EE+TY+Y+
Sbjct: 1135 RLDDGYVVDATKRGGIARFINHSCEPNCYPKVISFEGQKKIFIYAKRHINAGEEITYNYK 1194
Query: 815 FFSIDEQLACYCG 827
F ++++ C CG
Sbjct: 1195 FPLEEKKIPCNCG 1207
>gi|242022466|ref|XP_002431661.1| PHD finger protein, putative [Pediculus humanus corporis]
gi|212516969|gb|EEB18923.1| PHD finger protein, putative [Pediculus humanus corporis]
Length = 2563
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/190 (39%), Positives = 102/190 (53%), Gaps = 18/190 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C A P C LC
Sbjct: 183 CDVCRSPDSEEGNEMVFCDACNICVHQACYG-ITSIPPGSWLCRTC---ALSKRPECVLC 238
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G WAH++CA+WIPE + V++MEPI ++ + + RW L+C +C
Sbjct: 239 PNKGGAMKCTKSGHEWAHVSCALWIPEVSIGCVEKMEPITKISNIPQSRWALICVLCRER 298
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVE--LEDEDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQCS TC+ AYH CA GL + +EDE + D ++L S+C+K
Sbjct: 299 VGACIQCSVKTCKTAYHVTCAFKHGLEMRAIIEDE-----------NADDGVKLRSYCQK 347
Query: 576 HKQPLNDRLA 585
H +R A
Sbjct: 348 HSVKKENRSA 357
>gi|452987901|gb|EME87656.1| hypothetical protein MYCFIDRAFT_212971 [Pseudocercospora fijiensis
CIRAD86]
Length = 954
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 103/199 (51%), Gaps = 11/199 (5%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C + N + CD C + VH CYG + + W C C+ P C
Sbjct: 425 KCSICDDGDCENANAIIFCDGCDLAVHQECYG-VPFIPEGQWFCRKCKEIG-RGTPTCIF 482
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP V GA K T+ RW+HL CAIWIPE + ++ MEPI +++V K RWKL C IC
Sbjct: 483 CPNVDGAFKQTSTLRWSHLLCAIWIPEVTIANMTFMEPIQDVDKVPKPRWKLSCYICEQK 542
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC N TC A+H CAR A L ++++ + N S+D L +FC +H
Sbjct: 543 MGACIQCGNKTCYRAFHVTCARRARLFLKMKSQ---NQGSID------TTSLKAFCDRHV 593
Query: 578 QPLNDRLAVDERLVQVTRR 596
P R E +Q RR
Sbjct: 594 PPDWRRTHDTENAIQEARR 612
>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
histone-lysine n-methyltransferase, h3 lysine-4 specific,
putative [Candida dubliniensis CD36]
gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
dubliniensis CD36]
Length = 1032
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 889 KKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGS-SYLF 947
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 948 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1007
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1008 FERETNDEERIRCLCGAPGCKGYLN 1032
>gi|114662089|ref|XP_001155055.1| PREDICTED: histone-lysine N-methyltransferase SETD1A isoform 2 [Pan
troglodytes]
gi|410289610|gb|JAA23405.1| SET domain containing 1A [Pan troglodytes]
Length = 1707
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1567 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1625
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1626 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1685
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1686 FPLEDNKIPCLCGTESCRGSLN 1707
>gi|402908170|ref|XP_003916826.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Papio anubis]
Length = 1707
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1567 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1625
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1626 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1685
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1686 FPLEDNKIPCLCGTESCRGSLN 1707
>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
Length = 739
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 153/278 (55%), Gaps = 29/278 (10%)
Query: 563 EDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG 622
ED +RL + K+ L D DE L V IP + +GC+R+ PY +
Sbjct: 487 EDPLLRLNPI--RSKKGLPDAFYEDEELDGV-------IPVA--AGCSRARPYEKMTMKQ 535
Query: 623 RKEPEALAAASLKRLFVENQPYLV-GGYCQNGLSGNTLPS--IRVIGSKFSFSLHRDAPN 679
++ + ++R E+ P + + + L S +R++ + SL DA N
Sbjct: 536 KR-------SLVRRPDNESHPTAIFSERDETAIRHQHLASKDMRLLQRRLLTSLG-DANN 587
Query: 680 FLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
D +K + FRK++ F +S IHG+G++A +M++EY G+ +R +A+ R
Sbjct: 588 -----DFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEER 642
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +G+ +Y+FRID VIDAT+ G+ A INHSC+PNCY++V+++ G++ I+I+
Sbjct: 643 EKAYERRGIGS-SYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIY 701
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++ IK+ EE+TYDY+F D+++ C CG CRG +N
Sbjct: 702 SRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 739
>gi|395845760|ref|XP_003795591.1| PREDICTED: protein Jade-1 isoform 3 [Otolemur garnettii]
Length = 830
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/209 (37%), Positives = 105/209 (50%), Gaps = 19/209 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKHS 357
Query: 578 QPLNDRLAVDERLVQVTRRCCDYIPPSNP 606
++ E Q PP NP
Sbjct: 358 SHRKPEESLGEGATQ--ENGAPECPPRNP 384
>gi|344294411|ref|XP_003418911.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Loxodonta africana]
Length = 1703
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1563 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1621
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1622 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1681
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1682 FPLEDNKIPCLCGTESCRGSLN 1703
>gi|194219010|ref|XP_001915080.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Equus caballus]
Length = 1707
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1567 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1625
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1626 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1685
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1686 FPLEDNKIPCLCGTESCRGSLN 1707
>gi|403276868|ref|XP_003930105.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Saimiri
boliviensis boliviensis]
Length = 1713
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1573 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1631
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1632 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1691
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1692 FPLEDNKIPCLCGTESCRGSLN 1713
>gi|410221312|gb|JAA07875.1| SET domain containing 1A [Pan troglodytes]
Length = 1707
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1567 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1625
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1626 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1685
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1686 FPLEDNKIPCLCGTESCRGSLN 1707
>gi|297674328|ref|XP_002815181.1| PREDICTED: protein Jade-1 isoform 2 [Pongo abelii]
gi|297674330|ref|XP_002815182.1| PREDICTED: protein Jade-1 isoform 3 [Pongo abelii]
Length = 842
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|440912082|gb|ELR61683.1| Protein Jade-1 [Bos grunniens mutus]
Length = 843
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|154422490|ref|XP_001584257.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121918503|gb|EAY23271.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 259
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 68/149 (45%), Positives = 100/149 (67%), Gaps = 3/149 (2%)
Query: 696 KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFR 755
K L F KSGIH +G+F+ G+ ++EYTGELVR S+ + R+ + Y + G+Y+FR
Sbjct: 97 KYLRFEKSGIHLWGVFSACYFAPGEPIVEYTGELVRLSVTEARQKY-YETEGNHGSYIFR 155
Query: 756 IDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR- 814
+DD+ IDAT G IA +NHSC+PNC + V+ G HI+IFAK+ I+ +EELTYDY
Sbjct: 156 LDDDLYIDATHKGGIARFLNHSCDPNCKTCVVEAGGQRHIVIFAKKKIEPFEELTYDYNL 215
Query: 815 -FFSIDEQLACYCGFPRCRGVVNDTEAEE 842
+ S ++ + C CG P+CRG +N T+ ++
Sbjct: 216 PYESKEKAIVCLCGSPKCRGYLNYTDKKD 244
>gi|448106516|ref|XP_004200765.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|448109616|ref|XP_004201396.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382187|emb|CCE81024.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382952|emb|CCE80259.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
Length = 1062
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 919 KKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHRERSYLKTGIGS-SYLF 977
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V+G + I+I+A RDI + EELTYDY+
Sbjct: 978 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIDKNEELTYDYK 1037
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1038 FERETNDEERIRCLCGAPGCKGYLN 1062
>gi|329664196|ref|NP_001192362.1| histone-lysine N-methyltransferase SETD1A [Bos taurus]
Length = 1708
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1568 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1626
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1627 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1686
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1687 FPLEDNKIPCLCGTESCRGSLN 1708
>gi|20071601|gb|AAH27450.1| SETD1A protein [Homo sapiens]
Length = 469
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 329 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 387
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 388 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 447
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 448 FPLEDNKIPCLCGTESCRGSLN 469
>gi|426247055|ref|XP_004017302.1| PREDICTED: protein Jade-1 isoform 1 [Ovis aries]
Length = 843
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
AltName: Full=SET domain-containing protein 2
gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
Length = 1507
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 153/278 (55%), Gaps = 29/278 (10%)
Query: 563 EDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG 622
ED +RL + K+ L D DE L V IP + +GC+R+ PY +
Sbjct: 1255 EDPLLRLNPI--RSKKGLPDAFYEDEELDGV-------IPVA--AGCSRARPYEKMTMKQ 1303
Query: 623 RKEPEALAAASLKRLFVENQPYLV-GGYCQNGLSGNTLPS--IRVIGSKFSFSLHRDAPN 679
++ + ++R E+ P + + + L S +R++ + SL DA N
Sbjct: 1304 KR-------SLVRRPDNESHPTAIFSERDETAIRHQHLASKDMRLLQRRLLTSLG-DANN 1355
Query: 680 FLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
D +K + FRK++ F +S IHG+G++A +M++EY G+ +R +A+ R
Sbjct: 1356 -----DFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEER 1410
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +G+ +Y+FRID VIDAT+ G+ A INHSC+PNCY++V+++ G++ I+I+
Sbjct: 1411 EKAYERRGIGS-SYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIY 1469
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++ IK+ EE+TYDY+F D+++ C CG CRG +N
Sbjct: 1470 SRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1507
>gi|40389479|tpe|CAE30493.1| TPA: putative Jade1 [Takifugu rubripes]
Length = 537
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 100/179 (55%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + NN + CDKC + VH CYG ++ V WLC +C G P C LC
Sbjct: 159 CDVCQSPDGEDNNEMVFCDKCNICVHQACYG-IQKVPKGSWLCRICALGIL---PKCQLC 214
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + + ++MEPI ++++ +RW L+C +C
Sbjct: 215 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGNPEKMEPITNVSQIPSNRWALICCLCKEK 274
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS CR A+H C A L +N + L DDE ++ S+C KH
Sbjct: 275 SGACIQCSAKNCRTAFHVTCGLHASL--------EMNTI-LTADDE---VKFKSYCPKH 321
>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
Length = 1510
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 153/278 (55%), Gaps = 29/278 (10%)
Query: 563 EDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPPSNPSGCARSEPYNYFGRRG 622
ED +RL + K+ L D DE L V IP + +GC+R+ PY +
Sbjct: 1258 EDPLLRLNPI--RSKKGLPDAFYEDEELDGV-------IPVA--AGCSRARPYEKMTMKQ 1306
Query: 623 RKEPEALAAASLKRLFVENQPYLV-GGYCQNGLSGNTLPS--IRVIGSKFSFSLHRDAPN 679
++ + ++R E+ P + + + L S +R++ + SL DA N
Sbjct: 1307 KR-------SLVRRPDNESHPTAIFSERDETAIRHQHLASKDMRLLQRRLLTSLG-DANN 1358
Query: 680 FLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRR 738
D +K + FRK++ F +S IHG+G++A +M++EY G+ +R +A+ R
Sbjct: 1359 -----DFFKINQLKFRKKMIKFARSRIHGWGLYAMESIAPDEMIVEYIGQTIRSLVAEER 1413
Query: 739 EHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIF 798
E +G+ +Y+FRID VIDAT+ G+ A INHSC+PNCY++V+++ G++ I+I+
Sbjct: 1414 EKAYERRGIGS-SYLFRIDLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIY 1472
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
++ IK+ EE+TYDY+F D+++ C CG CRG +N
Sbjct: 1473 SRTIIKKGEEITYDYKFPIEDDKIDCLCGAKTCRGYLN 1510
>gi|395846353|ref|XP_003795872.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Otolemur
garnettii]
Length = 1706
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1566 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1624
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1625 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1684
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1685 FPLEDNKIPCLCGTESCRGSLN 1706
>gi|350581585|ref|XP_003481071.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sus scrofa]
Length = 1550
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1410 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1468
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1469 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1528
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1529 FPLEDNKIPCLCGTESCRGSLN 1550
>gi|311262566|ref|XP_003129244.1| PREDICTED: protein Jade-1 isoform 1 [Sus scrofa]
gi|335293864|ref|XP_003357076.1| PREDICTED: protein Jade-1 isoform 2 [Sus scrofa]
Length = 842
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|431906860|gb|ELK10981.1| Histone-lysine N-methyltransferase SETD1A [Pteropus alecto]
Length = 1665
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1525 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1583
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1584 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1643
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1644 FPLEDNKIPCLCGTESCRGSLN 1665
>gi|126334524|ref|XP_001364653.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Monodelphis
domestica]
Length = 1706
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1566 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKRYVQEGIGS-SYLF 1624
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1625 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYK 1684
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1685 FPLEDNKIPCLCGTESCRGSLN 1706
>gi|40556384|ref|NP_758507.3| protein Jade-1 [Mus musculus]
gi|194328765|ref|NP_001123656.1| protein Jade-1 [Mus musculus]
gi|194328767|ref|NP_001123657.1| protein Jade-1 [Mus musculus]
gi|194328769|ref|NP_001123658.1| protein Jade-1 [Mus musculus]
gi|116248177|sp|Q6ZPI0.2|JADE1_MOUSE RecName: Full=Protein Jade-1; AltName: Full=PHD finger protein 17
gi|23273896|gb|AAH20316.1| PHD finger protein 17 [Mus musculus]
gi|40389485|tpe|CAE30496.1| TPA: Jade1L protein [Mus musculus]
gi|74184642|dbj|BAE27932.1| unnamed protein product [Mus musculus]
Length = 834
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 207 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 262
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 263 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 322
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 323 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 369
>gi|444725782|gb|ELW66336.1| Histone-lysine N-methyltransferase SETD1A [Tupaia chinensis]
Length = 1344
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1204 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1262
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1263 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1322
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1323 FPLEDNKIPCLCGTESCRGSLN 1344
>gi|194385454|dbj|BAG65104.1| unnamed protein product [Homo sapiens]
Length = 842
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|26340050|dbj|BAC33688.1| unnamed protein product [Mus musculus]
Length = 834
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 207 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 262
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 263 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 322
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 323 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 369
>gi|426345459|ref|XP_004040429.1| PREDICTED: protein Jade-1 isoform 1 [Gorilla gorilla gorilla]
gi|426345461|ref|XP_004040430.1| PREDICTED: protein Jade-1 isoform 2 [Gorilla gorilla gorilla]
Length = 842
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|296473196|tpg|DAA15311.1| TPA: SET domain containing 1A [Bos taurus]
Length = 1708
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1568 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1626
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1627 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1686
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1687 FPLEDNKIPCLCGTESCRGSLN 1708
>gi|388453923|ref|NP_001253062.1| protein Jade-1 [Macaca mulatta]
gi|355687602|gb|EHH26186.1| hypothetical protein EGK_16088 [Macaca mulatta]
gi|355749563|gb|EHH53962.1| hypothetical protein EGM_14684 [Macaca fascicularis]
gi|380816672|gb|AFE80210.1| protein Jade-1 long isoform [Macaca mulatta]
gi|383421727|gb|AFH34077.1| protein Jade-1 long isoform [Macaca mulatta]
gi|384949518|gb|AFI38364.1| protein Jade-1 long isoform [Macaca mulatta]
Length = 843
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|27552784|gb|AAH42890.1| Setd1a protein, partial [Mus musculus]
Length = 458
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 318 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 376
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 377 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 436
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 437 FPLEDNKIPCLCGTESCRGSLN 458
>gi|426247057|ref|XP_004017303.1| PREDICTED: protein Jade-1 isoform 2 [Ovis aries]
Length = 831
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|40556393|ref|NP_955352.1| protein Jade-1 long isoform [Homo sapiens]
gi|114596011|ref|XP_001158765.1| PREDICTED: protein Jade-1 isoform 11 [Pan troglodytes]
gi|114596013|ref|XP_001158814.1| PREDICTED: protein Jade-1 isoform 12 [Pan troglodytes]
gi|397505202|ref|XP_003823159.1| PREDICTED: protein Jade-1 isoform 2 [Pan paniscus]
gi|74748786|sp|Q6IE81.1|JADE1_HUMAN RecName: Full=Protein Jade-1; AltName: Full=PHD finger protein 17
gi|40389495|tpe|CAE30500.1| TPA: JADE1L protein [Homo sapiens]
gi|119625576|gb|EAX05171.1| PHD finger protein 17, isoform CRA_b [Homo sapiens]
gi|119625580|gb|EAX05175.1| PHD finger protein 17, isoform CRA_b [Homo sapiens]
gi|410210268|gb|JAA02353.1| PHD finger protein 17 [Pan troglodytes]
gi|410260568|gb|JAA18250.1| PHD finger protein 17 [Pan troglodytes]
gi|410289292|gb|JAA23246.1| PHD finger protein 17 [Pan troglodytes]
gi|410337897|gb|JAA37895.1| PHD finger protein 17 [Pan troglodytes]
Length = 842
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|326666050|ref|XP_001920852.3| PREDICTED: hypothetical protein LOC556535 [Danio rerio]
Length = 2253
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 2113 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQSIRQMVADNREKRYAQEGIGS-SYLF 2171
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 2172 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYK 2231
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F + ++ C CG CRG +N
Sbjct: 2232 FPIEENKIPCLCGTESCRGTLN 2253
>gi|37360554|dbj|BAC98255.1| mKIAA1807 protein [Mus musculus]
Length = 850
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 223 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 278
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 279 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 338
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 339 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 385
>gi|402870446|ref|XP_003899232.1| PREDICTED: protein Jade-1 isoform 1 [Papio anubis]
gi|402870448|ref|XP_003899233.1| PREDICTED: protein Jade-1 isoform 2 [Papio anubis]
gi|402870450|ref|XP_003899234.1| PREDICTED: protein Jade-1 isoform 3 [Papio anubis]
Length = 843
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|332231039|ref|XP_003264701.1| PREDICTED: protein Jade-1 isoform 1 [Nomascus leucogenys]
gi|332231041|ref|XP_003264702.1| PREDICTED: protein Jade-1 isoform 2 [Nomascus leucogenys]
Length = 842
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|441618011|ref|XP_004088486.1| PREDICTED: protein Jade-1 [Nomascus leucogenys]
Length = 830
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|332021584|gb|EGI61949.1| PHD finger protein rhinoceros [Acromyrmex echinatior]
Length = 1438
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 99/179 (55%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C + P C LC
Sbjct: 251 CDVCRSPDSEEGNEMVFCDCCNICVHQACYGITSIPDGS-WLCRTC---SLSQRPDCVLC 306
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V+RMEPI ++ + + RW L+C +C
Sbjct: 307 PNKGGAMKCTRSGQKWAHVSCALWIPEVSIGCVERMEPITKISSIPQSRWALICVLCRER 366
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL ++ ED + D ++L S+C+KH
Sbjct: 367 VGACIQCSIKTCKTAYHVTCAFKYGLEMKAIIEDEM---------ADDGVKLRSYCQKH 416
>gi|195427992|ref|XP_002062059.1| GK16856 [Drosophila willistoni]
gi|194158144|gb|EDW73045.1| GK16856 [Drosophila willistoni]
Length = 3792
Score = 139 bits (351), Expect = 5e-30, Method: Composition-based stats.
Identities = 70/179 (39%), Positives = 97/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 355 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PDCVLC 410
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + + RW L+C +C
Sbjct: 411 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKR 470
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS C+ AYH CA GL E R ++E + + ++L S+C+KH
Sbjct: 471 VGSCIQCSVKPCKTAYHVTCAFQHGL------EMR---AIIEEGNAEDGVKLRSYCQKH 520
>gi|395514884|ref|XP_003761641.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Sarcophilus
harrisii]
Length = 1691
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1551 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKRYVQEGIGS-SYLF 1609
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1610 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIEAQKKIVIYSKQPIGVDEEITYDYK 1669
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1670 FPLEDNKIPCLCGTESCRGSLN 1691
>gi|345801544|ref|XP_848999.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Canis lupus familiaris]
Length = 1713
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1573 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1631
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1632 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1691
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1692 FPLEDNKIPCLCGTESCRGSLN 1713
>gi|410984758|ref|XP_003998693.1| PREDICTED: histone-lysine N-methyltransferase SETD1A [Felis catus]
Length = 1708
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1568 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1626
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1627 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1686
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1687 FPLEDNKIPCLCGTESCRGSLN 1708
>gi|158287424|ref|XP_309452.4| AGAP011192-PA [Anopheles gambiae str. PEST]
gi|157019644|gb|EAA05242.4| AGAP011192-PA [Anopheles gambiae str. PEST]
Length = 2808
Score = 139 bits (351), Expect = 5e-30, Method: Composition-based stats.
Identities = 67/157 (42%), Positives = 100/157 (63%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +++ RE
Sbjct: 2654 SKSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSEMREK- 2712
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
Y + G YMFR+D+ERV+DAT +G +A INHSC PNC + ++ V+ + IIIFAKR
Sbjct: 2713 QYEAR-NRGIYMFRLDEERVVDATLSGGLARYINHSCNPNCVTEIVEVDRELRIIIFAKR 2771
Query: 802 DIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
I + EEL+YDY+F D+ +++C CG P C+ +N
Sbjct: 2772 RINRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2808
Score = 39.3 bits (90), Expect = 8.4, Method: Composition-based stats.
Identities = 28/107 (26%), Positives = 40/107 (37%), Gaps = 25/107 (23%)
Query: 446 PGAPEPPPPCCLCPVVGGAMKPTTDG----------RWAHLACAIWIPETCLTDVKRMEP 495
P P+ C C VG + DG +W HL CA+W E
Sbjct: 2288 PKMPDDTRRCIFCHTVGDGV---ADGPSRLLNYDVDKWVHLNCALWSDGV-------YET 2337
Query: 496 IDG----LNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCA 538
++G L + +C+ C + GA ++C T C YH CA
Sbjct: 2338 VNGALMNLENALQQSLTSVCAFCS-TLGATVKCFKTRCANVYHLSCA 2383
>gi|354485459|ref|XP_003504901.1| PREDICTED: protein Jade-1 [Cricetulus griseus]
gi|344250244|gb|EGW06348.1| Protein Jade-1 [Cricetulus griseus]
Length = 834
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|429329891|gb|AFZ81650.1| hypothetical protein BEWA_010670 [Babesia equi]
Length = 3609
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 696 KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT-YMF 754
KRL S IHG G+F AG+ V+EY GEL+R I+D+RE S G G+ YMF
Sbjct: 3465 KRLDVKPSAIHGLGLFTTEDITAGEPVVEYVGELIRDIISDKREEIYSESQGGDGSCYMF 3524
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN-GDEHIIIFAKRDIKQWEELTYDY 813
R+DDE ++DATR G+++ INHSC+PNC R+I+ G +HI++FAK D+K +E+TYDY
Sbjct: 3525 RLDDELIVDATRKGNMSRFINHSCDPNCLCRIITCEYGLKHIVVFAKSDLKAGDEVTYDY 3584
Query: 814 RFFSIDE--QLACYCGFPRCRGVVN 836
+F E +L C CG P C G +N
Sbjct: 3585 QFGVESETRKLQCLCGAPNCLGRMN 3609
>gi|426255271|ref|XP_004021280.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Ovis aries]
Length = 1394
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1254 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1312
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1313 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1372
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1373 FPLEDNKIPCLCGTESCRGSLN 1394
>gi|348582125|ref|XP_003476827.1| PREDICTED: protein Jade-1-like [Cavia porcellus]
Length = 843
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
Length = 1221
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 90/256 (35%), Positives = 136/256 (53%), Gaps = 24/256 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N SGCAR+E P++ ++ R+E EA + + + +
Sbjct: 971 YVP--NASGCARTEGAKKILNSEKSKYLPHHIKVQKAREEREANSKKTGRDATMAANEAA 1028
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
+ GN+ + R ++ L+ +D +K + RK+ + F +S
Sbjct: 1029 KLAAEKLIAKGNSR-ANRATNRRYVADLNDQKKTLGQDSDVFKFNQLKKRKKPVKFARSA 1087
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++A DM+IEY GE VR IA+ RE+ S +G+ +Y+FRIDD VIDA
Sbjct: 1088 IHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKSGIGS-SYLFRIDDNTVIDA 1146
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF----FSIDE 820
T+ G IA INHSC PNC +++I V G + I+I+A RDI EELTYDY+F S+D
Sbjct: 1147 TKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFEREIGSLD- 1205
Query: 821 QLACYCGFPRCRGVVN 836
++ C CG C+G +N
Sbjct: 1206 RIPCLCGTAACKGFLN 1221
>gi|119624286|gb|EAX03881.1| bromodomain and PHD finger containing, 3, isoform CRA_e [Homo
sapiens]
Length = 878
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 387
Query: 578 QP 579
P
Sbjct: 388 PP 389
>gi|297674332|ref|XP_002815183.1| PREDICTED: protein Jade-1 isoform 4 [Pongo abelii]
Length = 830
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
Length = 565
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 68/159 (42%), Positives = 104/159 (65%), Gaps = 6/159 (3%)
Query: 682 SMAD----KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADR 737
SMA+ K+ +K +KRL F KS IH +G++A A D+VIEY GE++R +A+
Sbjct: 409 SMAESDLLKFNQLKNR-KKRLIFDKSPIHDWGLYAGESIDAHDIVIEYIGEVIRQQVAEI 467
Query: 738 REHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIII 797
RE Y + +Y+FR+DD+ VIDAT+ G +A INH C PNC +++I+V+ + ++I
Sbjct: 468 REKH-YERIGIGSSYLFRVDDDMVIDATKKGGMARFINHCCTPNCSAKIITVDKQKKVVI 526
Query: 798 FAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+A RDI+ EE+TYDY+F E++ C+CG C+G +N
Sbjct: 527 YANRDIEPGEEITYDYKFPIEAEKIPCFCGSKFCKGSLN 565
>gi|338718061|ref|XP_001918146.2| PREDICTED: LOW QUALITY PROTEIN: bromodomain and PHD
finger-containing protein 3 [Equus caballus]
Length = 1207
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 87/151 (57%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 216 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 273
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 274 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 333
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GA IQC C A+H CA+ AGL +++E
Sbjct: 334 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIE 364
>gi|402870452|ref|XP_003899235.1| PREDICTED: protein Jade-1 isoform 4 [Papio anubis]
Length = 831
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|350407207|ref|XP_003488016.1| PREDICTED: hypothetical protein LOC100744014 [Bombus impatiens]
Length = 2766
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 98/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C P C LC
Sbjct: 251 CDVCRSPDSEEGNEMVFCDCCNICVHQACYGITSIPDGS-WLCRTCSLSQR---PDCVLC 306
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V+RMEPI ++ + + RW L+C +C
Sbjct: 307 PNKGGAMKCTRSGQKWAHVSCALWIPEVSIGCVERMEPITKISSIPQSRWALICVLCRER 366
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL ++ ED + D ++L S+C+KH
Sbjct: 367 VGACIQCSIKTCKTAYHVTCAFKYGLEMKAIIEDEM---------ADDGVKLRSYCQKH 416
>gi|195564658|ref|XP_002105931.1| GD16408 [Drosophila simulans]
gi|194203296|gb|EDX16872.1| GD16408 [Drosophila simulans]
Length = 1076
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 130/239 (54%), Gaps = 10/239 (4%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVG----GYCQNGLSGNTLP 660
NPSG AR+EP RK A S + N + G Y + + +
Sbjct: 841 NPSGAARTEPKQRQLLVWRKPHTQRTAGSCSTQRMANSAAIAGEVACPYSKQFVHSKS-S 899
Query: 661 SIRVIGSKFSFSLHRDAPNFL-SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
+ + ++ +++ + F+ S + +YK MK+ +R + +S I G G++A
Sbjct: 900 QYKKMKQEWRNNVYLASKQFVHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKH 959
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
M+IEY GE++R +++ RE Y S G YMFR+D++RV+DAT +G +A INHSC
Sbjct: 960 TMIIEYIGEVIRTEVSEIREK-QYES-KNRGIYMFRLDEDRVVDATLSGGLARYINHSCN 1017
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
PNC + ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 1018 PNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 1076
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/141 (22%), Positives = 63/141 (44%), Gaps = 15/141 (10%)
Query: 228 YVFGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTE 286
+ G++ L++G+++ E F +I+P GY R + + P Y + E
Sbjct: 683 FRVGNMTFLNVGQLLPHQLEAFHTPHYIYPIGYKVSRYYWCVRRPNRRCRYICSI---AE 739
Query: 287 SKIRPLFRVTLDNG------EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKIS 340
+ +P FR+ + + +F GS+PS W +I I + + + I
Sbjct: 740 AGCKPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITRLRKVHK---WLQLFPQHI- 795
Query: 341 ESGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 796 -SGEDLFGLTEPAIVRILESL 815
>gi|114596015|ref|XP_001158286.1| PREDICTED: protein Jade-1 isoform 4 [Pan troglodytes]
gi|34534189|dbj|BAC86931.1| unnamed protein product [Homo sapiens]
gi|119625577|gb|EAX05172.1| PHD finger protein 17, isoform CRA_c [Homo sapiens]
Length = 830
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|291401876|ref|XP_002717311.1| PREDICTED: PHD finger protein 17 [Oryctolagus cuniculus]
Length = 861
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 223 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 278
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 279 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 338
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 339 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 385
>gi|383856687|ref|XP_003703839.1| PREDICTED: uncharacterized protein LOC100877323 [Megachile
rotundata]
Length = 1885
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 99/179 (55%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C + P C LC
Sbjct: 251 CDVCRSPDSEEGNEMVFCDCCNICVHQACYGITSIPDGS-WLCRTC---SLSQRPDCVLC 306
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V+RMEPI ++ + + RW L+C +C
Sbjct: 307 PNKGGAMKCTRSGQKWAHVSCALWIPEVSIGCVERMEPITKISSIPQSRWALICVLCRER 366
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL ++ ED + D ++L S+C+KH
Sbjct: 367 VGACIQCSIKTCKTAYHVTCAFKYGLEMKAIIEDEM---------ADDGVKLRSYCQKH 416
>gi|426345463|ref|XP_004040431.1| PREDICTED: protein Jade-1 isoform 3 [Gorilla gorilla gorilla]
Length = 830
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|449668927|ref|XP_004206899.1| PREDICTED: LOW QUALITY PROTEIN: peregrin-like [Hydra
magnipapillata]
Length = 1259
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 82/150 (54%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
CS+C E +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 259 CSICCDGECSNSNVILFCDMCNLAVHQECYGVPYIPEGQ-WLCRRCF-RSPSKPVSCLLC 316
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T WAH+ CA+WIPE C + +EPID + + RWKLLC IC
Sbjct: 317 PTKSGAFKQTDTNHWAHVVCALWIPEVCFANTVFLEPIDSIQEIPAARWKLLCYICKKKE 376
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC T C A+H CA+ GL +++E
Sbjct: 377 GACIQCFKTNCYTAFHVTCAQQGGLYMKIE 406
>gi|410956878|ref|XP_003985063.1| PREDICTED: protein Jade-1 isoform 3 [Felis catus]
Length = 831
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
Length = 1042
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 899 KKPVTFARSTIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGS-SYLF 957
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 958 RIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1017
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1018 FERETNDEERIRCLCGAPGCKGYLN 1042
>gi|344277238|ref|XP_003410410.1| PREDICTED: protein Jade-1 [Loxodonta africana]
Length = 874
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 237 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 292
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 293 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 352
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 353 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 399
>gi|410956874|ref|XP_003985061.1| PREDICTED: protein Jade-1 isoform 1 [Felis catus]
gi|410956876|ref|XP_003985062.1| PREDICTED: protein Jade-1 isoform 2 [Felis catus]
Length = 843
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|354493050|ref|XP_003508657.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Cricetulus griseus]
gi|344247509|gb|EGW03613.1| Bromodomain and PHD finger-containing protein 3 [Cricetulus
griseus]
Length = 1204
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG + + WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYG-VPYIPEGQWLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 H 576
H
Sbjct: 386 H 386
>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
Length = 2204
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 121/234 (51%), Gaps = 25/234 (10%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NPSGCAR E GRR P L + + R F + G + P
Sbjct: 1994 NPSGCARCE-----GRRRLVLPGKLRS-HIPRSFTSSH-----GSSLLSQLDLSAPPTCP 2042
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
F +H S + +YK MK+ +R + +S I G G++A MVIE
Sbjct: 2043 YSKHF---VH-------SKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIE 2092
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y GE++R + D RE +Y + G YMFR+DD+RV+DAT +G +A INHSC PNC +
Sbjct: 2093 YIGEIIRSQLCDYREK-LYEA-KNRGIYMFRLDDDRVVDATISGGLARYINHSCNPNCVT 2150
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
+ V+ + IIIFAKR I + EEL YDY+F D+Q + C CG P CR +N
Sbjct: 2151 EKVEVDRELRIIIFAKRRIARGEELAYDYQFDIEDDQHKIPCNCGAPNCRKWMN 2204
>gi|119624284|gb|EAX03879.1| bromodomain and PHD finger containing, 3, isoform CRA_c [Homo
sapiens]
Length = 506
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/182 (40%), Positives = 98/182 (53%), Gaps = 8/182 (4%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 95 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCILC 152
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 153 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKG 212
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GA IQC C A+H CA+ AGL +++E +L + +R ++C+ H
Sbjct: 213 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSL-----NGTIFTVRKTAYCEAHS 267
Query: 578 QP 579
P
Sbjct: 268 PP 269
>gi|195125125|ref|XP_002007033.1| GI12708 [Drosophila mojavensis]
gi|193918642|gb|EDW17509.1| GI12708 [Drosophila mojavensis]
Length = 3514
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 97/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 315 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PECVLC 370
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + + RW L+C +C
Sbjct: 371 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPQSRWALICVLCRER 430
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS TC+ AYH CA GL + ++E + + ++L S+C+KH
Sbjct: 431 VGSCIQCSVKTCKTAYHVTCAFQHGLEMR---------AIIEEGNAEDGVKLRSYCQKH 480
>gi|50312247|ref|XP_456155.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74636430|sp|Q6CIT4.1|SET1_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49645291|emb|CAG98863.1| KLLA0F24134p [Kluyveromyces lactis]
Length = 1000
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 857 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQPVAEMREKRYIKSGIGS-SYLF 915
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH CEP+C +++I V+G + I+I+A RDI EELTYDY+
Sbjct: 916 RIDENTVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYK 975
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F E+L C CG P C+G +N
Sbjct: 976 FERETDEGERLPCLCGAPSCKGFLN 1000
>gi|326918480|ref|XP_003205516.1| PREDICTED: protein Jade-1-like [Meleagris gallopavo]
Length = 841
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 205 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 260
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 261 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALICSLCNEK 320
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 321 VGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 367
>gi|296195582|ref|XP_002745403.1| PREDICTED: protein Jade-1 isoform 1 [Callithrix jacchus]
gi|390460412|ref|XP_003732479.1| PREDICTED: protein Jade-1 isoform 2 [Callithrix jacchus]
Length = 843
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVR---PRCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFERGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
Length = 1083
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 89/255 (34%), Positives = 133/255 (52%), Gaps = 22/255 (8%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N +GCAR+E P++ ++ R+E +A S K V
Sbjct: 833 YVP--NTTGCARTEGVKKILNSEKSKYLPHHIKVQKAREERQAQNGKSGKD-SVLAAAEA 889
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSG 704
++ ++ T + R +F LH +D ++ + RK+ + F +S
Sbjct: 890 AKLAAESQVARGTSRANRANNRRFVAGLHDQIRGLGQDSDAFRFNQLKKRKKPVKFARSA 949
Query: 705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDA 764
IH +G++ DM+IEY GE VR IA+ RE S +G+ +Y+FRIDD VIDA
Sbjct: 950 IHNWGLYTMENIPKDDMIIEYVGEEVRQVIAELREARYLKSGIGS-SYLFRIDDNTVIDA 1008
Query: 765 TRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQ 821
T+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+F ++
Sbjct: 1009 TKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGATDR 1068
Query: 822 LACYCGFPRCRGVVN 836
+ C CG C+G +N
Sbjct: 1069 IPCLCGTAACKGFLN 1083
>gi|307186407|gb|EFN72041.1| PHD finger protein rhinoceros [Camponotus floridanus]
Length = 2950
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 98/179 (54%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG +G WLC C P C LC
Sbjct: 244 CDVCRSPDSEEGNEMVFCDCCNICVHQACYGITSIPDGS-WLCRTCSLSQR---PDCVLC 299
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V+RMEPI ++ + + RW L+C +C
Sbjct: 300 PNKGGAMKCTRSGQKWAHVSCALWIPEVSIGCVERMEPITKISSIPQSRWALICVLCRER 359
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC+ AYH CA GL ++ ED + D ++L S+C+KH
Sbjct: 360 VGACIQCSIKTCKTAYHVTCAFKYGLEMKAIIEDEM---------ADDGVKLRSYCQKH 409
>gi|390460414|ref|XP_003732480.1| PREDICTED: protein Jade-1 isoform 3 [Callithrix jacchus]
Length = 831
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 194 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVR---PRCLLC 249
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 250 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 309
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 310 FGASIQCSVKNCRTAFHVTCAFERGLEMK---------TILAENDE---VKFKSYCPKH 356
>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
Length = 2510
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 124/237 (52%), Gaps = 31/237 (13%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYL--VGGYCQNGLSGNTLPSI 662
NPSG AR+EP ++L V +P+ G C S + +
Sbjct: 2300 NPSGAARTEPKQ------------------RQLLVWRKPHTQRTAGSCS---STQRMANS 2338
Query: 663 RVIGSKFSFSLHRDAPNFL-SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
I + + + F+ S + +YK MK+ +R + +S I G G++A M
Sbjct: 2339 AAIAGEVACPYSKQ---FVHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTM 2395
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
+IEY GE++R +++ RE G YMFR+D++RV+DAT +G +A INHSC PN
Sbjct: 2396 IIEYIGEVIRTEVSEIREK--QYEAKNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPN 2453
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
C + ++ V+ D IIIFAKR I + EEL+YDY+F DE ++ C CG P CR +N
Sbjct: 2454 CVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNCRKWMN 2510
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 34/149 (22%)
Query: 230 FGDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESK 288
G++ L++G+++ E F I+P GY R + L P Y + E+
Sbjct: 2143 VGNMTFLNVGQLLPHQLEAFHTPHHIYPIGYKVSRYYWCLRRPNRRCRYICSI---AEAG 2199
Query: 289 IRPLFRVTL-DNGE------QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE 341
RP FR+ + D G+ +F STP+ W +I I+ K+ +
Sbjct: 2200 CRPEFRILVQDTGDKAEPDREFRDSTPTAVWQQILQPIQR--------------LRKVHK 2245
Query: 342 ---------SGSDMFGFSNPEVMKLILGL 361
SG D+FG + P +++++ L
Sbjct: 2246 WLQLFPQHISGEDLFGLTEPAIVRILESL 2274
>gi|403271117|ref|XP_003927487.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Saimiri
boliviensis boliviensis]
Length = 206
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 114/196 (58%), Gaps = 12/196 (6%)
Query: 643 PYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGK 702
P L GG SG +LPS +V S R L MA +++H+K+T ++ + +
Sbjct: 21 PTLPGGVS----SGKSLPSAQVQLR----STRRATSLELPMAMQFRHLKKTSKESVGVYR 72
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +F K AG+M+I+Y+G ++R + D+RE F G YMFR+DD V+
Sbjct: 73 STIHRRALFCKGDIDAGEMIIDYSGIVIRSVLTDKREKFYDGK--GIRCYMFRMDDFDVL 130
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--E 820
DAT G A INHSCEPNC+SRVI V G +HI+IFA R I + EELTYDY+F D
Sbjct: 131 DATMHGHAARFINHSCEPNCFSRVIHVEGQKHIVIFALRRILRGEELTYDYKFPIEDASN 190
Query: 821 QLACYCGFPRCRGVVN 836
+L C G RCR +N
Sbjct: 191 KLPCNYGGKRCRRFLN 206
>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
90-125]
gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
Length = 1038
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 895 KKPVTFARSTIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAEHREKSYLRTGIGS-SYLF 953
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 954 RIDENTVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1013
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1014 FERETNDEERIRCLCGAPGCKGYLN 1038
>gi|403271711|ref|XP_003927754.1| PREDICTED: protein Jade-1 isoform 1 [Saimiri boliviensis
boliviensis]
gi|403271713|ref|XP_003927755.1| PREDICTED: protein Jade-1 isoform 2 [Saimiri boliviensis
boliviensis]
Length = 842
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVR---PRCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFERGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|395854486|ref|XP_003799721.1| PREDICTED: protein Jade-3 [Otolemur garnettii]
Length = 846
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 77/181 (42%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 228 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 283
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C V
Sbjct: 284 PKKGGAMKTTRTGSKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCSLCKVK 343
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS TC A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 344 TGACIQCSIKTCITAFHVTCAFEHNLEMK---------TILDEGDE---VKFKSYCLKHS 391
Query: 578 Q 578
Q
Sbjct: 392 Q 392
>gi|119625575|gb|EAX05170.1| PHD finger protein 17, isoform CRA_a [Homo sapiens]
Length = 787
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 151 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 206
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 207 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 266
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 267 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 313
>gi|397472033|ref|XP_003807565.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A [Pan paniscus]
Length = 1479
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1339 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1397
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 1398 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 1457
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 1458 FPLEDNKIPCLCGTESCRGSLN 1479
>gi|194388362|dbj|BAG65565.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 119 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 174
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 175 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 234
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 235 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 281
>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
Y-27907]
Length = 1060
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A RE + +G+ +Y+F
Sbjct: 917 KKPVTFARSAIHNWGLYAMEPIAAKEMIIEYVGERIRQQVAQHREESYLKTGIGS-SYLF 975
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 976 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1035
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1036 FERETNDEERIRCLCGAPGCKGYLN 1060
>gi|281343485|gb|EFB19069.1| hypothetical protein PANDA_015059 [Ailuropoda melanoleuca]
Length = 825
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 188 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 243
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 244 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 303
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 304 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 350
>gi|14017831|dbj|BAB47436.1| KIAA1807 protein [Homo sapiens]
Length = 702
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 66 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 121
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 122 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 181
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 182 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 228
>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 1278
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 141/258 (54%), Gaps = 38/258 (14%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N +GCAR+E P++ ++ R+E +A A A+ +
Sbjct: 1038 YVP--NATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAPGKSLT-----A 1090
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL----SMADKYKHMKETFRKRLAFG 701
V N +S + S+ + + HR A + S A ++ +K+ +K + F
Sbjct: 1091 VKVADDNSISRGS--------SRANRANHRRAMAEMNDQKSDALRFNQLKKR-KKPVKFE 1141
Query: 702 KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERV 761
+S IH +G++A DM+IEY GE VRPS+A RE S +G+ +Y+FRID++ V
Sbjct: 1142 RSAIHNWGLYAMEHIPKDDMIIEYVGEEVRPSVAQVREARYDRSGIGS-SYLFRIDEDAV 1200
Query: 762 IDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSI 818
IDAT+ G IA INHSC PNC +++I V G + I+I+A RDI + EELTYDY+F
Sbjct: 1201 IDATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKE 1260
Query: 819 DEQLACYCGFPRCRGVVN 836
++++ C CG C+G +N
Sbjct: 1261 EDRVPCLCGTTNCKGFLN 1278
>gi|397505200|ref|XP_003823158.1| PREDICTED: protein Jade-1 isoform 1 [Pan paniscus]
Length = 864
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 228 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 283
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 284 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 343
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 344 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 390
>gi|359322015|ref|XP_003639753.1| PREDICTED: protein Jade-1 [Canis lupus familiaris]
gi|359322017|ref|XP_848505.3| PREDICTED: protein Jade-1 isoform 2 [Canis lupus familiaris]
Length = 844
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
atroviride IMI 206040]
Length = 1241
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 100/155 (64%), Gaps = 7/155 (4%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K+ +K + F +S IH +G++A DM+IEY GE VR IA+ RE+ S
Sbjct: 1090 KFNQLKKR-KKPVKFARSAIHNWGLYAMENINKDDMIIEYVGEEVRQQIAEIRENRYLKS 1148
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI
Sbjct: 1149 GIGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAM 1207
Query: 806 WEELTYDYRFF----SIDEQLACYCGFPRCRGVVN 836
EELTYDY+F S+D ++ C CG C+G +N
Sbjct: 1208 NEELTYDYKFEREIGSLD-RIPCLCGTAACKGFLN 1241
>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
Length = 1278
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 141/258 (54%), Gaps = 38/258 (14%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P N +GCAR+E P++ ++ R+E +A A A+ +
Sbjct: 1038 YVP--NATGCARTEGVKKILNSEKSKYLPHHIKVKKAREERQAQAKAAPGKSLT-----A 1090
Query: 646 VGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL----SMADKYKHMKETFRKRLAFG 701
V N +S + S+ + + HR A + S A ++ +K+ +K + F
Sbjct: 1091 VKVADDNSISRGS--------SRANRANHRRAMAEMNDQKSDALRFNQLKKR-KKPVKFE 1141
Query: 702 KSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERV 761
+S IH +G++A DM+IEY GE VRPS+A RE S +G+ +Y+FRID++ V
Sbjct: 1142 RSAIHNWGLYAMEHIPKDDMIIEYVGEEVRPSVAQVREARYDRSGIGS-SYLFRIDEDAV 1200
Query: 762 IDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSI 818
IDAT+ G IA INHSC PNC +++I V G + I+I+A RDI + EELTYDY+F
Sbjct: 1201 IDATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKE 1260
Query: 819 DEQLACYCGFPRCRGVVN 836
++++ C CG C+G +N
Sbjct: 1261 EDRVPCLCGTTNCKGFLN 1278
>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
Length = 1670
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 101/148 (68%), Gaps = 1/148 (0%)
Query: 689 HMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVG 748
+M ++ +K + F +S IHG+G++A +M+IEY G+ +R +AD RE +G
Sbjct: 1524 NMLKSRKKMIKFARSRIHGYGLYAMETIAQDEMIIEYIGQKIRSLVADEREKAYERRGIG 1583
Query: 749 AGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEE 808
+ +Y+FRID+ VIDAT+ G+ A INHSC+PNCY++V+++ G++ I+I+++ I + EE
Sbjct: 1584 S-SYLFRIDEHTVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSTINKGEE 1642
Query: 809 LTYDYRFFSIDEQLACYCGFPRCRGVVN 836
+TYDY+F ++++ C CG CRG +N
Sbjct: 1643 ITYDYKFPIEEDKIDCLCGAKTCRGYLN 1670
>gi|222617469|gb|EEE53601.1| hypothetical protein OsJ_36855 [Oryza sativa Japonica Group]
Length = 1165
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 104/156 (66%), Gaps = 5/156 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ +K ++ +KRL F +S IH +G+ A A D VIEY GEL+R +
Sbjct: 1011 NLLAAAEGTDLLKIMQSKSRKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQV 1070
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE S +G+ +Y+FR+DD+ V+DAT+ G +A INHSC+PNCY++VI+V G +
Sbjct: 1071 SDIREDQYEKSGIGS-SYLFRLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKK 1129
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPR 830
I+I+AKR I EELTY+Y+F ++++ C+CG R
Sbjct: 1130 IVIYAKRRIYAGEELTYNYKFPLEEKKIPCHCGSQR 1165
>gi|218187240|gb|EEC69667.1| hypothetical protein OsI_39097 [Oryza sativa Indica Group]
Length = 1167
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 104/156 (66%), Gaps = 5/156 (3%)
Query: 679 NFLSMADKYKHMK----ETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ +K ++ +KRL F +S IH +G+ A A D VIEY GEL+R +
Sbjct: 1013 NLLAAAEGTDLLKIMQSKSRKKRLRFQRSKIHEWGLVALESIDAEDFVIEYVGELIRRQV 1072
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE S +G+ +Y+FR+DD+ V+DAT+ G +A INHSC+PNCY++VI+V G +
Sbjct: 1073 SDIREDQYEKSGIGS-SYLFRLDDDYVVDATKRGGLARFINHSCDPNCYTKVITVEGQKK 1131
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPR 830
I+I+AKR I EELTY+Y+F ++++ C+CG R
Sbjct: 1132 IVIYAKRRIYAGEELTYNYKFPLEEKKIPCHCGSQR 1167
>gi|42538982|ref|NP_970614.1| protein Jade-3 [Danio rerio]
gi|82188675|sp|Q7ZVP1.1|JADE3_DANRE RecName: Full=Protein Jade-3; AltName: Full=PHD finger protein 16
gi|28277726|gb|AAH45468.1| PHD finger protein 16 [Danio rerio]
gi|40389477|tpe|CAE30492.1| TPA: putative Jade3 protein [Danio rerio]
gi|182890742|gb|AAI65248.1| Phf16 protein [Danio rerio]
Length = 795
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 73/182 (40%), Positives = 100/182 (54%), Gaps = 17/182 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG ++ +G WLC C G P C LC
Sbjct: 204 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGIVKVPDGN-WLCRTCVLGIT---PQCLLC 259
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 260 PKTGGAMKATRAGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWSLICSLCKLK 319
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS C + +H CA L ++ LDE DE ++ S+C KH
Sbjct: 320 TGACIQCSVKNCTIPFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 367
Query: 578 QP 579
+P
Sbjct: 368 KP 369
>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
24927]
Length = 1338
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A +M+IEY GE+VR +AD RE S +G+ +Y+F
Sbjct: 1195 KKPVRFARSAIHNWGLYAMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIGS-SYLF 1253
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI + EELTYDY+
Sbjct: 1254 RIDETTVIDATKKGGIARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYK 1313
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E++ C CG C+G +N
Sbjct: 1314 FEREIDSEERIPCLCGSSGCKGFLN 1338
>gi|404247448|ref|NP_001258194.1| uncharacterized protein LOC299305 [Rattus norvegicus]
Length = 824
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 204 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKIPEGS-WLCRSCVLGIY---PQCVLC 259
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEP+ ++ + RW L+C++C +
Sbjct: 260 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPVTKISHIPPSRWALVCNLCKLK 319
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ SFC KH
Sbjct: 320 TGACIQCSVKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSFCLKHS 367
Query: 578 Q 578
Q
Sbjct: 368 Q 368
>gi|16307411|gb|AAH10250.1| Setd1a protein, partial [Mus musculus]
Length = 316
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 82/239 (34%), Positives = 131/239 (54%), Gaps = 21/239 (8%)
Query: 602 PPSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPS 661
P + +G ARSE Y ++ + + + S ++L G+T +
Sbjct: 95 PREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQL----------------EGGDTQGT 138
Query: 662 IRVIGSKFSFS---LHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHR 717
RV+ + S L + + +D K + FRK+ L FG+S IH +G+FA P
Sbjct: 139 NRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIA 198
Query: 718 AGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS 777
A +MVIEY G+ +R +AD RE +G+ +Y+FR+D + +IDAT+ G++A INH
Sbjct: 199 ADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLFRVDHDTIIDATKCGNLARFINHC 257
Query: 778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
C PNCY++VI++ + I+I++K+ I EE+TYDY+F D ++ C CG CRG +N
Sbjct: 258 CTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 316
>gi|328767840|gb|EGF77888.1| hypothetical protein BATDEDRAFT_27217 [Batrachochytrium
dendrobatidis JAM81]
Length = 838
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 65/149 (43%), Positives = 85/149 (57%), Gaps = 2/149 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC E +N + CD C + VH CYG + + WLC C +PE P C C
Sbjct: 307 CAVCDDGEAENSNAIVFCDGCNLAVHQDCYG-VPFIPEGQWLCRKCM-LSPETPVSCLFC 364
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P+ GGA K TT +W HL CA+WIPE + + MEP++G + K RW+L+C IC Y
Sbjct: 365 PIKGGAFKQTTTNKWVHLNCAMWIPECHIANTVYMEPVEGTGNIPKSRWRLICYICRKRY 424
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVEL 547
GA IQCSN C V +H CAR A L + +
Sbjct: 425 GAPIQCSNKACFVPFHASCARKAKLFMRM 453
>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae
DBVPG#7215]
Length = 995
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 98/145 (67%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K++ F +S IH +G++A P A +M+IEY GE++R +A+ RE+ S +G+ +Y+F
Sbjct: 852 KKQVTFARSPIHNWGLYALEPIAAKEMIIEYVGEMIRQPVAEMRENRYLKSGIGS-SYLF 910
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 911 RIDEYTVIDATKKGGIARFINHCCDPSCTAKIIKVGGRKRIVIYALRDIAANEELTYDYK 970
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG C+G +N
Sbjct: 971 FEREVDDEERLPCLCGAATCKGFLN 995
>gi|40556368|ref|NP_955021.1| protein Jade-3 [Mus musculus]
gi|81891667|sp|Q6IE82.1|JADE3_MOUSE RecName: Full=Protein Jade-3; AltName: Full=PHD finger protein 16
gi|40389493|tpe|CAE30499.1| TPA: Jade3 protein [Mus musculus]
gi|120577662|gb|AAI30271.1| PHD finger protein 16 [Mus musculus]
Length = 823
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKIPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEP+ ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPVTKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ SFC KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSFCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|157821409|ref|NP_001101085.1| bromodomain and PHD finger-containing protein 3 [Rattus norvegicus]
gi|149043488|gb|EDL96939.1| bromodomain and PHD finger containing, 3 (predicted) [Rattus
norvegicus]
Length = 1199
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 97/181 (53%), Gaps = 12/181 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 215 CCVCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 272
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DG WAH+ CAIWIPE C + +EPI+G++ + RWKL C IC
Sbjct: 273 PNKGGAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDSIPPARWKLTCYICKQKG 332
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN +R ++C+
Sbjct: 333 LGAAIQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIF-------TVRKTAYCEA 385
Query: 576 H 576
H
Sbjct: 386 H 386
>gi|194386878|dbj|BAG59805.1| unnamed protein product [Homo sapiens]
Length = 715
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|71051743|gb|AAH98812.1| RGD1311624 protein, partial [Rattus norvegicus]
Length = 216
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 76 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 134
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 135 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 194
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 195 FPLEDNKIPCLCGTESCRGSLN 216
>gi|254569422|ref|XP_002491821.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031618|emb|CAY69541.1| hypothetical protein PAS_chr2-2_0494 [Komagataella pastoris GS115]
gi|328351679|emb|CCA38078.1| histone-lysine N-methyltransferase SETD1 [Komagataella pastoris CBS
7435]
Length = 1020
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 99/145 (68%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P A +M+IEY GE++R +++ RE S +G+ +Y+F
Sbjct: 877 KKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGEILRQKVSEVREKKYLKSGIGS-SYLF 935
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D++ VIDAT+ G IA INH C+P+C +++I V G + I+I+A +DI EELTYDY+
Sbjct: 936 RVDEDTVIDATKKGGIARFINHCCQPSCTAKIIKVEGKKRIVIYALKDIAANEELTYDYK 995
Query: 815 FFSID---EQLACYCGFPRCRGVVN 836
F D E++ C CG P C+G +N
Sbjct: 996 FEREDNNEERIPCLCGVPGCKGYLN 1020
>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
Length = 1056
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 913 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQQVAEHREKSYLKTGIGS-SYLF 971
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 972 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVEGIKRIVIYALRDIEANEELTYDYK 1031
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1032 FERETNDEERIRCLCGAPGCKGYLN 1056
>gi|112418606|gb|AAI21971.1| LOC779492 protein [Xenopus (Silurana) tropicalis]
Length = 401
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 99/184 (53%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE + +N L CD C + VH CYG G WLC C +P P C LC
Sbjct: 219 CCVCLDDECHNSNAILFCDICNLAVHQECYGVPYIPEG-QWLCRCCL-QSPSKPVSCVLC 276
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G+N + RWKL C +C
Sbjct: 277 PNQGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGVNNIPSARWKLTCYLCKQKG 336
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL +++E E LN + +R +FC+
Sbjct: 337 RGAAIQCHKVNCYTAFHVTCAQRAGLFMKVEPVRETGLNGTTFT-------VRKTAFCEL 389
Query: 576 HKQP 579
H P
Sbjct: 390 HCPP 393
>gi|186908735|gb|ACC94158.1| bromodomain- and PHD finger-containing 1 [Danio rerio]
Length = 1258
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C P +P C LC
Sbjct: 309 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRC-PQSPSRAVDCALC 366
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 367 PNKGGAFKQTDDSRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 426
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 427 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 479
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 480 HTPPGSAR 487
>gi|196004646|ref|XP_002112190.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
gi|190586089|gb|EDV26157.1| hypothetical protein TRIADDRAFT_50258 [Trichoplax adhaerens]
Length = 152
Score = 139 bits (349), Expect = 1e-29, Method: Composition-based stats.
Identities = 67/154 (43%), Positives = 96/154 (62%), Gaps = 2/154 (1%)
Query: 683 MADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI 742
MA +Y+H+ T ++ + +S IHG G+F K AG+M++EY G +R + D++E +
Sbjct: 1 MAMRYRHLAATAKETVGVYRSDIHGRGLFCKRIIEAGEMIMEYAGTKIRSILTDKQEKYY 60
Query: 743 YNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRD 802
G G YMFRID + VIDAT +G+ A INHSC+PNCY +++ ++ + IIIFA R
Sbjct: 61 EGK--GLGCYMFRIDRDEVIDATLSGNAARFINHSCDPNCYCKIVPIDNIKKIIIFALRR 118
Query: 803 IKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
I EELTYDY F D +L C C +CR +N
Sbjct: 119 IYPGEELTYDYNFAKEDIKLPCNCSSKKCRKFLN 152
>gi|405970535|gb|EKC35431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 1927
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 121/234 (51%), Gaps = 29/234 (12%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GCAR+EP R ++P L +A+ R P V G+SG+
Sbjct: 1721 NPTGCARTEPKL---RTHFRKPHTLQSANTSRSL----PQTV-----TGVSGD------- 1761
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
I S + S + +Y+ +K +R + G+S I G G+FA MVIE
Sbjct: 1762 INSPYMKQF------VHSKSAQYRKLKTEWRTNVFLGRSRIQGLGLFAARDLEKHTMVIE 1815
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y G L+R +A+R E +Y G YMFRID+E V+DAT AG A INHSC PNC +
Sbjct: 1816 YIGYLIRNEVANRTE-VVYEEQ-NRGVYMFRIDNETVVDATMAGGPARYINHSCNPNCVA 1873
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
V+ + + III R I + EELTYDY+F DEQ + C CG P CR +N
Sbjct: 1874 EVVPFDKESKIIIITNRRIPRGEELTYDYKFDFEDEQHKIPCCCGAPGCRKWMN 1927
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVA 532
+AHL CA+W E T + +D K K+ C+ CG+ GA + C N C
Sbjct: 1441 YAHLNCALWSSEVYETLNGALMNVDS---AYKRGLKIECTACGIK-GATVGCFNNRCPKY 1496
Query: 533 YHPLCARAAGLCVELEDEDRL 553
YH CA+ G C+ +D+ L
Sbjct: 1497 YHLGCAKKVG-CMFFQDKTIL 1516
>gi|365981399|ref|XP_003667533.1| hypothetical protein NDAI_0A01320 [Naumovozyma dairenensis CBS 421]
gi|343766299|emb|CCD22290.1| hypothetical protein NDAI_0A01320 [Naumovozyma dairenensis CBS 421]
Length = 831
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 99/179 (55%), Gaps = 5/179 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYG-ELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
C++C+ N + CD C + VH CYG PV+ WLC C+ G +P C +
Sbjct: 242 CAICYGTNSDVTNTIVFCDGCNIAVHQECYGIVFIPVDS--WLCRRCQFGNNDPDIGCIV 299
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T +G W H CA+W+PE ++ MEPI+G+ + RWKL C IC
Sbjct: 300 CPSKTGAFKMTDNGIWIHNICALWLPELYFANLHYMEPIEGIGNIPSSRWKLFCYICKKR 359
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC++ C +AYH CAR AGL ++ + + +++ S+ + +L SFC KH
Sbjct: 360 MGACIQCTHKNCFLAYHVTCARRAGLYLKWDRD--VSVGSVASNQVHLGNKLHSFCDKH 416
>gi|403160477|ref|XP_003890492.1| hypothetical protein PGTG_20786 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170263|gb|EHS64083.1| hypothetical protein PGTG_20786 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1979
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/192 (40%), Positives = 106/192 (55%), Gaps = 15/192 (7%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C++C + +N + CD C + VH CYG + + WLC C +P+ P C L
Sbjct: 198 RCAICEDGDTENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPDRPVTCEL 255
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T++ +WAHL CAI IPET + + MEP+DG+ + K RWKL C IC +
Sbjct: 256 CPNSFGAFKQTSENKWAHLVCAIHIPETGVGNAMYMEPVDGVRCIPKQRWKLKCYICKKT 315
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLS---LDEDDEDQC-------- 566
GACIQC+N +C VAYH CA+ GL V+++ L + ++ D C
Sbjct: 316 VGACIQCANRSCCVAYHATCAQEVGLYVKMKPAGSLYPTTGARGTDEGPDSCAADHVGAD 375
Query: 567 -IRL-LSFCKKH 576
IRL SFC KH
Sbjct: 376 GIRLSQSFCDKH 387
>gi|380801021|gb|AFE72386.1| histone-lysine N-methyltransferase SETD1A, partial [Macaca mulatta]
Length = 310
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 170 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 228
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 229 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 288
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 289 FPLEDNKIPCLCGTESCRGSLN 310
>gi|312384477|gb|EFR29200.1| hypothetical protein AND_02075 [Anopheles darlingi]
Length = 5086
Score = 138 bits (348), Expect = 1e-29, Method: Composition-based stats.
Identities = 66/157 (42%), Positives = 97/157 (61%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK+ +R + +S I G G++A MVIEY GE++R +++ RE
Sbjct: 4932 SKSSQYKKMKQEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREK- 4990
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
G YMFR+D+ERV+DAT +G +A INHSC PNC + + V+ + IIIFAKR
Sbjct: 4991 -QYEAKNRGIYMFRLDEERVVDATLSGGLARYINHSCNPNCVTETVEVDRELRIIIFAKR 5049
Query: 802 DIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
I + EEL+YDY+F D+ +++C CG P C+ +N
Sbjct: 5050 RINRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 5086
>gi|432104072|gb|ELK30902.1| Protein Jade-1 [Myotis davidii]
Length = 688
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|37359846|dbj|BAC97901.1| mKIAA0215 protein [Mus musculus]
Length = 822
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 202 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKIPEGS-WLCRSCVLGIY---PQCVLC 257
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEP+ ++ + RW L+C++C +
Sbjct: 258 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPVTKISHIPPSRWALVCNLCKLK 317
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ SFC KH
Sbjct: 318 TGACIQCSVKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSFCLKHS 365
Query: 578 Q 578
Q
Sbjct: 366 Q 366
>gi|355756723|gb|EHH60331.1| hypothetical protein EGM_11666, partial [Macaca fascicularis]
Length = 256
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/142 (44%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 116 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 174
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 175 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 234
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D ++ C CG CRG +N
Sbjct: 235 FPLEDNKIPCLCGTESCRGSLN 256
>gi|363733267|ref|XP_003641225.1| PREDICTED: protein Jade-1-like [Gallus gallus]
Length = 891
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 255 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 310
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 311 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALICSLCNEK 370
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 371 VGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 417
>gi|427788389|gb|JAA59646.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 1597
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 99/181 (54%), Gaps = 15/181 (8%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD+C + VH CYG ++ + W+C C G PPC LC
Sbjct: 181 CDVCRSPDSEEGNEMVFCDQCDLCVHQACYG-IQRIPEGSWVCRTCALGIR---PPCVLC 236
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + V++MEPI ++++ RW L C +C
Sbjct: 237 PTRGGAMKSTRSGQKWAHVSCALWIPEVSIGCVEKMEPIMKISQIPPSRWALTCCLCRER 296
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS C+ AYH CA G L + + +D+ ++L SFC KH
Sbjct: 297 IGACIQCSVKACKRAYHVTCAFENG----------LEMKPIIDDNTVDEVKLKSFCPKHS 346
Query: 578 Q 578
+
Sbjct: 347 K 347
>gi|410988415|ref|XP_004000481.1| PREDICTED: protein Jade-3 [Felis catus]
Length = 823
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C V
Sbjct: 259 PKRGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKVK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|374106286|gb|AEY95196.1| FABR136Wp [Ashbya gossypii FDAG1]
Length = 975
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 832 KKPVTFARSAIHNWGLYALEPISAKEMIIEYVGERIRQPVAEMREKRYLKSGIGS-SYLF 890
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 891 RVDESTVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYK 950
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 951 FERETDDEERLPCLCGAPNCKGFLN 975
>gi|302306708|ref|NP_983083.2| ABR136Wp [Ashbya gossypii ATCC 10895]
gi|442570023|sp|Q75D88.2|SET1_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|299788647|gb|AAS50907.2| ABR136Wp [Ashbya gossypii ATCC 10895]
Length = 975
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 832 KKPVTFARSAIHNWGLYALEPISAKEMIIEYVGERIRQPVAEMREKRYLKSGIGS-SYLF 890
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 891 RVDESTVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYK 950
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 951 FERETDDEERLPCLCGAPNCKGFLN 975
>gi|449278214|gb|EMC86148.1| Protein Jade-1, partial [Columba livia]
Length = 824
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 187 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVR---PKCLLC 242
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 243 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 302
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 303 VGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 349
>gi|428181744|gb|EKX50607.1| hypothetical protein GUITHDRAFT_66907 [Guillardia theta CCMP2712]
Length = 182
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 95/151 (62%), Gaps = 2/151 (1%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K K+++ R +S IH G+FA DMVIEY GEL+RP + D R+
Sbjct: 34 KAKYLQLRSEPRTVVRRSPIHNLGLFATRRIDRNDMVIEYVGELIRPIVGDIRDDLALEK 93
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
G TYMFR+DD ++DA AG+ + INH C+PNCY +V++V+G +HI++FA RDI+
Sbjct: 94 --GKSTYMFRLDDNYIVDAMFAGNASRFINHCCDPNCYCQVVTVDGVKHIVLFAMRDIEA 151
Query: 806 WEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
EE+TYDY+ + ++ C+CG +CRG +N
Sbjct: 152 DEEITYDYKLPIEEVKVICHCGSAKCRGYMN 182
>gi|326669898|ref|XP_001919047.3| PREDICTED: bromodomain and PHD finger-containing protein 3 [Danio
rerio]
Length = 1207
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 100/186 (53%), Gaps = 12/186 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG G WLC C +P P C LC
Sbjct: 235 CCVCLDDECLNSNVILFCDICNLAVHQECYGVPYIPEGQ-WLCRCCL-QSPSRPVDCVLC 292
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G++ + RWKL C +C
Sbjct: 293 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGVDNIPPARWKLTCYLCKQKG 352
Query: 519 -GACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL ++++ E +N + ++ +FC+
Sbjct: 353 CGASIQCHKANCYTAFHVTCAQRAGLFMKIDPVRETTVNGTTF-------SVKKTAFCEA 405
Query: 576 HKQPLN 581
H +N
Sbjct: 406 HSPAVN 411
>gi|327274074|ref|XP_003221803.1| PREDICTED: protein Jade-1-like [Anolis carolinensis]
Length = 931
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 100/191 (52%), Gaps = 20/191 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 296 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 351
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 352 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALICSLCNEK 411
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH- 576
GA IQCS CR A+H CA GL + + D D+ ++ S+C KH
Sbjct: 412 IGASIQCSVKNCRTAFHVTCAFDRGL--------EMKTILADNDE----VKFKSYCPKHS 459
Query: 577 --KQPLNDRLA 585
K+P + L+
Sbjct: 460 STKKPDEENLS 470
>gi|449548376|gb|EMD39343.1| hypothetical protein CERSUDRAFT_112983 [Ceriporiopsis subvermispora
B]
Length = 1293
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 86/150 (57%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 133 CAICDDSEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVSCILC 190
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G++++ K RWKL CS C V
Sbjct: 191 PNEGGAFKQTVHGDWVHLLCAIWVPETRVANDVFMEPITGVDKIPKQRWKLKCSRCDVKE 250
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+ +C A+H CAR L + ++
Sbjct: 251 GACIQCTKASCFTAFHVTCARKEKLLMPMK 280
>gi|431917779|gb|ELK17021.1| Protein Jade-3 [Pteropus alecto]
Length = 988
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/191 (40%), Positives = 104/191 (54%), Gaps = 20/191 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 367 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIR---PQCVLC 422
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 423 PKKGGAMKATKTGTKWAHVSCALWIPEVSIGCPERMEPITKISHIPPSRWALVCSLCKLK 482
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 483 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 530
Query: 578 QPLNDRLAVDE 588
Q +RL + E
Sbjct: 531 Q---NRLKLGE 538
>gi|348521834|ref|XP_003448431.1| PREDICTED: bromodomain-containing protein 1-like [Oreochromis
niloticus]
Length = 1164
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/222 (37%), Positives = 116/222 (52%), Gaps = 23/222 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG + V WLC C +P P C LC
Sbjct: 239 CCVCLDDECLNSNVILFCDICNLAVHQECYG-VPYVPEGQWLCRCCL-QSPSRPVDCVLC 296
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G++ + RWKL C +C
Sbjct: 297 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGVSNIPPARWKLTCYLCKQKG 356
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL ++++ E +N + ++ +FC+
Sbjct: 357 RGASIQCHKANCYRAFHVTCAQKAGLFMKIDPVRETSVNGTTF-------SVKKTAFCEH 409
Query: 576 HKQPLNDRL------AVDERLV----QVTRRCCDYIPPSNPS 607
H P+ R +V+ RLV +R PPS+P+
Sbjct: 410 HS-PVGSRRDGSGDESVEGRLVGGRGNRGQRSYTQSPPSSPN 450
>gi|410080444|ref|XP_003957802.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
gi|372464389|emb|CCF58667.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
Length = 1133
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 990 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 1048
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G I+I+A RDI + EELTYDY+
Sbjct: 1049 RVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIYALRDIAKNEELTYDYK 1108
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1109 FEREQDDEERLPCLCGAPNCKGFLN 1133
>gi|301780240|ref|XP_002925538.1| PREDICTED: protein Jade-1-like [Ailuropoda melanoleuca]
Length = 994
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 357 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 412
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 413 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 472
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 473 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 519
>gi|403179544|ref|XP_003888549.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165103|gb|EHS62842.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1912
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 92/151 (60%), Gaps = 2/151 (1%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C++C + +N + CD C + VH CYG + + WLC C +P+ P C L
Sbjct: 142 RCAICEDGDTENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPDRPVTCEL 199
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T++ +WAHL CAI IPET + + MEP+DG+ + K RWKL C IC +
Sbjct: 200 CPNSFGAFKQTSENKWAHLVCAIHIPETGVGNAMYMEPVDGVRCIPKQRWKLKCYICKKT 259
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+N +C VAYH CA+ GL V+++
Sbjct: 260 VGACIQCANRSCCVAYHATCAQEVGLYVKMK 290
>gi|403297395|ref|XP_003939551.1| PREDICTED: protein Jade-3 [Saimiri boliviensis boliviensis]
Length = 823
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|355757310|gb|EHH60835.1| PHD finger protein 16 [Macaca fascicularis]
Length = 823
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|355711211|gb|AES03937.1| PHD finger protein 17 [Mustela putorius furo]
Length = 549
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 213 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 268
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 269 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 328
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 329 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 375
>gi|355704749|gb|EHH30674.1| PHD finger protein 16 [Macaca mulatta]
gi|380818382|gb|AFE81064.1| protein Jade-3 [Macaca mulatta]
gi|383423211|gb|AFH34819.1| protein Jade-3 [Macaca mulatta]
gi|384950584|gb|AFI38897.1| protein Jade-3 [Macaca mulatta]
Length = 823
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
Length = 1088
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 945 KKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHRERSYLKTGIGS-SYLF 1003
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ V+DAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 1004 RIDENTVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1063
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + E++ C CG P C+G +N
Sbjct: 1064 FEKETNDAERIRCLCGAPGCKGYLN 1088
>gi|406603115|emb|CCH45348.1| hypothetical protein BN7_4930 [Wickerhamomyces ciferrii]
Length = 720
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 84/151 (55%), Gaps = 2/151 (1%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C+VC+ E +N + CD C + VH CYG + G WLC C C
Sbjct: 254 RCAVCNESECDNSNAIIFCDGCDIAVHQDCYGVIFIPEGQ-WLCRRCMISKKRKTR-CLF 311
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T +G W+H+ CA+WIPE MEP++G + + K RWKL C IC
Sbjct: 312 CPSTTGAFKQTDNGLWSHVLCALWIPELYFASAGHMEPVEGFDAIPKGRWKLNCYICKQK 371
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC+N C A+HP CAR AGL +E++
Sbjct: 372 MGACIQCANRNCFTAFHPTCARRAGLFMEMK 402
>gi|297303703|ref|XP_001100171.2| PREDICTED: protein Jade-3 isoform 1 [Macaca mulatta]
gi|297303705|ref|XP_002806255.1| PREDICTED: protein Jade-3 isoform 2 [Macaca mulatta]
Length = 823
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|402909982|ref|XP_003917673.1| PREDICTED: protein Jade-3 [Papio anubis]
Length = 823
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGAMKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|149744467|ref|XP_001491347.1| PREDICTED: protein Jade-3 [Equus caballus]
Length = 824
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 204 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 259
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C V
Sbjct: 260 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKVK 319
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 320 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 367
Query: 578 Q 578
Q
Sbjct: 368 Q 368
>gi|40389487|tpe|CAE30497.1| TPA: Jade1S protein [Mus musculus]
Length = 510
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 207 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 262
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 263 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 322
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 323 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 369
>gi|449499720|ref|XP_004176758.1| PREDICTED: LOW QUALITY PROTEIN: protein Jade-1 [Taeniopygia
guttata]
Length = 971
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 333 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVR---PKCLLC 388
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 389 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 448
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 449 VGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 495
>gi|299744607|ref|XP_001831141.2| bromodomain and PHD finger-containing protein 3 [Coprinopsis
cinerea okayama7#130]
gi|298406206|gb|EAU90763.2| bromodomain and PHD finger-containing protein 3 [Coprinopsis
cinerea okayama7#130]
Length = 1145
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/178 (40%), Positives = 94/178 (52%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 134 CAICDDSEGENTNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPENPVQCILC 191
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T G W HL CAIW+PET + + MEPI G ++SK RWKL CS+C
Sbjct: 192 PNEGGAFKQTVTGDWVHLLCAIWVPETRVANEVFMEPITGGEQISKQRWKLKCSLCEQRG 251
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC+ +C VA+H CAR L LL + + L ++C++H
Sbjct: 252 GACIQCAKPSCFVAFHTTCARQEKL-----------LLPMKSTPGAEPATLQAYCERH 298
>gi|353234765|emb|CCA66787.1| related to peregrin (bromodomain and PHD finger-containing protein
1) [Piriformospora indica DSM 11827]
Length = 1076
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 96/178 (53%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC E +N + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 129 CAVCDDGEGENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPEAPVSCLLC 186
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K T+ G WAHL CAIWIPE + + MEPI+ + +SK RW+L CSIC
Sbjct: 187 PNEGGAFKQTSSGHWAHLLCAIWIPEVVVQNQVFMEPIEHIENISKSRWRLRCSICKEPK 246
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC +C A+H CAR L + D E+Q +R +FC++H
Sbjct: 247 GACIQCDIKSCYSAFHVSCARKQKF---------LCSMKTLPDQEEQPLR--AFCERH 293
>gi|393244276|gb|EJD51788.1| hypothetical protein AURDEDRAFT_82629, partial [Auricularia
delicata TFB-10046 SS5]
Length = 1070
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/178 (41%), Positives = 100/178 (56%), Gaps = 13/178 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N+ + CD C + VH CYG + + WLC C +PE P C LC
Sbjct: 135 CTICDDGEGENSNVIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCT-VSPESPVSCVLC 192
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GGA K TT G+W HL CAIWIPE + ++ MEPI+ +NR+ K R KL CSIC +
Sbjct: 193 PNEGGAFKQTTTGQWVHLLCAIWIPEVSVGNMTFMEPIEHVNRIPKSRLKLTCSICRLR- 251
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G CIQC N +C A+H CAR L ++ +L +E +R +FC+KH
Sbjct: 252 GPCIQCDNKSCFAAFHVTCARQEKLLAPMK--------ALPGVEEAPPLR--AFCEKH 299
>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
bruxellensis AWRI1499]
Length = 1104
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 97/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+ EY GE +R +A+ RE S +G+ +Y+F
Sbjct: 961 QKPVQFARSAIHNWGLYAXEPIAAKEMIXEYVGERIRQKVAEVREKRYLKSGIGS-SYLF 1019
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+DD VIDA++ G IA INH C+P+C +++I V+G + I+I+A RDI EELTYDY+
Sbjct: 1020 RVDDNTVIDASKKGGIARFINHCCDPSCTAKIIKVDGKKRIVIYALRDIAANEELTYDYK 1079
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 1080 FEKETNPEERIPCLCGAPNCKGYLN 1104
>gi|156382583|ref|XP_001632632.1| predicted protein [Nematostella vectensis]
gi|156219691|gb|EDO40569.1| predicted protein [Nematostella vectensis]
Length = 813
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 85/150 (56%), Gaps = 2/150 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 260 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRVVDCVLC 317
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T DGRW H++CA+WIPE C + +EPID ++ + RWKL C IC
Sbjct: 318 PNKTGAFKQTDDGRWGHVSCALWIPEVCFANTVFLEPIDSIDNIPSARWKLTCYICKRRQ 377
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC T C A+H CA+ AGL +++E
Sbjct: 378 GACIQCFKTNCYTAFHVTCAQQAGLYMKIE 407
>gi|384486382|gb|EIE78562.1| hypothetical protein RO3G_03266 [Rhizopus delemar RA 99-880]
Length = 614
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/247 (35%), Positives = 116/247 (46%), Gaps = 30/247 (12%)
Query: 330 FSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPV 389
+ E EK+ E +D+F ++ K L K+ P K S+ P
Sbjct: 175 LNNERRKEKLGELNADLFENIIDQLEKEWFDLVKNLP-------KRVSEELTFPED---- 223
Query: 390 RVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAP 449
KC++C E +N + CD C + VH CYG + + WLC C +P
Sbjct: 224 -------SKCAICDDGECENSNAIVFCDGCNLAVHQDCYG-VPYIPEGQWLCRKCM-VSP 274
Query: 450 EPPPPCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKL 509
+ P C CP GGA K TT +W HL CAIWIPE L + MEPID + + K RWKL
Sbjct: 275 DKPVSCIFCPTEGGAFKQTTTNQWGHLLCAIWIPEVSLGNSVYMEPIDNIANIPKSRWKL 334
Query: 510 LCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRL 569
C IC GACIQC N C A+H CAR A L ++++ + DD + L
Sbjct: 335 TCYICRRRQGACIQCDNKHCFTAFHVTCARWARLYMKMKPPNS------HYDD----VAL 384
Query: 570 LSFCKKH 576
+FC KH
Sbjct: 385 KAFCDKH 391
>gi|41056011|ref|NP_957310.1| peregrin [Danio rerio]
gi|27881884|gb|AAH44418.1| Bromodomain and PHD finger containing, 1 [Danio rerio]
Length = 899
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C P +P C LC
Sbjct: 309 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRC-PQSPSRAVDCALC 366
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 367 PNKGGAFKQTDDSRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 426
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 427 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 479
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 480 HTPPGSAR 487
>gi|19923609|ref|NP_079176.2| protein Jade-1 short isoform [Homo sapiens]
gi|114596017|ref|XP_001158066.1| PREDICTED: protein Jade-1 isoform 1 [Pan troglodytes]
gi|332231043|ref|XP_003264703.1| PREDICTED: protein Jade-1 isoform 3 [Nomascus leucogenys]
gi|14042423|dbj|BAB55239.1| unnamed protein product [Homo sapiens]
gi|21595374|gb|AAH32376.1| PHF17 protein [Homo sapiens]
gi|22324559|gb|AAM95612.1| PHD protein Jade-1 [Homo sapiens]
gi|119625578|gb|EAX05173.1| PHD finger protein 17, isoform CRA_d [Homo sapiens]
gi|119625581|gb|EAX05176.1| PHD finger protein 17, isoform CRA_d [Homo sapiens]
gi|123981338|gb|ABM82498.1| PHD finger protein 17 [synthetic construct]
gi|123996181|gb|ABM85692.1| PHD finger protein 17 [synthetic construct]
gi|307684870|dbj|BAJ20475.1| PHD finger protein 17 [synthetic construct]
gi|383408775|gb|AFH27601.1| protein Jade-1 short isoform [Macaca mulatta]
gi|384940482|gb|AFI33846.1| protein Jade-1 short isoform [Macaca mulatta]
gi|410210266|gb|JAA02352.1| PHD finger protein 17 [Pan troglodytes]
gi|410260570|gb|JAA18251.1| PHD finger protein 17 [Pan troglodytes]
gi|410289290|gb|JAA23245.1| PHD finger protein 17 [Pan troglodytes]
gi|410337895|gb|JAA37894.1| PHD finger protein 17 [Pan troglodytes]
Length = 509
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
Length = 469
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 114/188 (60%), Gaps = 8/188 (4%)
Query: 656 GNTLPSIRVI---GSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIF 711
GN+L S RV +F+ + + +D T RK+ +++ +S IH +G++
Sbjct: 283 GNSLQSSRVNRANNRRFAADISAQKQILSTESDILNLNALTKRKKPVSYARSAIHNWGLY 342
Query: 712 AKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIA 771
A P A +M+IEY GE +R +A+ RE + +G+ +Y+FRID+ VIDAT+ G IA
Sbjct: 343 ALEPIAAKEMIIEYVGESIRQQVAEHREKSYLRTGIGS-SYLFRIDENTVIDATKKGGIA 401
Query: 772 HLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSIDEQLACYCGF 828
INH C P+C +++I V+G + I+I+A RDI+ EELTYDY+F + E++ C CG
Sbjct: 402 RFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERETNDAERIRCLCGA 461
Query: 829 PRCRGVVN 836
P C+G +N
Sbjct: 462 PGCKGYLN 469
>gi|334331316|ref|XP_001365299.2| PREDICTED: protein Jade-1 [Monodelphis domestica]
Length = 877
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 237 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 292
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 293 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 352
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 353 IGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 399
>gi|62752055|ref|NP_001015559.1| protein Jade-1 [Bos taurus]
gi|75057838|sp|Q5E9T7.1|JADE1_BOVIN RecName: Full=Protein Jade-1; AltName: Full=PHD finger protein 17
gi|59858031|gb|AAX08850.1| Jade1 protein short isoform [Bos taurus]
Length = 509
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/187 (39%), Positives = 101/187 (54%), Gaps = 17/187 (9%)
Query: 391 VDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPE 450
+++ + C VC + N + CDKC + VH CYG L+ G WLC C G
Sbjct: 198 IEYDEYVVCDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ- 255
Query: 451 PPPPCCLCPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKL 509
P C LCP GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L
Sbjct: 256 --PKCLLCPKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWAL 313
Query: 510 LCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRL 569
+CS+C +GA IQCS CR A+H CA GL ++ L E+DE ++
Sbjct: 314 VCSLCNEKFGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKF 361
Query: 570 LSFCKKH 576
S+C KH
Sbjct: 362 KSYCPKH 368
>gi|296478725|tpg|DAA20840.1| TPA: protein Jade-1 [Bos taurus]
Length = 509
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|431899699|gb|ELK07653.1| Protein Jade-1 [Pteropus alecto]
Length = 670
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 36 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGLQ---PKCLLC 91
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 92 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 151
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 152 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 198
>gi|408391029|gb|EKJ70413.1| hypothetical protein FPSE_09407 [Fusarium pseudograminearum CS3096]
Length = 1263
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 98/154 (63%), Gaps = 5/154 (3%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K+ +K + F +S IH +G++A DM+IEY GE VR I++ RE+ S
Sbjct: 1112 KFNQLKKR-KKPVKFARSAIHNWGLYAMENIAKDDMIIEYVGEQVRQQISEIRENRYLKS 1170
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI
Sbjct: 1171 GIGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAL 1229
Query: 806 WEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
EELTYDY+F +++ C CG C+G +N
Sbjct: 1230 NEELTYDYKFEREIGSTDRIPCLCGTAACKGFLN 1263
>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
sinensis]
Length = 328
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/140 (50%), Positives = 90/140 (64%), Gaps = 2/140 (1%)
Query: 697 RLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI 756
R+A S IHG G+FA R +MVIEY GEL+R + + RE I G YMFRI
Sbjct: 191 RVAVHPSHIHGRGLFALRGFREDEMVIEYMGELIRNFVCETRE--IRYRSAGVDCYMFRI 248
Query: 757 DDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFF 816
D + VIDAT AG+ A INHSC+PNCY++V++V+ +HI+I A+R I EELTYDYRF
Sbjct: 249 DSDLVIDATYAGNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDYRFP 308
Query: 817 SIDEQLACYCGFPRCRGVVN 836
++L C CG CR +N
Sbjct: 309 KESDKLLCNCGSYNCRKYLN 328
>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 1450
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A DM+IEY G+ VR S+A RE + +G+ +Y+F
Sbjct: 1307 KKTVKFARSAIHNWGLYAMESIPKDDMIIEYVGQEVRQSVATIREKAYIRAGIGS-SYLF 1365
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI Q EELTYDY+
Sbjct: 1366 RIDDGTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYK 1425
Query: 815 FFSID---EQLACYCGFPRCRGVVN 836
F D +++ C CG C+G +N
Sbjct: 1426 FEPEDNPEDRVPCLCGTTACKGFLN 1450
>gi|405962579|gb|EKC28243.1| Peregrin [Crassostrea gigas]
Length = 1329
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 86/151 (56%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N+ L CD C + VH CYG + + WLC C +P CCLC
Sbjct: 270 CNICQDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCCLC 327
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE + +EPID + + RWKL C IC
Sbjct: 328 PNKGGAFKQTDDGRWAHVVCALWIPEVGFANTVFLEPIDSFDNIPPARWKLSCYICKQRG 387
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC T C A+H CA+ AGL +++E
Sbjct: 388 KGACIQCHKTNCYTAFHVTCAQQAGLYMKIE 418
>gi|410516926|sp|Q4I5R3.2|SET1_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
Length = 1263
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 98/154 (63%), Gaps = 5/154 (3%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K+ +K + F +S IH +G++A DM+IEY GE VR I++ RE+ S
Sbjct: 1112 KFNQLKKR-KKPVKFARSAIHNWGLYAMENIAKDDMIIEYVGEQVRQQISEIRENRYLKS 1170
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRIDD VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI
Sbjct: 1171 GIGS-SYLFRIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAL 1229
Query: 806 WEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
EELTYDY+F +++ C CG C+G +N
Sbjct: 1230 NEELTYDYKFEREIGSTDRIPCLCGTAACKGFLN 1263
>gi|444721953|gb|ELW62660.1| Protein Jade-1 [Tupaia chinensis]
Length = 690
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 387 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 442
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 443 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 502
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 503 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 549
>gi|86129848|gb|ABC86576.1| KIAA0339 protein [Danio rerio]
Length = 406
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 130/235 (55%), Gaps = 22/235 (9%)
Query: 607 SGCARSEPYNYFGRRGRKEPEALAAASLKRLFVE-NQPYL---VGGYCQNGLSGNTLPSI 662
+GCARSE Y R+ K +++E +QP +G Y G S L
Sbjct: 189 TGCARSEGYYAISRKE------------KDVYLELDQPVTLREIGDYDTAG-SNRVLSER 235
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDM 721
R + ++ P + +D K + FRK+ L FG+S IH +G+FA P A +M
Sbjct: 236 RSEQRRLLSAI--GTPAVMD-SDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEM 292
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY G+ +R +AD E +G+ +Y+FR+D + +IDAT+ G++A INH C PN
Sbjct: 293 VIEYVGQSIRQMVADNWEKRYAQEGIGS-SYLFRVDHDTIIDATKCGNLARFINHCCTPN 351
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
CY++VI++ + I+I++K+ I EE+TYDY+F + ++ C CG CRG +N
Sbjct: 352 CYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIEENKIPCLCGTESCRGTLN 406
>gi|365983760|ref|XP_003668713.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
gi|343767480|emb|CCD23470.1| hypothetical protein NDAI_0B04360 [Naumovozyma dairenensis CBS 421]
Length = 1105
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R ++A+ RE + +G+ +Y+F
Sbjct: 962 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQAVAEMRERRYLKNGIGS-SYLF 1020
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G I+I+A RDI EELTYDY+
Sbjct: 1021 RVDENNVIDATKKGGIARFINHCCDPSCTAKIIKVGGRRRIVIYALRDIAANEELTYDYK 1080
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C+CG C+G +N
Sbjct: 1081 FEREIDAEERLPCHCGSVNCKGFLN 1105
>gi|427788533|gb|JAA59718.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 1073
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/158 (44%), Positives = 87/158 (55%), Gaps = 3/158 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C E +N L CD C + VH CYG G WLC C +P C LC
Sbjct: 249 CAICSDGECQNSNAILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVDCVLC 306
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+W+PE C + +EPID LN + RWKL C IC
Sbjct: 307 PNKGGAFKQTDDGRWAHVVCALWVPEVCFANTVFLEPIDSLNNIPAARWKLTCYICKQRG 366
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNL 555
GACIQC C A+H CA+ AGL + LE+ L++
Sbjct: 367 VGACIQCHRANCYTAFHVTCAQQAGLYMRLEEATGLHV 404
>gi|256076153|ref|XP_002574378.1| mixed-lineage leukemia protein mll [Schistosoma mansoni]
gi|350644878|emb|CCD60414.1| mixed-lineage leukemia protein, mll, putative [Schistosoma mansoni]
Length = 1560
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 117/239 (48%), Gaps = 24/239 (10%)
Query: 605 NPSGCARSEP--YNYFGRRGRKEPEALA---AASLKRLFVENQPYLVGGYCQNGLSGNTL 659
NPSGCARSEP Y R +++ SL + + P G C NG
Sbjct: 1339 NPSGCARSEPKMQTYVKRMHDAGEYSISFNHRTSLPGVRRLSHPSFNGTTCSNG------ 1392
Query: 660 PSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAG 719
V + H+ S + +Y+ +K + G+S I G G+FA
Sbjct: 1393 ----VYNIDTASKQHQS-----SRSQQYRKLKTEVNSNVILGRSRIQGLGLFAARDLEQQ 1443
Query: 720 DMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCE 779
MVIEY GEL+R IA++RE G YMFR+DD+ VIDAT G +A INHSC+
Sbjct: 1444 TMVIEYVGELIRLEIANKREKNY--EAHNRGIYMFRLDDDTVIDATVCGGLARYINHSCQ 1501
Query: 780 PNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID--EQLACYCGFPRCRGVVN 836
PNC + ++ G HI+I R IK+ EEL YDY F D +++ C C P CR +N
Sbjct: 1502 PNCLAEFVNFGGHSHIVIITNRRIKKGEELCYDYNFDLEDRSDKIPCLCRAPNCRKWMN 1560
>gi|209877148|ref|XP_002140016.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209555622|gb|EEA05667.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 2678
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/186 (40%), Positives = 111/186 (59%), Gaps = 5/186 (2%)
Query: 656 GNTLPSIRVIGSKFSFSLHRDAPNFLSMADKYKHMKETF-RKRLAFGKSGIHGFGIFAKH 714
NT ++ S R ++ A Y++++ KRL KS IHG+G+FAK
Sbjct: 2493 NNTFKHSNILNISNRISRRRTKAEDMTPAMAYRYLESLPPDKRLDVKKSKIHGYGLFAKE 2552
Query: 715 PHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT-YMFRIDDERVIDATRAGSIAHL 773
+ ++EY GE++R S+AD+RE G+ YMFR+D+ RVIDAT G++A
Sbjct: 2553 CIHPNEPIVEYVGEVIRNSVADKREKIYEQEGQRDGSCYMFRLDEHRVIDATNCGNLARF 2612
Query: 774 INHSCEPNCYSRVI-SVNGDEHIIIFAKRDIKQWEELTYDYRFF--SIDEQLACYCGFPR 830
+NH C+PNC +VI S +G++HI+IF+K +IK EE+TYDY+F E++ CYCG P
Sbjct: 2613 MNHCCQPNCVCKVITSDDGNKHIVIFSKTEIKPDEEVTYDYQFNLEEESEKIYCYCGAPN 2672
Query: 831 CRGVVN 836
C G +N
Sbjct: 2673 CLGRMN 2678
>gi|291407397|ref|XP_002719927.1| PREDICTED: PHD finger protein 16 [Oryctolagus cuniculus]
Length = 823
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/181 (41%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 259 PKKGGAMKSTRTGSKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCSLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFRSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|119625579|gb|EAX05174.1| PHD finger protein 17, isoform CRA_e [Homo sapiens]
Length = 495
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 192 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 247
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 248 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 307
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 308 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 354
>gi|451848788|gb|EMD62093.1| hypothetical protein COCSADRAFT_162605 [Cochliobolus sativus ND90Pr]
Length = 1180
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 96/146 (65%), Gaps = 5/146 (3%)
Query: 695 RKRLA-FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYM 753
RK+L F +S IH +G++A+ P DM+IEY GE VR +AD RE VG+ +Y+
Sbjct: 1036 RKKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGS-SYL 1094
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FRID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI+ EELTYDY
Sbjct: 1095 FRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDNTKRIVIYALRDIQSDEELTYDY 1154
Query: 814 RF---FSIDEQLACYCGFPRCRGVVN 836
+F +++ C CG C+G +N
Sbjct: 1155 KFEREIDATDRIPCLCGSIGCKGFLN 1180
>gi|451998610|gb|EMD91074.1| hypothetical protein COCHEDRAFT_1156416 [Cochliobolus heterostrophus
C5]
Length = 1251
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ P DM+IEY GE VR +AD RE VG+ +Y+F
Sbjct: 1108 KKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGS-SYLF 1166
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI+ EELTYDY+
Sbjct: 1167 RIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDNTKRIVIYALRDIQSDEELTYDYK 1226
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG C+G +N
Sbjct: 1227 FEREIDATDRIPCLCGSIGCKGFLN 1251
>gi|348667410|gb|EGZ07235.1| hypothetical protein PHYSODRAFT_306469 [Phytophthora sojae]
Length = 3106
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 101/158 (63%), Gaps = 3/158 (1%)
Query: 681 LSMADKYKHMKE-TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE 739
L + +Y+ ++ F +R+ KS IHG+G+F K G M++EY G+++ ++AD RE
Sbjct: 2950 LPITMQYRELRRRPFDERMLVRKSSIHGYGLFMKEAVSEGQMIVEYQGQMINQAVADERE 3009
Query: 740 HFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDE-HIIIF 798
VG+ YMFR+D++ +IDATR G++A INHSC+P ++R+++V G E I+IF
Sbjct: 3010 RRYEEQGVGS-CYMFRLDEKTIIDATRCGNLARFINHSCDPKAFARIVAVEGGEKKIVIF 3068
Query: 799 AKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
AKR I +E+TYDY+F DE + C C P C G +N
Sbjct: 3069 AKRAIAVGDEVTYDYKFPIEDEAIRCDCNAPNCIGRMN 3106
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 42/95 (44%), Gaps = 11/95 (11%)
Query: 455 CCLCPVVG-----GAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKL 509
C LC ++G G + T W H+ CAIW E + + +K R +L
Sbjct: 2295 CALCFMIGDNTACGRLLYTESETWVHVNCAIWSMEVYEDAAGVLHKCNK----AKHRSRL 2350
Query: 510 L-CSICGVSYGACIQCSNTTCRVAYHPLCARAAGL 543
+ C CG+ GA I C+ C YH CA AG+
Sbjct: 2351 VRCDACGL-MGATIGCTIARCTTHYHFPCAVDAGV 2384
>gi|324501479|gb|ADY40659.1| Protein Jade-3 [Ascaris suum]
Length = 856
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 102/192 (53%), Gaps = 12/192 (6%)
Query: 388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG 447
P + ++ + C +C + +++ + CD C + VH CYG L+ V WLC+ C
Sbjct: 366 PFQAEFDEDAFCDICRQTDYEEDDEIIFCDGCNVGVHQSCYG-LDSVPHDDWLCHACTLL 424
Query: 448 APEPPPPCCLCPVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
+ P C LCP+ GGAMK G+ WAH+ CA+WIPE DV EPI ++ + +R
Sbjct: 425 GYKAQPRCALCPLTGGAMKCMKGGKTWAHVVCALWIPEVRFGDVDHREPITNISDIPNER 484
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
W L CSIC GACIQCS +C A+H C G +++E + + D
Sbjct: 485 WALRCSICDTKQGACIQCSVKSCTTAFHVTCGLRKGQVMKIEHDSSV----------DDG 534
Query: 567 IRLLSFCKKHKQ 578
+R++S C+KH +
Sbjct: 535 VRMVSLCEKHSR 546
>gi|260794338|ref|XP_002592166.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
gi|229277381|gb|EEN48177.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
Length = 1891
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 122/235 (51%), Gaps = 34/235 (14%)
Query: 605 NPSGCARSEP-YNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIR 663
NP+GCARSEP + RR P L +++ R Q L+G L
Sbjct: 1688 NPTGCARSEPKMRTYIRR----PHTLTSSNTSR------------SSQTTLTGELL---- 1727
Query: 664 VIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
S + + S + +Y+ +K+ +R + G+S I G G+FA MVI
Sbjct: 1728 ---SPYHKQFAQ------SKSAQYRKLKQEWRNNVVLGRSRIQGLGLFAAKDIDKHVMVI 1778
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EY G ++R + ++REH IY G YMFRID + VIDAT AG A INHSC PNC
Sbjct: 1779 EYIGVIIRNEVCNKREH-IYEEQ-NRGVYMFRIDSDLVIDATLAGGPARYINHSCNPNCV 1836
Query: 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRF-FSIDEQ-LACYCGFPRCRGVVN 836
+ V++ ++ III + R + + EELTYDY+F DEQ + C CG P CR +N
Sbjct: 1837 AEVVNFEKEQKIIIISSRRLSKGEELTYDYKFDIEDDEQKIPCCCGAPNCRKWMN 1891
>gi|195336239|ref|XP_002034749.1| GM14298 [Drosophila sechellia]
gi|194127842|gb|EDW49885.1| GM14298 [Drosophila sechellia]
Length = 3197
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 269 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PDCVLC 324
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + + RW L+C +C
Sbjct: 325 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKR 384
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS C+ AYH CA GL + ++E + + ++L S+C+KH
Sbjct: 385 VGSCIQCSVKPCKTAYHVTCAFQHGLEMR---------AIIEEGNAEDGVKLRSYCQKH 434
>gi|326928062|ref|XP_003210203.1| PREDICTED: LOW QUALITY PROTEIN: peregrin-like [Meleagris gallopavo]
Length = 1217
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 284 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 341
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 342 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 401
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 402 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGXNGTSF-------SVRQTAYCDI 454
Query: 576 HKQP 579
H P
Sbjct: 455 HTPP 458
>gi|324505992|gb|ADY42566.1| Protein Jade-3, partial [Ascaris suum]
Length = 822
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 102/192 (53%), Gaps = 12/192 (6%)
Query: 388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG 447
P + ++ + C +C + +++ + CD C + VH CYG L+ V WLC+ C
Sbjct: 332 PFQAEFDEDAFCDICRQTDYEEDDEIIFCDGCNVGVHQSCYG-LDSVPHDDWLCHACTLL 390
Query: 448 APEPPPPCCLCPVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDR 506
+ P C LCP+ GGAMK G+ WAH+ CA+WIPE DV EPI ++ + +R
Sbjct: 391 GYKAQPRCALCPLTGGAMKCMKGGKTWAHVVCALWIPEVRFGDVDHREPITNISDIPNER 450
Query: 507 WKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQC 566
W L CSIC GACIQCS +C A+H C G +++E + + D
Sbjct: 451 WALRCSICDTKQGACIQCSVKSCTTAFHVTCGLRKGQVMKIEHDSSV----------DDG 500
Query: 567 IRLLSFCKKHKQ 578
+R++S C+KH +
Sbjct: 501 VRMVSLCEKHSR 512
>gi|348527754|ref|XP_003451384.1| PREDICTED: protein Jade-3-like [Oreochromis niloticus]
Length = 770
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 99/183 (54%), Gaps = 19/183 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLCRPGAPEPPPPCCL 457
C VC + + N + CDKC + VH CYG ++ P+ WLC C G P C L
Sbjct: 205 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGIVKVPIGN--WLCRTCVLGID---PQCLL 259
Query: 458 CPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV 516
CP GGAMK T G RWAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 260 CPQKGGAMKATRAGTRWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWSLICSLCKL 319
Query: 517 SYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS C +H CA L ++ LDE DE ++ S+C KH
Sbjct: 320 KTGACIQCSVKNCTTPFHVTCAFQHSLEMK---------TILDEGDE---VKFKSYCLKH 367
Query: 577 KQP 579
QP
Sbjct: 368 SQP 370
>gi|45552837|ref|NP_995944.1| rhinoceros, isoform B [Drosophila melanogaster]
gi|386770257|ref|NP_612007.2| rhinoceros, isoform C [Drosophila melanogaster]
gi|386770259|ref|NP_001246528.1| rhinoceros, isoform D [Drosophila melanogaster]
gi|74884270|sp|Q7YZH1.1|RNO_DROME RecName: Full=PHD finger protein rhinoceros
gi|32483351|gb|AAP84718.1| PHD zinc finger protein rhinoceros [Drosophila melanogaster]
gi|45445719|gb|AAS64921.1| rhinoceros, isoform B [Drosophila melanogaster]
gi|383291636|gb|AAF47343.2| rhinoceros, isoform C [Drosophila melanogaster]
gi|383291637|gb|AFH04199.1| rhinoceros, isoform D [Drosophila melanogaster]
Length = 3241
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 315 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PDCVLC 370
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + + RW L+C +C
Sbjct: 371 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKR 430
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS C+ AYH CA GL + ++E + + ++L S+C+KH
Sbjct: 431 VGSCIQCSVKPCKTAYHVTCAFQHGLEMR---------AIIEEGNAEDGVKLRSYCQKH 480
>gi|332231621|ref|XP_003264992.1| PREDICTED: peregrin isoform 3 [Nomascus leucogenys]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|297670744|ref|XP_002813518.1| PREDICTED: peregrin isoform 3 [Pongo abelii]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
Length = 1019
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE +G+ +Y+F
Sbjct: 876 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREIRYIKKGIGS-SYLF 934
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 935 RVDENTVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIAANEELTYDYK 994
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 995 FERETDDEERLPCLCGAPTCKGFLN 1019
>gi|426339321|ref|XP_004033599.1| PREDICTED: peregrin isoform 3 [Gorilla gorilla gorilla]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|328353814|emb|CCA40211.1| Peregrin [Komagataella pastoris CBS 7435]
Length = 643
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 102/200 (51%), Gaps = 11/200 (5%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC+ + NN+ + CD C + VH CYG G W C C A P C C
Sbjct: 196 CAVCNESDCDVNNVIIFCDGCNIAVHQECYGITFIPEGP-WFCRRC-IIAKGAPKRCQFC 253
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P V GA K T G W+H+ C +WI E + MEPI+G + + RWKL C IC +
Sbjct: 254 PSVTGAFKQTDTGSWSHIICGLWINELYFANPIYMEPIEGTQLIPRSRWKLKCFICKLKI 313
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQCSN C AYH CA+ AGL ++L + L + L+S+C KH
Sbjct: 314 GACIQCSNKNCFTAYHVTCAKRAGLFLDLSKGLQACL--------NNPRYLVSYCDKHSP 365
Query: 579 -PLNDRLAVDERLVQVTRRC 597
P N +L +D+ + + R C
Sbjct: 366 IPSNGKLDLDQGICRTRRYC 385
>gi|403217274|emb|CCK71769.1| hypothetical protein KNAG_0H03550 [Kazachstania naganishii CBS 8797]
Length = 1111
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/142 (45%), Positives = 95/142 (66%), Gaps = 4/142 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 970 KKPVTFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQPVAEMREMRYLKSGIGS-SYLF 1028
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+P+C +++I V G I+I+A RDI + EELTYDY+
Sbjct: 1029 RVDENNVIDATKRGGIARFINHCCDPSCTAKIIKVGGKRRIVIYALRDIGKNEELTYDYK 1088
Query: 815 F---FSIDEQLACYCGFPRCRG 833
F +E+L C+CG P C+G
Sbjct: 1089 FEREQDDEERLPCHCGAPTCKG 1110
>gi|403270350|ref|XP_003927149.1| PREDICTED: peregrin isoform 3 [Saimiri boliviensis boliviensis]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|397486341|ref|XP_003814288.1| PREDICTED: peregrin isoform 3 [Pan paniscus]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|402859462|ref|XP_003894178.1| PREDICTED: peregrin isoform 3 [Papio anubis]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|367004711|ref|XP_003687088.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
gi|357525391|emb|CCE64654.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
Length = 1030
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 65/146 (44%), Positives = 96/146 (65%), Gaps = 6/146 (4%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE-HFIYNSLVGAGTYM 753
+K + F +S IH +G++A P A +M+IEY GE +R +++ RE +I N + +Y+
Sbjct: 887 KKPVTFARSAIHNWGLYALQPIAAKEMIIEYVGERIRQPVSEMREIRYIKNGI--GSSYL 944
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FRID+ VIDAT+ G IA INH C+P+C +++I V G + I+I+A RDI EELTYDY
Sbjct: 945 FRIDENTVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIDVNEELTYDY 1004
Query: 814 RFFSID---EQLACYCGFPRCRGVVN 836
+F + E+L C CG P C+G +N
Sbjct: 1005 KFEREEDDQERLPCLCGAPNCKGFLN 1030
>gi|297285317|ref|XP_002802755.1| PREDICTED: peregrin [Macaca mulatta]
Length = 1119
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|440896054|gb|ELR48091.1| Peregrin [Bos grunniens mutus]
Length = 1203
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|395733472|ref|XP_002813517.2| PREDICTED: peregrin isoform 2 [Pongo abelii]
Length = 1214
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|350591400|ref|XP_003483259.1| PREDICTED: peregrin [Sus scrofa]
Length = 1220
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|158255876|dbj|BAF83909.1| unnamed protein product [Homo sapiens]
Length = 823
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|7662006|ref|NP_055550.1| protein Jade-3 [Homo sapiens]
gi|117190334|ref|NP_001070913.1| protein Jade-3 [Homo sapiens]
gi|34098663|sp|Q92613.1|JADE3_HUMAN RecName: Full=Protein Jade-3; AltName: Full=PHD finger protein 16
gi|5805248|gb|AAD51905.1| unknown [Homo sapiens]
gi|40389499|tpe|CAE30502.1| TPA: JADE3 protein [Homo sapiens]
gi|109730315|gb|AAI13881.1| PHD finger protein 16 [Homo sapiens]
gi|109731640|gb|AAI14488.1| PHD finger protein 16 [Homo sapiens]
gi|119579681|gb|EAW59277.1| PHD finger protein 16, isoform CRA_a [Homo sapiens]
gi|119579682|gb|EAW59278.1| PHD finger protein 16, isoform CRA_a [Homo sapiens]
gi|168274481|dbj|BAG09660.1| PHD finger protein 16 [synthetic construct]
Length = 823
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|332231619|ref|XP_003264991.1| PREDICTED: peregrin isoform 2 [Nomascus leucogenys]
Length = 1220
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|426395709|ref|XP_004064104.1| PREDICTED: protein Jade-3 [Gorilla gorilla gorilla]
Length = 823
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
Length = 1372
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 101/155 (65%), Gaps = 7/155 (4%)
Query: 686 KYKHMKETFRKRLA-FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYN 744
++ +K+ RK+L F +S IH +G++A+ A DM+IEY GE VR +AD RE
Sbjct: 1221 RFNQLKK--RKKLVKFDRSAIHNWGLYAEENIAANDMIIEYVGEKVRQRVADLREAKYDQ 1278
Query: 745 SLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIK 804
VG+ +Y+FRID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI
Sbjct: 1279 QGVGS-SYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDTSKRIVIYALRDIS 1337
Query: 805 QWEELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
Q EELTYDY+F +++ C CG C+G +N
Sbjct: 1338 QDEELTYDYKFEREMDATDRIPCLCGSIGCKGFLN 1372
>gi|395753895|ref|XP_002831611.2| PREDICTED: LOW QUALITY PROTEIN: protein Jade-3 [Pongo abelii]
Length = 859
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 239 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 294
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 295 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 354
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 355 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 402
Query: 578 Q 578
Q
Sbjct: 403 Q 403
>gi|357627321|gb|EHJ77057.1| putative PHD finger protein [Danaus plexippus]
Length = 2594
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 71/181 (39%), Positives = 96/181 (53%), Gaps = 18/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 182 CDVCRSPDSEDGNEMVFCDSCNICVHQACYG-ITVIPDGQWLCRPCGAGIR---PTCVLC 237
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P +GGAMK T G +WAH++C +WIPE + ++MEPI + + RW L+C +C
Sbjct: 238 PNLGGAMKCTPSGHKWAHVSCVLWIPEVSIGCAEKMEPITKITSIPASRWSLVCVLCRER 297
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVE--LEDEDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQCS TC+ AYH CA GL + +EDE + D ++L S+C+K
Sbjct: 298 KGACIQCSVKTCKTAYHVTCAFKHGLEMRAIIEDE-----------NADDGVKLRSYCQK 346
Query: 576 H 576
H
Sbjct: 347 H 347
>gi|291412484|ref|XP_002722501.1| PREDICTED: bromodomain and PHD finger-containing protein 1 isoform
2 [Oryctolagus cuniculus]
Length = 1213
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|441673872|ref|XP_004092473.1| PREDICTED: LOW QUALITY PROTEIN: protein Jade-3 [Nomascus
leucogenys]
Length = 823
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|189205485|ref|XP_001939077.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975170|gb|EDU41796.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1274
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 95/146 (65%), Gaps = 5/146 (3%)
Query: 695 RKRLA-FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYM 753
RK+L F +S IH +G++A+ P DM+IEY GE VR +AD RE VG+ +Y+
Sbjct: 1130 RKKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGS-SYL 1188
Query: 754 FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
FRID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI EELTYDY
Sbjct: 1189 FRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDY 1248
Query: 814 RF---FSIDEQLACYCGFPRCRGVVN 836
+F +++ C CG C+G +N
Sbjct: 1249 KFEREMDATDRIPCLCGSIGCKGFLN 1274
>gi|350591398|ref|XP_001928420.3| PREDICTED: peregrin isoform 2 [Sus scrofa]
Length = 1214
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|410951644|ref|XP_003982503.1| PREDICTED: peregrin isoform 1 [Felis catus]
Length = 1214
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|403270346|ref|XP_003927147.1| PREDICTED: peregrin isoform 1 [Saimiri boliviensis boliviensis]
Length = 1214
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|332231617|ref|XP_003264990.1| PREDICTED: peregrin isoform 1 [Nomascus leucogenys]
Length = 1214
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|397476684|ref|XP_003809723.1| PREDICTED: protein Jade-3 [Pan paniscus]
gi|410212160|gb|JAA03299.1| PHD finger protein 16 [Pan troglodytes]
gi|410303128|gb|JAA30164.1| PHD finger protein 16 [Pan troglodytes]
Length = 823
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|296225800|ref|XP_002758655.1| PREDICTED: peregrin isoform 3 [Callithrix jacchus]
Length = 1119
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|74224125|dbj|BAE33691.1| unnamed protein product [Mus musculus]
Length = 1247
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|402859460|ref|XP_003894177.1| PREDICTED: peregrin isoform 2 [Papio anubis]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|402859458|ref|XP_003894176.1| PREDICTED: peregrin isoform 1 [Papio anubis]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|383421479|gb|AFH33953.1| peregrin isoform 2 [Macaca mulatta]
Length = 1213
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|287580776|gb|ADC41872.1| MIP14416p [Drosophila melanogaster]
Length = 1149
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 96/179 (53%), Gaps = 14/179 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD C + VH CYG + + WLC C G P C LC
Sbjct: 274 CDVCRSPDSEEANEMVFCDNCNICVHQACYG-ITAIPSGQWLCRTCSMGIK---PDCVLC 329
Query: 459 PVVGGAMKPTTDGR-WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G+ WAH++CA+WIPE + V RMEPI ++ + + RW L+C +C
Sbjct: 330 PNKGGAMKSNKSGKHWAHVSCALWIPEVSIGCVDRMEPITKISSIPQSRWSLICVLCRKR 389
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
G+CIQCS C+ AYH CA GL + ++E + + ++L S+C+KH
Sbjct: 390 VGSCIQCSVKPCKTAYHVTCAFQHGLEMR---------AIIEEGNAEDGVKLRSYCQKH 439
>gi|254574498|ref|XP_002494358.1| Subunit of the NuA3 histone acetyltransferase complex that
acetylates histone H3 [Komagataella pastoris GS115]
gi|238034157|emb|CAY72179.1| Subunit of the NuA3 histone acetyltransferase complex that
acetylates histone H3 [Komagataella pastoris GS115]
Length = 622
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 102/200 (51%), Gaps = 11/200 (5%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC+ + NN+ + CD C + VH CYG G W C C A P C C
Sbjct: 175 CAVCNESDCDVNNVIIFCDGCNIAVHQECYGITFIPEGP-WFCRRC-IIAKGAPKRCQFC 232
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P V GA K T G W+H+ C +WI E + MEPI+G + + RWKL C IC +
Sbjct: 233 PSVTGAFKQTDTGSWSHIICGLWINELYFANPIYMEPIEGTQLIPRSRWKLKCFICKLKI 292
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQCSN C AYH CA+ AGL ++L + L + L+S+C KH
Sbjct: 293 GACIQCSNKNCFTAYHVTCAKRAGLFLDLSKGLQACL--------NNPRYLVSYCDKHSP 344
Query: 579 -PLNDRLAVDERLVQVTRRC 597
P N +L +D+ + + R C
Sbjct: 345 IPSNGKLDLDQGICRTRRYC 364
>gi|109034690|ref|XP_001094152.1| PREDICTED: peregrin isoform 4 [Macaca mulatta]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|19584408|emb|CAD28495.1| hypothetical protein [Homo sapiens]
gi|119584380|gb|EAW63976.1| bromodomain and PHD finger containing, 1, isoform CRA_e [Homo
sapiens]
Length = 1213
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|73984715|ref|XP_861729.1| PREDICTED: peregrin isoform 4 [Canis lupus familiaris]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|30794210|ref|NP_084454.1| peregrin [Mus musculus]
gi|28277047|gb|AAH46521.1| Bromodomain and PHD finger containing, 1 [Mus musculus]
Length = 1246
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|297670740|ref|XP_002813516.1| PREDICTED: peregrin isoform 1 [Pongo abelii]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|296225796|ref|XP_002758653.1| PREDICTED: peregrin isoform 1 [Callithrix jacchus]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|40788914|dbj|BAA13205.2| KIAA0215 [Homo sapiens]
Length = 857
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 100/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 237 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIY---PQCVLC 292
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGA+K T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 293 PKKGGALKTTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCNLCKLK 352
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA GL ++ LDE DE ++ S+C KH
Sbjct: 353 TGACIQCSIKSCITAFHVTCAFEHGLEMK---------TILDEGDE---VKFKSYCLKHS 400
Query: 578 Q 578
Q
Sbjct: 401 Q 401
>gi|296225798|ref|XP_002758654.1| PREDICTED: peregrin isoform 2 [Callithrix jacchus]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 912 KKPVTFARSAIHNWGLYALESIAAKEMIIEYVGESIRQQVAEHREKSYLKTGIGS-SYLF 970
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 971 RIDENSVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1030
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + DE++ C CG P C+G +N
Sbjct: 1031 FERETNDDERIRCLCGAPGCKGYLN 1055
>gi|51173720|ref|NP_001003694.1| peregrin isoform 1 [Homo sapiens]
gi|426339319|ref|XP_004033598.1| PREDICTED: peregrin isoform 2 [Gorilla gorilla gorilla]
gi|31753086|gb|AAH53851.1| Bromodomain and PHD finger containing, 1 [Homo sapiens]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|426249198|ref|XP_004018337.1| PREDICTED: peregrin isoform 2 [Ovis aries]
Length = 1213
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|397486337|ref|XP_003814286.1| PREDICTED: peregrin isoform 1 [Pan paniscus]
gi|410209968|gb|JAA02203.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
gi|410266264|gb|JAA21098.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
gi|410292246|gb|JAA24723.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
gi|410335057|gb|JAA36475.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|109034684|ref|XP_001094039.1| PREDICTED: peregrin isoform 3 [Macaca mulatta]
gi|355559465|gb|EHH16193.1| Bromodomain and PHD finger-containing protein 1 [Macaca mulatta]
gi|355746540|gb|EHH51154.1| Bromodomain and PHD finger-containing protein 1 [Macaca
fascicularis]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|51173722|ref|NP_004625.2| peregrin isoform 2 [Homo sapiens]
gi|426339317|ref|XP_004033597.1| PREDICTED: peregrin isoform 1 [Gorilla gorilla gorilla]
gi|116241271|sp|P55201.2|BRPF1_HUMAN RecName: Full=Peregrin; AltName: Full=Bromodomain and PHD
finger-containing protein 1; AltName: Full=Protein Br140
gi|6630865|gb|AAF19605.1| putative 8-hydroxyguanine DNA glycosylase [Homo sapiens]
gi|119584377|gb|EAW63973.1| bromodomain and PHD finger containing, 1, isoform CRA_b [Homo
sapiens]
gi|168275726|dbj|BAG10583.1| peregrin [synthetic construct]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|403214155|emb|CCK68656.1| hypothetical protein KNAG_0B02140 [Kazachstania naganishii CBS
8797]
Length = 810
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 7/179 (3%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPP-CCL 457
CS+C+ E N + CD C + VH CYG + G WLC C G E P C +
Sbjct: 220 CSICNGVETTHNTIVF-CDCCNLAVHQDCYGVIFIPTGP-WLCRACLQGKFESKRPRCAV 277
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP VGGA+K +T G W H++CA+WI E C + EPI+G++R+ RW+L C +C
Sbjct: 278 CPEVGGALKQSTCGSWVHVSCAVWINELCFGNWHYAEPIEGIDRIPLSRWRLNCYLCKQR 337
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC N C VAYH CAR GL + L ++L+ + L SFC +H
Sbjct: 338 TGACIQCCNRNCFVAYHVSCARRVGLDMTPLVTGSLAEMALNNGERS----LESFCDRH 392
>gi|397486339|ref|XP_003814287.1| PREDICTED: peregrin isoform 2 [Pan paniscus]
gi|410266266|gb|JAA21099.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
gi|410292248|gb|JAA24724.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
gi|410335059|gb|JAA36476.1| bromodomain and PHD finger containing, 1 [Pan troglodytes]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|395824508|ref|XP_003785505.1| PREDICTED: peregrin [Otolemur garnettii]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|332816021|ref|XP_003309650.1| PREDICTED: LOW QUALITY PROTEIN: peregrin [Pan troglodytes]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|348556652|ref|XP_003464135.1| PREDICTED: peregrin-like isoform 1 [Cavia porcellus]
Length = 1213
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|344276039|ref|XP_003409817.1| PREDICTED: peregrin isoform 1 [Loxodonta africana]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|330906701|ref|XP_003295568.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
gi|311333048|gb|EFQ96340.1| hypothetical protein PTT_01686 [Pyrenophora teres f. teres 0-1]
Length = 1272
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ P DM+IEY GE VR +AD RE VG+ +Y+F
Sbjct: 1129 KKLVKFDRSAIHNWGLYAQEPIAVNDMIIEYVGEKVRQRVADLREAKYDMQGVGS-SYLF 1187
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI EELTYDY+
Sbjct: 1188 RIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDNTKRIVIYALRDINSDEELTYDYK 1247
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG C+G +N
Sbjct: 1248 FEREMDATDRIPCLCGSIGCKGFLN 1272
>gi|291412482|ref|XP_002722500.1| PREDICTED: bromodomain and PHD finger-containing protein 1 isoform
1 [Oryctolagus cuniculus]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 912 KKPVTFARSAIHNWGLYALESIAAKEMIIEYVGESIRQQVAEHREKSYLKTGIGS-SYLF 970
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V G + I+I+A RDI+ EELTYDY+
Sbjct: 971 RIDENSVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYK 1030
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + DE++ C CG P C+G +N
Sbjct: 1031 FERETNDDERIRCLCGAPGCKGYLN 1055
>gi|149036900|gb|EDL91518.1| rCG56048 [Rattus norvegicus]
Length = 1212
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|119584379|gb|EAW63975.1| bromodomain and PHD finger containing, 1, isoform CRA_d [Homo
sapiens]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|410951646|ref|XP_003982504.1| PREDICTED: peregrin isoform 2 [Felis catus]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|354468925|ref|XP_003496900.1| PREDICTED: peregrin-like [Cricetulus griseus]
Length = 1218
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|348556654|ref|XP_003464136.1| PREDICTED: peregrin-like isoform 2 [Cavia porcellus]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|329663383|ref|NP_001193018.1| peregrin [Bos taurus]
gi|296475103|tpg|DAA17218.1| TPA: bromodomain and PHD finger containing, 1-like isoform 1 [Bos
taurus]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|148667032|gb|EDK99448.1| bromodomain and PHD finger containing, 1 [Mus musculus]
gi|187952873|gb|AAI38362.1| Brpf1 protein [Mus musculus]
Length = 1212
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|149240463|ref|XP_001526107.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450230|gb|EDK44486.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 572
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 102/199 (51%), Gaps = 10/199 (5%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C+VC+ + N + CD C + VH CYG G WLC C + C
Sbjct: 44 RCAVCNDSDCDNTNAIVFCDGCDIAVHQECYGVAFIPEGS-WLCRKCMINKNKSVTECVF 102
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K + W+H+ CA+WI E + MEPI+G++ + K RWKL C IC
Sbjct: 103 CPSTTGAFKQLDNSLWSHVVCALWINELYFANPIYMEPIEGIDAIPKSRWKLACYICKQR 162
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQC+N C +AYH CA+ AGL +E+ + L + + L SFC+KH
Sbjct: 163 VGACIQCANRNCFLAYHVTCAKRAGLYMEMTQGMKGALTNK--------LTLKSFCEKHS 214
Query: 578 QPLNDRLAVDERLVQVTRR 596
PL D V + ++ TRR
Sbjct: 215 PPLFDTGGV-TKGIERTRR 232
>gi|71895457|ref|NP_001026635.1| peregrin [Gallus gallus]
gi|60098759|emb|CAH65210.1| hypothetical protein RCJMB04_8a18 [Gallus gallus]
Length = 1218
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 85/151 (56%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 283 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 340
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 341 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 400
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 401 SGACIQCHKANCYTAFHVTCAQQAGLYMKME 431
>gi|426249196|ref|XP_004018336.1| PREDICTED: peregrin isoform 1 [Ovis aries]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|296475104|tpg|DAA17219.1| TPA: bromodomain and PHD finger containing, 1-like isoform 2 [Bos
taurus]
Length = 1213
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|403270348|ref|XP_003927148.1| PREDICTED: peregrin isoform 2 [Saimiri boliviensis boliviensis]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|344276041|ref|XP_003409818.1| PREDICTED: peregrin isoform 2 [Loxodonta africana]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|395516558|ref|XP_003762454.1| PREDICTED: peregrin [Sarcophilus harrisii]
Length = 1212
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 279 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 336
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 337 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 396
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 397 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 449
Query: 576 HKQP 579
H P
Sbjct: 450 HTPP 453
>gi|73984725|ref|XP_850615.1| PREDICTED: peregrin isoform 2 [Canis lupus familiaris]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|301779339|ref|XP_002925081.1| PREDICTED: peregrin-like isoform 2 [Ailuropoda melanoleuca]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|300797262|ref|NP_001178501.1| peregrin [Rattus norvegicus]
Length = 1246
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|301779337|ref|XP_002925080.1| PREDICTED: peregrin-like isoform 1 [Ailuropoda melanoleuca]
gi|281354158|gb|EFB29742.1| hypothetical protein PANDA_014517 [Ailuropoda melanoleuca]
Length = 1220
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|432110870|gb|ELK34344.1| Peregrin [Myotis davidii]
Length = 1219
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|390346709|ref|XP_798723.3| PREDICTED: uncharacterized protein LOC594182 [Strongylocentrotus
purpuratus]
Length = 1216
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 71/179 (39%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG + G W+C C G P C LC
Sbjct: 175 CDVCRAPDSEEGNEMVFCDKCDICVHQACYGIVNVPEGS-WMCRTCALGIQ---PLCILC 230
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
+ GGAMK T G +W+H++CA+W+PE + V+RMEPI ++++ RW L+C C
Sbjct: 231 GIKGGAMKSTRSGTKWSHVSCALWVPEVSIGCVERMEPITKISQIPASRWALICVQCRER 290
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS TC++AYH C GL E + L DE+ +R S+C KH
Sbjct: 291 TGACIQCSVKTCKIAYHVTCGFENGL------EMKTYL------DEEADVRFRSYCSKH 337
>gi|338714466|ref|XP_003363085.1| PREDICTED: peregrin isoform 2 [Equus caballus]
Length = 1214
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|219519842|gb|AAI45257.1| Brpf1 protein [Mus musculus]
Length = 1218
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|426257137|ref|XP_004022191.1| PREDICTED: protein Jade-3 [Ovis aries]
Length = 823
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 259 PKKGGAMKSTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCSLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHSLEMK---------TILDKGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|149728307|ref|XP_001495114.1| PREDICTED: peregrin isoform 1 [Equus caballus]
Length = 1220
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
NIH/UT8656]
Length = 1277
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G+++ + +M+IEY GE +R IAD RE S +G+ +Y+F
Sbjct: 1134 KKPVKFARSAIHNWGLYSLERIQTSEMIIEYVGEKIRQEIADLREIKYTESGIGS-SYLF 1192
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ V+DAT+ G IA INHSC PNC +++I V G + I+I+A RDI++ EELTYDY+
Sbjct: 1193 RIDEGTVVDATKKGGIARFINHSCSPNCTAKIIRVGGTKRIVIYALRDIEKDEELTYDYK 1252
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F D+++ C CG C+G +N
Sbjct: 1253 FEREIDSDDRIPCLCGSAVCKGFLN 1277
>gi|444722826|gb|ELW63501.1| Peregrin [Tupaia chinensis]
Length = 1505
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 96/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQ-WLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|119584378|gb|EAW63974.1| bromodomain and PHD finger containing, 1, isoform CRA_c [Homo
sapiens]
Length = 1247
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|118404942|ref|NP_001072493.1| protein Jade-3 [Xenopus (Silurana) tropicalis]
gi|116248179|sp|Q0P4S5.1|JADE3_XENTR RecName: Full=Protein Jade-3; AltName: Full=PHD finger protein 16
gi|112419349|gb|AAI21925.1| hypothetical protein MGC146209 [Xenopus (Silurana) tropicalis]
Length = 817
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD+C + VH CYG L+ G WLC C G P C LC
Sbjct: 205 CDVCRSPDSEEGNDMVFCDRCNICVHQACYGILKVPEGS-WLCRTCVLGLH---PQCILC 260
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 261 PKTGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKLK 320
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 321 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 368
Query: 578 Q 578
+
Sbjct: 369 K 369
>gi|351710981|gb|EHB13900.1| Peregrin [Heterocephalus glaber]
Length = 1213
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|224071200|ref|XP_002193972.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Taeniopygia
guttata]
Length = 2004
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1872 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1930
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1931 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1990
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1991 PCLCGSENCRGTLN 2004
>gi|431899908|gb|ELK07855.1| Peregrin [Pteropus alecto]
Length = 1241
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|301778787|ref|XP_002924825.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1A-like [Ailuropoda melanoleuca]
Length = 1594
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 97/143 (67%), Gaps = 2/143 (1%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 1453 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 1511
Query: 755 RIDDERVIDATRAGSIAHLINHSC-EPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY 813
R+D + +IDAT+ G++A INH C +PNCY++VI++ + I+I++K+ I EE+TYDY
Sbjct: 1512 RVDHDTIIDATKCGNLARFINHCCMQPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDY 1571
Query: 814 RFFSIDEQLACYCGFPRCRGVVN 836
+F D ++ C CG CRG +N
Sbjct: 1572 KFPLEDNKIPCLCGTESCRGSLN 1594
>gi|71897211|ref|NP_001025832.1| histone-lysine N-methyltransferase SETD1B [Gallus gallus]
gi|82231199|sp|Q5F3P8.1|SET1B_CHICK RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|60098811|emb|CAH65236.1| hypothetical protein RCJMB04_10j6 [Gallus gallus]
Length = 2008
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1876 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1934
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1935 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1994
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1995 PCLCGSENCRGTLN 2008
>gi|149063329|gb|EDM13652.1| rCG21620 [Rattus norvegicus]
Length = 1091
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 959 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1017
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1018 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1077
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1078 PCLCGSENCRGTLN 1091
>gi|449474115|ref|XP_002186959.2| PREDICTED: peregrin [Taeniopygia guttata]
Length = 935
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 60 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQ-WLCRRCL-QSPSRAVDCALC 117
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 118 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 177
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 178 SGACIQCHKANCYTAFHVTCAQQAGLYMKME 208
>gi|190352|gb|AAB02119.1| Br140 [Homo sapiens]
Length = 1214
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLEVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|395527056|ref|XP_003765667.1| PREDICTED: protein Jade-3 [Sarcophilus harrisii]
Length = 858
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 236 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 291
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 292 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWSLVCSLCKLK 351
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 352 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 399
Query: 578 Q 578
+
Sbjct: 400 K 400
>gi|291412486|ref|XP_002722502.1| PREDICTED: bromodomain and PHD finger-containing protein 1 isoform
3 [Oryctolagus cuniculus]
Length = 1245
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 447 HTPPGSAR 454
>gi|395542689|ref|XP_003773258.1| PREDICTED: protein Jade-1, partial [Sarcophilus harrisii]
Length = 704
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 243 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 298
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 299 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALICSLCNEK 358
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 359 IGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 405
>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
Length = 1055
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K ++F +S IH +G++A P A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 912 KKPVSFARSAIHNWGLYALEPIAAKEMIIEYVGESIRQQVAEHREKSYLKTGIGS-SYLF 970
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDAT+ G IA INH C P+C +++I V+ + I+I+A RDI EELTYDY+
Sbjct: 971 RIDENTVIDATKKGGIARFINHCCSPSCTAKIIKVDNQKRIVIYALRDIDANEELTYDYK 1030
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + E++ C CG P C+G +N
Sbjct: 1031 FERETNDAERIRCLCGAPGCKGYLN 1055
>gi|242084688|ref|XP_002442769.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
gi|241943462|gb|EES16607.1| hypothetical protein SORBIDRAFT_08g002530 [Sorghum bicolor]
Length = 1033
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 103/156 (66%), Gaps = 5/156 (3%)
Query: 679 NFLSMADKYKHMKET----FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ + +K T +KRL F +S IH +G+ A A D VIEY G+L+ +
Sbjct: 879 NLLAAAEGAELLKITQMKARKKRLRFQRSKIHEWGLVALELIEAEDFVIEYVGQLIHRRV 938
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE S +G+ +Y+FR+DD+ V+DAT+ G +A INHSCEPNCY++VI+V+G +
Sbjct: 939 SDIRESQYEKSGIGS-SYLFRLDDDFVVDATKRGGLARFINHSCEPNCYTKVITVDGQKK 997
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPR 830
I I+AKR I EE+TY+Y+F +E++ C+CG R
Sbjct: 998 IFIYAKRRIYAGEEITYNYKFPLEEEKIPCHCGSRR 1033
>gi|334327124|ref|XP_003340832.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like, partial [Monodelphis domestica]
Length = 1723
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1591 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1649
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1650 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKI 1709
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1710 PCLCGSENCRGTLN 1723
>gi|119618696|gb|EAW98290.1| hCG1812756 [Homo sapiens]
Length = 1048
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 916 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 974
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 975 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1034
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1035 PCLCGSENCRGTLN 1048
>gi|355718717|gb|AES06362.1| SET domain containing 1A [Mustela putorius furo]
Length = 296
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 62/139 (44%), Positives = 94/139 (67%), Gaps = 1/139 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K+L FG+S IH +G+FA P A +MVIEY G+ +R +AD RE +G+ +Y+F
Sbjct: 157 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGS-SYLF 215
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D + +IDAT+ G++A INH C PNCY++VI++ + I+I++K+ I EE+TYDY+
Sbjct: 216 RVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYK 275
Query: 815 FFSIDEQLACYCGFPRCRG 833
F D ++ C CG CRG
Sbjct: 276 FPLEDNKIPCLCGTESCRG 294
>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
Length = 296
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 69/144 (47%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
RK L F +S IH +G+FA+ P ++VIEY GE+VR ++A+ RE +G+ +Y+F
Sbjct: 154 RKLLRFARSMIHEWGLFAQEPIDKDELVIEYVGEIVRQTVAEDRERRYARIGIGS-SYLF 212
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID++ VIDATR GSIA INHSC+ NCY++V+SV+G + I I++KR I EE+TYDY+
Sbjct: 213 RIDEDYVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPIAANEEITYDYK 272
Query: 815 FFSID--EQLACYCGFPRCRGVVN 836
F + ++ C+CG CRG +N
Sbjct: 273 FPREEGPNKIPCFCGARTCRGTLN 296
>gi|74217519|dbj|BAC31528.2| unnamed protein product [Mus musculus]
Length = 807
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKME 423
>gi|57111725|ref|XP_538010.1| PREDICTED: protein Jade-3 [Canis lupus familiaris]
Length = 823
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKRGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|345323698|ref|XP_003430739.1| PREDICTED: LOW QUALITY PROTEIN: peregrin-like [Ornithorhynchus
anatinus]
Length = 1158
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 243 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 300
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 301 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 360
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 361 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 413
Query: 576 HKQP 579
H P
Sbjct: 414 HTPP 417
>gi|327266272|ref|XP_003217930.1| PREDICTED: peregrin-like [Anolis carolinensis]
Length = 1205
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 276 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 333
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 334 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 393
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 394 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 446
Query: 576 HKQP 579
H P
Sbjct: 447 HTPP 450
>gi|432856468|ref|XP_004068436.1| PREDICTED: protein Jade-3-like [Oryzias latipes]
Length = 895
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 100/183 (54%), Gaps = 19/183 (10%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLCRPGAPEPPPPCCL 457
C VC + + N + CDKC + VH CYG ++ PV WLC C G P C L
Sbjct: 205 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGIVKVPVGN--WLCRTCVLGIL---PQCLL 259
Query: 458 CPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV 516
CP GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 260 CPQKGGAMKATRAGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWSLICSLCKL 319
Query: 517 SYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQCS C + +H CA L ++ LDE DE ++ SFC KH
Sbjct: 320 KTGACIQCSVKNCTIPFHVTCAFEHSLEMK---------TILDEGDE---VKFKSFCLKH 367
Query: 577 KQP 579
+P
Sbjct: 368 SKP 370
>gi|348554403|ref|XP_003463015.1| PREDICTED: hypothetical protein LOC100714908 [Cavia porcellus]
Length = 1931
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1799 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1857
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1858 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIEDVKI 1917
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1918 PCLCGSENCRGTLN 1931
>gi|440904112|gb|ELR54672.1| Protein Jade-3 [Bos grunniens mutus]
Length = 823
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIY---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 259 PKKGGAMKSTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCSLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 319 TGACIQCSIKSCITAFHVTCAFEHSLEMK---------TILDKGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|410919935|ref|XP_003973439.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Takifugu rubripes]
Length = 1169
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 113/220 (51%), Gaps = 21/220 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG + V WLC C +P P C LC
Sbjct: 237 CCVCLDDECLNSNVILFCDICNLAVHQECYG-VPYVPEGQWLCRCCL-QSPSRPVDCVLC 294
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G+ + RWKL C +C
Sbjct: 295 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGIKNIPPARWKLTCYLCKQKG 354
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL ++++ E +N + ++ +FC+
Sbjct: 355 RGASIQCHKANCYRAFHVTCAQKAGLFMKIDPVRETGVNGTTFS-------VKKTAFCEH 407
Query: 576 HKQPLNDR-----LAVDERLV----QVTRRCCDYIPPSNP 606
H + R +V+ RLV +R PPS+P
Sbjct: 408 HSPAGSHRDGSGDESVEGRLVGGRGNRGQRSYTQSPPSSP 447
>gi|432857915|ref|XP_004068789.1| PREDICTED: bromodomain and PHD finger-containing protein 3-like
[Oryzias latipes]
Length = 1229
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 105/195 (53%), Gaps = 15/195 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG + V WLC C +P P C LC
Sbjct: 253 CCVCLDDECLNSNVILFCDICNLAVHQECYG-VPYVPEGQWLCRCCL-QSPSRPVDCVLC 310
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EPI+G+ + RWKL C +C
Sbjct: 311 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPIEGVKNIPPARWKLTCYLCKQKG 370
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL ++++ E +N + ++ +FC +
Sbjct: 371 RGASIQCHKANCYRAFHVTCAQKAGLFMKIDPVRETGVNGTTFS-------VKKTAFC-E 422
Query: 576 HKQPLNDRL--AVDE 588
H P+ R +VDE
Sbjct: 423 HHSPVGSRRDGSVDE 437
>gi|169601854|ref|XP_001794349.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
gi|111067888|gb|EAT89008.1| hypothetical protein SNOG_03803 [Phaeosphaeria nodorum SN15]
Length = 1168
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 95/145 (65%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A+ A DM+IEY GE VR +AD RE VG+ +Y+F
Sbjct: 1025 KKLVKFDRSAIHNWGLYAQENIVANDMIIEYVGEKVRQRVADLREVRYDQQGVGS-SYLF 1083
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID++ VIDAT+ G IA INHSC PNC +++I V+ + I+I+A RDI Q EELTYDY+
Sbjct: 1084 RIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDNTKRIVIYALRDIGQDEELTYDYK 1143
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG C+G +N
Sbjct: 1144 FEREMDATDRIPCLCGSVGCKGFLN 1168
>gi|441630858|ref|XP_003280765.2| PREDICTED: uncharacterized protein LOC100584028 [Nomascus leucogenys]
Length = 1863
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1731 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1789
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1790 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1849
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1850 PCLCGSENCRGTLN 1863
>gi|426374487|ref|XP_004054104.1| PREDICTED: uncharacterized protein LOC101124677 [Gorilla gorilla
gorilla]
Length = 1922
Score = 135 bits (341), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1790 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1848
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1849 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1908
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1909 PCLCGSENCRGTLN 1922
>gi|344250082|gb|EGW06186.1| Peregrin [Cricetulus griseus]
Length = 771
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 97/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 275 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 332
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 333 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 392
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 393 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 445
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 446 HTPPGSAR 453
>gi|291237507|ref|XP_002738687.1| PREDICTED: bromodomain containing protein 1-like [Saccoglossus
kowalevskii]
Length = 1328
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 277 CCICSDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 334
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DG WAH+ CA+WIPE C + +EPID ++ + RWKL C IC
Sbjct: 335 PNKGGAFKQTDDGHWAHVVCALWIPEVCFANTVFLEPIDSIDHIPSARWKLTCYICKQRG 394
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 395 TGACIQCHKANCYTAFHVTCAQQAGLYMKME 425
>gi|443918070|gb|ELU38649.1| Setd1a protein [Rhizoctonia solani AG-1 IA]
Length = 425
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 110/166 (66%), Gaps = 11/166 (6%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ ++ T +K+L F +S IH +G++A P AGDMVIEY GE++R +AD+RE + +
Sbjct: 206 KFNQLR-TRKKQLTFARSPIHDWGLYAAEPIPAGDMVIEYVGEVIRQQVADKREKYYEKT 264
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FR+DD+ V+DAT+ G++ LINH C PNC +++I++NG++ I+I+AK +I
Sbjct: 265 GIGS-SYLFRVDDDSVVDATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKTNIDV 323
Query: 806 WEELTYDYR--FFSIDEQLACYCGFPRCRGVVNDTEAEEQVAKLYA 849
+E+TY R F +D QL P C+ ++ + ++ KLY
Sbjct: 324 GDEITYGTRDVSFDLDRQL---IRTPTCQIII----SRSKIKKLYT 362
>gi|345321023|ref|XP_001506028.2| PREDICTED: hypothetical protein LOC100074411 [Ornithorhynchus
anatinus]
Length = 1258
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1126 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1184
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1185 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1244
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1245 PCLCGSENCRGTLN 1258
>gi|344297409|ref|XP_003420391.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Loxodonta africana]
Length = 1750
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1618 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1676
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1677 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1736
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1737 PCLCGSENCRGTLN 1750
>gi|410976579|ref|XP_003994695.1| PREDICTED: uncharacterized protein LOC101096419 [Felis catus]
Length = 1919
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1787 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1845
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1846 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1905
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1906 PCLCGSENCRGTLN 1919
>gi|301754587|ref|XP_002913168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Ailuropoda melanoleuca]
Length = 1805
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1673 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1731
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1732 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1791
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1792 PCLCGSENCRGTLN 1805
>gi|210032580|ref|NP_055863.1| histone-lysine N-methyltransferase SETD1B [Homo sapiens]
gi|166977692|sp|Q9UPS6.2|SET1B_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=Lysine N-methyltransferase 2G; AltName: Full=SET
domain-containing protein 1B; Short=hSET1B
Length = 1923
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1791 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1849
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1850 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1909
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1910 PCLCGSENCRGTLN 1923
>gi|366989279|ref|XP_003674407.1| hypothetical protein NCAS_0A14700 [Naumovozyma castellii CBS 4309]
gi|342300270|emb|CCC68028.1| hypothetical protein NCAS_0A14700 [Naumovozyma castellii CBS 4309]
Length = 989
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 97/179 (54%), Gaps = 6/179 (3%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYG-ELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
C++C+ E N + C+ C + VH CYG PV WLC C A C +
Sbjct: 248 CAICYGTESDDTNAIVFCEGCDIAVHQECYGIVFIPVGP--WLCRRCH-LATNYKINCLV 304
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T G W H CA+WIPE ++ MEPI+G+ +SK RWKL+C IC
Sbjct: 305 CPSDTGAFKQTDTGVWIHSICALWIPELYFANLHYMEPIEGVANISKSRWKLVCYICKRK 364
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC++ C VAYH CAR AGL ++ + + L + ++ + +L SFC KH
Sbjct: 365 MGACIQCTHRNCFVAYHVTCARRAGLYLKWDKD--LTVGAVASNQVHLGNKLHSFCDKH 421
>gi|334192482|gb|AEG67286.1| histone-lysine N-methyltransferase [Homo sapiens]
Length = 1966
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1834 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1892
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1893 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1952
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1953 PCLCGSENCRGTLN 1966
>gi|410047437|ref|XP_003314036.2| PREDICTED: uncharacterized protein LOC473295, partial [Pan
troglodytes]
Length = 1955
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1823 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1881
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1882 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1941
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1942 PCLCGSENCRGTLN 1955
>gi|403281795|ref|XP_003932362.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Saimiri
boliviensis boliviensis]
Length = 1823
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1691 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1749
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1750 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1809
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1810 PCLCGSENCRGTLN 1823
>gi|444321116|ref|XP_004181214.1| hypothetical protein TBLA_0F01530 [Tetrapisispora blattae CBS 6284]
gi|387514258|emb|CCH61695.1| hypothetical protein TBLA_0F01530 [Tetrapisispora blattae CBS 6284]
Length = 790
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 65/149 (43%), Positives = 84/149 (56%), Gaps = 2/149 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C++C NN + CD C + VH CYG + G WLC C + C LC
Sbjct: 255 CAICDGAYSDNNNAIVFCDGCDIAVHQECYGIVFIPEGQ-WLCRKCL-FSKNMKVNCLLC 312
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T G+WAH+ C++WIPE +V MEPI+GL ++K RWKL+C IC
Sbjct: 313 PSHTGAFKQTDVGKWAHVLCSLWIPELYFANVNYMEPIEGLEYIAKSRWKLVCYICEQRV 372
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVEL 547
GACIQCSN C +YH CA+ AGL ++
Sbjct: 373 GACIQCSNKNCFRSYHVTCAKRAGLYLKF 401
>gi|313220754|emb|CBY31596.1| unnamed protein product [Oikopleura dioica]
Length = 1087
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 70/180 (38%), Positives = 98/180 (54%), Gaps = 11/180 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC+ E N L CD C + VH CYG + + WLC C+ +P P C LC
Sbjct: 318 CCVCNDGECTNTNAILFCDLCNLAVHQECYG-VPYIPEGQWLCRRCQ-FSPSRPVDCVLC 375
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P + GA K T D RW H+ CA+W+PE + +EPIDG ++V K RW L C IC +
Sbjct: 376 PSLNGAFKQTHDNRWCHVVCALWVPEVSFQNPVFLEPIDGFSKVPKARWNLKCYICKKA- 434
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVEL--EDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC+ C A+H CA+ AGL +E+ + D +N ++ ++ + +FC H
Sbjct: 435 GACIQCNKNACYTAFHVTCAQQAGLYMEMKVQKSDGINGITSNK------VTQTAFCHNH 488
>gi|348523507|ref|XP_003449265.1| PREDICTED: bromodomain-containing protein 1-like [Oreochromis
niloticus]
Length = 1058
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C + +N+ L CD C + VH CYG G WLC C P P C C
Sbjct: 218 CCICMDGDGADSNVILFCDSCNIAVHQECYGVPYIPEG-QWLCRHCL-QCPTRPAECVFC 275
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA+K T D RW H+ACA+W+PE +D +EPIDG+ + RWKL C +C
Sbjct: 276 PNRGGALKKTDDDRWGHVACALWVPEVGFSDTVFIEPIDGVRNIPPARWKLTCYLCKEKG 335
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGLC+++E
Sbjct: 336 VGACIQCDKINCYTAFHVSCAQKAGLCMKME 366
>gi|119923239|ref|XP_583146.3| PREDICTED: protein Jade-3 [Bos taurus]
gi|297493079|ref|XP_002700112.1| PREDICTED: protein Jade-3 [Bos taurus]
gi|296470791|tpg|DAA12906.1| TPA: PHD finger protein 16 [Bos taurus]
Length = 785
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 165 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIY---PQCLLC 220
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 221 PKKGGAMKSTKTGTKWAHVSCALWIPEVSIACPERMEPITKISHIPPSRWALVCSLCKLK 280
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 281 TGACIQCSIKSCITAFHVTCAFEHSLEMK---------TILDKGDE---VKFKSYCLKHS 328
Query: 578 Q 578
Q
Sbjct: 329 Q 329
>gi|392352531|ref|XP_003751234.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Rattus
norvegicus]
Length = 1900
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1768 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1826
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1827 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1886
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1887 PCLCGSENCRGTLN 1900
>gi|351698529|gb|EHB01448.1| Histone-lysine N-methyltransferase SETD1B [Heterocephalus glaber]
Length = 1486
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1354 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1412
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1413 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1472
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1473 PCLCGSENCRGTLN 1486
>gi|326913705|ref|XP_003203175.1| PREDICTED: protein Jade-3-like [Meleagris gallopavo]
Length = 812
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 105/198 (53%), Gaps = 18/198 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 195 CDVCRSPDSEDGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 250
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 251 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKLK 310
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 311 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDDGDE---VKFKSYCLKHS 358
Query: 578 QPLNDRLA-VDERLVQVT 594
+ + L VDE +T
Sbjct: 359 KNKQNSLPDVDEHPKSIT 376
>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
Length = 1858
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 98/156 (62%), Gaps = 2/156 (1%)
Query: 681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREH 740
+ MA +Y+ +++ +R +S IHG G++A MV+EY GE +R + D RE
Sbjct: 1705 MPMAMQYRVLRDNYRTYSVVRRSPIHGCGLYAARRLEKDSMVVEYMGERIRDILTDYRER 1764
Query: 741 FIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAK 800
+Y++ G G YMFRIDD+ +IDAT + A +NHSCEPNCY+R+++ +G + II FA
Sbjct: 1765 -MYDAR-GIGCYMFRIDDDYIIDATMKANQARFMNHSCEPNCYTRIVNPDGVKRIIYFAS 1822
Query: 801 RDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN 836
R + + EELT DY+ D ++ CYCG CRG +N
Sbjct: 1823 RVVLEGEELTVDYKMPIEDVKIPCYCGTRSCRGSMN 1858
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 54/117 (46%), Gaps = 15/117 (12%)
Query: 446 PGAPEPPPPCCLCPVVGGAMKPTTDGR--------WAHLACAIWIPE-TCLTDVKRMEPI 496
P A E P C LC V G + P GR WAHL CAIW E +CL D +
Sbjct: 1129 PTAVEDPRHCSLCSVAGDS-PPEESGRLLSVGDLGWAHLNCAIWSSEVSCLDDGH----L 1183
Query: 497 DGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRL 553
DG+ + C CG + GA I C+ C++ YH CAR A CV L + L
Sbjct: 1184 DGVTAALSRSRAMKCHHCGKT-GATIGCAKPRCQLNYHFPCARHARGCVLLTSKTLL 1239
>gi|211830050|gb|AAH38367.2| Setd1b protein [Mus musculus]
Length = 1103
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 971 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1029
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1030 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1089
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1090 PCLCGSENCRGTLN 1103
>gi|413916020|gb|AFW55952.1| putative SET-domain containing protein family [Zea mays]
Length = 710
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 103/156 (66%), Gaps = 5/156 (3%)
Query: 679 NFLSMADKYKHMKET----FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSI 734
N L+ A+ + +K T +KRL F +S IH +G+ A A D VIEY G+L+ +
Sbjct: 556 NLLAAAEGSELLKITQMKARKKRLRFQRSKIHDWGLVALESIEAEDFVIEYVGQLIHRRV 615
Query: 735 ADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEH 794
+D RE S +G+ +Y+FR+DD+ V+DAT+ G +A INHSCEPNCY++VI+V+G +
Sbjct: 616 SDIRESQYEKSGIGS-SYLFRLDDDFVVDATKRGGLARFINHSCEPNCYTKVITVDGQKK 674
Query: 795 IIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPR 830
I I+AKR I EE+TY+Y+F ++++ C+CG R
Sbjct: 675 IFIYAKRRIYAGEEITYNYKFPLEEKKIPCHCGSRR 710
>gi|311276207|ref|XP_003135098.1| PREDICTED: protein Jade-3 [Sus scrofa]
Length = 841
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 221 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 276
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 277 PKKGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCNLCKLK 336
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 337 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 384
Query: 578 Q 578
Q
Sbjct: 385 Q 385
>gi|399218649|emb|CCF75536.1| unnamed protein product [Babesia microti strain RI]
Length = 2180
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 75/162 (46%), Positives = 107/162 (66%), Gaps = 7/162 (4%)
Query: 681 LSMADKYKHMKETFR-KRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE 739
LS + Y++++ + RL S IHG G+FA ++G+ VIEY GEL+R I+D RE
Sbjct: 2020 LSSSMLYRYIQSLSQPSRLRVRGSPIHGIGLFANQLIQSGEPVIEYIGELIRDVISDVRE 2079
Query: 740 HFIYNSLVG--AGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISV-NGDEHII 796
F YN G YMF+++++ V+DATR G++A INHSCEPNC R+I+ +G +HI+
Sbjct: 2080 KF-YNDKWGIDGSCYMFKLNEQLVVDATRMGNMARFINHSCEPNCICRIITCESGLKHIV 2138
Query: 797 IFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
IFAKRDI EE+TY+Y+F E +L+C+CG P C G +N
Sbjct: 2139 IFAKRDINPGEEITYNYQFGIEAEAKKLSCHCGAPSCLGRMN 2180
>gi|348553549|ref|XP_003462589.1| PREDICTED: LOW QUALITY PROTEIN: protein Jade-3-like [Cavia
porcellus]
Length = 822
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 98/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C P C LC
Sbjct: 203 CDVCQSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLSIQ---PQCVLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK G +W H++CA+WIPE + +RMEP+ L+ + RW L+CS+C +
Sbjct: 259 PKKGGAMKTNRTGTKWVHVSCALWIPEVSIACPERMEPVTKLSHIPPSRWALVCSLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C +A+H CA L ++ LDE+DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCIIAFHVTCAFEHNLEMK---------TILDEEDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1295
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/145 (45%), Positives = 93/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A +M+IEY GE VR IA+ RE S +G+ +Y+F
Sbjct: 1152 KKPVKFARSAIHNWGLYAMENIPKDEMIIEYVGEEVRQQIAELRESRYLKSGIGS-SYLF 1210
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RIDD VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI + EELTYDY+
Sbjct: 1211 RIDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYK 1270
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +++ C CG C+G +N
Sbjct: 1271 FERELGSADRIPCLCGTAACKGFLN 1295
>gi|402887949|ref|XP_003907341.1| PREDICTED: uncharacterized protein LOC101023789 [Papio anubis]
Length = 1927
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1795 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1853
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1854 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1913
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1914 PCLCGSENCRGTLN 1927
>gi|396578140|ref|NP_001035488.2| histone-lysine N-methyltransferase SETD1B [Mus musculus]
gi|166977693|sp|Q8CFT2.2|SET1B_MOUSE RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
Length = 1985
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1853 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1911
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1912 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1971
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1972 PCLCGSENCRGTLN 1985
>gi|301784188|ref|XP_002927508.1| PREDICTED: protein Jade-3-like [Ailuropoda melanoleuca]
Length = 825
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKRGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|392332670|ref|XP_003752655.1| PREDICTED: uncharacterized protein LOC100359816 [Rattus norvegicus]
Length = 2265
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 2133 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 2191
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 2192 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 2251
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 2252 PCLCGSENCRGTLN 2265
>gi|344292603|ref|XP_003418015.1| PREDICTED: LOW QUALITY PROTEIN: protein Jade-3-like [Loxodonta
africana]
Length = 823
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNVCVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKRGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCNLCKLK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|241998630|ref|XP_002433958.1| jade1l protein, putative [Ixodes scapularis]
gi|215495717|gb|EEC05358.1| jade1l protein, putative [Ixodes scapularis]
Length = 483
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 105/191 (54%), Gaps = 15/191 (7%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CD+C + VH CYG G W+C RP A PPC LC
Sbjct: 181 CDVCRSPDSEEGNEMVFCDQCDLCVHQACYGITRIPEGS-WVC---RPCALGIRPPCALC 236
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+W+PE + V++MEPI ++ + RW L C +C
Sbjct: 237 PARGGAMKSTRSGSKWAHVSCALWVPEVSIGCVEKMEPITKISEIPASRWALTCCLCRER 296
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS C+ AYH CA E+ L + ++ +++ + ++L SFC KH
Sbjct: 297 MGACIQCSVKACKRAYHVTCAF----------ENSLEMKAIIDENPEDGVKLRSFCPKHS 346
Query: 578 QPLNDRLAVDE 588
+ + + A+ +
Sbjct: 347 KKTHRKEALSD 357
>gi|323348281|gb|EGA82530.1| Set1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 980
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 93/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 837 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 895
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 896 RVDENTVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 955
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 956 FEREKDDEERLPCLCGAPNCKGFLN 980
>gi|5689489|dbj|BAA83028.1| KIAA1076 protein [Homo sapiens]
Length = 804
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 672 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 730
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 731 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 790
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 791 PCLCGSENCRGTLN 804
>gi|172087406|ref|XP_001913245.1| Brpf1 protein-like protein [Oikopleura dioica]
gi|42601372|gb|AAS21398.1| Brpf1 protein-like protein [Oikopleura dioica]
Length = 1062
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/180 (38%), Positives = 98/180 (54%), Gaps = 11/180 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC+ E N L CD C + VH CYG + + WLC C+ +P P C LC
Sbjct: 318 CCVCNDGECTNTNAILFCDLCNLAVHQECYG-VPYIPEGQWLCRRCQ-FSPSRPVDCVLC 375
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P + GA K T D RW H+ CA+W+PE + +EPIDG ++V K RW L C IC +
Sbjct: 376 PSLNGAFKQTHDNRWCHVVCALWVPEVSFQNPVFLEPIDGFSKVPKARWNLKCYICKKA- 434
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVEL--EDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC+ C A+H CA+ AGL +E+ + D +N ++ ++ + +FC H
Sbjct: 435 GACIQCNKNACYTAFHVTCAQQAGLYMEMKVQKSDGINGITSNK------VTQTAFCHNH 488
>gi|47219426|emb|CAG10790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1776
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 125/236 (52%), Gaps = 38/236 (16%)
Query: 603 PSNPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSI 662
P NP CARSE Y+ R+ + A+ + L P L G + + S
Sbjct: 1577 PINPKRCARSEIYS---RKATFDMFNFLASQHREL-----PDLTGPFEEEDDEFPLKSSR 1628
Query: 663 RVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMV 722
R S+ L MA +++H+++ ++ + +S IHG G+F K AG+MV
Sbjct: 1629 RATSSE------------LPMAMRFRHLEKISKEAVGVYRSDIHGRGLFCKRNIEAGEMV 1676
Query: 723 IEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNC 782
IEY G ++R + D+R+ F Y+S G G YMFRIDD V+DAT G+ A INHSCEPNC
Sbjct: 1677 IEYAGTVIRAVLTDKRQKF-YDS-KGIGCYMFRIDDFDVVDATMQGNAARFINHSCEPNC 1734
Query: 783 YSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
YSRVI +I + EELTYDY+F D++ L C CG RCRG +N
Sbjct: 1735 YSRVI--------------NIYRGEELTYDYKFPIEDDESKLHCNCGTRRCRGFLN 1776
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 70/187 (37%), Gaps = 49/187 (26%)
Query: 399 CSVCH--MDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNL------------- 443
C++CH D+ ++ +QC C +H C G+ +C+L
Sbjct: 306 CTICHKCYDDNMRHAEMIQCSACNHWIHYSC-------EGISVICSLRQSYRPLKNTAMH 358
Query: 444 ------CRPGAPEP------PPPCCL---------CPVVGGAMKPTTDGRWAHLACAIWI 482
RP +PE P C C V G + WAH+ C++W
Sbjct: 359 SGWRGATRPESPEAHRWSTLTVPSCFSQSSHNLMQCHVDAGRLLYMGQNEWAHVNCSLWS 418
Query: 483 PETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
E + ++ ++R L C CG S GA + C TC+ +H +CAR
Sbjct: 419 AEVYEENGALLQVHSAVSRGR----HLRCDHCGQS-GATVGCCLATCQSNFHFMCARVQN 473
Query: 543 LCVELED 549
CV +D
Sbjct: 474 -CVFQQD 479
>gi|151944065|gb|EDN62358.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 1080
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|6321911|ref|NP_011987.1| Set1p [Saccharomyces cerevisiae S288c]
gi|731707|sp|P38827.1|SET1_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1
gi|529135|gb|AAB68867.1| Set1p [Saccharomyces cerevisiae]
gi|190405898|gb|EDV09165.1| histone-lysine N-methyltransferase [Saccharomyces cerevisiae RM11-1a]
gi|285810026|tpg|DAA06813.1| TPA: Set1p [Saccharomyces cerevisiae S288c]
gi|392298926|gb|EIW10021.1| Set1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1080
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|256271664|gb|EEU06704.1| Set1p [Saccharomyces cerevisiae JAY291]
Length = 1080
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|349578671|dbj|GAA23836.1| K7_Set1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1080
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|22760789|dbj|BAC11335.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 97/179 (54%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 36 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 91
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 92 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 151
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
+GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 152 FGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 198
>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
Length = 1326
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 89/142 (62%), Gaps = 1/142 (0%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K L F +S IH +G+ A D ++EY GE++RP +AD RE +G+ +Y F
Sbjct: 1186 KKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVGEVLRPKVADVREVRYLRQGLGS-SYFF 1244
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+ D VIDAT+ G + INHSCEPNCY ++I+V G + + I+A+ I ELTYDY+
Sbjct: 1245 RVGDGFVIDATQRGGLGRFINHSCEPNCYPKIITVEGQKRVFIYARTHIAPGTELTYDYK 1304
Query: 815 FFSIDEQLACYCGFPRCRGVVN 836
F D+++ C CG RCRG +N
Sbjct: 1305 FPHEDQKIPCLCGAERCRGFLN 1326
>gi|449298368|gb|EMC94383.1| hypothetical protein BAUCODRAFT_157996 [Baudoinia compniacensis
UAMH 10762]
Length = 1200
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 102/202 (50%), Gaps = 18/202 (8%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP---GAPEPPPP 454
KCS+C + N + CD C + VH CYG + + W C C+ G P
Sbjct: 434 KCSICDDGDCENANAIIFCDGCDLAVHQECYG-VPFIPEGQWFCRKCKEIGRGTPT---- 488
Query: 455 CCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSIC 514
C CP V GA K T RW+HL CAIWIPE + ++ MEPI +++V K RWKL C IC
Sbjct: 489 CIFCPNVDGAFKQTNTLRWSHLLCAIWIPEVTIANMTFMEPITDVDKVPKGRWKLNCYIC 548
Query: 515 GVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCK 574
GACIQC N C +A+H CAR A L ++++ + + + L +FC
Sbjct: 549 NQRMGACIQCGNKNCYLAFHVTCARRAKLFLKMKSQHQGGI---------DTTALKAFCD 599
Query: 575 KHKQPLNDRLA-VDERLVQVTR 595
KH P RL D +++ R
Sbjct: 600 KHVPPDWRRLHDTDHAIIEAKR 621
>gi|395513793|ref|XP_003761107.1| PREDICTED: uncharacterized protein LOC100928096 [Sarcophilus
harrisii]
Length = 1224
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1092 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1150
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1151 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 1210
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1211 PCLCGSENCRGTLN 1224
>gi|328705734|ref|XP_001948004.2| PREDICTED: hypothetical protein LOC100165704 [Acyrthosiphon pisum]
Length = 2352
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 96/183 (52%), Gaps = 22/183 (12%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLC----RPGAPEPPPP 454
C VC + N + CD C + VH CYG + + WLC C RP
Sbjct: 249 CDVCRSPDSEDGNEMVFCDNCNICVHQACYG-ITTIPSGSWLCRTCTLRFRPE------- 300
Query: 455 CCLCPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSI 513
C LCP GAMK T G +WAH++CA+WIPE + V++MEPI ++ + RW L+C +
Sbjct: 301 CVLCPNKNGAMKCTRSGHKWAHVSCALWIPEVSIGCVEKMEPITKISSIPPSRWALICVL 360
Query: 514 CGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFC 573
C GACIQCS TC+ AYH CA GL + + DE ED ++L S+C
Sbjct: 361 CRERVGACIQCSVKTCKTAYHVTCAFKHGL--------EMRAIIEDESAED-GVKLRSYC 411
Query: 574 KKH 576
+KH
Sbjct: 412 QKH 414
>gi|28972602|dbj|BAC65717.1| mKIAA1076 protein [Mus musculus]
Length = 855
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 723 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 781
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 782 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 841
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 842 PCLCGSENCRGTLN 855
>gi|334346973|ref|XP_001365577.2| PREDICTED: protein Jade-3 [Monodelphis domestica]
Length = 806
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 184 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 239
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 240 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWSLVCSLCKLK 299
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 300 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDDGDE---VKFKSYCLKHS 347
Query: 578 Q 578
+
Sbjct: 348 K 348
>gi|27371314|gb|AAH41681.1| Setd1b protein, partial [Mus musculus]
Length = 917
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 785 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 843
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 844 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 903
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 904 PCLCGSENCRGTLN 917
>gi|26251880|gb|AAH40775.1| Setd1b protein, partial [Mus musculus]
Length = 911
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 779 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 837
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 838 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 897
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 898 PCLCGSENCRGTLN 911
>gi|320580861|gb|EFW95083.1| histone-lysine n-methyltransferase, h3 lysine-4 specific, putative
[Ogataea parapolymorpha DL-1]
Length = 658
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 96/145 (66%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A P A +M+IEY GE +R +A+ RE S +G+ +Y+F
Sbjct: 515 KKPVQFARSAIHNWGLYALEPIAAKEMIIEYVGERIRQQVAEVREKKYLRSGIGS-SYLF 573
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
RID+ VIDA++ G IA INH C P+C +++I V G + I+I+A RDI EELTYDY+
Sbjct: 574 RIDENTVIDASKKGGIARFINHCCVPSCTAKIIKVEGKKRIVIYALRDIAANEELTYDYK 633
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F + +E++ C CG P C+G +N
Sbjct: 634 FERETNDEERIPCLCGAPGCKGYLN 658
>gi|281340768|gb|EFB16352.1| hypothetical protein PANDA_017294 [Ailuropoda melanoleuca]
Length = 785
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 99/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 163 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRSCVLGIH---PQCLLC 218
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 219 PKRGGAMKTTRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCNLCKLK 278
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LDE DE ++ S+C KH
Sbjct: 279 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDEGDE---VKFKSYCLKHS 326
Query: 578 Q 578
Q
Sbjct: 327 Q 327
>gi|449275739|gb|EMC84507.1| Protein Jade-3 [Columba livia]
Length = 820
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 105/198 (53%), Gaps = 18/198 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 205 CDVCRSPDSEDGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 260
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 261 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKLK 320
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 321 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDDGDE---VKFKSYCLKHS 368
Query: 578 QPLNDRLA-VDERLVQVT 594
+ + L VDE V+
Sbjct: 369 KNKQNSLPDVDEHPKSVS 386
>gi|251823738|ref|NP_001156554.1| bromodomain and PHD finger containing, 1 [Oryzias latipes]
gi|237823392|dbj|BAH59433.1| bromodomain and PHD finger containing, 1 [Oryzias latipes]
Length = 1283
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 96/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 322 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 379
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 380 PNKGGAFKQTDDSRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 439
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 440 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 492
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 493 HTPPGSAR 500
>gi|297263735|ref|XP_002808043.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Macaca mulatta]
Length = 2216
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 2084 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 2142
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 2143 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 2202
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 2203 PCLCGSENCRGTLN 2216
>gi|444724926|gb|ELW65512.1| Histone-lysine N-methyltransferase SETD1B [Tupaia chinensis]
Length = 1554
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1422 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1480
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 1481 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIEDIKI 1540
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1541 PCLCGSENCRGTLN 1554
>gi|410899921|ref|XP_003963445.1| PREDICTED: peregrin-like isoform 1 [Takifugu rubripes]
Length = 1277
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 95/188 (50%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 322 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVDCALC 379
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 380 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 439
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 440 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 492
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 493 HTPPGSAR 500
>gi|355786615|gb|EHH66798.1| hypothetical protein EGM_03852, partial [Macaca fascicularis]
Length = 673
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 541 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 599
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 600 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 659
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 660 PCLCGSENCRGTLN 673
>gi|259146872|emb|CAY80128.1| Set1p [Saccharomyces cerevisiae EC1118]
Length = 1080
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 93/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|398394325|ref|XP_003850621.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339470500|gb|EGP85597.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 1163
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/257 (33%), Positives = 135/257 (52%), Gaps = 25/257 (9%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NP+GCAR+E P+ +R R+E +A A ++ +
Sbjct: 912 YVP--NPTGCARTEGFKKILNEEKSKYLPHRIKVQRAREERQAQATSNPTAAAEAAKIKE 969
Query: 646 VGGYCQNGLS-GNTLPSIRVIGS-KFSFSLHRDAPNFLSMADKYKHMKETFRKRLA-FGK 702
S N + + R++ A + +A ++ +K+ RK+L F +
Sbjct: 970 AEKIASTATSRSNRVNNRRLVNDINLQKQTLATANSDADVAIRFNQLKK--RKKLVKFDR 1027
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IHG+G++A+ D++IEY GE VR IAD RE VG+ +Y+FR+ D+ ++
Sbjct: 1028 SAIHGWGLYAEENIAVNDLIIEYVGEKVRQKIADLREIRYEKQGVGS-SYLFRMIDDEIV 1086
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF---FSID 819
DAT+ G IA INHSC PNC +++I V G I+I+A +DI + +ELTYDY+F
Sbjct: 1087 DATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFEREMDST 1146
Query: 820 EQLACYCGFPRCRGVVN 836
+++ C CG C+G +N
Sbjct: 1147 DRIPCLCGSANCKGFLN 1163
>gi|348502975|ref|XP_003439042.1| PREDICTED: peregrin isoform 2 [Oreochromis niloticus]
Length = 1286
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 96/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 324 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 381
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 382 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 441
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 442 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 494
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 495 HTPPGSAR 502
>gi|410899923|ref|XP_003963446.1| PREDICTED: peregrin-like isoform 2 [Takifugu rubripes]
Length = 1282
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 95/188 (50%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 322 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVDCALC 379
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 380 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 439
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 440 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 492
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 493 HTPPGSAR 500
>gi|417402077|gb|JAA47897.1| Putative phd finger protein [Desmodus rotundus]
Length = 509
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 96/179 (53%), Gaps = 17/179 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 206 CDVCQSPDGEDGNEMVFCDKCNICVHQACYGILKVPEGS-WLCRTCALGVQ---PKCLLC 261
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMKPT G +W H++CA+WIPE + ++MEPI ++ + RW L+CS+C
Sbjct: 262 PKKGGAMKPTRSGTKWVHVSCALWIPEVSIGSPEKMEPITKVSHIPSSRWALVCSLCNEK 321
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GA IQCS CR A+H CA GL ++ L E+DE ++ S+C KH
Sbjct: 322 CGASIQCSVKNCRTAFHVTCAFDRGLEMK---------TILAENDE---VKFKSYCPKH 368
>gi|432942400|ref|XP_004083000.1| PREDICTED: bromodomain-containing protein 1-like [Oryzias latipes]
Length = 1237
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/152 (45%), Positives = 88/152 (57%), Gaps = 5/152 (3%)
Query: 399 CSVCHMDEEYQN-NLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
CS+C MDEE N N L CD C + VH CYG G W C C +P P C L
Sbjct: 225 CSIC-MDEECHNSNAILFCDMCNVAVHQECYGVPYIPEG-QWHCRHCL-QSPTQPAGCIL 281
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GGA+K T D RW H+ CA+W+PE ++ +EPIDG++++ RWKL C IC
Sbjct: 282 CPNKGGAVKKTDDERWGHVVCALWVPEVGFSNTTFIEPIDGVSQIPPARWKLTCYICKEK 341
Query: 518 -YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 342 GVGACIQCHKANCYTAFHVTCAQKAGLFMKME 373
>gi|348502973|ref|XP_003439041.1| PREDICTED: peregrin isoform 1 [Oreochromis niloticus]
Length = 1281
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 95/188 (50%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG G WLC C +P C LC
Sbjct: 324 CCICNDGECQNSNVILFCDMCNLAVHQECYGVPYIPEG-QWLCRRCL-QSPSRAVDCALC 381
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 382 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 441
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 442 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 494
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 495 HTPPGSAR 502
>gi|157109807|ref|XP_001650833.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108878935|gb|EAT43160.1| AAEL005380-PA [Aedes aegypti]
Length = 2874
Score = 134 bits (338), Expect = 2e-28, Method: Composition-based stats.
Identities = 66/157 (42%), Positives = 95/157 (60%), Gaps = 4/157 (2%)
Query: 682 SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHF 741
S + +YK MK +R + +S I G G++A MVIEY GE++R +++ RE
Sbjct: 2720 SKSSQYKKMKLEWRNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREK- 2778
Query: 742 IYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKR 801
G YMFR+D++RV+DAT +G +A INHSC PNC + + V D IIIFAKR
Sbjct: 2779 -QYEARNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNCVTETVEVERDLRIIIFAKR 2837
Query: 802 DIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
I + EEL+YDY+F D+ +++C CG P C+ +N
Sbjct: 2838 RINRGEELSYDYKFDIEDDAHKISCMCGAPNCKKWMN 2874
>gi|47223204|emb|CAG11339.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1293
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 96/188 (51%), Gaps = 12/188 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 320 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 377
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 378 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 437
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 438 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSF-------SVRKTAYCDI 490
Query: 576 HKQPLNDR 583
H P + R
Sbjct: 491 HTPPGSAR 498
>gi|358058803|dbj|GAA95766.1| hypothetical protein E5Q_02423 [Mixia osmundae IAM 14324]
Length = 2083
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 97/146 (66%), Gaps = 4/146 (2%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K + +K+L F +S IH +G++A A DM+IEY GEL+R +AD+RE Y
Sbjct: 1379 KFNQLK-SRKKQLKFARSPIHDWGLYAMEHIPARDMIIEYVGELIRQQVADKREK-AYEK 1436
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+ +Y+FR+DD+ V+DAT+ G+ A LINH C PNC +R+I++ G + I+I+A DI+
Sbjct: 1437 MGIGSSYLFRVDDDLVVDATKKGTYARLINHCCAPNCTARIITIGGHKKIVIYALTDIEP 1496
Query: 806 WEELTYDYRFF--SIDEQLACYCGFP 829
+E+TYDY F S D ++ C CG P
Sbjct: 1497 GDEITYDYHFATESDDLKIPCLCGSP 1522
>gi|355564772|gb|EHH21272.1| hypothetical protein EGK_04290, partial [Macaca mulatta]
Length = 663
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 90/134 (67%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 531 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 589
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++K+ I EE+TYDY+F D ++
Sbjct: 590 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKI 649
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 650 PCLCGSENCRGTLN 663
>gi|444525923|gb|ELV14211.1| Protein Jade-3 [Tupaia chinensis]
Length = 483
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 101/181 (55%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 203 CDVCRSPDSEEGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRACVMGIH---PQCLLC 258
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+C++C +
Sbjct: 259 PKTGGAMKTTRTGDKWAHVSCALWIPEVNIACPERMEPITKISHIPPSRWALVCNLCKMK 318
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA E L + ++ DDED+ ++ S+C KH
Sbjct: 319 TGACIQCSVKSCITAFHVTCAF----------EHNLEMKTI-LDDEDE-VKFKSYCLKHS 366
Query: 578 Q 578
Q
Sbjct: 367 Q 367
>gi|307105150|gb|EFN53401.1| hypothetical protein CHLNCDRAFT_13462, partial [Chlorella
variabilis]
Length = 157
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/160 (46%), Positives = 96/160 (60%), Gaps = 6/160 (3%)
Query: 386 YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE-PVNGVLWLCNLC 444
Y+P+ V + ++C VC + +Y + + CD C + VH CYG +E P +WLC C
Sbjct: 1 YQPIAVRYSG-ERCCVCDTEADYDFDQLVGCDLCGITVHQSCYGIMELPGPDQMWLCRAC 59
Query: 445 RPGAP-EPPPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRV 502
+P P CC+CPVVGGA+KPT T G W H AC WIPE ++DV EPI+G+ +
Sbjct: 60 ELREDGKPAPQCCVCPVVGGALKPTSTRGLWCHSACLQWIPELSVSDVLAQEPIEGVRSI 119
Query: 503 SKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG 542
K+RW LLC +C GA IQC C AYHPLCAR AG
Sbjct: 120 PKERWDLLCCVCKQRMGAKIQCE--ACFAAYHPLCARVAG 157
>gi|89273401|emb|CAJ82982.1| bromodomain and PHD finger containing, 1 [Xenopus (Silurana)
tropicalis]
Length = 484
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/184 (39%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C +C+ E +N+ L CD C + VH CYG + + WLC C +P C LC
Sbjct: 277 CCICNDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLCRRCL-QSPSRAVDCALC 334
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T DGRWAH+ CA+WIPE C + +EPID + + RWKL C IC
Sbjct: 335 PNKGGAFKQTDDGRWAHVVCALWIPEVCFANTVFLEPIDSIEHIPPARWKLTCYICKQRG 394
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GACIQC C A+H CA+ AGL +++E E N S +R ++C
Sbjct: 395 SGACIQCHKANCYTAFHVTCAQQAGLYMKMEPVRETGANGTSFS-------VRKTAYCDI 447
Query: 576 HKQP 579
H P
Sbjct: 448 HTPP 451
>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
Length = 1151
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 99/155 (63%), Gaps = 7/155 (4%)
Query: 686 KYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNS 745
K+ +K+ +K + F +S IH +G++ DM+IEY GE VR I++ RE+ S
Sbjct: 1000 KFNQLKKR-KKPVKFARSAIHNWGLYTMEDIHKDDMIIEYVGEEVRQQISEIRENRYLKS 1058
Query: 746 LVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQ 805
+G+ +Y+FRID+ VIDAT+ G IA INHSC PNC +++I V G + I+I+A RDI
Sbjct: 1059 GIGS-SYLFRIDENTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITT 1117
Query: 806 WEELTYDYRFF----SIDEQLACYCGFPRCRGVVN 836
EELTYDY+F S+D ++ C CG C+G +N
Sbjct: 1118 NEELTYDYKFEREIGSLD-RIPCLCGTAACKGFLN 1151
>gi|255712365|ref|XP_002552465.1| KLTH0C05522p [Lachancea thermotolerans]
gi|238933844|emb|CAR22027.1| KLTH0C05522p [Lachancea thermotolerans CBS 6340]
Length = 721
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 92/181 (50%), Gaps = 5/181 (2%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C+VC E +N + CD C + VH CYG + G WLC C C C
Sbjct: 242 CAVCGGTECDNSNAIVFCDGCDVAVHQECYGVVFIPEG-QWLCRRCMISRNRKIN-CLFC 299
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY 518
P GA K T G W H+ C +WIPE + MEPI+G++ + + RWKL C IC
Sbjct: 300 PSHTGAFKQTDTGSWGHVVCGLWIPELYFVNSHYMEPIEGVDLIPRSRWKLTCYICRKKV 359
Query: 519 GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ 578
GACIQCSN C AYH CA+ + LC+ D +++ + ++LLSFC KH
Sbjct: 360 GACIQCSNKNCFCAYHVTCAKRSALCM---DFGSCSIIEASSNAIPPGLKLLSFCDKHSP 416
Query: 579 P 579
P
Sbjct: 417 P 417
>gi|363728949|ref|XP_416870.3| PREDICTED: protein Jade-3 [Gallus gallus]
Length = 812
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/192 (40%), Positives = 103/192 (53%), Gaps = 18/192 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 195 CDVCRSPDSEDGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 250
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 251 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKLK 310
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 311 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDDGDE---VKFKSYCLKHS 358
Query: 578 QPLNDRLA-VDE 588
+ + L VDE
Sbjct: 359 KNKQNSLPDVDE 370
>gi|427785297|gb|JAA58100.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
pulchellus]
Length = 3936
Score = 134 bits (337), Expect = 2e-28, Method: Composition-based stats.
Identities = 83/235 (35%), Positives = 122/235 (51%), Gaps = 25/235 (10%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GCARSEP R K P + ++ R P L L
Sbjct: 3724 NPTGCARSEPKL---RTHFKRPHTMHTSNTLR------PSL------QTLGAGGGGGGSC 3768
Query: 665 IGSKFSFSLHRDAPNFL-SMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVI 723
+ + ++ H F+ S + +Y+ MK +R + +SGI G G++A M+I
Sbjct: 3769 LEASSPYTKH-----FVHSKSSQYRRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMII 3823
Query: 724 EYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY 783
EY G+L+R IA+R E IY + G YMFR+D+ RVIDAT +G +A INHSC PNC
Sbjct: 3824 EYIGQLIRNEIAERNEA-IYEAQ-NRGVYMFRLDENRVIDATLSGGLARYINHSCSPNCV 3881
Query: 784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDE--QLACYCGFPRCRGVVN 836
+ ++ ++ + I+I A R I + EELTYDY+F D+ ++ C C CR +N
Sbjct: 3882 AELVQIDRENKILIIANRRITRGEELTYDYKFDYEDDGHKIPCLCSASNCRKWMN 3936
Score = 40.8 bits (94), Expect = 3.0, Method: Composition-based stats.
Identities = 39/163 (23%), Positives = 66/163 (40%), Gaps = 14/163 (8%)
Query: 231 GDLQILSLGKIVKDS-EYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLRDTESKI 289
G L LS+G+++ + F I+P GY VR + S+ + Y + D + +
Sbjct: 3572 GSLIFLSIGQLLPHQLQNFHTQNCIYPVGYKVVRIYWSMHKLGARSRYTCSI-HDVDGQP 3630
Query: 290 RPLFRVTLDNGEQ-FTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISE---SGSD 345
+ RV+ E+ T TP W K+ I + F E A K+ +G D
Sbjct: 3631 QFQIRVSYKGQEKVLTHRTPKGVWQKVLQPI--------ESFRRETQAIKVFSNFITGED 3682
Query: 346 MFGFSNPEVMKLILGLTKSRPTSKSSLCKLTSKYRDLPGGYRP 388
+FG + P ++++ L S + S +LP P
Sbjct: 3683 LFGLTEPAIVRITESLPGVETLSDYNFRYGRSPLLELPLAINP 3725
>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
Length = 1505
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/249 (32%), Positives = 133/249 (53%), Gaps = 34/249 (13%)
Query: 608 GCARSEPYNYFGRRGRK----EPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIR 663
GCAR+ PY + ++ PE ++ S ++ + + N +R
Sbjct: 1271 GCARARPYKKMSMKQKRSLVRRPENESSVSTAIFSERDETAMRHQHLANK-------DMR 1323
Query: 664 VIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRL-AFGKSGIHGFGIFAKHPHRAGDMV 722
++ + SL DA N D +K + FRK++ F +S IHG+G++A +M+
Sbjct: 1324 LLQRRLLTSLG-DASN-----DFFKINQLKFRKKMIKFARSRIHGWGLYAMETIAQDEMI 1377
Query: 723 IEYTGELV---------------RPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRA 767
+EY G+ V R +AD RE +G+ +Y+FRID+ VIDAT+
Sbjct: 1378 VEYIGQTVIVFQNFSSILFHLQIRSLVADEREKAYERRGIGS-SYLFRIDENSVIDATKR 1436
Query: 768 GSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCG 827
G+ A INHSC+PNCY++V+++ G++ I+I+++ I + EE+TYDY+F D+++ C CG
Sbjct: 1437 GNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSVINKGEEITYDYKFPIEDDKIDCLCG 1496
Query: 828 FPRCRGVVN 836
CRG +N
Sbjct: 1497 AKACRGYLN 1505
>gi|345490044|ref|XP_001603865.2| PREDICTED: LOW QUALITY PROTEIN: hypothetical protein LOC100120205
[Nasonia vitripennis]
Length = 5138
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 88/237 (37%), Positives = 118/237 (49%), Gaps = 32/237 (13%)
Query: 605 NPSGCARSEPY--NYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLP-S 661
NPSGCAR+EP N + R + A++S + ++ N P S
Sbjct: 4929 NPSGCARTEPRLRNQLPWK-RPHTQRTASSSARAIYAP---------ASNSSGELACPYS 4978
Query: 662 IRVIGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDM 721
+ + SK S +YK MK+ +R + +S I G G++A M
Sbjct: 4979 KQFVHSKSS---------------QYKKMKQDWRNNVFLARSKIQGLGLYAARDLEKHTM 5023
Query: 722 VIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPN 781
VIEY GE+VR +AD RE G YMFR+D+ RV+DAT G +A INHSC PN
Sbjct: 5024 VIEYIGEIVRNELADIREKQY--EAKNRGIYMFRLDENRVVDATLCGGLARYINHSCNPN 5081
Query: 782 CYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
C + V +IIFAKR I + EEL YDY+F D+Q +AC CG P CR +N
Sbjct: 5082 CVVENVEVERKLRLIIFAKRRILRGEELAYDYKFDIEDDQHKIACACGAPNCRKWMN 5138
>gi|453089066|gb|EMF17106.1| hypothetical protein SEPMUDRAFT_146201 [Mycosphaerella populorum
SO2202]
Length = 1177
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 94/179 (52%), Gaps = 11/179 (6%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
KCS+C + N + CD C + VH CYG + + W C C+ P C
Sbjct: 432 KCSICDDGDCENANAIIFCDGCDLAVHQECYG-VPFIPEGQWFCRKCKEIG-RGTPTCIF 489
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP V GA K T+ RW+HL CAIWIPE + ++ MEPI +++V K+RWKL C IC
Sbjct: 490 CPNVDGAFKQTSTLRWSHLLCAIWIPEVNIANMTFMEPIQDVDKVPKNRWKLSCYICNQK 549
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH 576
GACIQC N C A+H CAR A L ++++ + + + L +FC KH
Sbjct: 550 MGACIQCGNKNCYQAFHVTCARRARLFLKMKSQSQGGI---------DTTALKAFCDKH 599
>gi|123454343|ref|XP_001314927.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121897588|gb|EAY02704.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 486
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 90/141 (63%), Gaps = 3/141 (2%)
Query: 698 LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRID 757
+ F KS I GFG+ P R G+ VIEY GE++RP IAD+R+ Y + GTY+F+ D
Sbjct: 293 IRFEKSKIEGFGVKTTIPIRKGEKVIEYIGEVIRPIIADKRQ-INYEKMGNHGTYVFKAD 351
Query: 758 DERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR--F 815
+ +DAT G IA INHSC+PNC SR+I +NG +++ A +DI EELTYDY+ +
Sbjct: 352 SDHYLDATFRGGIARWINHSCDPNCESRIIKLNGRFAVVLVAIKDINPCEELTYDYKLPY 411
Query: 816 FSIDEQLACYCGFPRCRGVVN 836
D+ + C CG P CRG +N
Sbjct: 412 EPEDKAIKCLCGSPNCRGWLN 432
>gi|207344594|gb|EDZ71692.1| YHR119Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1080
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 94/145 (64%), Gaps = 4/145 (2%)
Query: 695 RKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMF 754
+K + F +S IH +G++A A +M+IEY GE +R +A+ RE + +G+ +Y+F
Sbjct: 937 KKPVMFARSAIHNWGLYALDFIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGS-SYLF 995
Query: 755 RIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYR 814
R+D+ VIDAT+ G IA INH C+PNC +++I V G I+I+A RDI EELTYDY+
Sbjct: 996 RVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYK 1055
Query: 815 F---FSIDEQLACYCGFPRCRGVVN 836
F +E+L C CG P C+G +N
Sbjct: 1056 FEREKDDEERLPCLCGAPNCKGFLN 1080
>gi|224043022|ref|XP_002195036.1| PREDICTED: protein Jade-3 [Taeniopygia guttata]
Length = 819
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/181 (40%), Positives = 98/181 (54%), Gaps = 17/181 (9%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC + N + CDKC + VH CYG L+ G WLC C G P C LC
Sbjct: 204 CDVCRSPDSEDGNDMVFCDKCNICVHQACYGILKVPEGS-WLCRTCVLGIH---PQCLLC 259
Query: 459 PVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
P GGAMK T G +WAH++CA+WIPE + +RMEPI ++ + RW L+CS+C +
Sbjct: 260 PKRGGAMKATRTGTKWAHVSCALWIPEVSIACPERMEPITKVSHIPPSRWALVCSLCKLK 319
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHK 577
GACIQCS +C A+H CA L ++ LD+ DE ++ S+C KH
Sbjct: 320 TGACIQCSVKSCITAFHVTCAFEHSLEMK---------TILDDGDE---VKFKSYCLKHS 367
Query: 578 Q 578
+
Sbjct: 368 K 368
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 120/234 (51%), Gaps = 26/234 (11%)
Query: 605 NPSGCARSEPYNYFGRRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRV 664
NP+GCAR++P A++ + + QP V S ++ S +
Sbjct: 4663 NPAGCARAQPK--------------ASSPYTSVVIRPQPQAV--------SSSSARSNQN 4700
Query: 665 IGSKFSFSLHRDAPNFLSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIE 724
+ + H P + +Y+ MK +R + G+S I G G+FA MVIE
Sbjct: 4701 VAQGEGGAPHNKPPVVHPRSSQYRWMKSEWRANVYLGRSRIQGLGLFAARDIEKQTMVIE 4760
Query: 725 YTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS 784
Y G ++R +A R+E IY S A +MFRID E V+DAT G IA INHSC PNC +
Sbjct: 4761 YNGTILRNEVAIRKEK-IYRSKNRA-VFMFRIDSEHVVDATLNGGIARYINHSCAPNCVA 4818
Query: 785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQ--LACYCGFPRCRGVVN 836
V++ III R I++ EEL ++Y+F S++ Q +AC+CG P CR +N
Sbjct: 4819 EVVTFERGYKIIISCIRRIEKGEELCFNYQFESVEGQHKIACHCGAPECRKWIN 4872
>gi|47229814|emb|CAG07010.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1523
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 98/181 (54%), Gaps = 12/181 (6%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
C VC DE +N+ L CD C + VH CYG + V WLC C +P P C LC
Sbjct: 230 CCVCLDDECLNSNVILFCDICNLAVHQECYG-VPYVPEGQWLCRCCL-QSPSRPVDCVLC 287
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS- 517
P GGA K T+DGRWAH+ CAIWIPE C + +EP++G+ + RWKL C +C
Sbjct: 288 PNRGGAFKQTSDGRWAHVVCAIWIPEVCFANTVFLEPVEGIKNIPPARWKLTCYLCKQKG 347
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELED--EDRLNLLSLDEDDEDQCIRLLSFCKK 575
GA IQC C A+H CA+ AGL ++++ E +N + ++ +FC+
Sbjct: 348 RGASIQCHKANCYRAFHVTCAQKAGLFMKIDPVRETGVNGTTFS-------VKKTAFCEH 400
Query: 576 H 576
H
Sbjct: 401 H 401
>gi|302501458|ref|XP_003012721.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
gi|291176281|gb|EFE32081.1| hypothetical protein ARB_00972 [Arthroderma benhamiae CBS 112371]
Length = 1363
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 143/281 (50%), Gaps = 51/281 (18%)
Query: 600 YIPPSNPSGCARSE--------------PYNYFGRRGRKEPEALAAASLKRLFVENQPYL 645
Y+P NPSG AR++ P+ ++ R+E EALA + +
Sbjct: 1090 YVP--NPSGSARTDGRKRILESEKSKYLPHRIKVQKAREEREALAKSDPTAAADAARVIA 1147
Query: 646 VGGYCQNGLSG---NTLPSIRVIGSKFSFSLHRDA-PNFLSMAD--KYKHMKETFRKRLA 699
++ ++ N + + R+I + + A P D ++ +K+ +K +
Sbjct: 1148 AKTLSKSTITSSRSNRVNNRRLIAD---INAQKQALPTQSGEGDVLRFNQLKKR-KKPVR 1203
Query: 700 FGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDE 759
F +S IH +G++A+ A DM+IEY GE VR +AD RE S +G+ +Y+FRID+
Sbjct: 1204 FARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDEN 1262
Query: 760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQW------------- 806
VIDAT+ G IA INHSC PNC +++I V+G + I+I+A RDI++
Sbjct: 1263 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERGGLIHSKLLLSIYD 1322
Query: 807 --------EELTYDYRF---FSIDEQLACYCGFPRCRGVVN 836
EELTYDY+F + D+++ C CG C+G +N
Sbjct: 1323 RLLTFCVDEELTYDYKFEREWDSDDRIPCLCGSTGCKGFLN 1363
>gi|345564212|gb|EGX47192.1| hypothetical protein AOL_s00097g31 [Arthrobotrys oligospora ATCC
24927]
Length = 1479
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 84/153 (54%), Gaps = 1/153 (0%)
Query: 398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCL 457
+C +C E +N + CD C + VH CYG + + WLC C AP C
Sbjct: 386 RCQICDDGECENSNAIVFCDGCNIAVHQDCYG-VPFIPEGQWLCRRCSLLAPRREVSCIF 444
Query: 458 CPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVS 517
CP GA K T W+HL CAIWIPE ++++ MEP++G+ V K RWKL C IC
Sbjct: 445 CPNTDGAFKMTDSSLWSHLLCAIWIPEVTISNMVYMEPVEGVELVPKSRWKLHCYICKQR 504
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELEDE 550
GACIQCSN C +A+H CAR A L + + +
Sbjct: 505 MGACIQCSNKNCYLAFHVTCARKAKLFLSMRQQ 537
>gi|67586322|ref|XP_665182.1| SET-domain protein [Cryptosporidium hominis TU502]
gi|54655690|gb|EAL34953.1| SET-domain protein [Cryptosporidium hominis]
Length = 209
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/147 (45%), Positives = 98/147 (66%), Gaps = 4/147 (2%)
Query: 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT-Y 752
+ KRL KS IHGFG+FAK + G+ +IEY GEL+R S+AD+RE ++ G+ Y
Sbjct: 63 YDKRLNIKKSSIHGFGLFAKELIKTGEPIIEYVGELIRNSVADKRESLYKSNGNRDGSCY 122
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG-DEHIIIFAKRDIKQWEELTY 811
MFR+D+ VIDAT G+ A +NH C+PN +VIS++ ++HI+IF+K+ I + EE+TY
Sbjct: 123 MFRLDESSVIDATNIGNHARFMNHCCDPNSICKVISIDSQNKHIVIFSKKTINKDEEITY 182
Query: 812 DYRFF--SIDEQLACYCGFPRCRGVVN 836
DY+F E++ C+CG C G +N
Sbjct: 183 DYQFNVEEASEKIICHCGASNCLGRMN 209
>gi|260798747|ref|XP_002594361.1| hypothetical protein BRAFLDRAFT_114040 [Branchiostoma floridae]
gi|229279595|gb|EEN50372.1| hypothetical protein BRAFLDRAFT_114040 [Branchiostoma floridae]
Length = 962
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 84/151 (55%), Gaps = 3/151 (1%)
Query: 399 CSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC 458
CS+C E +N+ L CD C + VH CYG + + WL C +P C LC
Sbjct: 269 CSICQDGECQNSNVILFCDMCNLAVHQECYG-VPYIPEGQWLXRRCL-QSPSRAVDCVLC 326
Query: 459 PVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-S 517
P GGA K T D RWAH+ CA+WIPE C + +EPID ++ + RWKL C IC
Sbjct: 327 PNKGGAFKQTDDARWAHVVCALWIPEVCFANTVFLEPIDSIDHIPTARWKLTCYICKQRG 386
Query: 518 YGACIQCSNTTCRVAYHPLCARAAGLCVELE 548
GACIQC C A+H CA+ AGL +++E
Sbjct: 387 VGACIQCHKANCYTAFHVTCAQHAGLYMKME 417
>gi|66359990|ref|XP_627173.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
gi|46228588|gb|EAK89458.1| multidomain chromatinic protein with the following architecture: 3x
PHD-bromo-3xPHD-SET domain and associated cysteine
cluster at the C-terminus [Cryptosporidium parvum Iowa
II]
Length = 2244
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/147 (45%), Positives = 98/147 (66%), Gaps = 4/147 (2%)
Query: 694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGT-Y 752
+ KRL KS IHGFG+FAK + G+ +IEY GEL+R S+AD+RE ++ G+ Y
Sbjct: 2098 YDKRLNIKKSSIHGFGLFAKELIKTGEPIIEYVGELIRNSVADKRESLYKSNGNRDGSCY 2157
Query: 753 MFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNG-DEHIIIFAKRDIKQWEELTY 811
MFR+D+ VIDAT G+ A +NH C+PN +VIS++ ++HI+IF+K+ I + EE+TY
Sbjct: 2158 MFRLDESSVIDATNIGNHARFMNHCCDPNSICKVISIDSQNKHIVIFSKKTINKDEEITY 2217
Query: 812 DYRFF--SIDEQLACYCGFPRCRGVVN 836
DY+F E++ C+CG C G +N
Sbjct: 2218 DYQFNVEEASEKIICHCGASNCLGRMN 2244
>gi|345791349|ref|XP_543382.3| PREDICTED: histone-lysine N-methyltransferase SETD1B [Canis lupus
familiaris]
Length = 1920
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 62/134 (46%), Positives = 89/134 (66%), Gaps = 1/134 (0%)
Query: 703 SGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVI 762
S IH +G+FA P A +MVIEY G+ +R IAD RE + +G+ +YMFR+D + +I
Sbjct: 1788 SHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGS-SYMFRVDHDTII 1846
Query: 763 DATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQL 822
DAT+ G+ A INHSC PNCY++VI+V + I+I++ + I EE+TYDY+F D ++
Sbjct: 1847 DATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSNQHINVNEEITYDYKFPIEDVKI 1906
Query: 823 ACYCGFPRCRGVVN 836
C CG CRG +N
Sbjct: 1907 PCLCGSENCRGTLN 1920
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,755,565,077
Number of Sequences: 23463169
Number of extensions: 661242451
Number of successful extensions: 1361708
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4393
Number of HSP's successfully gapped in prelim test: 3331
Number of HSP's that attempted gapping in prelim test: 1339023
Number of HSP's gapped (non-prelim): 13996
length of query: 860
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 708
effective length of database: 8,792,793,679
effective search space: 6225297924732
effective search space used: 6225297924732
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)